1. 03 Mar, 2021 1 commit
  2. 01 Mar, 2021 1 commit
  3. 27 Feb, 2021 1 commit
  4. 24 Feb, 2021 1 commit
    • Andreas Marek's avatar
      Enable GPU verion with OpenMP in ELPA 2stage · ea6daa06
      Andreas Marek authored
      The GPU logic has been implemented in the OpenMP code paths in ELPA2.
      Currently, this implies that _internal_ to ELPA2, the number of OpenMP
      threads is set to one (independent of how many threads the calling
      application uses) and the original value is restored at the end of ELPA.
      Though this is not the general case, this is _not_ a limitation, since
      in the GPU case no work is done on the CPU and thus no threading is
      helpful
      ea6daa06
  5. 07 Dec, 2020 1 commit
  6. 30 Nov, 2020 1 commit
  7. 23 Nov, 2020 1 commit
  8. 25 Sep, 2020 1 commit
  9. 18 Sep, 2020 1 commit
    • Wenzhe Yu's avatar
      Update ELPA2 Householder CUDA kernel · 41c48f5f
      Wenzhe Yu authored
      One index shift is moved from outside the kernel to inside the kernel.
      Shouldn't really change anything. Just making the code more readable
      and more consistent with the paper.
      41c48f5f
  10. 11 Aug, 2020 1 commit
  11. 10 Aug, 2020 1 commit
  12. 06 Apr, 2020 1 commit
    • Andreas Marek's avatar
      AVX kernels should not use FMA · 2b2bb0d2
      Andreas Marek authored
      Due to an error, on systems where AVX-2 (FMA) was supported, also
      the "pure" AVX kernels used FMA instructions. This has been fixed
      now
      2b2bb0d2
  13. 20 Nov, 2019 1 commit
    • Wenzhe Yu's avatar
      Rewrite compute_hh_trafo CUDA kernels · 6cd5a4f1
      Wenzhe Yu authored
      * Switch to a simple non-WY algorithm
      * Unify real and complex cases
      * Update reduction kernel
      * Use __shfl_xor_sync for warp reduce (CUDA 9+)
      * Support 2^n block size, n = 1,2,...,10
      * Use templates when possible
      * Clean up unused CUDA functions
      * Increase default stripe width when using GPU
      6cd5a4f1
  14. 29 May, 2019 2 commits
  15. 07 Mar, 2019 1 commit
  16. 05 Mar, 2019 3 commits
  17. 04 Mar, 2019 1 commit
  18. 01 Mar, 2019 2 commits
  19. 25 Feb, 2019 1 commit
  20. 24 Feb, 2019 1 commit
  21. 20 Feb, 2019 1 commit
  22. 07 May, 2018 1 commit
  23. 14 Dec, 2017 2 commits
  24. 23 Nov, 2017 2 commits
  25. 20 Nov, 2017 2 commits
  26. 18 Nov, 2017 2 commits
  27. 03 Aug, 2017 1 commit
  28. 01 Aug, 2017 1 commit
  29. 30 Jul, 2017 2 commits
  30. 26 May, 2017 1 commit
  31. 23 May, 2017 1 commit