1. 28 Mar, 2020 3 commits
  2. 26 Mar, 2020 3 commits
  3. 25 Mar, 2020 2 commits
  4. 24 Mar, 2020 2 commits
  5. 19 Mar, 2020 1 commit
  6. 18 Mar, 2020 2 commits
  7. 09 Mar, 2020 1 commit
  8. 21 Feb, 2020 1 commit
  9. 19 Dec, 2019 3 commits
  10. 13 Dec, 2019 1 commit
  11. 10 Dec, 2019 3 commits
  12. 06 Dec, 2019 1 commit
  13. 04 Dec, 2019 3 commits
  14. 20 Nov, 2019 5 commits
    • Wenzhe Yu's avatar
      Use cuBLAS in multiply_a_b · 85782a1f
      Wenzhe Yu authored
      85782a1f
    • Wenzhe Yu's avatar
      GPU memory optimization in ELPA2 · af7bb4a0
      Wenzhe Yu authored
      * Removed redundant malloc, memset and memcpy
      * Use pinned host memory
      * Implemented blocking for GPU code path in step5
      * Removed unused code
      af7bb4a0
    • Wenzhe Yu's avatar
      Extend CUDA wrapper · 6e5c03a6
      Wenzhe Yu authored
      * cudaMallocHost
      * cudaFreeHost
      * cudaHostRegister
      * cudaHostUnregister
      6e5c03a6
    • Wenzhe Yu's avatar
      Rewrite compute_hh_trafo CUDA kernels · 6cd5a4f1
      Wenzhe Yu authored
      * Switch to a simple non-WY algorithm
      * Unify real and complex cases
      * Update reduction kernel
      * Use __shfl_xor_sync for warp reduce (CUDA 9+)
      * Support 2^n block size, n = 1,2,...,10
      * Use templates when possible
      * Clean up unused CUDA functions
      * Increase default stripe width when using GPU
      6cd5a4f1
    • Andreas Marek's avatar
      ELPA 2019.11.001.rc1 · 1afe1b76
      Andreas Marek authored
      1afe1b76
  15. 14 Nov, 2019 1 commit
  16. 11 Nov, 2019 1 commit
  17. 08 Nov, 2019 1 commit
  18. 05 Nov, 2019 1 commit
  19. 04 Nov, 2019 2 commits
  20. 31 Oct, 2019 1 commit
  21. 30 Oct, 2019 2 commits