1. 30 Mar, 2020 1 commit
  2. 28 Mar, 2020 6 commits
  3. 26 Mar, 2020 3 commits
  4. 25 Mar, 2020 2 commits
  5. 24 Mar, 2020 2 commits
  6. 19 Mar, 2020 1 commit
  7. 18 Mar, 2020 2 commits
  8. 09 Mar, 2020 1 commit
  9. 21 Feb, 2020 1 commit
  10. 19 Dec, 2019 3 commits
  11. 13 Dec, 2019 1 commit
  12. 10 Dec, 2019 3 commits
  13. 06 Dec, 2019 1 commit
  14. 04 Dec, 2019 3 commits
  15. 20 Nov, 2019 5 commits
    • Wenzhe Yu's avatar
      Use cuBLAS in multiply_a_b · 85782a1f
      Wenzhe Yu authored
      85782a1f
    • Wenzhe Yu's avatar
      GPU memory optimization in ELPA2 · af7bb4a0
      Wenzhe Yu authored
      * Removed redundant malloc, memset and memcpy
      * Use pinned host memory
      * Implemented blocking for GPU code path in step5
      * Removed unused code
      af7bb4a0
    • Wenzhe Yu's avatar
      Extend CUDA wrapper · 6e5c03a6
      Wenzhe Yu authored
      * cudaMallocHost
      * cudaFreeHost
      * cudaHostRegister
      * cudaHostUnregister
      6e5c03a6
    • Wenzhe Yu's avatar
      Rewrite compute_hh_trafo CUDA kernels · 6cd5a4f1
      Wenzhe Yu authored
      * Switch to a simple non-WY algorithm
      * Unify real and complex cases
      * Update reduction kernel
      * Use __shfl_xor_sync for warp reduce (CUDA 9+)
      * Support 2^n block size, n = 1,2,...,10
      * Use templates when possible
      * Clean up unused CUDA functions
      * Increase default stripe width when using GPU
      6cd5a4f1
    • Andreas Marek's avatar
      ELPA 2019.11.001.rc1 · 1afe1b76
      Andreas Marek authored
      1afe1b76
  16. 14 Nov, 2019 1 commit
  17. 11 Nov, 2019 1 commit
  18. 08 Nov, 2019 1 commit
  19. 05 Nov, 2019 1 commit
  20. 04 Nov, 2019 1 commit