1. 23 Nov, 2020 1 commit
  2. 25 Sep, 2020 1 commit
  3. 11 Aug, 2020 1 commit
  4. 10 Aug, 2020 1 commit
  5. 08 Apr, 2020 1 commit
  6. 06 Apr, 2020 1 commit
  7. 31 Mar, 2020 1 commit
  8. 28 Mar, 2020 1 commit
  9. 20 Nov, 2019 2 commits
    • Wenzhe Yu's avatar
      GPU memory optimization in ELPA2 · af7bb4a0
      Wenzhe Yu authored
      * Removed redundant malloc, memset and memcpy
      * Use pinned host memory
      * Implemented blocking for GPU code path in step5
      * Removed unused code
      af7bb4a0
    • Wenzhe Yu's avatar
      Rewrite compute_hh_trafo CUDA kernels · 6cd5a4f1
      Wenzhe Yu authored
      * Switch to a simple non-WY algorithm
      * Unify real and complex cases
      * Update reduction kernel
      * Use __shfl_xor_sync for warp reduce (CUDA 9+)
      * Support 2^n block size, n = 1,2,...,10
      * Use templates when possible
      * Clean up unused CUDA functions
      * Increase default stripe width when using GPU
      6cd5a4f1
  10. 17 Oct, 2019 1 commit
    • Andreas Marek's avatar
      Experimental feature: 64bit integer support for MPI · 043ddf39
      Andreas Marek authored
      ELPA can now be linked against a 64bit integer version of MPI and
      ScalaPack. This is an experimental feature
      
      The following points are still to be done
      - does not work with real QR-decomposition
      - generalized routines return wrong results
      - the C tests and the C Cannon algorithm implementation do not
        work (no 64bit header files for MPI *at least* with Intel MPI)
      043ddf39
  11. 14 Oct, 2019 3 commits
  12. 25 Jun, 2019 1 commit
  13. 16 Jan, 2019 1 commit
  14. 11 Jan, 2019 1 commit
  15. 18 Oct, 2018 1 commit
  16. 13 Oct, 2018 1 commit
  17. 27 Sep, 2018 1 commit
  18. 16 May, 2018 1 commit
  19. 08 May, 2018 1 commit
  20. 07 May, 2018 1 commit
  21. 04 May, 2018 1 commit
  22. 30 Apr, 2018 1 commit
  23. 11 Sep, 2017 1 commit
  24. 03 Sep, 2017 9 commits
  25. 02 Sep, 2017 2 commits
  26. 01 Sep, 2017 3 commits