1. 23 Nov, 2020 2 commits
  2. 22 Sep, 2020 1 commit
  3. 18 Jun, 2020 1 commit
  4. 10 Jun, 2020 1 commit
  5. 09 Jun, 2020 3 commits
  6. 08 Apr, 2020 1 commit
  7. 07 Apr, 2020 1 commit
  8. 06 Apr, 2020 1 commit
    • Andreas Marek's avatar
      AVX kernels should not use FMA · 2b2bb0d2
      Andreas Marek authored
      Due to an error, on systems where AVX-2 (FMA) was supported, also
      the "pure" AVX kernels used FMA instructions. This has been fixed
      now
      2b2bb0d2
  9. 05 Mar, 2020 1 commit
  10. 20 Nov, 2019 1 commit
    • Wenzhe Yu's avatar
      Rewrite compute_hh_trafo CUDA kernels · 6cd5a4f1
      Wenzhe Yu authored
      * Switch to a simple non-WY algorithm
      * Unify real and complex cases
      * Update reduction kernel
      * Use __shfl_xor_sync for warp reduce (CUDA 9+)
      * Support 2^n block size, n = 1,2,...,10
      * Use templates when possible
      * Clean up unused CUDA functions
      * Increase default stripe width when using GPU
      6cd5a4f1
  11. 11 Nov, 2019 1 commit
  12. 05 Nov, 2019 1 commit
  13. 24 Oct, 2019 1 commit
  14. 23 Oct, 2019 1 commit
  15. 21 Oct, 2019 1 commit
  16. 19 Oct, 2019 2 commits
  17. 14 Oct, 2019 1 commit
    • Andreas Marek's avatar
      Enable 64bit lapack support · d836eea2
      Andreas Marek authored
      ELPA can now link agains a 64bit integer verion of BLAS/LAPACK.
      Currently this only works if ELPA is compiled with MPI=OFF!
      
      The 64bit support is not available in the legacy interface
      d836eea2
  18. 11 Oct, 2019 1 commit
  19. 10 Oct, 2019 1 commit
    • Andreas Marek's avatar
      HETEROGENOUS_CLUSTER support · dd47b584
      Andreas Marek authored
      On heterogenous cluster, of nodes with different CPUs the _experimental_
      feature (--enable-heterogenous-cluster-support) can be used:
      
      It compares the (Intel) cpuid set of all CPUs which are used by ELPA MPI
      processes and finds the SIMD instruction set, which is supported by all
      used CPUs. The ELPA 2stage back-transformation kernel (a.k.a "kernel")
      will be set accordingly on all MPI processes.
      
      This feature, can override the setting of the kernel done previously by
      the user!
      
      At the moment it will only work for Intel CPUs, i.e. clusters consisting
      of nodes with Intel CPUs and e.g. AMD CPUs are at the moment _NOT_
      supported.
      
      Since this is an experimental feature, it might be dropped again in the
      future, if it turns out not to be useful for the users
      dd47b584
  20. 09 Oct, 2019 1 commit
  21. 26 Sep, 2019 2 commits
  22. 24 Sep, 2019 1 commit
  23. 23 Sep, 2019 2 commits
  24. 20 Sep, 2019 1 commit
  25. 01 Jul, 2019 1 commit
  26. 26 Jun, 2019 1 commit
  27. 17 Jun, 2019 1 commit
  28. 14 Jun, 2019 1 commit
  29. 09 Jun, 2019 2 commits
  30. 08 Jun, 2019 1 commit
  31. 06 Jun, 2019 2 commits
  32. 05 Jun, 2019 1 commit