1. 08 Apr, 2020 1 commit
  2. 07 Apr, 2020 1 commit
  3. 06 Apr, 2020 1 commit
    • Andreas Marek's avatar
      AVX kernels should not use FMA · 2b2bb0d2
      Andreas Marek authored
      Due to an error, on systems where AVX-2 (FMA) was supported, also
      the "pure" AVX kernels used FMA instructions. This has been fixed
      now
      2b2bb0d2
  4. 05 Mar, 2020 1 commit
  5. 20 Nov, 2019 1 commit
    • Wenzhe Yu's avatar
      Rewrite compute_hh_trafo CUDA kernels · 6cd5a4f1
      Wenzhe Yu authored
      * Switch to a simple non-WY algorithm
      * Unify real and complex cases
      * Update reduction kernel
      * Use __shfl_xor_sync for warp reduce (CUDA 9+)
      * Support 2^n block size, n = 1,2,...,10
      * Use templates when possible
      * Clean up unused CUDA functions
      * Increase default stripe width when using GPU
      6cd5a4f1
  6. 11 Nov, 2019 1 commit
  7. 05 Nov, 2019 1 commit
  8. 24 Oct, 2019 1 commit
  9. 23 Oct, 2019 1 commit
  10. 21 Oct, 2019 1 commit
  11. 19 Oct, 2019 2 commits
  12. 14 Oct, 2019 1 commit
    • Andreas Marek's avatar
      Enable 64bit lapack support · d836eea2
      Andreas Marek authored
      ELPA can now link agains a 64bit integer verion of BLAS/LAPACK.
      Currently this only works if ELPA is compiled with MPI=OFF!
      
      The 64bit support is not available in the legacy interface
      d836eea2
  13. 11 Oct, 2019 1 commit
  14. 10 Oct, 2019 1 commit
    • Andreas Marek's avatar
      HETEROGENOUS_CLUSTER support · dd47b584
      Andreas Marek authored
      On heterogenous cluster, of nodes with different CPUs the _experimental_
      feature (--enable-heterogenous-cluster-support) can be used:
      
      It compares the (Intel) cpuid set of all CPUs which are used by ELPA MPI
      processes and finds the SIMD instruction set, which is supported by all
      used CPUs. The ELPA 2stage back-transformation kernel (a.k.a "kernel")
      will be set accordingly on all MPI processes.
      
      This feature, can override the setting of the kernel done previously by
      the user!
      
      At the moment it will only work for Intel CPUs, i.e. clusters consisting
      of nodes with Intel CPUs and e.g. AMD CPUs are at the moment _NOT_
      supported.
      
      Since this is an experimental feature, it might be dropped again in the
      future, if it turns out not to be useful for the users
      dd47b584
  15. 09 Oct, 2019 1 commit
  16. 26 Sep, 2019 2 commits
  17. 24 Sep, 2019 1 commit
  18. 23 Sep, 2019 2 commits
  19. 20 Sep, 2019 1 commit
  20. 01 Jul, 2019 1 commit
  21. 26 Jun, 2019 1 commit
  22. 17 Jun, 2019 1 commit
  23. 14 Jun, 2019 1 commit
  24. 09 Jun, 2019 2 commits
  25. 08 Jun, 2019 1 commit
  26. 06 Jun, 2019 2 commits
  27. 05 Jun, 2019 1 commit
  28. 29 May, 2019 1 commit
  29. 23 May, 2019 2 commits
  30. 20 May, 2019 1 commit
  31. 18 May, 2019 1 commit
  32. 15 May, 2019 1 commit
  33. 12 May, 2019 1 commit
  34. 10 May, 2019 1 commit