- 23 Nov, 2020 2 commits
-
-
Andreas Marek authored
-
Andreas Marek authored
-
- 22 Sep, 2020 1 commit
-
-
Andreas Marek authored
-
- 18 Jun, 2020 1 commit
-
-
Andreas Marek authored
-
- 10 Jun, 2020 1 commit
-
-
Andreas Marek authored
-
- 09 Jun, 2020 3 commits
-
-
Andreas Marek authored
-
Andreas Marek authored
-
Andreas Marek authored
-
- 08 Apr, 2020 1 commit
-
-
Andreas Marek authored
-
- 07 Apr, 2020 1 commit
-
-
Andreas Marek authored
-
- 06 Apr, 2020 1 commit
-
-
Andreas Marek authored
Due to an error, on systems where AVX-2 (FMA) was supported, also the "pure" AVX kernels used FMA instructions. This has been fixed now
-
- 05 Mar, 2020 1 commit
-
-
Andreas Marek authored
-
- 20 Nov, 2019 1 commit
-
-
Wenzhe Yu authored
* Switch to a simple non-WY algorithm * Unify real and complex cases * Update reduction kernel * Use __shfl_xor_sync for warp reduce (CUDA 9+) * Support 2^n block size, n = 1,2,...,10 * Use templates when possible * Clean up unused CUDA functions * Increase default stripe width when using GPU
-
- 11 Nov, 2019 1 commit
-
-
Andreas Marek authored
-
- 05 Nov, 2019 1 commit
-
-
Andreas Marek authored
-
- 24 Oct, 2019 1 commit
-
-
Andreas Marek authored
-
- 23 Oct, 2019 1 commit
-
-
Andreas Marek authored
-
- 21 Oct, 2019 1 commit
-
-
Andreas Marek authored
-
- 19 Oct, 2019 2 commits
-
-
Andreas Marek authored
-
-
- 14 Oct, 2019 1 commit
-
-
Andreas Marek authored
ELPA can now link agains a 64bit integer verion of BLAS/LAPACK. Currently this only works if ELPA is compiled with MPI=OFF! The 64bit support is not available in the legacy interface
-
- 11 Oct, 2019 1 commit
-
-
Andreas Marek authored
-
- 10 Oct, 2019 1 commit
-
-
Andreas Marek authored
On heterogenous cluster, of nodes with different CPUs the _experimental_ feature (--enable-heterogenous-cluster-support) can be used: It compares the (Intel) cpuid set of all CPUs which are used by ELPA MPI processes and finds the SIMD instruction set, which is supported by all used CPUs. The ELPA 2stage back-transformation kernel (a.k.a "kernel") will be set accordingly on all MPI processes. This feature, can override the setting of the kernel done previously by the user! At the moment it will only work for Intel CPUs, i.e. clusters consisting of nodes with Intel CPUs and e.g. AMD CPUs are at the moment _NOT_ supported. Since this is an experimental feature, it might be dropped again in the future, if it turns out not to be useful for the users
-
- 09 Oct, 2019 1 commit
-
-
Carolin Penke authored
-
- 26 Sep, 2019 2 commits
-
-
Andreas Marek authored
-
Andreas Marek authored
-
- 24 Sep, 2019 1 commit
-
-
Andreas Marek authored
Obviously, recent versions of automake decided to filter out an "test_" string which led to some surprises
-
- 23 Sep, 2019 2 commits
-
-
Andreas Marek authored
-
Andreas Marek authored
-
- 20 Sep, 2019 1 commit
-
-
Andreas Marek authored
-
- 01 Jul, 2019 1 commit
-
-
Andreas Marek authored
-
- 26 Jun, 2019 1 commit
-
-
Andreas Marek authored
-
- 17 Jun, 2019 1 commit
-
-
Andreas Marek authored
-
- 14 Jun, 2019 1 commit
-
-
Andreas Marek authored
-
- 09 Jun, 2019 2 commits
-
-
Andreas Marek authored
-
Andreas Marek authored
-
- 08 Jun, 2019 1 commit
-
-
Andreas Marek authored
-
- 06 Jun, 2019 2 commits
-
-
Andreas Marek authored
-
Andreas Marek authored
-
- 05 Jun, 2019 1 commit
-
-
Andreas Marek authored
-