... | ... | @@ -2,12 +2,14 @@ |
|
|
|
|
|
- not yet decided
|
|
|
|
|
|
Changelog for ELPA 2022.05.001.rc1
|
|
|
Changelog for ELPA 2022.05.001
|
|
|
- implement OpenMP offloading to GPU for Intel GPU for ELPA 1 and 2 stage (
|
|
|
except for "step tridi_to_band")
|
|
|
- implement SYCL offloading to Intel GPUs for ELPA 1 and 2 stage (except for
|
|
|
step "tridi_to_band")
|
|
|
- implement SYCL offloading to Intel GPUs for ELPA 1 and 2 stage
|
|
|
|
|
|
- AMD GPU offload has been tested on Mi200 (also with MPI)
|
|
|
- can use ELPA with one individual "gpu stream" per MPI task (Nvidia and AMD
|
|
|
only)
|
|
|
- allow steps "cholesky", "invert_trm", and "multiply_ab" to be called
|
|
|
directly with GPU device pointers
|
|
|
- on error ELPA returns rather than aborting to give controll to calling
|
... | ... | @@ -17,7 +19,6 @@ Changelog for ELPA 2022.05.001.rc1 |
|
|
level > -O2
|
|
|
- better checking of user defined options in configure
|
|
|
|
|
|
|
|
|
Changelog for ELPA 2021.11.002
|
|
|
- fix an error when choosing the Nvidia GPU kernel (fallback to CPU might have
|
|
|
been selected)
|
... | ... | |