elpa merge requestshttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/merge_requests2022-01-21T13:01:45Zhttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/merge_requests/90Intel ifx2022-01-21T13:01:45ZAndreas MarekIntel ifxAndreas MarekAndreas Marekhttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/merge_requests/14improving forgotten precision macros2019-05-08T12:39:27ZPavel Kusimproving forgotten precision macrosmaster_pre_stage (even though it did not solve the issue)master_pre_stage (even though it did not solve the issue)https://gitlab.mpcdf.mpg.de/elpa/elpa/-/merge_requests/107Improvements for AMD Mi2502022-08-04T06:32:30ZAndreas MarekImprovements for AMD Mi250Andreas MarekAndreas Marekhttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/merge_requests/116If user does not set "omp_threads", then...2022-10-29T14:47:02ZAndreas MarekIf user does not set "omp_threads", then...use the value specified by omp_get_max_threads()use the value specified by omp_get_max_threads()Andreas MarekAndreas Marekhttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/merge_requests/109If user does not set "omp_threads", then...2022-08-08T07:30:22ZAndreas MarekIf user does not set "omp_threads", then...use the value specified by omp_get_max_threads()use the value specified by omp_get_max_threads()Andreas MarekAndreas Marekhttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/merge_requests/104Gpu streams2022-06-02T05:56:20ZAndreas MarekGpu streamsAndreas MarekAndreas Marekhttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/merge_requests/56Gpu in openmp code path2021-02-24T09:25:31ZAndreas MarekGpu in openmp code pathAndreas MarekAndreas Marekhttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/merge_requests/78GPU device pointer in interfaces2021-09-03T06:44:23ZAndreas MarekGPU device pointer in interfacesAndreas MarekAndreas Marekhttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/merge_requests/166GPU Cholesky optimization, solves #1092024-03-06T06:09:55ZPetr KarpovGPU Cholesky optimization, solves #109- Added elpa_gpu_ccl_transpose_vectors in Cholesky-GPU
- Extract memcpy of info outside of cublas?potrf
- Move nccl_group_start out of the loops
- delete unused vendor_agnostic_layer_template.F90
- Add new cusolverDnXpotrf interface (cus...- Added elpa_gpu_ccl_transpose_vectors in Cholesky-GPU
- Extract memcpy of info outside of cublas?potrf
- Move nccl_group_start out of the loops
- delete unused vendor_agnostic_layer_template.F90
- Add new cusolverDnXpotrf interface (cusolverDn?potrf is deprecated)Andreas MarekAndreas Marekhttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/merge_requests/163Gpu cholesky2024-02-15T06:18:07ZAndreas MarekGpu choleskyAndreas MarekAndreas Marekhttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/merge_requests/24Gitlab CI: Test for 64bit blas and 32bit MPI2019-10-30T06:01:56ZAndreas MarekGitlab CI: Test for 64bit blas and 32bit MPIAndreas MarekAndreas Marekhttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/merge_requests/79Fix vsx kernels2021-09-09T13:50:27ZAndreas MarekFix vsx kernelsAndreas MarekAndreas Marekhttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/merge_requests/111Fix the MPI communicators per elpa object2022-10-07T07:52:42ZAndreas MarekFix the MPI communicators per elpa objectClarification of the ELPA usage:
It has always been intended with the ELPA API that one should
only set the MPI communicators ("mpi_comm_parent", "mpi_comm_rows",
and "mpi_comm_cols") _once_ per ELPA object.
Technically, it has been poss...Clarification of the ELPA usage:
It has always been intended with the ELPA API that one should
only set the MPI communicators ("mpi_comm_parent", "mpi_comm_rows",
and "mpi_comm_cols") _once_ per ELPA object.
Technically, it has been possible to change these communicators, for
an existing ELPA object, which leads -- dependent on the exact
configuration of the ELPA object -- to correct or erroneous behaviour.
With this commit, it is _technically_ impossible to set the
communicators more than once (per ELPA object)Andreas MarekAndreas Marekhttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/merge_requests/6Fix some typos in the documentation and examples2018-02-22T11:52:52ZSebastian OhlmannFix some typos in the documentation and exampleshttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/merge_requests/101Fix problems when compiling merge_systems with -O22022-05-07T05:38:54ZAndreas MarekFix problems when compiling merge_systems with -O2With the Intel compiler, floating-point exceptions occur if the module
merge_systems is compiled with -O2. This does not happen with -O1. A
directive was added to force the optimization level to be less or equal
to 1.
Fixes #95With the Intel compiler, floating-point exceptions occur if the module
merge_systems is compiled with -O2. This does not happen with -O1. A
directive was added to force the optimization level to be less or equal
to 1.
Fixes #95Andreas MarekAndreas Marekhttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/merge_requests/9Fix problem in elpa2_print_kernels with GPU kernel2019-01-06T08:04:58ZAndreas MarekFix problem in elpa2_print_kernels with GPU kernelThe GPU kernel can only be set, if e%set("use_gpu",1) has been set
before and GPU usage was thus requested.
In the elpa2_print_kernels, this was not reflected, and the GPU kernels
were checked via can_set method without activating G...The GPU kernel can only be set, if e%set("use_gpu",1) has been set
before and GPU usage was thus requested.
In the elpa2_print_kernels, this was not reflected, and the GPU kernels
were checked via can_set method without activating GPU usage
=> GPU kernels were always reported as non exisitentAndreas MarekAndreas Marekhttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/merge_requests/172Fix print settings2024-03-20T07:35:42ZPetr KarpovFix print settings- Fix elpa_print_settings for CFLAGS=-D_FORTIFY_SOURCE=2 (like in OBS GNU installation)
- Move setup_gpu() after setting runtime options in test.c
- Add DeviceSynchronize() after kernel call in [cuda|hip]_check_device_info_FromC. This fi...- Fix elpa_print_settings for CFLAGS=-D_FORTIFY_SOURCE=2 (like in OBS GNU installation)
- Move setup_gpu() after setting runtime options in test.c
- Add DeviceSynchronize() after kernel call in [cuda|hip]_check_device_info_FromC. This fixes a potential problem of not catching a problem in gpusolver?potrf, when info_dev!=0Andreas MarekAndreas Marekhttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/merge_requests/12Fix omp2019-04-24T08:13:23ZAndreas MarekFix omphttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/merge_requests/135fix obs installation cxx112023-02-14T07:57:52ZPetr Karpovfix obs installation cxx11Add -std=c++11 flag to fix OBS installation for older compiler versions on cobraAdd -std=c++11 flag to fix OBS installation for older compiler versions on cobraAndreas MarekAndreas Marekhttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/merge_requests/174Fix merge of hipsolver, NCCL/RCCL bugs2024-03-29T06:44:41ZPetr KarpovFix merge of hipsolver, NCCL/RCCL bugs- Fix hipsolver merge problem
- Fix RCCL bug, correctness tested on LUMI
- Fix NCCL bugs: NCCL codepath was disactivated in elpa1/tridiag_template.F90, invert_trm_template.F90, and multiply_a_b/elpa_multiply_a_b_template.F90
- Change boo...- Fix hipsolver merge problem
- Fix RCCL bug, correctness tested on LUMI
- Fix NCCL bugs: NCCL codepath was disactivated in elpa1/tridiag_template.F90, invert_trm_template.F90, and multiply_a_b/elpa_multiply_a_b_template.F90
- Change bool->int in ELPA1 tridiagonalization C-backend and Fortran interfacesAndreas MarekAndreas Marek