elpa issueshttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/issues2018-02-05T19:48:54Zhttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/issues/3Wrong results in complex calculation2018-02-05T19:48:54ZAndreas MarekWrong results in complex calculationIt seems that under certain conditions the ELPA2 complex case produces wrong results. This depends on the number of used MPI tasks, and only appears if the matrix size is larger by one than the used blocksize, e.g. ./elpa2_test_complex 1...It seems that under certain conditions the ELPA2 complex case produces wrong results. This depends on the number of used MPI tasks, and only appears if the matrix size is larger by one than the used blocksize, e.g. ./elpa2_test_complex 17 17 16https://gitlab.mpcdf.mpg.de/elpa/elpa/-/issues/45unify real/complex QR paths2017-07-16T17:31:50ZAndreas Marekunify real/complex QR pathshttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/issues/43Unify GPU path for real/complex version in band_to_full2017-08-28T20:00:37ZAndreas MarekUnify GPU path for real/complex version in band_to_fullBoth cases are handled slightly necessary. Confirm whether this is necessary or is just legacyBoth cases are handled slightly necessary. Confirm whether this is necessary or is just legacyhttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/issues/102Undefined reference to elpa_skew functions2023-12-15T21:47:03ZPetr KarpovUndefined reference to elpa_skew functionsThere is a problem with undefined reference to elpa_skew functions, when the skew symmetric support is disabled (--disable-skew-symmetric-support).
Here is the reproducer for raven:
module load anaconda/3/2021.11 intel/21.6.0 impi/2021...There is a problem with undefined reference to elpa_skew functions, when the skew symmetric support is disabled (--disable-skew-symmetric-support).
Here is the reproducer for raven:
module load anaconda/3/2021.11 intel/21.6.0 impi/2021.6 mkl/2022.1 gcc/11 cuda/11.4
../configure CC=mpicc FC=mpiifort CXX=mpiicpc CFLAGS="-O3 -march=skylake-avx512" FCFLAGS="-O3 -xCORE-AVX512" SCALAPACK_FCFLAGS="-I/mpcdf/soft/SLE_15/packages/x86_64/intel_oneapi/2021.3/mkl/latest/include/intel64/lp64" SCALAPACK_LDFLAGS="-L/mpcdf/soft/SLE_15/packages/x86_64/intel_oneapi/2021.3/mkl/latest/lib/intel64 -lmkl_scalapack_lp64 -lmkl_intel_lp64 -lmkl_sequential -lmkl_core -lmkl_blacs_intelmpi_lp64 -lpthread -Wl,-rpath,/mpcdf/soft/SLE_15/packages/x86_64/intel_oneapi/2021.3/mkl/latest/lib/intel64" --disable-openmp --disable-64bit-integer-math-support --disable-64bit-integer-mpi-support --enable-mpi-module --enable-detect-mpi-launcher --enable-generic --disable-sparc64 --disable-neon-arch64 --disable-vsx --enable-sse --enable-sse-assembly --enable-avx --enable-avx2 --enable-avx512 --disable-sve128 --disable-sve256 --disable-sve512 --disable-bgp --disable-bgp --enable-assumed-size --disable-ifx-compiler --enable-Fortran2008-features --enable-option-checking=fatal --disable-heterogenous-cluster-support --enable-timings --enable-band-to-full-blocking --without-threading-support-check-during-build --disable-runtime-threading-support-checks --disable-allow-thread-limiting --disable-gpu --enable-nvidia-gpu --disable-amd-gpu --disable-intel-gpu-sycl --disable-nvidia-sm80-gpu --disable-NVIDIA-gpu-memory-debug --disable-cuda-aware-mpi --disable-gpu-streams --disable-nvtx --disable-c-tests --disable-cpp-tests --disable-skew-symmetric-support --with-mpi=yes --disable-redirect --enable-single-precision --disable-autotuning --disable-scalapack-tests --disable-autotune-redistribute-matrix --with-papi=no --with-likwid=no --disable-store-build-config --disable-python --disable-python-tests --with-cuda-path="/mpcdf/soft/SLE_15/packages/x86_64/cuda/11.4.2" --with-NVIDIA-GPU-compute-capability=sm_80 --with-cusolver
make -j 18
Here is the error message we get:
ld: ./.libs/libelpa.so: undefined reference to `elpa_skew_eigenvectors_a_h_a_f'
ld: ./.libs/libelpa.so: undefined reference to `elpa_skew_eigenvalues_d_ptr_f'
ld: ./.libs/libelpa.so: undefined reference to `elpa_skew_eigenvalues_a_h_a_d'
ld: ./.libs/libelpa.so: undefined reference to `elpa_skew_eigenvectors_d_ptr_f'
ld: ./.libs/libelpa.so: undefined reference to `elpa_skew_eigenvectors_d_ptr_d'
ld: ./.libs/libelpa.so: undefined reference to `elpa_skew_eigenvectors_a_h_a_d'
ld: ./.libs/libelpa.so: undefined reference to `elpa_skew_eigenvalues_a_h_a_f'
ld: ./.libs/libelpa.so: undefined reference to `elpa_skew_eigenvalues_d_ptr_d'https://gitlab.mpcdf.mpg.de/elpa/elpa/-/issues/108UCX warnings for GPU complex_double tests with OpenMPI2023-12-15T13:01:02ZPetr KarpovUCX warnings for GPU complex_double tests with OpenMPIReproducer:
```
module purge
module load cuda/11.4 gcc/11 openmpi/4 mkl/2022.1 nccl/2.11.4
export OMPI_MCA_coll=^hcoll
../configure --prefix=$HOME/soft/elpa_mpi_00 --enable-option-checking=fatal CC=mpicc FC=mpif90 CXX=mpicxx CFLAGS="-...Reproducer:
```
module purge
module load cuda/11.4 gcc/11 openmpi/4 mkl/2022.1 nccl/2.11.4
export OMPI_MCA_coll=^hcoll
../configure --prefix=$HOME/soft/elpa_mpi_00 --enable-option-checking=fatal CC=mpicc FC=mpif90 CXX=mpicxx CFLAGS="-O3 -g -march=skylake-avx512 -I$MKL_HOME/include/intel64/lp64 -I$CUDA_HOME/include" CXXFLAGS="-std=c++17 -O3 -march=skylake-avx512 -I$MKL_HOME/include/intel64/lp64 -I$CUDA_HOME/include" FCFLAGS="-O3 -g -march=skylake-avx512 -I$MKL_HOME/include/intel64/lp64 -I$CUDA_HOME/include" LDFLAGS="-L$MKL_HOME/lib/intel64 -lmkl_scalapack_lp64 -lmkl_gf_lp64 -lmkl_sequential -lmkl_core -lmkl_blacs_openmpi_lp64 -lpthread -Wl,-rpath,$MKL_HOME/lib/intel64" --with-mpi=yes --enable-assumed-size --enable-band-to-full-blocking --enable-nvidia-gpu --with-NVIDIA-GPU-compute-capability=sm_70 -with-cuda-path=$CUDA_HOME --enable-avx512 --enable-cpp-tests=no --enable-single-precision --enable-nvtx
```
The warnings like
```
[1702644215.666499] [ravg1002:132812:0] mpool.c:55 UCX WARN object 0xcf82c0 {{cpml|cb|snd_tag|rk_use} send length 41943040 ucp_proto_progress_tag_rndv_rts() comp:mca_pml_ucx_send_nbx_completion()host me was not returned to mpool ucp_requests
```
appear for complex_double tests, e.g. `validate_complex_double_eigenvectors_1stage_gpu_random` but not for real_doublePetr KarpovPetr Karpovhttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/issues/44Tunable blocking in band_to_full2017-09-03T18:04:30ZAndreas MarekTunable blocking in band_to_fullAllow run-time choices whether blocking should be used, and which blocking parameters are optimalAllow run-time choices whether blocking should be used, and which blocking parameters are optimalhttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/issues/54Trouble compiling ELPA with Intel 2018b2017-07-07T14:39:29ZLorenz HuedepohlTrouble compiling ELPA with Intel 2018bA user reported a problem compiling `elpa-2017.05.001.rc1` with the Intel 2018 Beta compiler, attached is his error message.
```
[KNL]$make
make all-am
make[1]: Entering directory `/home/elpauser/PCP/qe-6.1/qe-6.1_compilers_and_li...A user reported a problem compiling `elpa-2017.05.001.rc1` with the Intel 2018 Beta compiler, attached is his error message.
```
[KNL]$make
make all-am
make[1]: Entering directory `/home/elpauser/PCP/qe-6.1/qe-6.1_compilers_and_libraries_2018.0.061_VEC_xMIC-AVX512/elpa-2017.05.001.rc1-build_compilers_and_libraries_2018.0.061'
PPFC src/libelpa_onenode_openmp_private_la-elpa_impl.lo
/tmp/ifortTnMEgA.i90: error #8322: A deferred binding is inherited by non-abstract type; It must be overridden. [ELPA_SET_INTEGER]
manually_preprocessed__home_elpauser_PCP_qe-6.1_package_elpa-2017.05.001.rc1_src_elpa_impl.F90-src_.libs_libelpa_onenode_openmp_private_la-elpa_impl.o.F90(71): error #6136: Derived-type declared must be ABSTRACT [ELPA_IMPL_T]
type, extends(elpa_abstract_impl_t) :: elpa_impl_t
-----------------------------------------^
/tmp/ifortTnMEgA.i90: error #8322: A deferred binding is inherited by non-abstract type; It must be overridden. [ELPA_SET_DOUBLE]
/tmp/ifortTnMEgA.i90: error #8322: A deferred binding is inherited by non-abstract type; It must be overridden. [ELPA_GET_INTEGER]
/tmp/ifortTnMEgA.i90: error #8322: A deferred binding is inherited by non-abstract type; It must be overridden. [ELPA_GET_DOUBLE]
/tmp/ifortTnMEgA.i90: error #8322: A deferred binding is inherited by non-abstract type; It must be overridden. [ELPA_EIGENVECTORS_D]
/tmp/ifortTnMEgA.i90: error #8322: A deferred binding is inherited by non-abstract type; It must be overridden. [ELPA_EIGENVECTORS_F]
/tmp/ifortTnMEgA.i90: error #8322: A deferred binding is inherited by non-abstract type; It must be overridden. [ELPA_EIGENVECTORS_DC]
/tmp/ifortTnMEgA.i90: error #8322: A deferred binding is inherited by non-abstract type; It must be overridden. [ELPA_EIGENVECTORS_FC]
/tmp/ifortTnMEgA.i90: error #8322: A deferred binding is inherited by non-abstract type; It must be overridden. [ELPA_EIGENVALUES_D]
/tmp/ifortTnMEgA.i90: error #8322: A deferred binding is inherited by non-abstract type; It must be overridden. [ELPA_EIGENVALUES_F]
/tmp/ifortTnMEgA.i90: error #8322: A deferred binding is inherited by non-abstract type; It must be overridden. [ELPA_EIGENVALUES_DC]
/tmp/ifortTnMEgA.i90: error #8322: A deferred binding is inherited by non-abstract type; It must be overridden. [ELPA_EIGENVALUES_FC]
/tmp/ifortTnMEgA.i90: error #8322: A deferred binding is inherited by non-abstract type; It must be overridden. [ELPA_HERMITIAN_MULTIPLY_D]
/tmp/ifortTnMEgA.i90: error #8322: A deferred binding is inherited by non-abstract type; It must be overridden. [ELPA_HERMITIAN_MULTIPLY_F]
/tmp/ifortTnMEgA.i90: error #8322: A deferred binding is inherited by non-abstract type; It must be overridden. [ELPA_HERMITIAN_MULTIPLY_DC]
/tmp/ifortTnMEgA.i90(1615): catastrophic error: Too many errors, exiting
compilation aborted for manually_preprocessed__home_elpauser_PCP_qe-6.1_package_elpa-2017.05.001.rc1_src_elpa_impl.F90-src_.libs_libelpa_onenode_openmp_private_la-elpa_impl.o.F90 (code 1)
Traceback (most recent call last):
File "/home/elpauser/PCP/qe-6.1/package/elpa-2017.05.001.rc1/manual_cpp", line 56, in <module>
check_call(args + [tmp_filename])
File "/home/elpauser/PCP/qe-6.1/package/elpa-2017.05.001.rc1/manual_cpp", line 16, in check_call
return subprocess.check_call(args, **kwargs)
File "/usr/lib64/python2.7/subprocess.py", line 542, in check_call
raise CalledProcessError(retcode, cmd)
subprocess.CalledProcessError: Command '['mpiifort', '-DHAVE_CONFIG_H', '-I.', '-I/home/elpauser/PCP/qe-6.1/package/elpa-2017.05.001.rc1', '-L/opt/intel/compilers_and_libraries_2018.0.061/linux/mkl/lib/intel64', '-lmkl_scalapack_lp64', '-lmkl_intel_lp64',
'-lmkl_sequential', '-lmkl_core', '-lmkl_blacs_intelmpi_lp64', '-lpthread', '-I/opt/intel/compilers_and_libraries_2018.0.061/linux/mkl/include/intel64/lp64', '-Imodules', '-mod', 'private_modules', '-Iprivate_modules', '-fopenmp', '-g', '-c', '-fPIC', '-o',
'src/.libs/libelpa_onenode_openmp_private_la-elpa_impl.o', 'manually_preprocessed__home_elpauser_PCP_qe-6.1_package_elpa-2017.05.001.rc1_src_elpa_impl.F90-src_.libs_libelpa_onenode_openmp_private_la-elpa_impl.o.F90']' returned non-zero exit status 1
make[1]: *** [src/libelpa_onenode_openmp_private_la-elpa_impl.lo] Error 1
make[1]: Leaving directory `/home/elpauser/PCP/qe-6.1/qe-6.1_compilers_and_libraries_2018.0.061_VEC_xMIC-AVX512/elpa-2017.05.001.rc1-build_compilers_and_libraries_2018.0.061'
make: *** [all] Error 2
```Lorenz HuedepohlLorenz Huedepohlhttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/issues/81Toeplitz test cases hang for realy small matrices na=42021-05-06T12:37:33ZAndreas MarekToeplitz test cases hang for realy small matrices na=4If you use 4 MPI tasks for a setup of na=4 nev=4 nblk=1, the the test-cases for Toeplitz matrices hang.
The test-cases for other matrix setups do work, however.
It seems that the code hangs in the "solve" stepIf you use 4 MPI tasks for a setup of na=4 nev=4 nblk=1, the the test-cases for Toeplitz matrices hang.
The test-cases for other matrix setups do work, however.
It seems that the code hangs in the "solve" stephttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/issues/26The AVX single precision kernel optimization2017-05-21T22:14:19ZAndreas MarekThe AVX single precision kernel optimizationhttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/issues/105SYCL kernels for multiply missing2023-10-25T06:03:39ZAndreas MarekSYCL kernels for multiply missingPetr KarpovPetr Karpovhttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/issues/70suspected problem for matrix of size 200k2021-02-24T09:45:56ZPavel Kussuspected problem for matrix of size 200kReported by Phillip Coles
-> check again the setupReported by Phillip Coles
-> check again the setupPavel KusPavel Kushttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/issues/50stripe_width in trans_ev_tridi_to_band2017-09-06T19:06:07ZAndreas Marekstripe_width in trans_ev_tridi_to_bandhttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/issues/17Single precision SSE/AVX/AVX BLOCK1 kernel does not work2017-05-21T22:14:19ZAndreas MarekSingle precision SSE/AVX/AVX BLOCK1 kernel does not workDue to this, also the BLOCK2 kernels do not workDue to this, also the BLOCK2 kernels do not workhttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/issues/8Single precision GPU version2018-02-05T19:48:54ZAndreas MarekSingle precision GPU versionThe GPU version has not yet been ported to single precision calculationsThe GPU version has not yet been ported to single precision calculationshttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/issues/7Single precision ELPA2 kernels2018-02-05T19:48:54ZAndreas MarekSingle precision ELPA2 kernelsThe assembler kernel and the kernels using gcc intrinsic assembler directives have not yet been ported to single precisionThe assembler kernel and the kernels using gcc intrinsic assembler directives have not yet been ported to single precisionhttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/issues/18Single precision AVX Block6 crashes sometime2021-04-15T06:55:36ZAndreas MarekSingle precision AVX Block6 crashes sometimeThe double precision case works fine, but single precision crashes sometimes:
e.g. 1500 50 16, or 150, 50, 16
1500 500 16 works fineThe double precision case works fine, but single precision crashes sometimes:
e.g. 1500 50 16, or 150, 50, 16
1500 500 16 works finehttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/issues/16Setting of specific kernels at build time does not work anymore in ELPA_GPU b...2018-02-05T19:48:54ZAndreas MarekSetting of specific kernels at build time does not work anymore in ELPA_GPU branchAt least setting the AVX_BLOCK6 kernel with --with-real-avx_block6-kernel-only does not work. It produces wrong results. Not specifying this Option but calling the kernel works!At least setting the AVX_BLOCK6 kernel with --with-real-avx_block6-kernel-only does not work. It produces wrong results. Not specifying this Option but calling the kernel works!https://gitlab.mpcdf.mpg.de/elpa/elpa/-/issues/94Setting of GPU kernel depens on order of set calls2022-02-03T17:05:00ZAndreas MarekSetting of GPU kernel depens on order of set callsWhen setting
first set("solver",2stage) and then
set("kernel",GPU_KERNEL)
it uses the CPU kernel (the default kernel seems to be set)
In the other order it works correctlyWhen setting
first set("solver",2stage) and then
set("kernel",GPU_KERNEL)
it uses the CPU kernel (the default kernel seems to be set)
In the other order it works correctlyhttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/issues/73Service Desk (from dev@stellardeath.org): A gitlab test issue using the servi...2021-02-24T09:37:42ZGitLab Support BotService Desk (from dev@stellardeath.org): A gitlab test issue using the service-deskFoobarFoobarhttps://gitlab.mpcdf.mpg.de/elpa/elpa/-/issues/27Remove unecessary data copies if MPI is not used2022-12-12T07:47:11ZAndreas MarekRemove unecessary data copies if MPI is not used