- 10 Dec, 2015 3 commits
-
-
Andreas Marek authored
The user functions of ELPA are now documented with doxygen tags. At the moment the interface of ELPA 2015.11.001 is decribed. The documentation has step by step to be implemented for all functions and test programms.
-
Andreas Marek authored
As in a previous commit for elpa1.F90, for automatic generation of documentation elpa2.F90 has been splitted in two files, in order to have a lean, easy-to-understand user interface: elpa2.F90 the visible user functions, which provide the library calls. The usage is the same as before elpa2_compute.F90 all internal routines, which are used by ELPA2, but which are never called external of the library by a user. These functions are now "hidden" in the module elpa2_compute, which is used by ELPA2. The procedures in elpa2_compute.F90 are identical to the ones in elpa2.F90 before this split commit. The only -- but quite a lot of them -- changes are intendation changes.
-
Andreas Marek authored
For automatic generation of documentation, the file elpa1.F90 has been splitted into two files, in order to have a lean, easy-to-understand user interface: elpa1.F90 the visible user functios, which provide the library calls. The usage is the same as always elpa1_compute.F90 all internal routines, which are used by ELPA1 and ELPA2, but which are never called by the user. These functions are now "hidden" in the module elpa1_compute, which is used by ELPA1 and ELPA2. The procedures in elpa1_compute.F90 are identical to the ones in elpa1.F90 before this split commit. The only -- but lot of -- changes are intendation.
-
- 09 Dec, 2015 1 commit
-
-
Andreas Marek authored
This variables, do not have to be global, they can be parsed along internally in ELPA. Removing them makes debugging more easy and the public interface more lean
-
- 08 Dec, 2015 1 commit
-
-
Alexander Heinecke authored
Current fix does as much blocking as possible, which should be beneficial from both a compute and communication point of view. Additionally, a second possible fix was added which just calls the blocked version if the local matrix has a sufficient size. This might create smaller and more messages at scale.
-
- 07 Dec, 2015 1 commit
-
-
Andreas Marek authored
For some matrix/block size combinations the real case of ELPA2 crashes, e.g: mpiexec -n 1 ./elpa2_test_real 50 50 32 leads to an error message ** On entry to DGEMM parameter number 3 had an illegal value and a crash. This only seems to happen with matrix size smaller than 64*64. he code path responsible for this has been identified, but the problem tself is not yet solved! The part of the code, which causes these crashes, has been switched on as default by Intel in commit fe63372d. The rest of the commit fe63372d seems to be fine, and is performance critical. As an intermediate step, the responsible code path is switched off again as default, this will be changed again once the underlying root cause has been solved.
-
- 26 Nov, 2015 1 commit
-
-
Andreas Marek authored
The API versioning number was not updated correctly at the release. This lead to a wrong soname. This is fixed now
-
- 16 Nov, 2015 4 commits
-
-
Andreas Marek authored
Due to the efforts of Intel, ELPA features now build-in support of AVX2 and FMA for the latest Intel processors
-
Lorenz Huedepohl authored
-
Lorenz Huedepohl authored
-
Lorenz Huedepohl authored
-
- 13 Nov, 2015 2 commits
-
-
Andreas Marek authored
-
Andreas Marek authored
-
- 11 Nov, 2015 1 commit
-
-
Andreas Marek authored
-
- 05 Nov, 2015 5 commits
-
-
Andreas Marek authored
-
Andreas Marek authored
-
Andreas Marek authored
-
Andreas Marek authored
Point more clearly to the usage of ELPA under the terms of the LGPL.
-
Andreas Marek authored
-
- 04 Nov, 2015 3 commits
-
-
Andreas Marek authored
Point more clearly to the usage of ELPA under the terms of the LGPL.
-
Andreas Marek authored
-
Andreas Marek authored
Intel (thanks, especially A.Heinecke from Intel) there exists an optimized version of ELPA with AVX2 support. This merge includes all the optimizations done by Intel plus some smaller changes which were necessary to incorperate these modifications.
-
- 03 Nov, 2015 2 commits
-
-
Andreas Marek authored
A build without autotools is not officially supported anymore. Thus this --- broken since long time --- Makefile.example is removed
-
Andreas Marek authored
The examples, how to invoke ELPA from a c program have been updated. There are now examples for ELPA1 and ELPA2 both real and complex case. The test cases are still with less functionality than their Fortran counter parts, they are just ment as a "proof-of-concept".
-
- 28 Oct, 2015 1 commit
-
-
Alexander Heinecke authored
- enabling fusing iterations of stage 5 in ELPA2 for every configuration - Changed reduction bandwidth in ELPA2 to be at least 64 - partial OpenMP parallelization of the QR factorization in bandred_real - OpenMP parallelization of SYMM - OpenMP parallelization of SYR2K in bandred_real - OpenMP parallelization for elpa1_reduce_add_vectors and elpa1_transpose_vectors - AVX2 support in backtransformation elpa2_kernels (FMA3 instructions introduced with Haswell microarchitecture)
-
- 24 Aug, 2015 1 commit
-
-
Andreas Marek authored
Inge Gutheil from FZ Juelich pointed out, that the configure test for BGQ failed due to typos. These are corrected now
-
- 16 Jun, 2015 3 commits
-
-
Andreas Marek authored
complex cases Create automatically two independent routines for real and complex valued matrices
-
Andreas Marek authored
This commit is not ABI compatible
-
Andreas Marek authored
This commit is not ABI compatible, since it changes the interfaces of some routines Also, introduce type checking for transpose and reduce_add routines
-
- 28 May, 2015 3 commits
-
-
Lorenz Huedepohl authored
These files are always automatically generated by autooconf and should not be in version control.
-
Lorenz Huedepohl authored
-
Lorenz Huedepohl authored
-
- 26 May, 2015 3 commits
-
-
Andreas Marek authored
-
Andreas Marek authored
-
Andreas Marek authored
Andreas Gloess informed us about a memory leak in ELPA, which was introduced in version 2013.11.008. This memory leak is removed now again. Note, that older versions of ELPA will not be fixed right now.
-
- 19 May, 2015 3 commits
-
-
Andreas Marek authored
-
Andreas Marek authored
An "dangling" fi has been removed
-
Andreas Marek authored
-
- 29 Apr, 2015 2 commits
-
-
Andreas Marek authored
Remove variables which are not needed (anymore)
-
Andreas Marek authored
The macros which define the functionality to test for - a specific real/complex kernel (not all available kernels) are now defined in files in the m4 directory
-