- 05 Aug, 2016 3 commits
-
-
Andreas Marek authored
-
Andreas Marek authored
-
Andreas Marek authored
-
- 04 Aug, 2016 4 commits
-
-
Andreas Marek authored
-
Andreas Marek authored
-
Andreas Marek authored
-
Andreas Marek authored
A random matrix is set up, and then result = a**T * 2*b is computed this is compared with the result of plain scalapack routines and the maximum norm of "scalapack - elpa" is computed (should be zero on machine precision)
-
- 07 Jul, 2016 1 commit
-
-
Lorenz Huedepohl authored
-
- 02 Jul, 2016 7 commits
-
-
Andreas Marek authored
-
Andreas Marek authored
-
Andreas Marek authored
-
Andreas Marek authored
-
Andreas Marek authored
-
Andreas Marek authored
-
Andreas Marek authored
-
- 01 Jul, 2016 1 commit
-
-
Andreas Marek authored
-
- 30 Jun, 2016 1 commit
-
-
Andreas Marek authored
-
- 30 May, 2016 1 commit
-
-
Andreas Marek authored
This allows to install several versions of the library in the same directory
-
- 18 May, 2016 1 commit
-
-
Andreas Marek authored
The QR decomposition test program can now called with parameters for matrix size, nev, and block size. This closses issue #20
-
- 17 May, 2016 2 commits
-
-
Andreas Marek authored
-
Andreas Marek authored
Michael Bankd (debian) provided a patch, which allows to run the qr-decomposition test programs with smaller matrices, and thus allows testing in reasonable time on limited hardware (e.g. arm devices) In order to make the patch work, also Makefile.am was updated
-
- 23 Apr, 2016 1 commit
-
-
Andreas Marek authored
In case of SSE/AVX/AVX2 it could happen that more than one kernel (since some depend on other kernels, e.g. block 6 on block 4 and block 2) were called
-
- 05 Apr, 2016 2 commits
-
-
Andreas Marek authored
The SSE kernels with blocking of 2,4,6 (real case) and 1,2 (complex) case are now available by default Thus the following changes have been done - introduce new macros in configure.ac and Makefile.am - renmae the AVX kernels in AVX_AVX2 (they also support AVX2) - introduce new files with SSE kernel - introduce new kernel parameters ! - make the SSE kernels callable The results are identical with previous kernels
-
Andreas Marek authored
-
- 04 Apr, 2016 1 commit
-
-
Andreas Marek authored
- The SSE part will be available in different files. - Specify whether AVX or AVX2 was used to build
-
- 04 Mar, 2016 1 commit
-
-
Andreas Marek authored
files
-
- 26 Feb, 2016 1 commit
-
-
Andreas Marek authored
-
- 24 Feb, 2016 3 commits
-
-
Andreas Marek authored
-
Andreas Marek authored
The test programs include the same template now, the printed messages are thus unified
-
Andreas Marek authored
The configure flag "--enable-shared-memory-only" triggers a build of ELPA without MPI support: - all MPI calls are skipped (or overloaded) - all calls to scalapack functions are replaced by the corresponding lapack calls - all calls to blacs are skipped Using ELPA without MPI gives the same results as using ELPA with 1 MPI task! This version is not yet optimized for performance, here and there some unecessary copies are done. Ths version is intended for users, who do not have MPI in their application but still would like to use ELPA on one compute node
-
- 04 Jan, 2016 2 commits
-
-
Andreas Marek authored
In some C test programs the function declaration header files have been missing
-
Andreas Marek authored
The Fortran variable declerations "variable type*[4,8,16]" is non Fortran standard. It might cause problem in the future. Furthermore, the usage of Fortran and C togehther is more clean if variables are defined according to C variable types. This is done, now for all the test programs
-
- 16 Dec, 2015 2 commits
-
-
Lorenz Huedepohl authored
-
Andreas Marek authored
This commit does not change the interfaces defined in ELPA_2015.11.001 ! All functionality is available via the interface names and definitions as in ELPA_2015.11.001 But some new interfaces have been added, in order to unfiy the references from C and Fortran codes: - The procedures to create the ELPA (row/column) communicators are now available from C _and_ Fortran with the name "get_elpa_communicators". The old Fortran name "get_elpa_row_col_comms" and the old C name "elpa_get_communicators" are from now on deprecated but still available - The 1-stage solver routines are available from C _and_ Fortran via the names "solve_evp_real_1stage" and "solve_evp_complex_1stage". The old Fortran names "solve_evp_real" and "solve_evp_complex" are from now on deprecated but still functional. All documentation (man pages, doxygen, and example test programs) have been changed accordingly. This commit implies a change in the API versioning number, but no chan...
-
- 11 Dec, 2015 2 commits
-
-
Andreas Marek authored
- the contact email is now: elpa-library@mpcdf.mpg.de - the official website is now hosted at http://elpa.mpcdf.mpg.de
-
Andreas Marek authored
The Rechenzentrum Garching (RZG) has been renamed into the Max Planck Computing and Data Facility (MPCDF). This is reflected now in the adapted headers. In the near future, all references to the ELPA webside and the ELPA email will also be adapted.
-
- 13 Nov, 2015 1 commit
-
-
Andreas Marek authored
-
- 03 Nov, 2015 2 commits
-
-
Andreas Marek authored
A build without autotools is not officially supported anymore. Thus this --- broken since long time --- Makefile.example is removed
-
Andreas Marek authored
The examples, how to invoke ELPA from a c program have been updated. There are now examples for ELPA1 and ELPA2 both real and complex case. The test cases are still with less functionality than their Fortran counter parts, they are just ment as a "proof-of-concept".
-
- 28 Oct, 2015 1 commit
-
-
Alexander Heinecke authored
- enabling fusing iterations of stage 5 in ELPA2 for every configuration - Changed reduction bandwidth in ELPA2 to be at least 64 - partial OpenMP parallelization of the QR factorization in bandred_real - OpenMP parallelization of SYMM - OpenMP parallelization of SYR2K in bandred_real - OpenMP parallelization for elpa1_reduce_add_vectors and elpa1_transpose_vectors - AVX2 support in backtransformation elpa2_kernels (FMA3 instructions introduced with Haswell microarchitecture)
-