Changelog 4.27 KB
Newer Older
1 2 3 4 5 6
Changelog for upcoming release

- user can define the default kernels
- simple block4 and block6 real kernel
- ELPA versioning number is provided in the C header files

Andreas Marek's avatar
Andreas Marek committed
7
Changelog for ELPA 2018.11.001
8 9 10 11 12

- improved autotuning
- improved performance of generalized problem via Cannon's algorithm
- check pointing functionality of elpa objects
- store/read/resume of autotuning
13
- Python interface for ELPA
14 15
- more ELPA functions have an optional error argument (Fortran) or required
error argument (C) => ABI and API change
16 17


Andreas Marek's avatar
Andreas Marek committed
18
Changelog for ELPA 2018.05.001
19 20 21 22 23

- significant improved performance on K-computer
- added interface for the generalized eigenvalue problem
- extended autotuning functionality

24
Changelog for ELPA 2017.11.001
25

Andreas Marek's avatar
Andreas Marek committed
26
- significant improvement of performance of GPU version
27 28 29
- added new compute kernels for IBM Power8 and Fujistu Sparc64
  processors
- a first implementation of autotuning capability
30 31
- correct some type statements in Fortran
- correct detection of PAPI in configure step
32

33 34 35 36 37
Changelog for ELPA 2017.05.003

- remove bug in invert_triangular, which had been introduced
  in ELPA 2017.05.002

38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53
Changelog for ELPA 2017.05.002

Mainly bugfixes for ELPA 2017.05.001:
- fix memory leak of MPI communicators
- tests for hermitian_multiply, cholesky decomposition and
- deal with a problem on Debian (mawk)

Changelog for ELPA 2017.05.001

Final release of ELPA 2017.05.001
Since rc2 the following changes have been made
- more extensive tests during "make check"
- distribute missing C headers
- introduce analytic tests
- Fix stack overflow in some kernels

Andreas Marek's avatar
Andreas Marek committed
54 55 56 57 58 59 60
Changelog for ELPA 2017.05.001.rc2

This is the release candidate 2 for the ELPA 2017.05.001 version.
Additionaly to the changes from rc1, it fixes some smaller issues
- add missing script "manual_cpp"
- cleanup of code

61 62 63 64 65 66
Changelog for ELPA 2017.05.001.rc1

This is the release candidate 1 for the ELPA 2017.05.001 version.
It provides a first version of the new, more generic API of the ELPA library.
Smaller changes to the API might be possible in the upcoming release
candidates. For users, who would like to use the older API of the ELPA
Andreas Marek's avatar
Andreas Marek committed
67
library, the API as defined with release 2016.11.001.pre is frozen in and
68 69 70 71 72 73 74 75 76 77 78 79
also supported.

Apart of the API change to be more flexible for the future, this release
offers the following changes:

- faster GPU implementation, especially for ELPA 1stage
- the restriction of the block-cyclic distribution blocksize = 128 in the GPU
  case is relaxed
- Faster CPU implementation due to better blocking
- support of already banded matrices (new API only!)
- improved KNL support

80 81 82 83 84 85 86 87 88
Changelog for pre-release ELPA 2016.11.001.pre

This pre-release contains an experimental API which will most likely
change in the next stable release

- also suport of single-precision (real and complex case) eigenvalule problems
- GPU support in ELPA 1stage and 2stage (real and complex case)
- change of API (w.r.t. ELPA 2016.05.004) to support runtime-choice of GPU usage

89
Changelog for release ELPA 2016.05.004
Andreas Marek's avatar
Andreas Marek committed
90 91 92

- fix a problem with the private state of module precision
- distribute test_project with dist tarball
93
- generic driver routine for ELPA 1stage and 2stage
Andreas Marek's avatar
Andreas Marek committed
94 95 96 97 98 99 100 101 102 103
- test case for elpa_mult_at_b_real
- test case for elpa_mult_ah_b_complex
- test case for elpa_cholesky_real
- test case for elpa_cholesky_complex
- test case for elpa_invert_trm_real
- test case for elpa_invert_trm_complex
- fix building of static library
- better choice of AVX, AVX2, AVX512 kernels
- make assumed size Fortran arrays default

Andreas Marek's avatar
Andreas Marek committed
104 105 106 107 108 109 110 111 112 113
Changelog for release ELPA 2016.05.003

- fix a problem with the build of SSE kernels
- make some (internal) functions public, such that they
  can be used outside of ELPA
- add documentation and interfaces for new public functions
- shorten file namses and directory names for test programs
  in under to by pass "make agrument list too long" error

Changelog for release ELPA 2016.05.002
114 115 116

- fix problem with generated *.sh- check scripts
- name library differently if build without MPI support
Andreas Marek's avatar
Andreas Marek committed
117
- install only public modules
118 119


Andreas Marek's avatar
Andreas Marek committed
120
Changelog for release ELPA 2016.05.001
121

Andreas Marek's avatar
Andreas Marek committed
122
- support building without MPI for one node usage
123 124 125 126 127
- doxygen and man pages documentation for ELPA
- cleanup of documentation
- introduction of SSE gcc intrinsic kernels
- Remove errors due to unaligned memory
- removal of Fortran "contains functions"
Andreas Marek's avatar
Andreas Marek committed
128
- Fortran interfaces for assembly and C kernels
129 130