1. 28 Oct, 2015 1 commit
    • Alexander Heinecke's avatar
      This commit improves ELPA's performance on Intel(R) Xeon(R) E5v2 and E5v3 series CPUs by: · fe63372d
      Alexander Heinecke authored
      - enabling fusing iterations of stage 5 in ELPA2 for every configuration
      - Changed reduction bandwidth in ELPA2 to be at least 64
      - partial OpenMP parallelization of the QR factorization in bandred_real
      - OpenMP parallelization of SYMM
      - OpenMP parallelization of SYR2K in bandred_real
      - OpenMP parallelization for elpa1_reduce_add_vectors and elpa1_transpose_vectors
      - AVX2 support in backtransformation elpa2_kernels (FMA3 instructions introduced with Haswell microarchitecture)
      fe63372d
  2. 28 May, 2015 3 commits
  3. 26 May, 2015 3 commits
  4. 19 May, 2015 3 commits
  5. 29 Apr, 2015 2 commits
  6. 28 Apr, 2015 4 commits
  7. 27 Apr, 2015 2 commits
    • Lorenz Huedepohl's avatar
      Handle different OpenMP flags for Fortran and C · ba9a188f
      Lorenz Huedepohl authored
      There was an inconsistency when the OpenMP flag was different for the
      Fortran and C compiler (e.g. -openmp for ifort and -fopenmp for gcc).
      
      This led to strange errors when linking the example program with the C
      main() routine when using Intel Fortran, Intel MPI, and GCC together, a
      typical error message was
      
        /usr/bin/ld: MPIR_Thread: TLS definition in [...]/intel64/lib/libmpi_dbg_mt.so section .tbss mismatches non-TLS definition in [...]/intel64/lib/libmpi_dbg.so section .bss
        [...]/intel64/lib/libmpi_dbg_mt.so: could not read symbols: Bad value
      
      The reason seems to be that the various MPI wrapper shell scripts
      (mpicc, mpiifort) need the correct OpenMP option to select the
      thread-safe Intel MPI debug library. Previously, always OPENMP_FCFLAGS
      was appended to LDFLAGS, which did not trigger this when linking a C
      main program with mpicc.
      ba9a188f
    • Lorenz Huedepohl's avatar
      Update OpenMP test for new autoconf 2.68 · 25004574
      Lorenz Huedepohl authored
      Sourcecode in AC_LINK_IFELSE must now be guarded by AC_LANG_SOURCE
      statements.
      25004574
  8. 23 Mar, 2015 3 commits
  9. 19 Mar, 2015 1 commit
  10. 18 Mar, 2015 1 commit
  11. 16 Mar, 2015 5 commits
  12. 11 Mar, 2015 2 commits
  13. 09 Mar, 2015 3 commits
  14. 06 Mar, 2015 4 commits
  15. 03 Mar, 2015 1 commit
    • Andreas Marek's avatar
      New ELPA 2015.02.001 release · c13fcc7b
      Andreas Marek authored
      If the user chooeses parameters for the QR-decomposition which are
      not allowed an error has been produced. This error is caught now,
      and the library aborts with a message
      
      It is now possible to switch on more debug messages via
      the environment variable "ELPA_DEBUG_MESSAGES=yes"
      c13fcc7b
  16. 27 Feb, 2015 1 commit
  17. 11 Feb, 2015 1 commit