1. 04 Nov, 2015 1 commit
    • Andreas Marek's avatar
      AVX2 support for ELPA · 44244df1
      Andreas Marek authored
      Intel (thanks, especially A.Heinecke from Intel) there exists an
      optimized version of ELPA with AVX2 support.
      
      This merge includes all the optimizations done by Intel plus some
      smaller changes which were necessary to incorperate these modifications.
      44244df1
  2. 03 Nov, 2015 2 commits
    • Andreas Marek's avatar
      Remove of Makefile.example · d86c8c96
      Andreas Marek authored
      A build without autotools is not officially supported anymore.
      Thus this --- broken since long time --- Makefile.example is removed
      d86c8c96
    • Andreas Marek's avatar
      Update of c test cases · 505004e7
      Andreas Marek authored
      The examples, how to invoke ELPA from a c program have been updated.
      There are now examples for ELPA1 and ELPA2 both real and complex case.
      The test cases are still with less functionality than their Fortran
      counter parts, they are just ment as a "proof-of-concept".
      505004e7
  3. 28 Oct, 2015 1 commit
    • Alexander Heinecke's avatar
      This commit improves ELPA's performance on Intel(R) Xeon(R) E5v2 and E5v3 series CPUs by: · fe63372d
      Alexander Heinecke authored
      - enabling fusing iterations of stage 5 in ELPA2 for every configuration
      - Changed reduction bandwidth in ELPA2 to be at least 64
      - partial OpenMP parallelization of the QR factorization in bandred_real
      - OpenMP parallelization of SYMM
      - OpenMP parallelization of SYR2K in bandred_real
      - OpenMP parallelization for elpa1_reduce_add_vectors and elpa1_transpose_vectors
      - AVX2 support in backtransformation elpa2_kernels (FMA3 instructions introduced with Haswell microarchitecture)
      fe63372d
  4. 24 Aug, 2015 1 commit
  5. 16 Jun, 2015 3 commits
  6. 28 May, 2015 3 commits
  7. 26 May, 2015 3 commits
  8. 19 May, 2015 3 commits
  9. 29 Apr, 2015 2 commits
  10. 28 Apr, 2015 4 commits
  11. 27 Apr, 2015 2 commits
    • Lorenz Huedepohl's avatar
      Handle different OpenMP flags for Fortran and C · ba9a188f
      Lorenz Huedepohl authored
      There was an inconsistency when the OpenMP flag was different for the
      Fortran and C compiler (e.g. -openmp for ifort and -fopenmp for gcc).
      
      This led to strange errors when linking the example program with the C
      main() routine when using Intel Fortran, Intel MPI, and GCC together, a
      typical error message was
      
        /usr/bin/ld: MPIR_Thread: TLS definition in [...]/intel64/lib/libmpi_dbg_mt.so section .tbss mismatches non-TLS definition in [...]/intel64/lib/libmpi_dbg.so section .bss
        [...]/intel64/lib/libmpi_dbg_mt.so: could not read symbols: Bad value
      
      The reason seems to be that the various MPI wrapper shell scripts
      (mpicc, mpiifort) need the correct OpenMP option to select the
      thread-safe Intel MPI debug library. Previously, always OPENMP_FCFLAGS
      was appended to LDFLAGS, which did not trigger this when linking a C
      main program with mpicc.
      ba9a188f
    • Lorenz Huedepohl's avatar
      Update OpenMP test for new autoconf 2.68 · 25004574
      Lorenz Huedepohl authored
      Sourcecode in AC_LINK_IFELSE must now be guarded by AC_LANG_SOURCE
      statements.
      25004574
  12. 23 Mar, 2015 3 commits
  13. 19 Mar, 2015 1 commit
  14. 18 Mar, 2015 1 commit
  15. 16 Mar, 2015 5 commits
  16. 11 Mar, 2015 2 commits
  17. 09 Mar, 2015 3 commits