1. 21 May, 2015 2 commits
    • Andreas Marek's avatar
      Fix out-of-bound overflow in real valued GPU version · fe5c1990
      Andreas Marek authored
      1. The dimensions of an array have been wrong in cuda calls.
      2. Start to get rid of "assumed-size" arrays in the real case
         They are a nightmare to debug and easily lead to a conceptional
         error as in 1.
      
         Furthermore, the compiler can generally optimize code better if
         "assumed-shape" arrays are used, since more information is available
         at compile time
      fe5c1990
    • Andreas Marek's avatar
      Update autotools scripts for GPU Version · d71c74e2
      Andreas Marek authored
      - The compiler options for nvcc are changed
      - The include paths are updated
      d71c74e2
  2. 29 Apr, 2015 3 commits
    • Andreas Marek's avatar
      Cleanup of configure.ac · 18c83c76
      Andreas Marek authored
      Remove variables which are not needed (anymore)
      18c83c76
    • Andreas Marek's avatar
      configure.ac: move ELPA specific macros into ./m4 · c788ec6b
      Andreas Marek authored
      The macros which define the functionality to test
      for
       - GPU support only (no CPU based kernels)
       - a specific real/complex kernel (not all available kernels)
      
      are now defined in files in the m4 directory
      c788ec6b
    • Andreas Marek's avatar
      configure.ac: treat GPU kernel as other kernels · 0a27d7c8
      Andreas Marek authored
      Configure treats the GPU kernels now as any other kernel, i. e.
      if GPU support is enabled (and it is possible to build it) then
      it will be build in ADDITION to all other possible kernels for
      the desired hardware.
      
      Also, it is possbile to configure the build process for
      the GPU version ONLY (as it was already possible to trigger the
      build for only ONE specific real/complex kernel).
      
      Note: The sources at the moment CANNOT handle this, i.e. if
      GPU support is configured, the GPU only code path is compiled.
      This will be changed in the near future.
      0a27d7c8
  3. 28 Apr, 2015 7 commits
  4. 27 Apr, 2015 2 commits
    • Lorenz Huedepohl's avatar
      Handle different OpenMP flags for Fortran and C · ba9a188f
      Lorenz Huedepohl authored
      There was an inconsistency when the OpenMP flag was different for the
      Fortran and C compiler (e.g. -openmp for ifort and -fopenmp for gcc).
      
      This led to strange errors when linking the example program with the C
      main() routine when using Intel Fortran, Intel MPI, and GCC together, a
      typical error message was
      
        /usr/bin/ld: MPIR_Thread: TLS definition in [...]/intel64/lib/libmpi_dbg_mt.so section .tbss mismatches non-TLS definition in [...]/intel64/lib/libmpi_dbg.so section .bss
        [...]/intel64/lib/libmpi_dbg_mt.so: could not read symbols: Bad value
      
      The reason seems to be that the various MPI wrapper shell scripts
      (mpicc, mpiifort) need the correct OpenMP option to select the
      thread-safe Intel MPI debug library. Previously, always OPENMP_FCFLAGS
      was appended to LDFLAGS, which did not trigger this when linking a C
      main program with mpicc.
      ba9a188f
    • Lorenz Huedepohl's avatar
      Update OpenMP test for new autoconf 2.68 · 25004574
      Lorenz Huedepohl authored
      Sourcecode in AC_LINK_IFELSE must now be guarded by AC_LANG_SOURCE
      statements.
      25004574
  5. 24 Mar, 2015 2 commits
  6. 23 Mar, 2015 3 commits
  7. 19 Mar, 2015 1 commit
  8. 18 Mar, 2015 1 commit
  9. 16 Mar, 2015 5 commits
  10. 11 Mar, 2015 2 commits
  11. 09 Mar, 2015 3 commits
  12. 06 Mar, 2015 4 commits
  13. 03 Mar, 2015 1 commit
    • Andreas Marek's avatar
      New ELPA 2015.02.001 release · c13fcc7b
      Andreas Marek authored
      If the user chooeses parameters for the QR-decomposition which are
      not allowed an error has been produced. This error is caught now,
      and the library aborts with a message
      
      It is now possible to switch on more debug messages via
      the environment variable "ELPA_DEBUG_MESSAGES=yes"
      c13fcc7b
  14. 27 Feb, 2015 1 commit
  15. 11 Feb, 2015 2 commits
    • Andreas Marek's avatar
      Error in BGQ kernel detection · 0b19d06a
      Andreas Marek authored
      Error in configure test program fixed
      0b19d06a
    • Andreas Marek's avatar
      Abort on error in QR-decomposition · ff13970b
      Andreas Marek authored
      If the QR-decomposition is used wrongly (matrix size is not a
      multiple of block size) the the execution will abort, in
      order to prevent the wrong results, discussed in a previous commit
      
      Debug messages are now available by setting the environment variable
      "ELPA_DEBUG_MESSAGES" to "yes".
      ff13970b
  16. 03 Feb, 2015 1 commit
    • Andreas Marek's avatar
      Partial roll-back of ELPA_2015.02.001 release · 3d880b65
      Andreas Marek authored
      We found a bug in the QR-decomposition, which appears for
      some matrix sizes and produces wrong results!
      
      If the QR decomposition is switched on, an appropiate warning
      is shown.
      
      This bug is still under investigation
      3d880b65