1. 31 Aug, 2016 1 commit
  2. 30 Aug, 2016 4 commits
  3. 05 Aug, 2016 1 commit
  4. 04 Aug, 2016 1 commit
  5. 07 Jul, 2016 1 commit
  6. 20 Jun, 2016 1 commit
  7. 13 Jun, 2016 1 commit
  8. 01 Jun, 2016 1 commit
  9. 31 May, 2016 1 commit
  10. 23 May, 2016 2 commits
  11. 19 May, 2016 1 commit
  12. 03 May, 2016 1 commit
  13. 02 May, 2016 1 commit
  14. 29 Apr, 2016 1 commit
  15. 25 Apr, 2016 2 commits
    • Andreas Marek's avatar
      Alignment error due to wrong stripe_width · f5feb969
      Andreas Marek authored
      In case of single precision calculations the stripe_width needs to
      be a multiple, which differs from the double precision by a factor of 2
      since one needs 32 bytes alignment and the sizeof(float) and sizeof(double)
      is different by a factor of two
      
      This commit closes issue #18
      f5feb969
    • Andreas Marek's avatar
      Error in single precision SSE BLOCK 4 kernel · 789121d6
      Andreas Marek authored
      The sub-kernels _8_ and _4_ were wrong
      This also solves problems with single precision SSE Block 6 kernel,
      since it also uses the Block 4 kernel
      789121d6
  16. 24 Apr, 2016 1 commit
  17. 20 Apr, 2016 1 commit
    • Andreas Marek's avatar
      Additional configure check for gcc SSE intrinsics · 896388e9
      Andreas Marek authored
      It turned out that if a CPU supports SSE the already existing
      test for SSE assembly instructions always passes.
      However, the compilation of gcc SSE intrinic instructions might
      nevertheless fail if gcc is not called with one of the options
      "-msse3", "-msse4" , "-msse4.1", "-msse4.2", "-mavx", or "-mavx2"!
      
      Obviously gcc does still not consider SSE as a standard on X86_64
      Intel CPUs.
      
      An additional configure test has been introduced, which test for
      gcc intrinsic sse instructions. If this test fails, the corresponding
      kernels are switched off.
      896388e9
  18. 19 Apr, 2016 3 commits
  19. 18 Apr, 2016 3 commits
  20. 15 Apr, 2016 1 commit
  21. 14 Apr, 2016 2 commits
  22. 13 Apr, 2016 4 commits
  23. 12 Apr, 2016 1 commit
  24. 08 Apr, 2016 2 commits
  25. 05 Apr, 2016 1 commit
    • Andreas Marek's avatar
      Introduction of new SSE kernels with different blocking · 69792b15
      Andreas Marek authored
      The SSE kernels with blocking of 2,4,6 (real case) and 1,2 (complex)
      case are now available by default
      
      Thus the following changes have been done
      - introduce new macros in configure.ac and Makefile.am
      - renmae the AVX kernels in AVX_AVX2 (they also support AVX2)
      - introduce new files with SSE kernel
      - introduce new kernel parameters !
      - make the SSE kernels callable
      
      The results are identical with previous kernels
      69792b15
  26. 04 Apr, 2016 1 commit