1. 10 Dec, 2019 3 commits
  2. 04 Dec, 2019 2 commits
  3. 20 Nov, 2019 1 commit
  4. 14 Nov, 2019 1 commit
  5. 11 Nov, 2019 1 commit
  6. 08 Nov, 2019 1 commit
  7. 05 Nov, 2019 1 commit
  8. 04 Nov, 2019 2 commits
  9. 31 Oct, 2019 1 commit
  10. 30 Oct, 2019 3 commits
  11. 29 Oct, 2019 1 commit
  12. 28 Oct, 2019 2 commits
    • Andreas Marek's avatar
      Merge branch 'master_pre_stage' into skew · cfa307bb
      Andreas Marek authored
      cfa307bb
    • Pavel Kus's avatar
      partially addressing issues with the GPU kernel · ec5b3bec
      Pavel Kus authored
      This commit addresses several issues. It essentially forbids the use of
      the GPU kernel, which become obsolete and caused problems. But it
      does not complete remove the related code, nor does it forbid from
      explicitly selecting the GPU kernel. However, if the user does select
      it, the warning will be issued and the GENERIC kernel would be used
      instead. In the more details:
      * Commentin out operations in the GPU kernel, which do not compile with
        CUDA 10.1. This makes the kernel deffinitely not ussable (but it was
        true even before)
      * removing the gpu_tridiag_band option, sincie the tridiag->banded routine
        is actually not ported to GPU at all. This step will thus always be
        run on the CPU
      * removing the gpu_trans_ev_tridi_to_band option, since the GPU version
        of this step cannot run without the GPU kernel and it is not usable.
        This step will thus also be performed on the CPU
      * modifying REAL_GPU_KERNEL_ONLY_WHEN_GPU_IS_ACTIVE and
        COMPLEX_GPU_KERNEL_ONLY_WHEN_GPU_IS_ACTIVE such that the GPU kernel is
        not considered during the autotuning
      
      * TODO however, the GPU kernel can still be enforced by the user. In
        this case, during the calculation, a warning is issued and the kernel
        is switched to the GENERIC one. This should be improved and there
        should not even be the possibility to choose the GPU kernel at the
        begining.
      ec5b3bec
  13. 26 Oct, 2019 2 commits
  14. 25 Oct, 2019 1 commit
  15. 24 Oct, 2019 7 commits
  16. 23 Oct, 2019 3 commits
  17. 22 Oct, 2019 3 commits
    • Pavel Kus's avatar
      fixing not freed a_dev · 57821b44
      Pavel Kus authored
      a_dev was never freed on the GPU
      However, this might not be enough. What if bandred runs on GPU and
      band_to_tridi on CPU? a_dev is then not allocated. Has to be re-thinked
      in general
      57821b44
    • Sebastian Ohlmann's avatar
      Add support for NVTX profiling · bf7c6410
      Sebastian Ohlmann authored
      When profiling the GPU version, NVTX can be used to highlight the
      corresponding regions of the code in the timeline of the profiling tool
      (nvvp or nsight systems). This is very useful to correlate what happens
      on the GPU with what part of the code we are in.
      bf7c6410
    • Pavel Kus's avatar
      adding tool to check GPU memory · 82ffbd38
      Pavel Kus authored
      82ffbd38
  18. 21 Oct, 2019 1 commit
  19. 19 Oct, 2019 2 commits
  20. 17 Oct, 2019 2 commits