1. 02 Jun, 2020 1 commit
  2. 20 Nov, 2019 1 commit
    • Wenzhe Yu's avatar
      Rewrite compute_hh_trafo CUDA kernels · 6cd5a4f1
      Wenzhe Yu authored
      * Switch to a simple non-WY algorithm
      * Unify real and complex cases
      * Update reduction kernel
      * Use __shfl_xor_sync for warp reduce (CUDA 9+)
      * Support 2^n block size, n = 1,2,...,10
      * Use templates when possible
      * Clean up unused CUDA functions
      * Increase default stripe width when using GPU
      6cd5a4f1
  3. 16 Jan, 2019 1 commit
  4. 11 Jan, 2019 1 commit
  5. 03 Sep, 2017 1 commit
  6. 02 Sep, 2017 1 commit
  7. 03 Aug, 2017 1 commit
  8. 29 May, 2017 1 commit
  9. 06 Apr, 2017 1 commit
  10. 03 Apr, 2017 1 commit
  11. 29 Mar, 2017 1 commit
    • Andreas Marek's avatar
      Rename cuda functions · 22a12154
      Andreas Marek authored
      - the functions now contain the appropiate real/complex in their
        name
      - unused functions have been removed as cleanup
      22a12154
  12. 21 Mar, 2017 1 commit