- 02 Jun, 2020 1 commit
-
-
Andreas Marek authored
-
- 20 Nov, 2019 1 commit
-
-
Wenzhe Yu authored
* Switch to a simple non-WY algorithm * Unify real and complex cases * Update reduction kernel * Use __shfl_xor_sync for warp reduce (CUDA 9+) * Support 2^n block size, n = 1,2,...,10 * Use templates when possible * Clean up unused CUDA functions * Increase default stripe width when using GPU
-
- 03 Aug, 2017 1 commit
-
-
Lorenz Huedepohl authored
Anything if it makes Andreas happy :)
-
- 18 Apr, 2017 1 commit
-
-
Andreas Marek authored
-
- 06 Apr, 2017 1 commit
-
-
Andreas Marek authored
-
- 21 Mar, 2017 2 commits
-
-
Andreas Marek authored
-
Andreas Marek authored
-
- 09 Feb, 2017 1 commit
-
-
Andreas Marek authored
-
- 07 Feb, 2017 1 commit
-
-
Andreas Marek authored
-