Skip to content

Fix cublas caching for cublasGemv, cublasGemm

Petr Karpov requested to merge peter_fix_cublas_caching into master_pre_stage

Fix the problem with cublas caching for cublasGemv, cublasGemm.

It has been introduced with cublas 11.11.3.6 (https://docs.nvidia.com/cuda/archive/11.8.0/cuda-toolkit-release-notes/index.html):

The problem with caching was resolved by NVIDIA with cublas 12.3.4.1 https://docs.nvidia.com/cuda/cuda-toolkit-release-notes/index.html#cublas-release-12-3-update-1

For the intermediate cublas version we have to switch caching by hand using cublasLtHeuristicsCacheSetCapacity(0).

Merge request reports