Introduce new key-value runtime option 'pxtrmm_for_generalized' and...

Introduce new key-value runtime option 'pxtrmm_for_generalized' and alternative codepath for GPU that uses PxTRAN+hermitian_multiply instead

Merge request reports

Loading