as well. Has to be improved later (since maybe the whole GPU infrastructure might change)
added some more wrappers for the cublas functions