Skip to content
GitLab
Menu
Projects
Groups
Snippets
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
elpa
elpa
Commits
5659ad6f
Commit
5659ad6f
authored
Dec 08, 2021
by
Soheil Soltani
Browse files
Implement cudaDeviceSynchronize()
parent
c54d95b9
Changes
2
Hide whitespace changes
Inline
Side-by-side
src/invert_trm/GPU/CUDA/elpa_invert_trm_cuda.cu
View file @
5659ad6f
...
...
@@ -614,3 +614,7 @@ extern "C" void copy_float_complex_a_tmp1_FromC(float _Complex *a_dev, float _Co
printf
(
"Error in executing copy_float_complex_a_tmp1_kernel: %s
\n
"
,
cudaGetErrorString
(
cuerr
));
}
}
extern
"C"
void
device_synchronizeFromC
()
{
Andreas Marek
@amarek
·
Dec 08, 2021
Owner
This should not be necessary. We have device_synchronize already implemented in cudaFunctions.cu
This should not be necessary. We have device_synchronize already implemented in cudaFunctions.cu
Please
register
or
sign in
to reply
cudaDeviceSynchronize
();
}
src/invert_trm/mod_invert_trm_cuda.F90
View file @
5659ad6f
...
...
@@ -50,6 +50,12 @@ module invert_trm_cuda
implicit
none
public
interface
subroutine
device_synchronize_c
()
&
bind
(
C
,
name
=
"device_synchronizeFromC"
)
end
subroutine
end
interface
interface
subroutine
copy_double_a_tmat2_c
(
a_dev
,
tmat2_dev
,
nblk
,
matrixRows
,
l_cols
,
l_colx
,
l_row1
,
nb
)&
bind
(
C
,
name
=
"copy_double_a_tmat2_FromC"
)
...
...
@@ -492,6 +498,9 @@ module invert_trm_cuda
#endif
end
subroutine
subroutine
device_synchronize
()
call
device_synchronize_c
()
end
subroutine
device_synchronize
end
module
Write
Preview
Supports
Markdown
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment