-
- Downloads
Counter for memory bandwidth (loads + stores)
Additionally one can now also measure load and stores, and thus the memory bandwidth. Therefore, also the arithmetic intensity. One caveat, though: The user is responsible to provide a meaningful value for the amount of bytes transferred in one load/store, via the "bytes_per_ldsr" parameter of the new function %set_print_options. Till now, I have now way of obtaining this value programmatically, and it also can and will vary for different sections of a program. For example, a SSE movapd instructions loads/stores 16 byte, but is still counted as one "load and store" instruction, just as well as a 1-byte mov. Feel free to advise me on a better set of machine counters.. Also, somewhat updated documentation.
Showing
- Doxyfile.in 1 addition, 1 deletionDoxyfile.in
- doxygen.am 1 addition, 1 deletiondoxygen.am
- ftimings/ftimings.F90 280 additions, 118 deletionsftimings/ftimings.F90
- ftimings/ftimings_value.F90 7 additions, 64 deletionsftimings/ftimings_value.F90
- ftimings/papi.c 121 additions, 30 deletionsftimings/papi.c
- ftimings/resident_set_size.c 0 additions, 11 deletionsftimings/resident_set_size.c
- test/do_flops.c 9 additions, 6 deletionstest/do_flops.c
- test/test_timings.F90 20 additions, 10 deletionstest/test_timings.F90
Loading
Please register or sign in to comment