Skip to content
Commit 50668951 authored by Karl Rupp's avatar Karl Rupp
Browse files

SpGEMM: Added profiling information to CUDA implementation, using __ldg()

__ldg() is supposed to improve cache utilization.
Profiling available via VIENNACL_WITH_SPGEMM_CUDA_TIMINGS
parent 3b9df078
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please to comment