Commit 50668951 authored Apr 17, 2015 by Karl Rupp

SpGEMM: Added profiling information to CUDA implementation, using __ldg()

__ldg() is supposed to improve cache utilization.
Profiling available via VIENNACL_WITH_SPGEMM_CUDA_TIMINGS

parent 3b9df078

Please to comment