Skip to content
Commit 575300ac authored by Karl Rupp's avatar Karl Rupp
Browse files

* Reduced generic vector kernel (av, avbv, avbv_v) startup by 10-20 percent by packing arguments

* Matrix-matrix operations for CUDA now functional. Performance is lower than with OpenCL, though...
parent a844270f
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment