Skip to content
Commit cc3cdca2 authored by Karl Rupp's avatar Karl Rupp
Browse files

Reverted to old CPU work size deduction, which is better for simple vector kernels.

Thread config 128x128 for CSR matrix-vector product is now applied right there. This gives the best of both worlds.
parent 3eae4433
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please to comment