Skip to content
Commit e1617c38 authored by Karl Rupp's avatar Karl Rupp
Browse files

SpMdM: Added CUDA implementation for hyb_matrix, thus resolving #22

Now:
 C = prod(A, B); C = prod(A, trans(B));
fully supported, where A is sparse and B, C are dense (both row- or column-major).
Some kernels can be further tuned towards improved coalesced memory transfers,
or for avoiding memory transfers at all.
parent b4442cd3
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment