Skip to content
Commit 87edf140 authored by Karl Rupp's avatar Karl Rupp
Browse files

compressed_matrix: Optimizing host-based sparse matrix-matrix product.

Performance relatively close to MKL (within 2x).
Possible further tweaks:
 - reduce overhead of resize()
 - use row_buffer for C when scanning nonzero pattern instead of temporary buffer
 - Parallel exclusive-scan of row_buffer for C

SpGEMM: Improved host-based implementation.
parent a0227178
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please to comment