Commit 87edf140 authored Apr 01, 2015 by Karl Rupp

compressed_matrix: Optimizing host-based sparse matrix-matrix product.

Performance relatively close to MKL (within 2x).
Possible further tweaks:
 - reduce overhead of resize()
 - use row_buffer for C when scanning nonzero pattern instead of temporary buffer
 - Parallel exclusive-scan of row_buffer for C

SpGEMM: Improved host-based implementation.

parent a0227178

Show whitespace changes

Inline Side-by-side

Please to comment