sliced_ell_matrix: Reduced block size to 32.
Improves performance on NVIDIA GPUs by about 10 percent on average. Also reduces memory footprint a little.
Loading
Please register or sign in to comment
Improves performance on NVIDIA GPUs by about 10 percent on average. Also reduces memory footprint a little.