Skip to content
Commit 0f735ce7 authored by Philippe Tillet's avatar Philippe Tillet
Browse files

Generator : GEMM template : now unrolling fetch to local memory

5-10% performance improvement on AMD hardware.
Also uncommented some tests in generator_blas3-test
parent 2a0c3ab5
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment