Device-Specific / GEMM : Now letting the compiler unroll the loop.
Probably requires more investigation, it may decrease the performance on some platforms, increase it on some others...
Loading
Please sign in to comment
Probably requires more investigation, it may decrease the performance on some platforms, increase it on some others...