Skip to content
Commit fd106a68 authored by Karl Rupp's avatar Karl Rupp
Browse files

Generator: Changed index type from uint to size_t for GEMM.

This improves sGEMM performance on AMD Hawaii by about a factor
of 2 with more recent drivers, which now use 64bit addressing.
Performance in double precision increases mildly (about 10 percent).
No performance change on NVIDIA devices observed.
parent b631a783
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment