Added fast GEMM kernels for AMD Tahiti based on input from Philippe's...
Added fast GEMM kernels for AMD Tahiti based on input from Philippe's autotuner. Only works for square matrices with dimensions being a multiple of 256.
Loading
Please register or sign in to comment