Tests: Rewrote the GEMM test from scratch.
Also includes a fix for GEMM slice= X * X for the row-major layout. It was undetected before because the previous test used stride1 == stride2 for C, I guess. The same thing may be going on for GEMV
Loading
Please register or sign in to comment