- May 09, 2018
-
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
-
- May 08, 2018
-
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
removed repetition from local tile sum knl because of weird performance (compiler may be doing some kind of loop optimization
-
- May 07, 2018
-
-
James Stevens authored
-
James Stevens authored
-
- May 06, 2018
-
-
James Stevens authored
-
- May 05, 2018
-
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
-
- May 03, 2018
-
-
James Stevens authored
-
- Apr 28, 2018
-
-
James Stevens authored
-
- Apr 26, 2018
-
-
James Stevens authored
-
- Apr 25, 2018
-
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
created new dg (naive vs. tiled) demo that utilizes training kernels specifically made to match dg access patterns
-
- Apr 09, 2018
-
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
-
- Apr 07, 2018
-
-
James Stevens authored
-
- Apr 05, 2018
-
-
James Stevens authored
created matmul naive vs tiled demo v2, which uses a set of kernel generators specifically designed to match the matmul access patterns
-
- Apr 04, 2018
-
-
James Stevens authored
-
- Apr 03, 2018
-
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
-
- Mar 28, 2018
-
-
James Stevens authored
in matmul naive vs tiled demo, now using training kernels that produce access-to-footprint-ratios that more closely match test kernels
-
James Stevens authored
-
James Stevens authored
-
- Mar 24, 2018
-
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
-
- Mar 22, 2018
-
-
James Stevens authored
-