- Jun 22, 2018
-
-
James Stevens authored
-
- Jun 16, 2018
-
-
James Stevens authored
-
- Jun 15, 2018
-
-
James Stevens authored
-
- Jun 13, 2018
-
-
James Stevens authored
-
- Jun 11, 2018
-
-
James Stevens authored
-
- Jun 10, 2018
-
-
James Stevens authored
-
- Jun 04, 2018
-
-
James Stevens authored
-
James Stevens authored
-
- Jun 03, 2018
-
-
James Stevens authored
-
James Stevens authored
made ALL_GENERATORS list that can be passed to KernelCollection, made two non_key_val_tag matching options (necessary vs. sufficient)
-
- May 14, 2018
-
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
-
- May 12, 2018
-
-
James Stevens authored
-
- May 08, 2018
-
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
-
- May 07, 2018
-
-
James Stevens authored
-
James Stevens authored
made new local load training kernel that doubles as local/global overlap training kernel (get_exec_local_directed_tile_sum_kernel)
-
- May 05, 2018
-
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
-
- May 03, 2018
-
-
James Stevens authored
-
- Apr 28, 2018
-
-
James Stevens authored
-
- Apr 25, 2018
-
-
James Stevens authored
improved dg diff vec-prefetch algorithm, added training kernels/generators specifically designed to match dg access patterns
-
- Apr 09, 2018
-
-
James Stevens authored
-
James Stevens authored
-
- Apr 05, 2018
-
-
James Stevens authored
-
- Mar 30, 2018
-
-
James Stevens authored
added 5 matmul access kernels and generators that exactly match the 5 different matmul access patterns found in the prefetch and no-prefetch versions of the matmul_sq kernel
-
- Mar 27, 2018
-
-
James Stevens authored
For tailored access pattern, fixed GlobalArg shape issue for cases with 0 strides. Iname associated with zero stride repaced in array index with '0', instead added for loop over each iname with 0 stride; Also now allowing string lid strides as well as int
-
James Stevens authored
-
- Mar 26, 2018
-
-
James Stevens authored
undoing last change (manually setting lsizes in tailored access pattern kernel so that arg arrays are not oversized), produces incorrect stats counting
-
James Stevens authored
-
- Mar 25, 2018
-
-
James Stevens authored
small change to empty knl problem sizes, using more keyname args to prevent bugs due to incorrect arg ordering
-
James Stevens authored
-
- Mar 23, 2018
-
-
James Stevens authored
-
- Mar 22, 2018
-
-
James Stevens authored
-
James Stevens authored
-