- May 09, 2019
-
-
James Stevens authored
-
James Stevens authored
-
- Apr 12, 2019
-
-
James Stevens authored
-
- Apr 05, 2019
-
-
James Stevens authored
using tag_data_axes to transpose elements in dg kernel (instead of stupidly changing the kernel instructions themsleves)
-
- Apr 04, 2019
-
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
-
- Jan 31, 2019
-
-
James Stevens authored
-
- Jan 23, 2019
-
-
James Stevens authored
-
James Stevens authored
-
- Jan 04, 2019
-
-
James Stevens authored
-
James Stevens authored
-
- Jan 03, 2019
-
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
-
- Jan 02, 2019
-
-
James Stevens authored
-
- Dec 19, 2018
-
-
James Stevens authored
-
James Stevens authored
-
- Dec 06, 2018
-
-
James Stevens authored
allowing remove_work tags for dg generator (though current dg kernel construction function doesn't do anything with them)
-
James Stevens authored
allowing default tag values *different* from usual tag values to be used when user does not pass any tag vals
-
James Stevens authored
refactored generator (arg, value, type) lists as KernelArg class; now allowing a arg/tag to accept ANY_VALUE and use a default when no arg value is passed
-
- Dec 05, 2018
-
-
James Stevens authored
-
- Nov 15, 2018
-
-
James Stevens authored
-
James Stevens authored
-
- Nov 14, 2018
-
-
James Stevens authored
-
- Nov 13, 2018
-
-
James Stevens authored
-
- Nov 12, 2018
-
-
James Stevens authored
-
James Stevens authored
added measurement kernel for the storing pattern of the dg_diff vec fetch pattern measurement kernel
-
James Stevens authored
-
James Stevens authored
moved code around for dg training kernels so that kernels used for training same variant are grouped together (one more)
-
James Stevens authored
moved code around for dg training kernels so that kernels used for training same variant are grouped together
-
- Nov 11, 2018
-
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
-
- Nov 05, 2018
-
-
James Stevens authored
-
- Oct 31, 2018
-
-
James Stevens authored
updated access pattern version of lg overlap kernel to use lmem shuffle instead of local ops to avoid loop math optimizations
-
- Oct 30, 2018
-
-
James Stevens authored
fixed weirdly fluctuating exec times in lg_overlap demo (possibly due to loop math optimizations) by changing local ops from math to shuffle
-
- Oct 09, 2018
-
-
James Stevens authored
-
- Sep 26, 2018
-
-
James Stevens authored
-