- Jun 23, 2018
-
-
James Stevens authored
-
- Jun 22, 2018
-
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
-
- Jun 16, 2018
-
-
James Stevens authored
-
- Jun 15, 2018
-
-
James Stevens authored
-
- Jun 14, 2018
-
-
James Stevens authored
-
- Jun 13, 2018
-
-
James Stevens authored
-
James Stevens authored
-
- Jun 04, 2018
-
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
updated simple matmul demo to use KernelCollection generate function rather than calling individual generator
-
- Jun 01, 2018
-
-
James Stevens authored
-
- May 30, 2018
-
-
James Stevens authored
added evaluation of lid/gid strides using kernel params, which enables more precise stride matching so that local sizes are not needed in demo global memory features
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
removed adjust_local_temp_var_storage, increased time trials, added random starting guess option, changed overlap demo size on stout
-
- May 18, 2018
-
-
James Stevens authored
-
- May 14, 2018
-
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
-
- May 13, 2018
-
-
James Stevens authored
-
- May 12, 2018
-
-
James Stevens authored
-
James Stevens authored
added rept vals to local tile sum problem size tags to enable different measurement set for AMD card
-
- May 09, 2018
-
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
-
- May 08, 2018
-
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
-
James Stevens authored
removed repetition from local tile sum knl because of weird performance (compiler may be doing some kind of loop optimization
-
- May 07, 2018
-
-
James Stevens authored
-
James Stevens authored
-
- May 06, 2018
-
-
James Stevens authored
-