changed some dg measurement knl prob sizes after updating the kernels to...
changed some dg measurement knl prob sizes after updating the kernels to prevent compiler loop optimizations, also no longer using mknls with (load-0, store-1) since it doesn't seem to reflect the actual cost per store very well