Count granularity2
These changes add a counting granularity parameter to MemAccess and Op that specifies whether they should be counted per thread, per warp, or per group. For example, uniform (stride-0) global memory accesses should be counted per warp.
Edited by James Stevens