replaced previous two kinds of local mem access measurement kernels with new...
replaced previous two kinds of local mem access measurement kernels with new single kind of local mem access/measurement kernel (local_shuffle)
replaced previous two kinds of local mem access measurement kernels with new single kind of local mem access/measurement kernel (local_shuffle)