Triangular solve: Using local work size 128 on OpenCL for GPUs and CPUs.
local work sizes 1 or 2 for CPUs seem to cause problems with some SDKs.
Loading
Please sign in to comment
local work sizes 1 or 2 for CPUs seem to cause problems with some SDKs.