BiCGStab: Applied const-correctness in OpenCL and CUDA kernels.
Some missing const qualifiers in the OpenCL kernels resulted in an almost 4x-performance drop. With equal const qualifiers, performance is now within a roughly ten percent margin.
Loading
Please register or sign in to comment