added TQL1 algorithm, improved compatibility for bisection algorithm
* added TQL1 algorithm * reduced OpenCL work group size from 512 to 256 for better compatibility for bisection algorithm * added option to reduce CUDA block dimension for bisection algorithm
Loading
Please register or sign in to comment