* Reimplementation of LU factorization in viennacl/linalg/lu.hpp. Better... (e8a6e5b3) · Commits · Kaushik Kulkarni / viennacl-dev

Commit e8a6e5b3 authored Nov 27, 2012 by Karl Rupp

* Reimplementation of LU factorization in viennacl/linalg/lu.hpp. Better...

* Reimplementation of LU factorization in viennacl/linalg/lu.hpp. Better performance, but still a lot of unused potential.
* Replaced slow generic CUDA matrix-matrix multiplication kernel by several semi-automatically generated kernels. Performance still only half of OpenCL, although code is virtually identical.
* Fixed a bug with C = prod(A, B) if C is a matrix_range or matrix_slice. An unnecessary temporary was introduced.
* CUDA-benchmarks now build correctly

parent 68ec5e72

Expand all Hide whitespace changes

Inline Side-by-side

Please register or to comment