- Nov 29, 2012
-
-
Karl Rupp authored
* Improved matrix-vector-test. Now checks all combinations of matrix/matrix-range/matrix-slice and vector/vector-range/vector-slice on rank-1-updates, matrix-vector products and triangular solves * Removed redundant prod_impl(A, b)
-
Karl Rupp authored
* Reduced execution time of sparse-test by speeding up the reference uBLAS calculations (double-transpose-trick)
-
- Nov 28, 2012
-
-
Karl Rupp authored
* Added inplace_qr_apply_trans_Q() to compute rhs of least-squares system R = Q^T b without setting up Q * Fixed overloads for inplace_solve(A, b). More tests required, though. * Unified use of viennacl::traits::clear(result) in CG and BiCGStab
-
Karl Rupp authored
* Added missing include directives for lu.hpp in one example and one test
-
Karl Rupp authored
* Reimplementation of LU factorization in viennacl/linalg/lu.hpp. Better performance, but still a lot of unused potential. * Replaced slow generic CUDA matrix-matrix multiplication kernel by several semi-automatically generated kernels. Performance still only half of OpenCL, although code is virtually identical. * Fixed a bug with C = prod(A, B) if C is a matrix_range or matrix_slice. An unnecessary temporary was introduced. * CUDA-benchmarks now build correctly
-
- Nov 21, 2012
-
-
Karl Rupp authored
* Generalized sparse_matrix_adapter. Now all types std::vector< std::map<T, U> > are supported (T was fixed to 'unsigned int' previously)
-
Karl Rupp authored
* Added level scheduling to ILUT, renamed routines from multifrontal_XYZ() to level_scheduling_XYZ() * Fixed a couple of issues in block-ILU and improved performance. Now works well with CPU/OpenCL/CUDA, with the latter striving for higher block sizes than the default 8.
-
- Nov 18, 2012
-
-
Karl Rupp authored
* CPU-fallback for ViennaCL-based block-ILU now working correctly. * Removed old bicgstab-kernels (unused anyway) * Eliminated 'potentially uninitialized variable' warnings in BiCGStab
-
- Nov 16, 2012
-
-
Karl Rupp authored
* Improved OpenCL matrix-vector performance of coordinate_matrix (factor 2 on GTX 285) * Added restart to BiCGStab if search direction vanishes or a certain number of iterations is reached. * Added two missing operator-overloads for vector in order to handle b - prod(A,x)
-
Karl Rupp authored
* Fixed wrong estimated residual in BiCGStab as introduced with the previous commit * Improved performance of block-ILU.
-
- Nov 15, 2012
-
-
Karl Rupp authored
* Row- and Jacobi-preconditioner now work on CPU, OpenCL and CUDA * Final summation in norm_1, norm_2, norm_inf is now carried out on GPU or CPU, depending on target (same as for inner_prod()) * Tweaked CG and BiCGStab to use norm_2 instead of inner_prod(v, v)
-
Karl Rupp authored
-
Karl Rupp authored
* Typesafe multi-backend transfer now working, making implementations based upon them nice and compact :-) * added operator= to compressed_matrix<> * moved viennacl::backend::memory_types to viennacl::memory_types
-
- Nov 14, 2012
- Nov 13, 2012
- Nov 12, 2012
- Nov 11, 2012
-
-
Karl Rupp authored
Reimplementation of ILU0 for compressed_matrix. Using the new multi-backend, first tests indicate speedups of 10. That rocks. :-)
-
Karl Rupp authored
-
Karl Rupp authored
Sparse triangular solver for compressed_matrix now working on CPU. Moved in place for OpenCL and CUDA, testing required.
-
- Nov 09, 2012
-
-
Karl Rupp authored
-
- Nov 08, 2012
-
-
Karl Rupp authored
Some progress with triangular solver for compressed_matrix: unit-lower-solve and upper-solve working, but performance is rather poor. This is, however, expected, because the data structure is not well suited for that.
-
Karl Rupp authored
* Added initializer types for vectors: unit_vector, zero_vector, scalar_vector
-
- Nov 07, 2012
- Nov 06, 2012
-
-
Karl Rupp authored
-
- Nov 05, 2012
- Nov 04, 2012
-
-
Karl Rupp authored
-
- Nov 03, 2012
- Nov 02, 2012
-
-
Karl Rupp authored
-
- Nov 01, 2012
- Oct 30, 2012