- Dec 02, 2012
-
-
Karl Rupp authored
* Added Doxygen comments to all namespaces. * Updated comments on host-based implementations to clearly state (optional) OpenMP usage.
-
- Dec 01, 2012
-
-
Karl Rupp authored
* Removed MSVC-switch in tutorials and benchmarks for reading files (require users to run from build/ accross different OS) * Updated old Eigen-code to version 3.x * Fixed a few more warnings in Visual Studio, added /wd4996 flag to get rid of VC iterator advertisements * Fixed an overly strict assert() on vector-reductions with OpenCL, including a clean initialization of reduction vector * Changed STL overload of norm_X from enable-if to plain overloading, otherwise MSVC has problems.
-
Karl Rupp authored
* Added finish() before copy() in tests in order to resolve issues with AMD APP SDK
-
Karl Rupp authored
* Fixed all warnings obtained in Visual Studio 2005 and 2010 * Reverted SFINAE in CTOR for vector to separate overloads for vector_range and vector_slice (does not work with VS 2005) * Moved default-implementation for predicates to forwards.h, otherwise Visual Studio does not recognize forward definitions properly * Removed unnecessary Boost.filesystem and Boost.system components check from dist-package * Adjusted version number in Doxyfile and CMakeLists.txt
-
- Nov 30, 2012
-
-
Karl Rupp authored
* Added least_squares and iterative to CUDA-examples * Fixed a minor flaw in viennacl-info
-
Karl Rupp authored
* viennacl-info now prints informations for all available platforms. * user-provided OpenCL context is no longer free'd at exit (inc() on handle after assignment). * Added Philippe's input to changelogs
-
Karl Rupp authored
* Reformatted SPAI implementation.
-
- Nov 29, 2012
-
-
Karl Rupp authored
* Fixed several doxygen warnings, still some more left
-
Karl Rupp authored
* Split manual into three parts: Core Functionality, Addon Functionality, Miscellaneous
-
Karl Rupp authored
* Improved matrix-vector-test. Now checks all combinations of matrix/matrix-range/matrix-slice and vector/vector-range/vector-slice on rank-1-updates, matrix-vector products and triangular solves * Removed redundant prod_impl(A, b)
-
Karl Rupp authored
* Reduced execution time of sparse-test by speeding up the reference uBLAS calculations (double-transpose-trick)
-
- Nov 28, 2012
-
-
Karl Rupp authored
* Added inplace_qr_apply_trans_Q() to compute rhs of least-squares system R = Q^T b without setting up Q * Fixed overloads for inplace_solve(A, b). More tests required, though. * Unified use of viennacl::traits::clear(result) in CG and BiCGStab
-
Karl Rupp authored
* Added missing include directives for lu.hpp in one example and one test
-
Karl Rupp authored
* Reimplementation of LU factorization in viennacl/linalg/lu.hpp. Better performance, but still a lot of unused potential. * Replaced slow generic CUDA matrix-matrix multiplication kernel by several semi-automatically generated kernels. Performance still only half of OpenCL, although code is virtually identical. * Fixed a bug with C = prod(A, B) if C is a matrix_range or matrix_slice. An unnecessary temporary was introduced. * CUDA-benchmarks now build correctly
-
- Nov 21, 2012
-
-
Karl Rupp authored
* Generalized sparse_matrix_adapter. Now all types std::vector< std::map<T, U> > are supported (T was fixed to 'unsigned int' previously)
-
Karl Rupp authored
* Added level scheduling to ILUT, renamed routines from multifrontal_XYZ() to level_scheduling_XYZ() * Fixed a couple of issues in block-ILU and improved performance. Now works well with CPU/OpenCL/CUDA, with the latter striving for higher block sizes than the default 8.
-
- Nov 18, 2012
-
-
Karl Rupp authored
* CPU-fallback for ViennaCL-based block-ILU now working correctly. * Removed old bicgstab-kernels (unused anyway) * Eliminated 'potentially uninitialized variable' warnings in BiCGStab
-
- Nov 16, 2012
-
-
Karl Rupp authored
* Improved OpenCL matrix-vector performance of coordinate_matrix (factor 2 on GTX 285) * Added restart to BiCGStab if search direction vanishes or a certain number of iterations is reached. * Added two missing operator-overloads for vector in order to handle b - prod(A,x)
-
Karl Rupp authored
* Fixed wrong estimated residual in BiCGStab as introduced with the previous commit * Improved performance of block-ILU.
-
- Nov 15, 2012
-
-
Karl Rupp authored
* Row- and Jacobi-preconditioner now work on CPU, OpenCL and CUDA * Final summation in norm_1, norm_2, norm_inf is now carried out on GPU or CPU, depending on target (same as for inner_prod()) * Tweaked CG and BiCGStab to use norm_2 instead of inner_prod(v, v)
-
Karl Rupp authored
-
Karl Rupp authored
* Typesafe multi-backend transfer now working, making implementations based upon them nice and compact :-) * added operator= to compressed_matrix<> * moved viennacl::backend::memory_types to viennacl::memory_types
-
- Nov 14, 2012
- Nov 13, 2012
- Nov 12, 2012
- Nov 11, 2012
-
-
Karl Rupp authored
Reimplementation of ILU0 for compressed_matrix. Using the new multi-backend, first tests indicate speedups of 10. That rocks. :-)
-
Karl Rupp authored
-
Karl Rupp authored
Sparse triangular solver for compressed_matrix now working on CPU. Moved in place for OpenCL and CUDA, testing required.
-
- Nov 09, 2012
-
-
Karl Rupp authored
-
- Nov 08, 2012
-
-
Karl Rupp authored
Some progress with triangular solver for compressed_matrix: unit-lower-solve and upper-solve working, but performance is rather poor. This is, however, expected, because the data structure is not well suited for that.
-
Karl Rupp authored
* Added initializer types for vectors: unit_vector, zero_vector, scalar_vector
-
- Nov 07, 2012
- Nov 06, 2012
-
-
Karl Rupp authored
-
- Nov 05, 2012