- Mar 28, 2014
-
-
Karl Rupp authored
-
- Mar 02, 2014
-
-
Karl Rupp authored
This addresses issue #64, (mildly) reduces the amount of code, and simplifies certain automated testing methodologies. Reported by
-
- Feb 28, 2014
-
-
Karl Rupp authored
operator++ was incrementing the index incorrectly, while operator* was returning a proxy which did not update any values if needed. Both problems are now fixed and the respective tests are in place. Resolves #62 Reported-by: Karan Poddar via IRC.
-
- Feb 25, 2014
-
-
Philippe Tillet authored
-
- Feb 22, 2014
- Feb 21, 2014
-
-
Philippe Tillet authored
-
- Feb 20, 2014
-
-
Philippe Tillet authored
Shame on me for not taking enough care while merging my own branch
-
Philippe Tillet authored
-
- Feb 15, 2014
-
-
Karl Rupp authored
This fixes the compilation problems on MacOS X 10.6.8 (Snow Leopard)
-
- Feb 14, 2014
-
-
Karl Rupp authored
This was a copy&paste bug introduced by the recent refactoring of matrix_base<T>
-
- Feb 13, 2014
-
-
Philippe Tillet authored
-
Karl Rupp authored
Reported-by: Paul Dufort via viennacl-support
-
Philippe Tillet authored
-
Philippe Tillet authored
* More compact and cleaner code (Column-Major Internally) * New operators (reduce, row_reduce, col_reduce, element-wise, etc...) * Better files architecture
-
- Feb 12, 2014
-
-
Karl Rupp authored
The rationale for this change is to simplify the interfacing to external BLAS libraries, which take the layout as a runtime argument. Another benefit is that we can better unify kernels for row-major+non-transposed with column-major+transposed, which is essentially the same operation. Compilation times are mildly reduced for OpenMP and OpenCL, and mildly increased for CUDA. OpenMP and OpenCL tests are known to pass, CUDA is not tested. Code in the worker-backends can be further simplified in follow-up commits to obtain a more compact code base.
-
- Feb 05, 2014
-
-
Karl Rupp authored
No need to rely on OpenCL 1.1. MacOS X 10.6 for example only provides 1.0.
-
- Feb 04, 2014
-
-
Karl Rupp authored
-
- Feb 03, 2014
-
-
Karl Rupp authored
Since this is a fairly light-weight test, there's no need to have this split into two tests. Supposed to reduce compilation times a little.
-
Karl Rupp authored
Apparently the tests were split to compile float and double separately to reduce compiler load, but this was never fully completed. This reverts the split, compilation load is moderate and therefore float and double can be tested in the same run.
-
Karl Rupp authored
Reduces amount of source code, compilation times (a bit) and simplifies calls to external libraries.
-
Karl Rupp authored
This replaces compile-time dispatches with respect to row_major and column_major tags by runtime arguments (boolean flags). This reduces compilation times a little and simplifies the incorporation of external BLAS backends. Also, it reduces the amount of code we have to maintain :-)
-
- Jan 31, 2014
-
-
Josef Weinbub authored
-
- Jan 30, 2014
-
-
Josef Weinbub authored
-
- Jan 24, 2014
-
-
Karl Rupp authored
-
- Jan 20, 2014
-
-
Karl Rupp authored
-
Karl Rupp authored
-
Karl Rupp authored
The source files were missing in the CMakeLists.txt file and hence not copied over to the build folder.
-
Karl Rupp authored
Subtractions are currently not supported for unsigned int. However, the test used inner_prod(u+v,u-v); and hence reported failure. This problem is well-known, so the tests should only cover the functionality which is supposed to be working.
-
Karl Rupp authored
uBLAS becomes really slow in debug mode, ultimately dominating the execution times consumed by the tests. However, the purpose of these tests is not to test uBLAS, but ViennaCL. Hence, using BOOST_UBLAS_NDEBUG to get rid of the unnecessary overhead.
-
- Jan 09, 2014
-
-
Karl Rupp authored
global work size must always be divisible by local work size... Reported-by: Andreas Rost
-
Karl Rupp authored
Without the absolute value, the denominator might be negative and the check does not trigger. Reported-by: Walter Mascarenhas
-
Philippe Tillet authored
Thanks Tom Nicholson for reporting!
-
Karl Rupp authored
Row and column ID computation in kernels was incorrectly reused from the row-major case.
-
- Jan 08, 2014
- Jan 07, 2014
-
-
Karl Rupp authored
We want to uniformly use C++ functionality, hence replaced malloc() with new and added the missing delete.
-
- Dec 22, 2013