- Apr 27, 2013
-
-
Karl Rupp authored
-
Karl Rupp authored
Removed use of enable_if for matrix operations. Now everything compiles cleanly again on Visual Studio 2005.
-
Karl Rupp authored
After issues with Visual Studio 2012 were resolved, Visual Studio 2005 now has some problems with enable_if<>. Hence, a couple of enable_if were replaced by overloads or removed completely.
-
Karl Rupp authored
-
Karl Rupp authored
-
- Apr 26, 2013
- Apr 25, 2013
-
-
Karl Rupp authored
Other tutorials and tests still to be revived.
-
- Apr 15, 2013
-
-
Karl Rupp authored
Matrix operations still broken. Work in progress.
-
- Apr 14, 2013
-
-
Karl Rupp authored
Allows to support vector, vector_range, and vector_slice with less code. Currently only examples/tutorial/blas1.{cpp,cu} compiles. Work in progress.
-
Karl Rupp authored
-
Karl Rupp authored
-
Karl Rupp authored
-
Karl Rupp authored
Reverted 128x128 work layout for CSR SpMV. Instead, increased search range of autotuner (better effect)
-
Karl Rupp authored
Thread config 128x128 for CSR matrix-vector product is now applied right there. This gives the best of both worlds.
-
Karl Rupp authored
Old settings based on core-count were based on ancient OpenCL SDKs and are no longer appropriate.
-
- Feb 22, 2013
- Feb 21, 2013
-
-
Karl Rupp authored
-
Karl Rupp authored
-
Karl Rupp authored
-
Karl Rupp authored
-
Karl Rupp authored
-
Karl Rupp authored
-
Karl Rupp authored
-
Karl Rupp authored
* Added guard for not using AMD GEMM kernels if device has less than 20 kB of local memory. * Fixed a warning for GEMM kernels (fast NVIDIA version).
-
Karl Rupp authored
-
- Feb 20, 2013
-
-
Karl Rupp authored
-
Karl Rupp authored
Added fast GEMM kernels for AMD Tahiti based on input from Philippe's autotuner. Only works for square matrices with dimensions being a multiple of 256.
-
Karl Rupp authored
-
Karl Rupp authored
Fixed a warning (/* in comment) in CL/cl_gl_ext.h (staying with OpenCL 1.1. The fix is already in OpenCL 1.2 headers)
-
Karl Rupp authored
Replaced all uses of size_t by std::size_t (exception: viennacl/generator/*, which will be replaced soon anyway)
-
Karl Rupp authored
-
- Feb 19, 2013
-
-
Karl Rupp authored
* Fixed some corner cases for BLAS-1-type operations on vectors. * Removed bottleneck in sparse-test-XYZ (manually transposing ublas-matrix).
-
- Feb 18, 2013
-
-
Karl Rupp authored
* Fixed an access violation in copy() from STL to device for compressed_matrix with empty rows.
-
- Feb 13, 2013
- Feb 12, 2013
-
-
Karl Rupp authored
Added support for Xeon Phi (using 128x128 work items). Adjusted number of work groups on CPU to power of two (six-core CPUs otherwise lead to problems with reductions).
-