Commits · 605a1a9027c2f7c17b15241f0a9bc885360732ed · Kaushik Kulkarni / viennacl-dev · GitLab

Dec 11, 2014
- OpenMP: Ensured compatibility with OpenMP 2.0 (only signed integers as loop vars) · 605a1a90
  Karl Rupp authored Dec 11, 2014
  605a1a90
- Doxygen: Added reference to CSR-adaptive in manual. · 548d8b14
  Karl Rupp authored Dec 11, 2014
  
  548d8b14
- hyb_matrix: Using default workgroup sizes. · 9ce4b367
  Karl Rupp authored Dec 11, 2014
  9ce4b367
- coordinate_matrix: Fixed bug in row_info() kernels for OpenCL and CUDA. · 41189e24
  Karl Rupp authored Dec 11, 2014
  41189e24
- coordinate_matrix: Changed local thread size to 128 for better compatibility. · e6faa56e
  Karl Rupp authored Dec 11, 2014
  e6faa56e
- AMG: Fixed possibly incorrect matrix sizes of GPU-based operators. · 731a7810
  Karl Rupp authored Dec 11, 2014
  731a7810
Dec 10, 2014
- Updated version to 1.6.2 · 12c5ed2c
  Karl Rupp authored Dec 10, 2014
  
  12c5ed2c
- Device database: Removed unused member in template class. · 54450e8b
  Karl Rupp authored Dec 10, 2014
  54450e8b
- compressed_compressed_matrix: Fixed wrong buffer size in clear() · 384185b0
  Karl Rupp authored Dec 10, 2014
  384185b0
- GCC: Fixed all conversion warnings. · 3881ea8b
  Karl Rupp authored Dec 10, 2014
  3881ea8b
Dec 09, 2014
- GCC, Clang: Fixed compiler warnings · 390c47c1
  Karl Rupp authored Dec 09, 2014
  
  390c47c1
- Solver Bench: Added sliced_ell_matrix and added pipelined runs. · ecb1088a
  Karl Rupp authored Dec 09, 2014
  
  ecb1088a
- FFT: Fixed flag passed to CUDA kernel. · c09831bb
  Karl Rupp authored Dec 09, 2014
  c09831bb
- OpenMP: Fixed unspecified behavior for operations using reductions. · 71e46368
  Karl Rupp authored Dec 09, 2014
  71e46368
- OpenMP: Removed use of private and shared clauses. · 0a8e2999
  Karl Rupp authored Dec 09, 2014
  0a8e2999
- Iterative: Moved local array declaration out of CSR-adaptive kernel. · 40686dbf
  Karl Rupp authored Dec 09, 2014
  40686dbf
- Iterative: Fixed overloads for pipelined iterative solvers. · b825e5d2
  Karl Rupp authored Dec 09, 2014
  b825e5d2
Dec 06, 2014
- Direct solve bench: Removed accidental uBLAS dependency. · 8682205a
  Karl Rupp authored Dec 06, 2014
  
  8682205a
- CUDA: Fixing uses of 'uint' and performance warnings. · 4ebc5c60
  Karl Rupp authored Dec 06, 2014
  
  4ebc5c60
- CUDA: Fixed complaints about destructor in Visual Studio. · 0e2a3758
  Karl Rupp authored Dec 06, 2014
  0e2a3758
Dec 05, 2014
- vector_iterator: Fixed internal handling of smart-pointer. · 55b05254
  Karl Rupp authored Dec 05, 2014
  55b05254
Dec 04, 2014
- GMRES: Reverted to previous kernel in first pipelined stage for non-NVIDIA GPUs. · a4b7354a
  Karl Rupp authored Dec 04, 2014
  a4b7354a
- GMRES: Improved kernel first first stage of pipelined orthogonalization. · 4381e000
  Karl Rupp authored Dec 04, 2014
  4381e000
- compressed_matrix: Fixed missing context switch for CSR-adaptive metainfo. · 9d8bae24
  Karl Rupp authored Dec 04, 2014
  9d8bae24
- Pipelined solvers: Added better parameters for NVIDIA GPUs. · b6758fb9
  Karl Rupp authored Dec 04, 2014
  b6758fb9
- sliced_ell_matrix: Setting defaults for NVIDIA GPUs to 256. · acb1ca0c
  Karl Rupp authored Dec 04, 2014
  acb1ca0c
Nov 20, 2014
- Doxygen: Added symbolic link to changelog, did not work with 1.8.8. · 7a0f5794
  Karl Rupp authored Nov 20, 2014
  
  7a0f5794
- Doxygen: Now taking version number directly from CMakeLists.txt · f6856df3
  Karl Rupp authored Nov 20, 2014
  
  f6856df3
- CUDA: Added CSR-adaptive to pipelined iterative solvers. · e0d55f9e
  Karl Rupp authored Nov 20, 2014
  
  e0d55f9e
- OpenCL: Added CSR-adaptive for pipelined iterative solvers. · 8678f02f
  Karl Rupp authored Nov 20, 2014
  
  8678f02f
- CUDA: Cleanup of CSR-adaptive implemenentation, adjustment of block sizes. · bac9d4ab
  Karl Rupp authored Nov 20, 2014
  bac9d4ab
- Tests: Fixed incorrect test code in libviennacl-blas1. · 9eed5e20
  Karl Rupp authored Nov 20, 2014
  
  9eed5e20
- Changelog: Fixed incorrect co-author name of CSR-adaptive paper. · 61792c08
  Karl Rupp authored Nov 20, 2014
  
  61792c08
- Changelog: Added notes for 1.6.1 release. · bdd0ddd9
  Karl Rupp authored Nov 20, 2014
  
  bdd0ddd9
- Doxygen: Fixed warnings. · 078328ff
  Karl Rupp authored Nov 20, 2014
  
  078328ff
- Updated version to 1.6.1. · 5bf0373b
  Karl Rupp authored Nov 20, 2014
  
  5bf0373b
- Visual Studio 2012: Fixed performance warnings and a test compilation error. · 1f51ee98
  Karl Rupp authored Nov 20, 2014
  1f51ee98
Nov 19, 2014
- Direct solve: Fixed errors obtained after resolution of self-assignment problems. · df29d5f3
  Karl Rupp authored Nov 19, 2014
  
  df29d5f3
- CUDA: Fixed compilation error in triangular solve kernels. · f321e151
  Karl Rupp authored Nov 19, 2014
  f321e151
- compressed_matrix: Implemented CSR-adaptive in CUDA and OpenCL. · 7d212433
  Karl Rupp authored Nov 19, 2014
  7d212433