- Nov 08, 2014
-
-
Karl Rupp authored
Makes it much easier to e.g. only use OpenCL even though CUDA is enabled. Since this relies on a singleton, the mechanism is not thread-safe.
-
Karl Rupp authored
-
Karl Rupp authored
Migration to Doxygen 1.8.x complete. Resolves #18.
-
Karl Rupp authored
-
Karl Rupp authored
-
Karl Rupp authored
Results in shorter URLs.
-
Karl Rupp authored
-
Karl Rupp authored
A temporary buffer wasn't flushed, still contained old values.
-
Karl Rupp authored
Now deals correctly with very small systems for which the maximum iteration count is larger than the system size.
-
Karl Rupp authored
Uses temporaries C = trans(B) in order to avoid troubles with corner cases like A += trans(A).
-
- Nov 07, 2014
-
-
Karl Rupp authored
Enables support for operations like C = fabs(A - trans(B)) and is supposed to work with both matrix and vector expressions.
-
Karl Rupp authored
Flags used: -Wall -Wextra -Weverything -pedantic -Werror -Wno-exit-time-destructors -Wno-global-constructors -Wno-padded -Wno-weak-vtables -Wno-documentation -Wno-old-style-cast -Wno-switch-enum
-
Karl Rupp authored
-
Karl Rupp authored
For reasons of backwards compatibility and uniformity with other headers.
-
Karl Rupp authored
Flags used: -Wall -Wextra -Weverything -pedantic -Werror -Wno-exit-time-destructors -Wno-global-constructors -Wno-padded -Wno-weak-vtables
-
Karl Rupp authored
-
Karl Rupp authored
Resolves #82 to the extent possible. A bullet-proof fix for 'auto' requires substantial refactoring and internal changes, which won't happen prior to ViennaCL 2.0.0.
-
Karl Rupp authored
-
- Nov 06, 2014
-
-
Karl Rupp authored
Derived from a Radeon HD 6850 with a tuning run.
-
Karl Rupp authored
Obtained on a Radeon HD 5450. Very low-end GPU, profile aims at compatibility rather than performance.
-
Karl Rupp authored
-
Karl Rupp authored
Flags used: -Wall -Wextra -Weverything -pedantic -Werror -Wno-exit-time-destructors -Wno-global-constructors -Wno-padded -Wno-weak-vtables
-
Karl Rupp authored
-
Karl Rupp authored
Only requires four kernels per iteration, which is much better than the Householder version. Implementation follows Algorithm 2.1 in Walker, Zhou: "A Simpler GMRES" (1994)
-
Karl Rupp authored
-
Karl Rupp authored
-
Karl Rupp authored
-
- Nov 05, 2014
-
-
Toby Smithe authored
-
Karl Rupp authored
Also includes a new example showing the use case. Resolves #69. Reported-by: Pushkar Ratnalikar via viennacl-devel
-
Karl Rupp authored
Might have gotten lost during refactoring?
-
Karl Rupp authored
-
Karl Rupp authored
Vector types lead to compilation issues on NVIDIA GPUs with abs(), since x = abs(y) does not compile due to incompatible vector types.
-
Karl Rupp authored
-
Karl Rupp authored
-
Karl Rupp authored
-
Karl Rupp authored
Discusssion here: https://github.com/viennacl/viennacl-dev/issues/106
-
- Nov 04, 2014