Skip to content
Commit 70690439 authored by Karl Rupp's avatar Karl Rupp
Browse files

OpenMP: Performance optimization of amg_coarse_ag_stage1_mis2.

Removed unnecessary reads and writes from work arrays.
Observed performance gains of about 10 percent on an AMD A10-5800K
with four threads. Results likely to be similar on other machines.

Similar optimizations can be applied to the CUDA and OpenCL backends.
These will be added with a later commit.
parent a9158726
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment