Skip to content
Commit fa833504 authored by dhj's avatar dhj
Browse files

Updated benchmark to use equivalent work loads for cpu and gpu.

* Numpy does element-wise operations by default.  Updated the cpu
  operation to use pure numpy.

* Eliminated the loop which is not necessary to demonstrate
  parallelism on array operations.

* Made the number of workers explicit rather than gpu chosen,
  through local_size variable passed to kernel execution.

* Increased to ~8 million data points to more clearly demonstrate
  the difference between cpu and gpu based computations.
parent 0724c5b6
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment