Commit fa833504 authored Oct 23, 2012 by dhj

Updated benchmark to use equivalent work loads for cpu and gpu.

* Numpy does element-wise operations by default.  Updated the cpu
  operation to use pure numpy.

* Eliminated the loop which is not necessary to demonstrate
  parallelism on array operations.

* Made the number of workers explicit rather than gpu chosen,
  through local_size variable passed to kernel execution.

* Increased to ~8 million data points to more clearly demonstrate
  the difference between cpu and gpu based computations.

parent 0724c5b6

Show whitespace changes

Inline Side-by-side

Please to comment