Reinstate OpenCL level concurrency
This previously existed, but was ripped out through !67 (merged) after causing [this]( this and pytential#75 (closed).
In addition, loopy!205 (merged) made the existing code level-sequential anyway.
A future attempt at this might use this abstraction for concurrency with arrays.