Improve cache retrieval speeds for execution by using lazy unpickling
of instructions. This requires caching the generation of invokers (which itself should also give a speed boost), since invoker generation needed to analyze the instructions in the kernel.
Showing
- loopy/kernel/__init__.py 12 additions, 1 deletionloopy/kernel/__init__.py
- loopy/kernel/instruction.py 12 additions, 0 deletionsloopy/kernel/instruction.py
- loopy/target/c/c_execution.py 6 additions, 3 deletionsloopy/target/c/c_execution.py
- loopy/target/execution.py 33 additions, 6 deletionsloopy/target/execution.py
- loopy/target/pyopencl_execution.py 6 additions, 3 deletionsloopy/target/pyopencl_execution.py
Loading
Please register or sign in to comment