Barrier domain doesn't match implemented domain
Hey all,
I know this feature (overridden_get_grid_sizes_for_insn_ids) is technically unsupported, but I imagine there are other ways to set something like this up such that there is a mismatch between a desired and implemented domain. In either case, I need to find a way to get this working (and could use some pointers in the right direction)
MWE: https://gist.github.com/arghdos/854fc7d2058824f96cc30a6f3b631844
The problem (I think):
A barrier inside a kernel "extended" by overridden_get_grid_sizes_for_insn_ids inherits the codegen_state's implemented domain (loopy/codegen/control.py:165). The implemented domain for the assignment instructions a and b then pull the updated implemented domain from the insn_inames (loopy/codegen/instruction.py:94) and then the barrier's implemented domain no longer matches the desired
How to fix:
Suggestions most welcome