Skip to content

fuse_kernels with kernel_callables

Should probably be implemented as "make new super-kernel that calls the kernels to be fused", "inline them", "analyze data flow to get finer-grained dependencies.

To make this equivalent, it needs at least two things:

  • #195
  • some form of data flow analysis

cc @kaushikcfdatA

Edited by Andreas Klöckner