cached output allocation is broken #2002

jjsjann123 · 2022-09-27T23:56:50Z

🐛 Describe the bug

FusionExecutor caches output sizes. e.g. for the same input sizes, we'll reuse the cached output sizes instead of running evaluation on outputs again.

This doesn't work any more with our recent expansion on codegen (TensorFactory methods & RNG ops). Unfortunately the issue was caught earlier, since the caching re-use gets accidentally disabled in my refactor code. https://github.com/csarofeen/pytorch/pull/1914/files#diff-3e62c8296c8362cd8c14a3d3300e5b2758d09b163ade856eefbf7361d75d7acaR373

So I'm going to add a check in executor and walk through fusion ops and disable cache re-use when I see those ops. I think there should be a more robust/aggressive way to handle this. -> we only needed to disable the cached output allocation when the factory methods depends on runtime scalar input.

Versions

ToT devel

Fixes #2002 checks all IterDomain on outputs and disables verifies that no extent value is a consumer of fusion inputs.

Cherry-picked from devel branch: csarofeen#2010 turns on accidentally disabled output allocation cache [#2002](csarofeen#2002) Updated check for safety regarding allocation cache by iterating all IterDomain on outputs and enables cache re-use only when no extent value is a consumer of fusion inputs (output sizes is not dependent on scalar inputs). Pull Request resolved: #86100 Approved by: https://github.com/csarofeen

Cherry-picked from devel branch: csarofeen/pytorch#2010 turns on accidentally disabled output allocation cache [#2002](csarofeen/pytorch#2002) Updated check for safety regarding allocation cache by iterating all IterDomain on outputs and enables cache re-use only when no extent value is a consumer of fusion inputs (output sizes is not dependent on scalar inputs). Pull Request resolved: pytorch/pytorch#86100 Approved by: https://github.com/csarofeen

jjsjann123 self-assigned this Sep 27, 2022

This was referenced Sep 28, 2022

Trivial forwarding #1995

Merged

Enable output allocation cache #2010

Merged

jjsjann123 closed this as completed in #2010 Sep 30, 2022

jjsjann123 added a commit that referenced this issue Sep 30, 2022

Enable output allocation cache (#2010)

a4effa6

Fixes #2002 checks all IterDomain on outputs and disables verifies that no extent value is a consumer of fusion inputs.

jjsjann123 mentioned this issue Oct 3, 2022

Enable output allocation cache pytorch/pytorch#86100

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cached output allocation is broken #2002

cached output allocation is broken #2002

jjsjann123 commented Sep 27, 2022

cached output allocation is broken #2002

cached output allocation is broken #2002

Comments

jjsjann123 commented Sep 27, 2022

🐛 Describe the bug

Versions