[JIT] Cache compilation of free functions #30503

eellison · 2019-11-27T02:00:24Z

We don't have to recompile free functions if we've already compiled them.

Improved compilation of resnet18 by 27%.

suo

Good change, but we need one additional thing to make it safe I think. Compiling a redefined function will lead to weird behavior:

def foo():
     # something

fn1 = torch.jit.script(foo)

def foo():
     # something else

fn2 = torch.jit.script(foo)

In the current implementation, fn2 will be the same as fn1. I think the fix is simple; you can use the fn pyobject ptr as the key into a compilation cache maintained from the python side.

suo · 2019-11-27T02:14:08Z

Can you also add a test to verify that we the corner case right?

eellison · 2019-11-27T02:25:16Z

Good change, but we need one additional thing to make it safe I think. Compiling a redefined function will lead to weird behavior:
def foo():
     # something

fn1 = torch.jit.script(foo)

def foo():
     # something else

fn2 = torch.jit.script(foo)
In the current implementation, fn2 will be the same as fn1. I think the fix is simple; you can use the fn pyobject ptr as the key into a compilation cache maintained from the python side.

Good point, and yea change sounds good. We could consider doing something like this for all objects which are script-convertible. This would also fix: #30421

The change for modules is a little more complicated because you have to check that attributes haven't changed.

eellison · 2019-11-27T20:54:03Z

actually hold off on review i need to fix something

edit: fixed

addressed comments

suo

lgtm besides some code movement requests!

torch/csrc/jit/script/init.cpp

Run Constant Propagation upon compilation only on ops with non-aliasing inputs and outputs. This speeds up the first run of `torchvision.models.resnet18` by over 50% and speeds up compilation by about 25% (although the effects didn't seem additive with with #30503, so I'm going to land this PR first and then see if caching still has a sizable impact). Running constant prop only with non-aliasing types does a lot of graph cleanup by removing constant ifs and a bunch of other smaller ops. It also avoids all the jitter problems we had when we tried running full constant prop previously. Bc it is idempotent it doesn't jitter, and it doesn't jitter graphs constructed from tracing because tracing doesn't emit any ops that only involve non-aliasing inputs. Full constant prop isn't idempotent because what ops are run depends on the state of mutation in alias db, which will often change upon successive iterations of constant propagation, and bc it affects graphs constructed from tracing. [ghstack-poisoned]

…e_functions

facebook-github-bot

@eellison has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot

@eellison is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Run Constant Propagation upon compilation only on ops with non-aliasing inputs and outputs. This speeds up the first run of `torchvision.models.resnet18` by over 50% and speeds up compilation by about 25% (although the effects didn't seem additive with with #30503, so I'm going to land this PR first and then see if caching still has a sizable impact). Running constant prop only with non-aliasing types does a lot of graph cleanup by removing constant ifs and a bunch of other smaller ops. It also avoids all the jitter problems we had when we tried running full constant prop previously. Bc it is idempotent it doesn't jitter, and it doesn't jitter graphs constructed from tracing because tracing doesn't emit any ops that only involve non-aliasing inputs. Full constant prop isn't idempotent because what ops are run depends on the state of mutation in alias db, which will often change upon successive iterations of constant propagation, and bc it affects graphs constructed from tracing. Edit: if we were okay with running constant propagation on graphs constructed from tracing (potentially making them hard to debug), an alternative would be to run constant propagation until the graph reaches a fixed point. [ghstack-poisoned]

Summary: Pull Request resolved: #30544 Run Constant Propagation upon compilation only on ops with non-aliasing inputs and outputs. This speeds up the first run of `torchvision.models.resnet18` by over 50% and speeds up compilation by about 25% (although the effects didn't seem additive with with #30503, so I'm going to land this PR first and then see if caching still has a sizable impact). Running constant prop only with non-aliasing types does a lot of graph cleanup by removing constant ifs and a bunch of other smaller ops. It also avoids all the jitter problems we had when we tried running full constant prop previously. Bc it is idempotent it doesn't jitter, and it doesn't jitter graphs constructed from tracing because tracing doesn't emit any ops that only involve non-aliasing inputs. Full constant prop isn't idempotent because what ops are run depends on the state of mutation in alias db, which will often change upon successive iterations of constant propagation, and bc it affects graphs constructed from tracing. Edit: if we were okay with running constant propagation on graphs constructed from tracing (potentially making them hard to debug), an alternative would be to run constant propagation until the graph reaches a fixed point. Test Plan: Imported from OSS Differential Revision: D18833607 Pulled By: eellison fbshipit-source-id: 92a0adb4882d67ed5a0db5c279f5e122aeeba54a

Summary: We don't have to recompile free functions if we've already compiled them. Improved compilation of resnet18 by 27%. Pull Request resolved: pytorch#30503 Differential Revision: D18796501 Pulled By: eellison fbshipit-source-id: 2dee0fc5fcf9adc5b92213f8cb813730d71b376f

Summary: Pull Request resolved: pytorch#30544 Run Constant Propagation upon compilation only on ops with non-aliasing inputs and outputs. This speeds up the first run of `torchvision.models.resnet18` by over 50% and speeds up compilation by about 25% (although the effects didn't seem additive with with pytorch#30503, so I'm going to land this PR first and then see if caching still has a sizable impact). Running constant prop only with non-aliasing types does a lot of graph cleanup by removing constant ifs and a bunch of other smaller ops. It also avoids all the jitter problems we had when we tried running full constant prop previously. Bc it is idempotent it doesn't jitter, and it doesn't jitter graphs constructed from tracing because tracing doesn't emit any ops that only involve non-aliasing inputs. Full constant prop isn't idempotent because what ops are run depends on the state of mutation in alias db, which will often change upon successive iterations of constant propagation, and bc it affects graphs constructed from tracing. Edit: if we were okay with running constant propagation on graphs constructed from tracing (potentially making them hard to debug), an alternative would be to run constant propagation until the graph reaches a fixed point. Test Plan: Imported from OSS Differential Revision: D18833607 Pulled By: eellison fbshipit-source-id: 92a0adb4882d67ed5a0db5c279f5e122aeeba54a

Cache compilation of free functions

dff0d6e

eellison requested a review from apaszke as a code owner November 27, 2019 02:00

facebook-github-bot added the oncall: jit Add this issue/PR to JIT oncall triage queue label Nov 27, 2019

suo previously requested changes Nov 27, 2019

View reviewed changes

fix caching logic

a182dac

eellison requested review from driazati and suo November 27, 2019 20:41

fix caching

0cb6d0f

called wrong api

c028dec

suo approved these changes Nov 28, 2019

View reviewed changes

torch/csrc/jit/script/init.cpp Outdated Show resolved Hide resolved

eellison mentioned this pull request Nov 28, 2019

[JIT] add constant prop for immutable types #30544

Closed

eellison added 2 commits December 3, 2019 12:59

Merge branch 'master' of https://github.com/pytorch/pytorch into cach…

0d69bdb

…e_functions

address comment

b8b6cee

facebook-github-bot reviewed Dec 3, 2019

View reviewed changes

eellison added 2 commits December 3, 2019 18:30

fix from rebasing

bfd1ded

fix

30957f7

facebook-github-bot reviewed Dec 4, 2019

View reviewed changes

facebook-github-bot closed this in d38f911 Dec 4, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[JIT] Cache compilation of free functions #30503

[JIT] Cache compilation of free functions #30503

Uh oh!

eellison commented Nov 27, 2019

Uh oh!

suo left a comment

Uh oh!

suo commented Nov 27, 2019

Uh oh!

eellison commented Nov 27, 2019

Uh oh!

eellison commented Nov 27, 2019 •

edited

Loading

Uh oh!

suo left a comment

Uh oh!

Uh oh!

facebook-github-bot left a comment

Uh oh!

facebook-github-bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[JIT] Cache compilation of free functions #30503

[JIT] Cache compilation of free functions #30503

Uh oh!

Conversation

eellison commented Nov 27, 2019

Uh oh!

suo left a comment

Choose a reason for hiding this comment

Uh oh!

suo commented Nov 27, 2019

Uh oh!

eellison commented Nov 27, 2019

Uh oh!

eellison commented Nov 27, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

suo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

eellison commented Nov 27, 2019 •

edited

Loading