PyTorch frontend: make type inference incremental #6900

t-vi · 2020-11-11T13:45:16Z

Currently, the PyTorch frontend will use the vanilla TVM type inference pass to get the types.
Combined with the incremental nature of translating the graph, this makes for quadratic (in the graph size) runtime.
This patch runs type inference on a subgraph (starting from the things that has types) instead. It is a bit hacky though because it essentially tries to implement in-place modification where TVM does not foresee it.

For converting Huggingface BERT, this patch gives a ~10x increase in the speed of conversion (from 31 seconds to just below 3), so it is solving a very real and quite bad problem.

t-vi · 2020-11-11T15:05:48Z

The other question is whether we would want to make the translator a use a class interface so that the type inference can directly access the outputs' types.

t-vi · 2020-11-11T20:39:18Z

https://discuss.tvm.apache.org/t/incremental-type-propagation/7446/10

masahi · 2020-11-12T06:35:04Z

The other question is whether we would want to make the translator a use a class interface so that the type inference can directly access the outputs' types.

Yes we can consider this approach. Initially I thought function-only approach would be cleaner (e.g. for recursively converting blocks in If and Loop) but this resulted in passing various constants such as convert_map, prelude, default_dtype etc around each function.

We can introduce a global object (like a class) to hold these constants.

While the functional approach is pretty neat, we ended up having global state (default frontend, dtype) and it'll be more soon (caching of inferred types, see apache#6900). To not have to pass around the state, this moves the op conversion into a class with instances having the state.

While the functional approach is pretty neat, we ended up having global state (default frontend, dtype) and it'll be more soon (caching of inferred types, see #6900). To not have to pass around the state, this moves the op conversion into a class with instances having the state.

While the functional approach is pretty neat, we ended up having global state (default frontend, dtype) and it'll be more soon (caching of inferred types, see apache#6900). To not have to pass around the state, this moves the op conversion into a class with instances having the state.

t-vi · 2020-12-08T15:50:46Z

@masahi @siju-samuel I think this is ready for review now. In BERT conversion, I get a 10x speedup for from_python. In addition to removing the N² problem, it prunes the module before type inference, which seems essential for prelude.

masahi · 2020-12-09T11:37:10Z

@t-vi Please have a look at the CI issue. It is due to the recent change I made.

t-vi · 2020-12-09T12:16:23Z

Oh, right, an undetected merge confict. Fixed. Thank you @masahi.

masahi · 2020-12-09T15:37:26Z

Thanks @t-vi

t-vi · 2020-12-09T16:41:22Z

Thank you @masahi, for the guidance and discussion, and review!

While the functional approach is pretty neat, we ended up having global state (default frontend, dtype) and it'll be more soon (caching of inferred types, see apache#6900). To not have to pass around the state, this moves the op conversion into a class with instances having the state.

t-vi force-pushed the incremental_type_inference branch from 9117082 to a9badc1 Compare November 11, 2020 14:52

t-vi mentioned this pull request Dec 2, 2020

[RELAY][BUG]type inference is slow #7008

Closed

t-vi mentioned this pull request Dec 3, 2020

Save PyTorch frontend state in object #7023

Merged

t-vi force-pushed the incremental_type_inference branch from a9badc1 to 5e07318 Compare December 8, 2020 15:38

Incremental type inference

1046b65

t-vi force-pushed the incremental_type_inference branch from 5e07318 to 1046b65 Compare December 9, 2020 12:15

masahi approved these changes Dec 9, 2020

View reviewed changes

masahi merged commit db0215e into apache:main Dec 9, 2020

TusharKanekiDey pushed a commit to TusharKanekiDey/tvm that referenced this pull request Jan 20, 2021

Incremental type inference (apache#6900)

3a1f5ea

trevor-m pushed a commit to neo-ai/tvm that referenced this pull request Jan 21, 2021

Incremental type inference (apache#6900)

e90ab87

electriclilies pushed a commit to electriclilies/tvm that referenced this pull request Feb 18, 2021

Incremental type inference (apache#6900)

cb86310

masahi mentioned this pull request Aug 27, 2021

[Frontend] clean duplicates of infer_type and infer_shape in frontends #8709

Closed

junrushao mentioned this pull request Nov 1, 2021

Apache TVM v0.8 Release Note Candidate #9416

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PyTorch frontend: make type inference incremental #6900

PyTorch frontend: make type inference incremental #6900

t-vi commented Nov 11, 2020 •

edited

t-vi commented Nov 11, 2020

t-vi commented Nov 11, 2020

masahi commented Nov 12, 2020

t-vi commented Dec 8, 2020 •

edited

masahi commented Dec 9, 2020

t-vi commented Dec 9, 2020

masahi commented Dec 9, 2020

t-vi commented Dec 9, 2020

PyTorch frontend: make type inference incremental #6900

PyTorch frontend: make type inference incremental #6900

Conversation

t-vi commented Nov 11, 2020 • edited

t-vi commented Nov 11, 2020

t-vi commented Nov 11, 2020

masahi commented Nov 12, 2020

t-vi commented Dec 8, 2020 • edited

masahi commented Dec 9, 2020

t-vi commented Dec 9, 2020

masahi commented Dec 9, 2020

t-vi commented Dec 9, 2020

t-vi commented Nov 11, 2020 •

edited

t-vi commented Dec 8, 2020 •

edited