Regain performance by caching initializer names in ORTModule by baijumeswani · Pull Request #7685 · microsoft/onnxruntime

baijumeswani · 2021-05-13T18:20:10Z

Every call to self._graph_info.initializer_names or self._graph_info.initializer_names_to_train resulted in a O(n) lookup time. Although these two were Python sets, their lookup time was linear. This was happening most likely because on every reference, pybind made a fresh copy of the C++ unordered_set to a Python set.

While we research ways to optimize this, this pull request fixes the perf regression that #7631 introduced by caching the set of initalizer names on the frontend.

This pull request also adds more unit tests for testing the support for unused model parameters.

thiagocrepaldi

LGTM
I am assuming you ran Ravi's script which benchmarks all scenarios

baijumeswani · 2021-05-13T20:23:17Z

LGTM
I am assuming you ran Ravi's script which benchmarks all scenarios

I ran Ravi's script for one of the models and it looked good. I am assuming this will generalize to other models as well.

mrry · 2021-05-13T21:46:14Z

Thanks for figuring out the problem Baiju! I would not have expected that property access to be causing a set construction every time :).

We should definitely merge this as-is, but one possibility for fixing the TODO is to investigate the trick @codemzs used to avoid copying STL structures (vectors in this case) across the Python/C++ boundary:

onnxruntime/orttraining/orttraining/python/orttraining_pybind_state.cc

Line 19 in 7bb3f24

PYBIND11_MAKE_OPAQUE(std::vector<OrtValue>);

ytaous · 2021-05-13T22:16:52Z

do u know what's perf gain by % with Ravi's?

In reply to: 840811250

baijumeswani · 2021-05-13T22:24:27Z

@ytaous no additional gain. Just back to where we were before pull request #7631 caused the regression.

baijumeswani added 2 commits May 13, 2021 18:14

Improve perf by caching the initializer names set

e80c033

Cover more scenarios in unit tests for unused parameters

840d040

baijumeswani requested review from a team, BowenBao, liqunfu, mrry, spandantiwari and thiagocrepaldi as code owners May 13, 2021 18:20

Update layer shapes to test if correct layers are dropped

9d779e3

thiagocrepaldi approved these changes May 13, 2021

View reviewed changes

baijumeswani added component:ortmodule labels May 13, 2021

mrry approved these changes May 13, 2021

View reviewed changes

baijumeswani merged commit 37f69fc into master May 14, 2021

baijumeswani deleted the bmeswani/perf-regression branch May 14, 2021 03:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Regain performance by caching initializer names in ORTModule#7685

Regain performance by caching initializer names in ORTModule#7685
baijumeswani merged 3 commits intomasterfrom
bmeswani/perf-regression

baijumeswani commented May 13, 2021 •

edited

Loading

Uh oh!

thiagocrepaldi left a comment

Uh oh!

baijumeswani commented May 13, 2021

Uh oh!

mrry commented May 13, 2021

Uh oh!

ytaous commented May 13, 2021

Uh oh!

baijumeswani commented May 13, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

baijumeswani commented May 13, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

thiagocrepaldi left a comment

Choose a reason for hiding this comment

Uh oh!

baijumeswani commented May 13, 2021

Uh oh!

mrry commented May 13, 2021

Uh oh!

ytaous commented May 13, 2021

Uh oh!

baijumeswani commented May 13, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

baijumeswani commented May 13, 2021 •

edited

Loading