Add ONNXProgram.call API to run model with ONNX Runtime #113495

thiagocrepaldi · 2023-11-10T22:43:26Z

Stack from ghstack (oldest at bottom):

Currently the user can use torch.onnx.dynamo_export to export the model.
to ONNX.

import torch

class Model(torch.nn.Module):
    def forward(self, x):
        return x + 1.0

onnx_program = torch.onnx.dynamo_export(
    Model(),
    torch.randn(1, 1, 2, dtype=torch.float),
)

The next step would be instantiating a ONNX runtime to execute it.

import onnxruntime  # type: ignore[import]

onnx_input = self.adapt_torch_inputs_to_onnx(*args, **kwargs)
options = options or {}
providers = options.get("providers", onnxruntime.get_available_providers())
onnx_model = self.model_proto.SerializeToString()
ort_session = onnxruntime.InferenceSession(onnx_model, providers=providers)

def to_numpy(tensor):
    return (
        tensor.detach().cpu().numpy()
        if tensor.requires_grad
        else tensor.cpu().numpy()
    )

onnxruntime_input = {
    k.name: to_numpy(v) for k, v in zip(ort_session.get_inputs(), onnx_input)
}

return ort_session.run(None, onnxruntime_input)

This PR provides the ONNXProgram.__call__ method as facilitator to use ONNX Runtime under the hood, similar to how torch.export.ExportedProgram.__call__ which allows the underlying torch.fx.GraphModule to be executed.

Currently the user can use torch.onnx.dynamo_export to export the model to ONNX. ```python import torch class Model(torch.nn.Module): def forward(self, x): return x + 1.0 onnx_program = torch.onnx.dynamo_export( Model(), torch.randn(1, 1, 2, dtype=torch.float), ) ``` The next step would be instantiate a ONNX runtime to execute it ```python import onnxruntime # type: ignore[import] onnx_input = self.adapt_torch_inputs_to_onnx(*args, **kwargs) options = options or {} providers = options.get("providers", onnxruntime.get_available_providers()) onnx_model = self.model_proto.SerializeToString() ort_session = onnxruntime.InferenceSession(onnx_model, providers=providers) def to_numpy(tensor): return ( tensor.detach().cpu().numpy() if tensor.requires_grad else tensor.cpu().numpy() ) onnxruntime_input = { k.name: to_numpy(v) for k, v in zip(ort_session.get_inputs(), onnx_input) } return ort_session.run(None, onnxruntime_input) ``` This PR provides the ONNXProgram.__call__ method as facilitator to use ONNX Runtime under the hood, similar to how torch.export.ExportedProgram.__call__ which allows the underlying torch.fx.GraphModule to be executed [ghstack-poisoned]

pytorch-bot · 2023-11-10T22:43:31Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/113495

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 25989b8 with merge base 85b9760 ():

FLAKY - The following job failed but was likely due to flakiness present on trunk:

pull / linux-focal-py3_8-clang9-xla / test (xla, 1, 1, linux.12xlarge) (gh)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Currently the user can use torch.onnx.dynamo_export to export the model to ONNX. ```python import torch class Model(torch.nn.Module): def forward(self, x): return x + 1.0 onnx_program = torch.onnx.dynamo_export( Model(), torch.randn(1, 1, 2, dtype=torch.float), ) ``` The next step would be instantiate a ONNX runtime to execute it ```python import onnxruntime # type: ignore[import] onnx_input = self.adapt_torch_inputs_to_onnx(*args, **kwargs) options = options or {} providers = options.get("providers", onnxruntime.get_available_providers()) onnx_model = self.model_proto.SerializeToString() ort_session = onnxruntime.InferenceSession(onnx_model, providers=providers) def to_numpy(tensor): return ( tensor.detach().cpu().numpy() if tensor.requires_grad else tensor.cpu().numpy() ) onnxruntime_input = { k.name: to_numpy(v) for k, v in zip(ort_session.get_inputs(), onnx_input) } return ort_session.run(None, onnxruntime_input) ``` This PR provides the ONNXProgram.__call__ method as facilitator to use ONNX Runtime under the hood, similar to how torch.export.ExportedProgram.__call__ which allows the underlying torch.fx.GraphModule to be executed ghstack-source-id: 584190c46c02d4c4a35717e14c79d49c9176d3ad Pull Request resolved: #113495

torch/onnx/_internal/exporter.py

Currently the user can use torch.onnx.dynamo_export to export the model. to ONNX. ```python import torch class Model(torch.nn.Module): def forward(self, x): return x + 1.0 onnx_program = torch.onnx.dynamo_export( Model(), torch.randn(1, 1, 2, dtype=torch.float), ) ``` The next step would be instantiating a ONNX runtime to execute it. ```python import onnxruntime # type: ignore[import] onnx_input = self.adapt_torch_inputs_to_onnx(*args, **kwargs) options = options or {} providers = options.get("providers", onnxruntime.get_available_providers()) onnx_model = self.model_proto.SerializeToString() ort_session = onnxruntime.InferenceSession(onnx_model, providers=providers) def to_numpy(tensor): return ( tensor.detach().cpu().numpy() if tensor.requires_grad else tensor.cpu().numpy() ) onnxruntime_input = { k.name: to_numpy(v) for k, v in zip(ort_session.get_inputs(), onnx_input) } return ort_session.run(None, onnxruntime_input) ``` This PR provides the `ONNXProgram.__call__` method as facilitator to use ONNX Runtime under the hood, similar to how `torch.export.ExportedProgram.__call__` which allows the underlying `torch.fx.GraphModule` to be executed. [ghstack-poisoned]

thiagocrepaldi · 2023-11-15T00:42:24Z

@BowenBao PTAL

torch/onnx/_internal/exporter.py

Currently the user can use torch.onnx.dynamo_export to export the model. to ONNX. ```python import torch class Model(torch.nn.Module): def forward(self, x): return x + 1.0 onnx_program = torch.onnx.dynamo_export( Model(), torch.randn(1, 1, 2, dtype=torch.float), ) ``` The next step would be instantiating a ONNX runtime to execute it. ```python import onnxruntime # type: ignore[import] onnx_input = self.adapt_torch_inputs_to_onnx(*args, **kwargs) options = options or {} providers = options.get("providers", onnxruntime.get_available_providers()) onnx_model = self.model_proto.SerializeToString() ort_session = onnxruntime.InferenceSession(onnx_model, providers=providers) def to_numpy(tensor): return ( tensor.detach().cpu().numpy() if tensor.requires_grad else tensor.cpu().numpy() ) onnxruntime_input = { k.name: to_numpy(v) for k, v in zip(ort_session.get_inputs(), onnx_input) } return ort_session.run(None, onnxruntime_input) ``` This PR provides the `ONNXProgram.__call__` method as facilitator to use ONNX Runtime under the hood, similar to how `torch.export.ExportedProgram.__call__` which allows the underlying `torch.fx.GraphModule` to be executed. [ghstack-poisoned]

BowenBao · 2023-11-15T23:39:11Z

torch/onnx/_internal/exporter.py

+        Args:
+            args: The positional inputs to the model.
+            kwargs: The keyword inputs to the model.
+            options: The options to use for running the model with ONNX Runtime.


Since this is public api I think we should be careful with introducing arguments. Should we make options a dataclass?

We didn't use dataclass for torch.onnx.ExportOptions, but I do agree that having a defined type instead of Anywould be a more robust solution. I will change options beclass ONNXRuntimeOption`.

In the near future we probably will add at least some of the following members to it:

sess_options: Sequence[onnxruntime.SessionOptions] | None = None, providers: Sequence[str | tuple[str, dict[Any, Any]]] | None = None, provider_options: Sequence[dict[Any, Any]] | None = None,

so that onnxruntime.InferenceSession can be instantiated with any customization we need

test/onnx/onnx_test_common.py

BowenBao

fyi here is some iobinding reference w/ dynamo_export

pytorch/benchmarks/dynamo/common.py

Line 1353 in 5d170fc

def create_iobinding(self, pt_inputs, example_outputs):

Currently the user can use torch.onnx.dynamo_export to export the model. to ONNX. ```python import torch class Model(torch.nn.Module): def forward(self, x): return x + 1.0 onnx_program = torch.onnx.dynamo_export( Model(), torch.randn(1, 1, 2, dtype=torch.float), ) ``` The next step would be instantiating a ONNX runtime to execute it. ```python import onnxruntime # type: ignore[import] onnx_input = self.adapt_torch_inputs_to_onnx(*args, **kwargs) options = options or {} providers = options.get("providers", onnxruntime.get_available_providers()) onnx_model = self.model_proto.SerializeToString() ort_session = onnxruntime.InferenceSession(onnx_model, providers=providers) def to_numpy(tensor): return ( tensor.detach().cpu().numpy() if tensor.requires_grad else tensor.cpu().numpy() ) onnxruntime_input = { k.name: to_numpy(v) for k, v in zip(ort_session.get_inputs(), onnx_input) } return ort_session.run(None, onnxruntime_input) ``` This PR provides the `ONNXProgram.__call__` method as facilitator to use ONNX Runtime under the hood, similar to how `torch.export.ExportedProgram.__call__` which allows the underlying `torch.fx.GraphModule` to be executed. [ghstack-poisoned]

Currently the user can use torch.onnx.dynamo_export to export the model to ONNX. ```python import torch class Model(torch.nn.Module): def forward(self, x): return x + 1.0 onnx_program = torch.onnx.dynamo_export( Model(), torch.randn(1, 1, 2, dtype=torch.float), ) ``` The next step would be instantiate a ONNX runtime to execute it ```python import onnxruntime # type: ignore[import] onnx_input = self.adapt_torch_inputs_to_onnx(*args, **kwargs) options = options or {} providers = options.get("providers", onnxruntime.get_available_providers()) onnx_model = self.model_proto.SerializeToString() ort_session = onnxruntime.InferenceSession(onnx_model, providers=providers) def to_numpy(tensor): return ( tensor.detach().cpu().numpy() if tensor.requires_grad else tensor.cpu().numpy() ) onnxruntime_input = { k.name: to_numpy(v) for k, v in zip(ort_session.get_inputs(), onnx_input) } return ort_session.run(None, onnxruntime_input) ``` This PR provides the ONNXProgram.__call__ method as facilitator to use ONNX Runtime under the hood, similar to how torch.export.ExportedProgram.__call__ which allows the underlying torch.fx.GraphModule to be executed ghstack-source-id: 11bbe8dd1ed4543bec5c5922ca3b63f16ce0bf53 Pull Request resolved: #113495

Currently the user can use torch.onnx.dynamo_export to export the model. to ONNX. ```python import torch class Model(torch.nn.Module): def forward(self, x): return x + 1.0 onnx_program = torch.onnx.dynamo_export( Model(), torch.randn(1, 1, 2, dtype=torch.float), ) ``` The next step would be instantiating a ONNX runtime to execute it. ```python import onnxruntime # type: ignore[import] onnx_input = self.adapt_torch_inputs_to_onnx(*args, **kwargs) options = options or {} providers = options.get("providers", onnxruntime.get_available_providers()) onnx_model = self.model_proto.SerializeToString() ort_session = onnxruntime.InferenceSession(onnx_model, providers=providers) def to_numpy(tensor): return ( tensor.detach().cpu().numpy() if tensor.requires_grad else tensor.cpu().numpy() ) onnxruntime_input = { k.name: to_numpy(v) for k, v in zip(ort_session.get_inputs(), onnx_input) } return ort_session.run(None, onnxruntime_input) ``` This PR provides the `ONNXProgram.__call__` method as facilitator to use ONNX Runtime under the hood, similar to how `torch.export.ExportedProgram.__call__` which allows the underlying `torch.fx.GraphModule` to be executed. [ghstack-poisoned]

Currently the user can use torch.onnx.dynamo_export to export the model to ONNX. ```python import torch class Model(torch.nn.Module): def forward(self, x): return x + 1.0 onnx_program = torch.onnx.dynamo_export( Model(), torch.randn(1, 1, 2, dtype=torch.float), ) ``` The next step would be instantiate a ONNX runtime to execute it ```python import onnxruntime # type: ignore[import] onnx_input = self.adapt_torch_inputs_to_onnx(*args, **kwargs) options = options or {} providers = options.get("providers", onnxruntime.get_available_providers()) onnx_model = self.model_proto.SerializeToString() ort_session = onnxruntime.InferenceSession(onnx_model, providers=providers) def to_numpy(tensor): return ( tensor.detach().cpu().numpy() if tensor.requires_grad else tensor.cpu().numpy() ) onnxruntime_input = { k.name: to_numpy(v) for k, v in zip(ort_session.get_inputs(), onnx_input) } return ort_session.run(None, onnxruntime_input) ``` This PR provides the ONNXProgram.__call__ method as facilitator to use ONNX Runtime under the hood, similar to how torch.export.ExportedProgram.__call__ which allows the underlying torch.fx.GraphModule to be executed ghstack-source-id: e603ee212b2effd886415779741e006eeb9ccc15 Pull Request resolved: #113495

Currently the user can use torch.onnx.dynamo_export to export the model. to ONNX. ```python import torch class Model(torch.nn.Module): def forward(self, x): return x + 1.0 onnx_program = torch.onnx.dynamo_export( Model(), torch.randn(1, 1, 2, dtype=torch.float), ) ``` The next step would be instantiating a ONNX runtime to execute it. ```python import onnxruntime # type: ignore[import] onnx_input = self.adapt_torch_inputs_to_onnx(*args, **kwargs) options = options or {} providers = options.get("providers", onnxruntime.get_available_providers()) onnx_model = self.model_proto.SerializeToString() ort_session = onnxruntime.InferenceSession(onnx_model, providers=providers) def to_numpy(tensor): return ( tensor.detach().cpu().numpy() if tensor.requires_grad else tensor.cpu().numpy() ) onnxruntime_input = { k.name: to_numpy(v) for k, v in zip(ort_session.get_inputs(), onnx_input) } return ort_session.run(None, onnxruntime_input) ``` This PR provides the `ONNXProgram.__call__` method as facilitator to use ONNX Runtime under the hood, similar to how `torch.export.ExportedProgram.__call__` which allows the underlying `torch.fx.GraphModule` to be executed. [ghstack-poisoned]

thiagocrepaldi · 2023-11-22T01:46:56Z

@pytorchbot merge -f "unrelated xla failure"

pytorchmergebot · 2023-11-22T01:48:39Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

thiagocrepaldi requested review from BowenBao, abock and wschin as code owners November 10, 2023 22:43

This was referenced Nov 10, 2023

Add inheritance to ONNX's InputAdaptStep and OutputAdaptSet impl #113476

Closed

Add optional torch.export.ExportGraphSignature to ONNXProgram #113477

Closed

pytorch-bot bot added the release notes: onnx torch.onnx related changes that should show up in the release notes label Nov 10, 2023

thiagocrepaldi mentioned this pull request Nov 10, 2023

Add support for models with mutated buffer on torch.onnx.dynamo_export #112272

Closed

thiagocrepaldi mentioned this pull request Nov 10, 2023

Extend _TestONNXRuntime to reuses all tests for new model format #112289

Closed

thiagocrepaldi added module: onnx Related to torch.onnx onnx-triaged triaged by ONNX team labels Nov 10, 2023

pytorchbot added the open source label Nov 10, 2023

vadimkantorov reviewed Nov 11, 2023

View reviewed changes

torch/onnx/_internal/exporter.py Outdated Show resolved Hide resolved

Thiago Crepaldi added 2 commits November 13, 2023 23:25

This was referenced Nov 14, 2023

[ONNX] Update back the mutated buffers into the original PyTorch model #113686

Closed

[ONNX] Execute ONNX Runtime with IOBindings through ONNXProgram.__call__ #113687

Open

Thiago Crepaldi added 5 commits November 14, 2023 22:02

vadimkantorov reviewed Nov 15, 2023

View reviewed changes

torch/onnx/_internal/exporter.py Outdated Show resolved Hide resolved

vadimkantorov reviewed Nov 15, 2023

View reviewed changes

torch/onnx/_internal/exporter.py Outdated Show resolved Hide resolved

Thiago Crepaldi added 2 commits November 15, 2023 17:59

BowenBao reviewed Nov 15, 2023

View reviewed changes

test/onnx/onnx_test_common.py Outdated Show resolved Hide resolved

BowenBao reviewed Nov 15, 2023

View reviewed changes

Thiago Crepaldi added 3 commits November 16, 2023 19:19

thiagocrepaldi requested a review from BowenBao November 17, 2023 01:05

Thiago Crepaldi added 2 commits November 17, 2023 18:19

Thiago Crepaldi added 4 commits November 17, 2023 20:09

titaiwangms approved these changes Nov 21, 2023

View reviewed changes

pytorchmergebot added the merging label Nov 22, 2023

pytorchmergebot added the Merged label Nov 22, 2023

pytorchmergebot closed this in 3f736c2 Nov 22, 2023

pytorchmergebot removed the merging label Nov 22, 2023

facebook-github-bot deleted the gh/thiagocrepaldi/11/head branch November 25, 2023 15:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ONNXProgram.call API to run model with ONNX Runtime #113495

Add ONNXProgram.call API to run model with ONNX Runtime #113495

thiagocrepaldi commented Nov 10, 2023 •

edited

pytorch-bot bot commented Nov 10, 2023 •

edited

thiagocrepaldi commented Nov 15, 2023

BowenBao Nov 15, 2023

thiagocrepaldi Nov 16, 2023

BowenBao left a comment

thiagocrepaldi commented Nov 22, 2023

pytorchmergebot commented Nov 22, 2023

Add ONNXProgram.__call__ API to run model with ONNX Runtime #113495

Add ONNXProgram.__call__ API to run model with ONNX Runtime #113495

Conversation

thiagocrepaldi commented Nov 10, 2023 • edited

pytorch-bot bot commented Nov 10, 2023 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/113495

✅ You can merge normally! (1 Unrelated Failure)

thiagocrepaldi commented Nov 15, 2023

BowenBao Nov 15, 2023

Choose a reason for hiding this comment

thiagocrepaldi Nov 16, 2023

Choose a reason for hiding this comment

BowenBao left a comment

Choose a reason for hiding this comment

thiagocrepaldi commented Nov 22, 2023

pytorchmergebot commented Nov 22, 2023

Merge started

Add ONNXProgram.call API to run model with ONNX Runtime #113495

Add ONNXProgram.call API to run model with ONNX Runtime #113495

thiagocrepaldi commented Nov 10, 2023 •

edited

pytorch-bot bot commented Nov 10, 2023 •

edited