Add the ability to choose the ONNX runtime execution provider in `ORTModel` #137

fxmarty · 2022-04-11T09:57:12Z

What does this PR do?

A small PR to add the ability to choose the ONNX runtime execution provider in ORTModel.

I did not change the README.md for readability. Also, I added the ort_provider parameter to the OptimizationArguments in the examples, which is maybe subideal.

HuggingFaceDocBuilderDev · 2022-04-11T10:06:58Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

echarlaix

Looks good to me, thanks for this addition !

optimum/onnxruntime/model.py

echarlaix · 2022-04-12T08:43:50Z

examples/onnxruntime/optimization/text-classification/run_glue.py

@@ -130,12 +130,16 @@ class ModelArguments:
        default=None,
        metadata={"help": "Where do you want to store the pretrained models downloaded from huggingface.co"},
    )
+    ort_provider: str = field(


Not sure if ort_provider should be an attribute of OptimizationArguments

Also when optimization_level > 1, the optimized graph might be hardware dependent : optimize_for_gpu and ort_provider should be set accordingly

@echarlaix What do you think of removing the argument in the examples and simply using

if optim_args.opt_level > 1 and optim_args.optimize_for_gpu: execution_provider = "CUDAExecutionProvider" else: execution_provider = "CPUExecutionProvider"

?

I think it could make sense to let the user select the execution provider (to have the same behavior as in the quantization examples + the user might want to use "CUDAExecutionProvider" with optimization level of 1)

I see, makes sense. In this case, you suggest to leave ort_provider as default CPUExecutionProvider and leave it as a parameter, while we overwrite optim_args.optimize_for_gpu to True if optimization_level > 1? Is this what you were suggesting above?

My proposal is to let the user select the optimization parameters freely (and not overwrite anything) but to raise an error when those parameters are incompatible (in order to keep things as transparent and understandable as possible). What do you think ?

Agreed, as you saw I added the following checks: https://github.com/fxmarty/optimum/blob/638e80e2bd8e86630162a964f98ae51d771fb407/examples/onnxruntime/optimization/text-classification/run_glue.py#L202-L222 . I hope it is not too heavy.

It looks great, thanks for the addition !

optimum/onnxruntime/model.py

…n are contradictory

examples/onnxruntime/optimization/question-answering/run_qa.py

fxmarty mentioned this pull request Apr 11, 2022

Using onnxruntime-gpu, specifying the execution provider is necessary #138

Closed

fxmarty added 5 commits April 12, 2022 08:43

added option for onnxruntime execution provider

be3cc96

formatting

94ec9c4

better description

d2b6a5e

changed ort provider to model arguments

9d78170

added documentation

5593bcf

echarlaix reviewed Apr 12, 2022

View reviewed changes

fxmarty added 4 commits April 14, 2022 11:10

changed ort provider name

ffac891

formatting

f06b5b9

remove wrong files

f420174

trigger actions

3afc340

echarlaix reviewed Apr 15, 2022

View reviewed changes

optimum/onnxruntime/model.py Show resolved Hide resolved

echarlaix approved these changes Apr 15, 2022

View reviewed changes

fxmarty added 4 commits April 19, 2022 09:11

Merge remote-tracking branch 'upstream/main'

80ffcd5

added error catch in case the given arguments for cpu-gpu optimizatio…

d8b0a04

…n are contradictory

remove unused files

3e55141

correct wrong catch

4223270

echarlaix reviewed Apr 20, 2022

View reviewed changes

examples/onnxruntime/optimization/question-answering/run_qa.py Outdated Show resolved Hide resolved

styling

638e80e

echarlaix merged commit 2742bf1 into huggingface:main Apr 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add the ability to choose the ONNX runtime execution provider in `ORTModel` #137

Add the ability to choose the ONNX runtime execution provider in `ORTModel` #137

fxmarty commented Apr 11, 2022 •

edited

HuggingFaceDocBuilderDev commented Apr 11, 2022

echarlaix left a comment

echarlaix Apr 12, 2022

echarlaix Apr 12, 2022

fxmarty Apr 14, 2022

echarlaix Apr 15, 2022

fxmarty Apr 19, 2022

echarlaix Apr 19, 2022

fxmarty Apr 20, 2022

echarlaix Apr 20, 2022

Add the ability to choose the ONNX runtime execution provider in ORTModel #137

Add the ability to choose the ONNX runtime execution provider in ORTModel #137

Conversation

fxmarty commented Apr 11, 2022 • edited

What does this PR do?

HuggingFaceDocBuilderDev commented Apr 11, 2022

echarlaix left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Add the ability to choose the ONNX runtime execution provider in `ORTModel` #137

Add the ability to choose the ONNX runtime execution provider in `ORTModel` #137

fxmarty commented Apr 11, 2022 •

edited