[Patch] Add loss for ORT inference #152

JingyaHuang · 2022-04-22T10:30:04Z

What does this PR do?

Wrap OnnxConfig by wrap_onnx_config_for_loss to obtain the loss while using ORTTrainer under the mode inference_with_ort=True.
Enable deepspeed for ONNX Runtime training. (Tested with ZeRO stage 2, full availability under progress)
Clean up unused dependencies in ORTTrainer.
Update CI of onnxruntime training.
Update associated tests.

HuggingFaceDocBuilderDev · 2022-04-22T10:40:11Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

lewtun · 2022-04-26T09:00:15Z

This line is why the doc build is currently failing: https://github.com/huggingface/optimum/pull/152/files#diff-3e928ce0b52f617b86cd0df9399c6bbb5804d6269c6a04b486613eb929449256R25

Until now we never actually imported OnnxConfigWithLoss etc and doing so triggers an import error because transformers is pinned to <4.17 in the doc build (to have both intel and onnxruntime packages in the same env):

from optimum.onnxruntime.trainer import ORTTrainer

ImportError: cannot import name 'TensorType' from 'transformers.utils' (/home/lewis/miniconda3/envs/optimum/lib/python3.8/site-packages/transformers/utils/__init__.py)

The error arises because TensorType was moved from transformers.file_utils to transformers.utils in >=4.17

The solution is to refactor the doc build so that we build intel and onnxruntime in separate envs. Doing so will also allow us to build the Graphcore & other hardware partner docs as well. I don't have bandwidth for this right now, but happy to review a PR if someone else has time to tackle this!

lewtun · 2022-04-26T21:23:05Z

FYI @JingyaHuang if you want to test that the docs build locally you can run:

pip install -e '.[dev,intel.onnxruntime]'
pip install git+https://github.com/huggingface/doc-builder.git
doc-builder build optimum docs/source --build_dir test-docs --version v1.0.0 --clean

You'll need a Linux machine for this since one cannot install intel on macOS

optimum/onnxruntime/trainer.py

tests/onnxruntime/test_onnxruntime_train.py

optimum/onnxruntime/trainer.py

echarlaix · 2022-04-28T16:26:10Z

This PR looks great !

…ptimum into patch-trainer-loss

Add loss for ORT inference

dab3e10

JingyaHuang requested a review from echarlaix April 22, 2022 11:29

JingyaHuang added 2 commits April 22, 2022 12:35

Fix the test

2bb8f7d

Put back CI for trainer

b6a132f

JingyaHuang added 2 commits April 26, 2022 21:04

Import TensorType by version

cf48b2f

Try again

a844928

JingyaHuang and others added 5 commits April 27, 2022 09:40

Modify import

e56d0f0

Merge branch 'main' into patch-trainer-loss

f67fe6a

Update shell for CI

2ac0afc

Merge branch 'main' into patch-trainer-loss

480b9d1

Add docker file for new release of ort 1.11.1 and 1.12.0

bd97824

echarlaix reviewed Apr 28, 2022

View reviewed changes

Resolve conflict with main

d1a411d

JingyaHuang added 4 commits April 29, 2022 10:51

Fix compatibility with label smoother

620835e

Merge branch 'patch-trainer-loss' of https://github.com/huggingface/o…

effc246

…ptimum into patch-trainer-loss

Modify according to review of Ella

bcec64e

Fix test

6c301bd

echarlaix approved these changes Apr 29, 2022

View reviewed changes

JingyaHuang merged commit 1c4b5b1 into main Apr 29, 2022

JingyaHuang deleted the patch-trainer-loss branch April 29, 2022 21:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Patch] Add loss for ORT inference #152

[Patch] Add loss for ORT inference #152

JingyaHuang commented Apr 22, 2022

HuggingFaceDocBuilderDev commented Apr 22, 2022

lewtun commented Apr 26, 2022 •

edited

lewtun commented Apr 26, 2022 •

edited

echarlaix commented Apr 28, 2022

[Patch] Add loss for ORT inference #152

[Patch] Add loss for ORT inference #152

Conversation

JingyaHuang commented Apr 22, 2022

What does this PR do?

HuggingFaceDocBuilderDev commented Apr 22, 2022

lewtun commented Apr 26, 2022 • edited

lewtun commented Apr 26, 2022 • edited

echarlaix commented Apr 28, 2022

lewtun commented Apr 26, 2022 •

edited

lewtun commented Apr 26, 2022 •

edited