[ONNX] Add huggingface models into CI tests #107247

titaiwangms · 2023-08-15T18:42:23Z

Stack from ghstack (oldest at bottom):

Add a list of HF models to CI tests. The PR intends to build them from Config, but some of them are not supported with Config. NOTE: Loaded from pre-trained model could potentially hit uint8/bool conflict when a newer version of transformers is used.
- Dolly has torch.fx.Node in OnnxFunction attribute, which is currently not supported.
- Falcon and MPT has unsupported user coding to Dynamo.
Only update GPT2 exporting with real tensor to Config, as FakeMode rises unequal input errors between PyTorch and ORT. The reason is that non-persistent buffer is not supported

[ONNX] Add transformers models into no runtime test of fx exporter [ghstack-poisoned]

[ONNX] Add transformers models into no runtime test of fx exporter ghstack-source-id: 01663e84842d7dd46fd7a72154ea1b2d5bb655d7 Pull Request resolved: #107247

test/onnx/test_fx_to_onnx.py

pytorch-bot · 2023-08-15T19:15:08Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/107247

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ 1 Unrelated Failure

As of commit 487fa54 with merge base a4eae43 ():

UNSTABLE - The following job failed but was likely due to flakiness present on trunk and has been marked as unstable:

linux-focal-rocm5.6-py3.8 / test (default, 1, 3, linux.rocm.gpu, unstable) (gh)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

[ONNX] Add transformers models into no runtime test of fx exporter [ghstack-poisoned]

[ONNX] Add transformers models into no runtime test of fx exporter ghstack-source-id: 84f2b39baf9f4a44e2ce4c6c50c4a0364ea7b8eb Pull Request resolved: #107247

[ONNX] Add transformers models into no runtime test of fx exporter [ghstack-poisoned]

[ONNX] Add transformers models into no runtime test of fx exporter ghstack-source-id: 3408fc7f4f1db69f5d6368520749022f9a0ab945 Pull Request resolved: #107247

1. Add a list of HF models to CI tests. The PR intends to build them from Config, but some of them are not supported with Config. NOTE: Loaded from pre-trained model could potentially hit [uint8/bool conflict](huggingface/transformers#21013) when a newer version of transformers is used. - Dolly has torch.fx.Node in OnnxFunction attribute, which is currently not supported. - Falcon and MPT has unsupported user coding to Dynamo. 2. Only update GPT2 exporting with real tensor to Config, as FakeMode rises unequal input errors between PyTorch and ORT. The reason is that [non-persistent buffer is not supported](#107211) [ghstack-poisoned]

[ONNX] Add transformers models into no runtime test of fx exporter ghstack-source-id: 56fdbc3a94c35f13010e1349581c3f3a5d505b91 Pull Request resolved: #107247

1. Add a list of HF models to CI tests. The PR intends to build them from Config, but some of them are not supported with Config. NOTE: Loaded from pre-trained model could potentially hit [uint8/bool conflict](huggingface/transformers#21013) when a newer version of transformers is used. - Dolly has torch.fx.Node in OnnxFunction attribute, which is currently not supported. - Falcon and MPT has unsupported user coding to Dynamo. 2. Only update GPT2 exporting with real tensor to Config, as FakeMode rises unequal input errors between PyTorch and ORT. The reason is that [non-persistent buffer is not supported](#107211) [ghstack-poisoned]

[ONNX] Add transformers models into no runtime test of fx exporter ghstack-source-id: a404d54be14f9d24966bc59f226431aae19dc0a0 Pull Request resolved: #107247

1. Add a list of HF models to CI tests. The PR intends to build them from Config, but some of them are not supported with Config. NOTE: Loaded from pre-trained model could potentially hit [uint8/bool conflict](huggingface/transformers#21013) when a newer version of transformers is used. - Dolly has torch.fx.Node in OnnxFunction attribute, which is currently not supported. - Falcon and MPT has unsupported user coding to Dynamo. 2. Only update GPT2 exporting with real tensor to Config, as FakeMode rises unequal input errors between PyTorch and ORT. The reason is that [non-persistent buffer is not supported](#107211) [ghstack-poisoned]

[ONNX] Add transformers models into no runtime test of fx exporter ghstack-source-id: fe1a4c2f9d0c554a49047ce4c543444eeb39565c Pull Request resolved: #107247

.ci/docker/common/install_onnx.sh

1. Add a list of HF models to CI tests. The PR intends to build them from Config, but some of them are not supported with Config. NOTE: Loaded from pre-trained model could potentially hit [uint8/bool conflict](huggingface/transformers#21013) when a newer version of transformers is used. - Dolly has torch.fx.Node in OnnxFunction attribute, which is currently not supported. - Falcon and MPT has unsupported user coding to Dynamo. 2. Only update GPT2 exporting with real tensor to Config, as FakeMode rises unequal input errors between PyTorch and ORT. The reason is that [non-persistent buffer is not supported](#107211) [ghstack-poisoned]

[ONNX] Add transformers models into no runtime test of fx exporter ghstack-source-id: 2640837ce94afb45879c22c2f65189dad34d1dd3 Pull Request resolved: #107247

1. Add a list of HF models to CI tests. The PR intends to build them from Config, but some of them are not supported with Config. NOTE: Loaded from pre-trained model could potentially hit [uint8/bool conflict](huggingface/transformers#21013) when a newer version of transformers is used. - Dolly has torch.fx.Node in OnnxFunction attribute, which is currently not supported. - Falcon and MPT has unsupported user coding to Dynamo. 2. Only update GPT2 exporting with real tensor to Config, as FakeMode rises unequal input errors between PyTorch and ORT. The reason is that [non-persistent buffer is not supported](#107211) [ghstack-poisoned]

titaiwangms · 2023-08-22T17:20:02Z

test/onnx/test_fx_to_onnx.py

+    # TODO: From Config/Model
+    @pytorch_test_common.skip_in_ci(
+        "Skip this test in CI because of memory issue."
+        "SymFloat in OnnxFUnction attribute is not supported yet."


test/onnx/test_fx_to_onnx.py

wschin · 2023-08-22T17:56:02Z

test/onnx/test_fx_to_onnx_with_onnxruntime.py

-        def create_kwargs(model_name=model_name):
-            tokenizer = AutoTokenizer.from_pretrained(model_name)
-            return tokenizer("Hello world!", return_tensors="pt")
+        def create_kwargs():


In _test_fake_tensor_mode_exporter, the exported ONNX model and PyTorch model are run and their outputs are compared. However, I am wondering if that really checks dynamic_shapes. If not, probably with the next (or 2nd next), let's modify create_args, create_kwargs, and test_fake_tensor_mode_exporter to guard dynamic_shapes supports different batch and sequence sizes.

There is assert_dynamic_shapes inside the _test_fake_tensor_mode_exporter should address your concern.

wschin

Good first steps to testing important HF models. There are more works required, especially for testing dynamic_shapes=True + FakeTensorMode, but can be done in another PR.

test/onnx/test_fx_to_onnx.py

test/onnx/test_fx_to_onnx_with_onnxruntime.py

wschin · 2023-08-22T18:11:41Z

Remember to create issues against proper repos (pytorch, hf, onnx, or ort) for the errors/problems encountered during the test.

1. Add a list of HF models to CI tests. The PR intends to build them from Config, but some of them are not supported with Config. NOTE: Loaded from pre-trained model could potentially hit [uint8/bool conflict](huggingface/transformers#21013) when a newer version of transformers is used. - Dolly has torch.fx.Node in OnnxFunction attribute, which is currently not supported. - Falcon and MPT has unsupported user coding to Dynamo. 2. Only update GPT2 exporting with real tensor to Config, as FakeMode rises unequal input errors between PyTorch and ORT. The reason is that [non-persistent buffer is not supported](#107211) [ghstack-poisoned]

test/onnx/test_fx_to_onnx.py

BowenBao · 2023-08-22T23:59:32Z

test/onnx/test_fx_to_onnx_with_onnxruntime.py


        def create_model() -> nn.Module:
-            return transformers.AutoModel.from_pretrained(model_name)
+            return transformers.AutoModel.from_pretrained(model_name).to(device).eval()


this still needs to be cached though right?

Oops. Added back the tiny-gpt2 cache.

BowenBao

LG w/ comment on tests with pretrained model and CI caching.

1. Add a list of HF models to CI tests. The PR intends to build them from Config, but some of them are not supported with Config. NOTE: Loaded from pre-trained model could potentially hit [uint8/bool conflict](huggingface/transformers#21013) when a newer version of transformers is used. - Dolly has torch.fx.Node in OnnxFunction attribute, which is currently not supported. - Falcon and MPT has unsupported user coding to Dynamo. 2. Only update GPT2 exporting with real tensor to Config, as FakeMode rises unequal input errors between PyTorch and ORT. The reason is that [non-persistent buffer is not supported](#107211) [ghstack-poisoned]

titaiwangms · 2023-08-23T03:53:27Z

@pytorchbot merge

pytorchmergebot · 2023-08-23T03:55:09Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

[ONNX] Add huggingface models into CI tests

aae282d

[ONNX] Add transformers models into no runtime test of fx exporter [ghstack-poisoned]

titaiwangms requested review from BowenBao, abock and thiagocrepaldi as code owners August 15, 2023 18:42

titaiwangms added a commit that referenced this pull request Aug 15, 2023

[ONNX] Add huggingface models into CI tests

f1e11a0

[ONNX] Add transformers models into no runtime test of fx exporter ghstack-source-id: 01663e84842d7dd46fd7a72154ea1b2d5bb655d7 Pull Request resolved: #107247

titaiwangms marked this pull request as draft August 15, 2023 18:42

BowenBao reviewed Aug 15, 2023

View reviewed changes

test/onnx/test_fx_to_onnx.py Outdated Show resolved Hide resolved

pytorch-bot bot added the topic: not user facing topic category label Aug 15, 2023

pytorchbot added the open source label Aug 15, 2023

titaiwangms mentioned this pull request Aug 15, 2023

[ONNX] Model Test List for FX-to-ONNX Exporters for FakeTensorMode and dynamic=True #106897

Closed

Update on "[ONNX] Add huggingface models into CI tests"

dc16fb9

[ONNX] Add transformers models into no runtime test of fx exporter [ghstack-poisoned]

titaiwangms added a commit that referenced this pull request Aug 15, 2023

[ONNX] Add huggingface models into CI tests

d3be05e

[ONNX] Add transformers models into no runtime test of fx exporter ghstack-source-id: 84f2b39baf9f4a44e2ce4c6c50c4a0364ea7b8eb Pull Request resolved: #107247

Update on "[ONNX] Add huggingface models into CI tests"

1e3a8f7

[ONNX] Add transformers models into no runtime test of fx exporter [ghstack-poisoned]

titaiwangms added a commit that referenced this pull request Aug 15, 2023

[ONNX] Add huggingface models into CI tests

b4d6e23

[ONNX] Add transformers models into no runtime test of fx exporter ghstack-source-id: 3408fc7f4f1db69f5d6368520749022f9a0ab945 Pull Request resolved: #107247

titaiwangms marked this pull request as ready for review August 15, 2023 22:51

titaiwangms requested a review from jeffdaily as a code owner August 15, 2023 22:51

titaiwangms added module: onnx Related to torch.onnx release notes: onnx torch.onnx related changes that should show up in the release notes labels Aug 15, 2023

titaiwangms removed the request for review from jeffdaily August 15, 2023 22:58

titaiwangms added a commit that referenced this pull request Aug 15, 2023

[ONNX] Add huggingface models into CI tests

0af4cb9

[ONNX] Add transformers models into no runtime test of fx exporter ghstack-source-id: 56fdbc3a94c35f13010e1349581c3f3a5d505b91 Pull Request resolved: #107247

titaiwangms requested a review from wschin August 15, 2023 23:53

titaiwangms added a commit that referenced this pull request Aug 16, 2023

[ONNX] Add huggingface models into CI tests

565d558

[ONNX] Add transformers models into no runtime test of fx exporter ghstack-source-id: a404d54be14f9d24966bc59f226431aae19dc0a0 Pull Request resolved: #107247

titaiwangms added a commit that referenced this pull request Aug 16, 2023

[ONNX] Add huggingface models into CI tests

ca682f2

[ONNX] Add transformers models into no runtime test of fx exporter ghstack-source-id: fe1a4c2f9d0c554a49047ce4c543444eeb39565c Pull Request resolved: #107247

titaiwangms commented Aug 16, 2023

View reviewed changes

.ci/docker/common/install_onnx.sh Outdated Show resolved Hide resolved

titaiwangms added a commit that referenced this pull request Aug 16, 2023

[ONNX] Add huggingface models into CI tests

6212c59

[ONNX] Add transformers models into no runtime test of fx exporter ghstack-source-id: 2640837ce94afb45879c22c2f65189dad34d1dd3 Pull Request resolved: #107247

titaiwangms removed a link to an issue Aug 18, 2023

[ONNX] ONNX doesn't support exporting non-persistent buffer included models in FakeMode #107211

Closed

titaiwangms added 4 commits August 18, 2023 18:27

titaiwangms assigned BowenBao Aug 21, 2023

titaiwangms commented Aug 22, 2023

View reviewed changes