[ONNX] ONNX doesn't support exporting non-persistent buffer included models in FakeMode #107211

titaiwangms · 2023-08-15T05:12:10Z

To avoid out of memory issue during exporting models to ONNX, we need to detach the parameters and persistent buffers with state_dict().

  # Create the toy model with real weight.
  real_model = create_model()

  with tempfile.NamedTemporaryFile(
      prefix=model_name, suffix=".pt"
  ) as tmp_checkpoint_file:
      # Dump state_dict to a file to simulate how HuggingFace model is initialized.
      # The file will be loaded via .load_state_dict(...)
      state_dict = real_model.state_dict()
      torch.save(state_dict, tmp_checkpoint_file.name)

      with torch.onnx.enable_fake_mode() as fake_context:
          fake_args = create_args()
          fake_kwargs = create_kwargs()
          fake_model = create_model()
          if load_checkpoint_during_init:
              fake_model.load_state_dict(torch.load(tmp_checkpoint_file.name))

          # Export the model with fake inputs and parameters
          export_options = torch.onnx.ExportOptions(
              dynamic_shapes=self.dynamic_shapes,
              op_level_debug=self.op_level_debug,
              fake_context=fake_context,
          )

          export_output = torch.onnx.dynamo_export(
              fake_model,
              *fake_args,
              **fake_kwargs,
              export_options=export_options,
          )

However, some models, for example, GPT2, there is non-persistent buffer which can't be detached to state_dict(). Subsequently, ONNX graph complains about the missing buffers, but we don't have it in external data of the model initializer. This kind of case can be be reproduced when we use Config to create_model().

cc @BowenBao @thiagocrepaldi @wschin

The text was updated successfully, but these errors were encountered:

1. Add a list of HF models to CI tests. The PR intends to build them from Config, but some of them are not supported with Config. NOTE: Loaded from pre-trained model could potentially hit [uint8/bool conflict](huggingface/transformers#21013) when a newer version of transformers is used. - Dolly has torch.fx.Node in OnnxFunction attribute, which is currently not supported. - Falcon and MPT has unsupported user coding to Dynamo. 2. Only update GPT2 exporting with real tensor to Config, as FakeMode rises unequal input errors between PyTorch and ORT. The reason is that [non-persistent buffer is not supported](#107211) [ghstack-poisoned]

1. Add a list of HF models to CI tests. The PR intends to build them from Config, but some of them are not supported with Config. NOTE: Loaded from pre-trained model could potentially hit [uint8/bool conflict](huggingface/transformers#21013) when a newer version of transformers is used. - Dolly has torch.fx.Node in OnnxFunction attribute, which is currently not supported. - Falcon and MPT has unsupported user coding to Dynamo. 2. Only update GPT2 exporting with real tensor to Config, as FakeMode rises unequal input errors between PyTorch and ORT. The reason is that [non-persistent buffer is not supported](#107211) Pull Request resolved: #107247 Approved by: https://github.com/wschin, https://github.com/BowenBao

titaiwangms · 2023-08-24T03:15:11Z

~~Brainstorming on this. Alternative way might be we random generate it, as it's not crucial.~~

thiagocrepaldi · 2024-01-09T19:41:18Z

@titaiwangms IIRC this should work with the torch.export.export approach, as the non-persistent buffers would be passed in as input, right?

titaiwangms · 2024-01-09T20:30:07Z

Based on your PR #115380, it does look like this issue is gone with ExportedProgram. I think we can track it with your issue #115745

titaiwangms added module: onnx Related to torch.onnx onnx-triaged triaged by ONNX team labels Aug 15, 2023

titaiwangms mentioned this issue Aug 15, 2023

[ONNX] Add huggingface models into CI tests #107247

Closed

soulitzer added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Aug 17, 2023

titaiwangms linked a pull request Aug 17, 2023 that will close this issue

[ONNX] Add huggingface models into CI tests #107247

Closed

titaiwangms removed a link to a pull request Aug 18, 2023

[ONNX] Add huggingface models into CI tests #107247

Closed

titaiwangms linked a pull request Aug 18, 2023 that will close this issue

[ONNX] Support non-persistent buffers in FakeMode export #107411

Closed

titaiwangms self-assigned this Aug 18, 2023

titaiwangms removed a link to a pull request Aug 24, 2023

[ONNX] Support non-persistent buffers in FakeMode export #107411

Closed

titaiwangms mentioned this issue Aug 24, 2023

FakeMode should not fakify non persistent buffer #107879

Open

titaiwangms removed their assignment Aug 28, 2023

titaiwangms closed this as not planned Won't fix, can't repro, duplicate, stale Jan 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ONNX] ONNX doesn't support exporting non-persistent buffer included models in FakeMode #107211

[ONNX] ONNX doesn't support exporting non-persistent buffer included models in FakeMode #107211

titaiwangms commented Aug 15, 2023 •

edited

titaiwangms commented Aug 24, 2023 •

edited

thiagocrepaldi commented Jan 9, 2024

titaiwangms commented Jan 9, 2024

[ONNX] ONNX doesn't support exporting non-persistent buffer included models in FakeMode #107211

[ONNX] ONNX doesn't support exporting non-persistent buffer included models in FakeMode #107211

Comments

titaiwangms commented Aug 15, 2023 • edited

titaiwangms commented Aug 24, 2023 • edited

thiagocrepaldi commented Jan 9, 2024

titaiwangms commented Jan 9, 2024

titaiwangms commented Aug 15, 2023 •

edited

titaiwangms commented Aug 24, 2023 •

edited