New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

[export] Initial deserialization v2 #102716

Closed

angelayi wants to merge 20 commits into main from deserialize2

Contributor

angelayi commented Jun 1, 2023 •

edited by pytorch-bot bot

v2 of #102126. mentally stacked on top of #102707

cc @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10

angelayi requested review from zhxchen17 and avikchaudhuri

June 1, 2023 08:20

pytorch-bot bot commented Jun 1, 2023 •

edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/102716

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 3 New Failures

As of commit bf91008:

NEW FAILURES - The following jobs have failed:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

angelayi marked this pull request as draft

June 1, 2023 08:20

angelayi changed the title ~~[export] Initial Deserialization~~ [export] Initial Deserialization v2

angelayi changed the title ~~[export] Initial Deserialization v2~~ [export] Initial deserialization v2

angelayi added the release notes: export label

angelayi force-pushed the deserialize2 branch from d96b415 to 3f11071 Compare

June 1, 2023 22:03

angelayi mentioned this pull request

[export] Initial serialization v2 #102707

Closed

5 tasks

angelayi changed the base branch from main to seralize2

June 2, 2023 18:54

angelayi marked this pull request as ready for review

June 2, 2023 18:55

larryliu0820 reviewed

View reviewed changes

torch/_export/serde/serialize.py Show resolved Hide resolved

larryliu0820 reviewed

View reviewed changes

torch/_export/serde/serialize.py Outdated Show resolved Hide resolved

larryliu0820 reviewed

View reviewed changes

torch/_export/serde/serialize.py Outdated Show resolved Hide resolved

angelayi force-pushed the seralize2 branch from ae345ca to 1a7a278 Compare

June 5, 2023 14:39

angelayi added a commit that referenced this pull request


          [export] Initial serialization v2 (#102707)

1a7a278

Summary:
v2 of #102125 because of git issues
corresponding deserialization diff: #102716

Implementing serialization of the exported program to a python dataclass, and then from that dataclass to json. This is split into a couple of sections:
- `serialize(ep: ep.ExportedProgram, opset_version: Dict[str, int]) -> Tuple[bytes, bytes]` -- takes an exported program object, a dictionary mapping opset namespaces to versions, and returns the serialized exported program in bytes, and separately the state dict serialized in bytes
- `GraphModuleSerializer` class that serializes torch.fx.GraphModule
to the schema.GraphModule dataclass
- `ExportedProgramSerializer` class that serializes torch._export.exported_program.ExportedProgram to the schema.ExportedProgram dataclass

Serialization TODOs:
- [x] pytree spec: #102577
- [ ] higher order ops
- [ ] node metadata (specifically nn_module_stack/source_fn)
- [ ] constraints
- [ ] graph module metadata

The tests are not super comprehensive, but that's because I think it'll be better tested + easier to test once deserialization is implemented.

Pull Request resolved: #102707

Reviewed By: zhxchen17

Differential Revision: D46362466

Pulled By: angelayi

fbshipit-source-id: 1d3fc157a7a5c2e615dbcc7f0e87d76f2f4c43ed

zhxchen17 reviewed

View reviewed changes

torch/_export/serde/schema.py Outdated Show resolved Hide resolved

angelayi force-pushed the deserialize2 branch from 805b5cc to 93b0e16 Compare

June 5, 2023 18:26

angelayi requested review from mrshenli, zhaojuanmao, rohan-varma, H-Huang, awgu, kwen2501, wanchaol, fegin, kiukchung and d4l3k as code owners

June 5, 2023 18:26

angelayi removed request for d4l3k and fegin

June 5, 2023 18:26

angelayi added 2 commits

June 6, 2023 05:36


          lint

8bb3414


          rebase

57daabb

angelayi force-pushed the deserialize2 branch from 93b0e16 to 57daabb Compare

June 6, 2023 05:40

angelayi requested a review from a team as a code owner

June 6, 2023 05:40

github-actions bot added module: cpu release notes: quantization labels

angelayi changed the base branch from seralize2 to main

June 6, 2023 05:41

angelayi removed the request for review from a team

June 6, 2023 05:41

angelayi added 4 commits

June 6, 2023 05:42


          lint

5f693f4


          fixing symbool

ca24a2b


          forgot schema

8a70340


          fix assertions

eab777f

angelayi requested a review from zhxchen17

June 6, 2023 15:31

angelayi added 4 commits

June 6, 2023 17:44


          3.11 bad

31cc9b5


          lint

a8134d9


          oops

2db60fc


          enum

c924326

avikchaudhuri approved these changes

View reviewed changes

test/export/test_serialize.py Outdated Show resolved Hide resolved

torch/_export/serde/serialize.py Show resolved Hide resolved

torch/_export/serde/serialize.py

+                      ret["stack_trace"] = stack_trace
+                  # Need an explicit None check instead of walrus operator, because
+                  # module_fqn can be the empty string if the node belongs to the root.
+                  # The walrus operator returns False on an empty string :(

Contributor

avikchaudhuri Jun 6, 2023

lol, I don't like the walrus operator anyway. Too many falsy values, including empty lists and dicts as well...

torch/_export/serde/serialize.py

+                  module_fqn = metadata.get("module_fqn")
+                  if module_fqn is not None:
+                      ret["module_fqn"] = module_fqn
+                  # TODO(angelayi) add nn_module_stack and source_fn

Contributor

avikchaudhuri Jun 6, 2023

Are there existing tests for nn_module_stack preservation? Not here, but for the future: might be sweet to run an existing set of tests through serde roundtrip "automatically," say by wrapping export with export + serde with a test-only decorator.

Contributor Author

angelayi Jun 6, 2023

Yup, I have an upcoming diff once this lands that checks for it.

torch/_export/serde/serialize.py Show resolved Hide resolved

zhxchen17 requested changes

View reviewed changes

Contributor

zhxchen17 left a comment

I feel we should leave the symbolic int parts out of the scope of this diff.
For example, we try to deserialize sym ints and bools from the serialized buffer, but I don't think we have a clear plan to serialize them yet, and another example is we're trying to serialize "_operator" ops, but I would simply leave that out until we're more clear what we should do next for them.

torch/_export/serde/schema.py Show resolved Hide resolved

torch/_export/serde/schema.py Outdated Show resolved Hide resolved

torch/_export/serde/serialize.py Outdated Show resolved Hide resolved

torch/_export/serde/serialize.py Outdated Show resolved Hide resolved

torch/_export/serde/serialize.py Show resolved Hide resolved

zhxchen17 requested changes

View reviewed changes

Contributor

zhxchen17 left a comment •

edited

clicked twice

angelayi added 2 commits

June 6, 2023 23:40


          addressed comments

153d383


          more schema

bf91008

zhxchen17 approved these changes

View reviewed changes

Contributor

zhxchen17 left a comment

After discussion offline I think we could move forward for now after addressing all the comments. Next time please consider separating one big PR touching different parts.

Contributor Author

angelayi commented Jun 7, 2023

@pytorchbot merge -f "failures appear on master"

pytorchmergebot added the merging label

Collaborator

pytorchmergebot commented Jun 7, 2023

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot added the Merged label

pytorchmergebot removed the merging label

pytorchmergebot closed this in

e930c0f

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment