[Checkpoint] Update docstring for DCP `save_state_dict` and `load_state_dict` #91209

wz337 · 2022-12-20T22:37:00Z

As title.

pytorch-bot · 2022-12-20T22:37:02Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/91209

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 6f78cc0:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

fduwjj

LGTM, and leave some comments.

fduwjj · 2022-12-21T21:35:05Z

torch/distributed/checkpoint/state_dict_loader.py

@@ -22,34 +22,39 @@ def load_state_dict(
    planner: LoadPlanner = None,
 ) -> None:
    """
-    Load a distributed state_dict in SPMD style.
+    Loads a distributed state_dict in SPMD style.


Nit: ``state_dict``

fduwjj · 2022-12-21T21:35:43Z

torch/distributed/checkpoint/state_dict_loader.py

-
-    When loading ShardedTensor instances, each rank only
-    reads data for their local shards.
+    to fullfill the requested `state_dict`. When loading ShardedTensor


:class:`ShardedTensor`

wz337 · 2022-12-22T06:40:35Z

@pytorchmergebot merge

pytorchmergebot · 2022-12-22T06:42:21Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2022-12-22T07:07:35Z

Merge failed

Reason: 3 additional jobs have failed, first few of them are: trunk ,trunk / linux-focal-rocm5.3-py3.8 / test (default, 1, 2, linux.rocm.gpu) ,trunk / linux-focal-rocm5.3-py3.8 / test (default, 2, 2, linux.rocm.gpu)

Details for Dev Infra team

Raised by workflow job

wz337 · 2022-12-22T17:34:00Z

@pytorchmergebot rebase

pytorchmergebot · 2022-12-22T17:35:45Z

@pytorchbot successfully started a rebase job. Check the current status here

pytorchmergebot · 2022-12-22T17:35:50Z

Successfully rebased update_state_dict_saver_loader_docstring onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout update_state_dict_saver_loader_docstring && git pull --rebase)

wz337 · 2022-12-22T17:50:22Z

@pytorchmergebot merge

pytorchmergebot · 2022-12-22T17:53:01Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…state_dict`` (pytorch#91209) As title. Pull Request resolved: pytorch#91209 Approved by: https://github.com/fduwjj

wz337 marked this pull request as ready for review December 20, 2022 22:37

wz337 requested review from mrshenli, zhaojuanmao, pritamdamania87, rohan-varma, H-Huang, awgu, kwen2501 and wanchaol as code owners December 20, 2022 22:37

wz337 requested a review from fduwjj December 20, 2022 22:37

wz337 changed the title ~~[Checkpoint] Update docstring for save_state_dict and load_state_dict~~ [Checkpoint] Update docstring for DCP save_state_dict and load_state_dict Dec 20, 2022

fduwjj approved these changes Dec 21, 2022

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Dec 22, 2022

wz337 added 4 commits December 22, 2022 17:35

update docstring

bae1d73

update docstring

2f01e93

add returns

ae2b2f5

address nit

6f78cc0

pytorchmergebot added the Merged label Dec 22, 2022

pytorchmergebot closed this in 0149467 Dec 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Checkpoint] Update docstring for DCP `save_state_dict` and `load_state_dict` #91209

[Checkpoint] Update docstring for DCP `save_state_dict` and `load_state_dict` #91209

wz337 commented Dec 20, 2022

pytorch-bot bot commented Dec 20, 2022 •

edited

fduwjj left a comment

fduwjj Dec 21, 2022

fduwjj Dec 21, 2022

wz337 commented Dec 22, 2022

pytorchmergebot commented Dec 22, 2022

pytorchmergebot commented Dec 22, 2022

wz337 commented Dec 22, 2022

pytorchmergebot commented Dec 22, 2022

pytorchmergebot commented Dec 22, 2022

wz337 commented Dec 22, 2022

pytorchmergebot commented Dec 22, 2022

[Checkpoint] Update docstring for DCP save_state_dict and load_state_dict #91209

[Checkpoint] Update docstring for DCP save_state_dict and load_state_dict #91209

Conversation

wz337 commented Dec 20, 2022

pytorch-bot bot commented Dec 20, 2022 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/91209

✅ No Failures

fduwjj left a comment

Choose a reason for hiding this comment

fduwjj Dec 21, 2022

Choose a reason for hiding this comment

fduwjj Dec 21, 2022

Choose a reason for hiding this comment

wz337 commented Dec 22, 2022

pytorchmergebot commented Dec 22, 2022

Merge started

pytorchmergebot commented Dec 22, 2022

Merge failed

wz337 commented Dec 22, 2022

pytorchmergebot commented Dec 22, 2022

pytorchmergebot commented Dec 22, 2022

wz337 commented Dec 22, 2022

pytorchmergebot commented Dec 22, 2022

Merge started

[Checkpoint] Update docstring for DCP `save_state_dict` and `load_state_dict` #91209

[Checkpoint] Update docstring for DCP `save_state_dict` and `load_state_dict` #91209

pytorch-bot bot commented Dec 20, 2022 •

edited