Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[PT-D][Checkpoint] Rename DCP storage layer init() #92869

Closed
wants to merge 1 commit into from
Closed

[PT-D][Checkpoint] Rename DCP storage layer init() #92869

wants to merge 1 commit into from

Conversation

wz337
Copy link
Contributor

@wz337 wz337 commented Jan 23, 2023

Rename DCP storage layer init() and update tests accordingly.

@pytorch-bot
Copy link

pytorch-bot bot commented Jan 23, 2023

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/92869

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 Failures

As of commit f0975e1:

NEW FAILURES - The following jobs have failed:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@wz337 wz337 marked this pull request as ready for review January 23, 2023 23:33
@wz337 wz337 requested review from kumpera and removed request for kumpera January 23, 2023 23:35
Copy link
Contributor

@kumpera kumpera left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@pytorchmergebot
Copy link
Collaborator

@pytorchbot successfully started a rebase job. Check the current status here

@pytorchmergebot
Copy link
Collaborator

Rebase failed due to Command git -C /home/runner/work/pytorch/pytorch rebase refs/remotes/origin/viable/strict pull/92869/head returned non-zero exit code 1

Rebasing (1/2)
Auto-merging torch/distributed/checkpoint/filesystem.py
Auto-merging torch/distributed/checkpoint/state_dict_loader.py
CONFLICT (content): Merge conflict in torch/distributed/checkpoint/state_dict_loader.py
Auto-merging torch/distributed/checkpoint/state_dict_saver.py
CONFLICT (content): Merge conflict in torch/distributed/checkpoint/state_dict_saver.py
error: could not apply 0dc155b2c0... rename storage init()
hint: Resolve all conflicts manually, mark them as resolved with
hint: "git add/rm <conflicted_files>", then run "git rebase --continue".
hint: You can instead skip this commit: run "git rebase --skip".
hint: To abort and get back to the state before "git rebase", run "git rebase --abort".
Could not apply 0dc155b2c0... rename storage init()

Raised by https://github.com/pytorch/pytorch/actions/runs/3998826624

@wz337
Copy link
Contributor Author

wz337 commented Jan 24, 2023

@pytorchmergebot merge

@pytorch-bot pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Jan 24, 2023
@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

@pytorchmergebot
Copy link
Collaborator

Merge failed

Reason: 1 jobs have failed, first few of them are: trunk / win-vs2019-cuda11.6-py3 / test (default, 1, 5, windows.g5.4xlarge.nvidia.gpu)

Details for Dev Infra team Raised by workflow job

@wz337
Copy link
Contributor Author

wz337 commented Jan 25, 2023

@pytorchmergebot rebase

@pytorchmergebot
Copy link
Collaborator

@pytorchbot successfully started a rebase job. Check the current status here

@pytorchmergebot
Copy link
Collaborator

Rebase failed due to Command git -C /home/runner/work/pytorch/pytorch rebase refs/remotes/origin/viable/strict pull/92869/head returned non-zero exit code 1

Rebasing (1/2)
Auto-merging torch/distributed/checkpoint/filesystem.py
Auto-merging torch/distributed/checkpoint/state_dict_loader.py
CONFLICT (content): Merge conflict in torch/distributed/checkpoint/state_dict_loader.py
Auto-merging torch/distributed/checkpoint/state_dict_saver.py
CONFLICT (content): Merge conflict in torch/distributed/checkpoint/state_dict_saver.py
error: could not apply 0dc155b2c0... rename storage init()
hint: Resolve all conflicts manually, mark them as resolved with
hint: "git add/rm <conflicted_files>", then run "git rebase --continue".
hint: You can instead skip this commit: run "git rebase --skip".
hint: To abort and get back to the state before "git rebase", run "git rebase --abort".
Could not apply 0dc155b2c0... rename storage init()

Raised by https://github.com/pytorch/pytorch/actions/runs/4001553632

@wz337
Copy link
Contributor Author

wz337 commented Jan 25, 2023

@pytorchmergebot rebase

@pytorchmergebot
Copy link
Collaborator

@pytorchbot successfully started a rebase job. Check the current status here

@pytorchmergebot
Copy link
Collaborator

Rebase failed due to Command git -C /home/runner/work/pytorch/pytorch rebase refs/remotes/origin/viable/strict pull/92869/head returned non-zero exit code 1

Rebasing (1/2)
Auto-merging torch/distributed/checkpoint/filesystem.py
Auto-merging torch/distributed/checkpoint/state_dict_loader.py
CONFLICT (content): Merge conflict in torch/distributed/checkpoint/state_dict_loader.py
Auto-merging torch/distributed/checkpoint/state_dict_saver.py
CONFLICT (content): Merge conflict in torch/distributed/checkpoint/state_dict_saver.py
error: could not apply 0dc155b2c0... rename storage init()
hint: Resolve all conflicts manually, mark them as resolved with
hint: "git add/rm <conflicted_files>", then run "git rebase --continue".
hint: You can instead skip this commit: run "git rebase --skip".
hint: To abort and get back to the state before "git rebase", run "git rebase --abort".
Could not apply 0dc155b2c0... rename storage init()

Raised by https://github.com/pytorch/pytorch/actions/runs/4001584873

@wz337
Copy link
Contributor Author

wz337 commented Jan 25, 2023

@pytorchmergebot rebase

@pytorchmergebot
Copy link
Collaborator

@pytorchbot successfully started a rebase job. Check the current status here

@pytorchmergebot
Copy link
Collaborator

Rebase failed due to Command git -C /home/runner/work/pytorch/pytorch rebase refs/remotes/origin/viable/strict pull/92869/head returned non-zero exit code 1

Rebasing (1/2)
Auto-merging torch/distributed/checkpoint/filesystem.py
Auto-merging torch/distributed/checkpoint/state_dict_loader.py
CONFLICT (content): Merge conflict in torch/distributed/checkpoint/state_dict_loader.py
Auto-merging torch/distributed/checkpoint/state_dict_saver.py
CONFLICT (content): Merge conflict in torch/distributed/checkpoint/state_dict_saver.py
error: could not apply 0dc155b2c0... rename storage init()
hint: Resolve all conflicts manually, mark them as resolved with
hint: "git add/rm <conflicted_files>", then run "git rebase --continue".
hint: You can instead skip this commit: run "git rebase --skip".
hint: To abort and get back to the state before "git rebase", run "git rebase --abort".
Could not apply 0dc155b2c0... rename storage init()

Raised by https://github.com/pytorch/pytorch/actions/runs/4003351680

@wz337
Copy link
Contributor Author

wz337 commented Jan 25, 2023

@pytorchmergebot rebase

@pytorchmergebot
Copy link
Collaborator

@pytorchbot successfully started a rebase job. Check the current status here

@pytorchmergebot
Copy link
Collaborator

Rebase failed due to Command git -C /home/runner/work/pytorch/pytorch rebase refs/remotes/origin/viable/strict pull/92869/head returned non-zero exit code 1

Rebasing (1/2)
Auto-merging torch/distributed/checkpoint/filesystem.py
Auto-merging torch/distributed/checkpoint/state_dict_loader.py
CONFLICT (content): Merge conflict in torch/distributed/checkpoint/state_dict_loader.py
Auto-merging torch/distributed/checkpoint/state_dict_saver.py
CONFLICT (content): Merge conflict in torch/distributed/checkpoint/state_dict_saver.py
error: could not apply 0dc155b2c0... rename storage init()
hint: Resolve all conflicts manually, mark them as resolved with
hint: "git add/rm <conflicted_files>", then run "git rebase --continue".
hint: You can instead skip this commit: run "git rebase --skip".
hint: To abort and get back to the state before "git rebase", run "git rebase --abort".
Could not apply 0dc155b2c0... rename storage init()

Raised by https://github.com/pytorch/pytorch/actions/runs/4008219597

@wz337
Copy link
Contributor Author

wz337 commented Jan 25, 2023

@pytorchmergebot merge

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

@pytorchmergebot
Copy link
Collaborator

Merge failed

Reason: 1 mandatory check(s) failed (Rule Distributed). The first few are:

Dig deeper by viewing the failures on hud

Details for Dev Infra team Raised by workflow job

@wz337
Copy link
Contributor Author

wz337 commented Jan 25, 2023

@pytorchmergebot rebase

@pytorchmergebot
Copy link
Collaborator

@pytorchbot successfully started a rebase job. Check the current status here

@pytorchmergebot
Copy link
Collaborator

Tried to rebase and push PR #92869, but it was already up to date

@wz337
Copy link
Contributor Author

wz337 commented Jan 25, 2023

@pytorchmergebot merge -f "unrelated test failures https://github.com/pytorch/pytorch/actions/runs/4008358242/jobs/6882601831"

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ciflow/trunk Trigger trunk jobs on your pull request Merged
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants