Skip to content

Conversation

ankitageorge
Copy link
Contributor

@ankitageorge ankitageorge commented Jun 13, 2025

Differential Revision: [D76602613](https://our.internmc.facebook.com/intern/diff/D76602613/)

[ghstack-poisoned]
Copy link

pytorch-bot bot commented Jun 13, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/155940

Note: Links to docs will display an error until the docs builds have been completed.

❌ 3 New Failures, 33 Pending

As of commit 9696fce with merge base 08dae94 (image):

NEW FAILURES - The following jobs have failed:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@pytorch-bot pytorch-bot bot added oncall: distributed Add this issue/PR to distributed oncall triage queue release notes: distributed (checkpoint) labels Jun 13, 2025
ankitageorge added a commit that referenced this pull request Jun 13, 2025
Differential Revision: [D76602613](https://our.internmc.facebook.com/intern/diff/D76602613/)

ghstack-source-id: 290317312
Pull Request resolved: #155940
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D76602613

Differential Revision: [D76602613](https://our.internmc.facebook.com/intern/diff/D76602613/)

cc H-Huang awgu wanchaol fegin fduwjj wz337 wconstab d4l3k

[ghstack-poisoned]
ankitageorge added a commit that referenced this pull request Jun 24, 2025
…h step

Title - we can consolidate the shards to a full tensors, optionally behind a flag, in the finish step of DCP.save
also adds the thread count argument which is configurable for users, before we were just using the default of 1.
Pull Request resolved: #155940


ghstack-source-id: 292259595
@exported-using-ghexport

Differential Revision: [D76602613](https://our.internmc.facebook.com/intern/diff/D76602613/)
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D76602613

Differential Revision: [D76602613](https://our.internmc.facebook.com/intern/diff/D76602613/)

cc H-Huang awgu wanchaol fegin fduwjj wz337 wconstab d4l3k

[ghstack-poisoned]
ankitageorge added a commit that referenced this pull request Jun 24, 2025
…h step

Title - we can consolidate the shards to a full tensors, optionally behind a flag, in the finish step of DCP.save
also adds the thread count argument which is configurable for users, before we were just using the default of 1.
Pull Request resolved: #155940


ghstack-source-id: 292339908
@exported-using-ghexport

Differential Revision: [D76602613](https://our.internmc.facebook.com/intern/diff/D76602613/)
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D76602613

Differential Revision: [D76602613](https://our.internmc.facebook.com/intern/diff/D76602613/)

cc H-Huang awgu wanchaol fegin fduwjj wz337 wconstab d4l3k

[ghstack-poisoned]
ankitageorge added a commit that referenced this pull request Jun 24, 2025
…h step

Title - we can consolidate the shards to a full tensors, optionally behind a flag, in the finish step of DCP.save
also adds the thread count argument which is configurable for users, before we were just using the default of 1.
Pull Request resolved: #155940


ghstack-source-id: 292340529
@exported-using-ghexport

Differential Revision: [D76602613](https://our.internmc.facebook.com/intern/diff/D76602613/)
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D76602613

Differential Revision: [D76602613](https://our.internmc.facebook.com/intern/diff/D76602613/)

cc H-Huang awgu wanchaol fegin fduwjj wz337 wconstab d4l3k

[ghstack-poisoned]
ankitageorge added a commit that referenced this pull request Jun 24, 2025
…h step

Title - we can consolidate the shards to a full tensors, optionally behind a flag, in the finish step of DCP.save
also adds the thread count argument which is configurable for users, before we were just using the default of 1.
Pull Request resolved: #155940


ghstack-source-id: 292346941
@exported-using-ghexport

Differential Revision: [D76602613](https://our.internmc.facebook.com/intern/diff/D76602613/)
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D76602613

ankitageorge added a commit that referenced this pull request Jun 24, 2025
…h step

Title - we can consolidate the shards to a full tensors, optionally behind a flag, in the finish step of DCP.save
also adds the thread count argument which is configurable for users, before we were just using the default of 1.
Re-creating #155940 bc it got into a bad detached state

Differential Revision: [D77231774](https://our.internmc.facebook.com/intern/diff/D77231774/)

[ghstack-poisoned]
ankitageorge added a commit that referenced this pull request Jun 24, 2025
…h step

Title - we can consolidate the shards to a full tensors, optionally behind a flag, in the finish step of DCP.save
also adds the thread count argument which is configurable for users, before we were just using the default of 1.
Re-creating #155940 bc it got into a bad detached state

Differential Revision: [D77231774](https://our.internmc.facebook.com/intern/diff/D77231774/)

ghstack-source-id: 292353372
Pull Request resolved: #156705
@ankitageorge
Copy link
Contributor Author

closing b/c this got into a bad detached state and re-creating in #156705

ankitageorge added a commit that referenced this pull request Jun 24, 2025
…rs in finish step"

Title - we can consolidate the shards to a full tensors, optionally behind a flag, in the finish step of DCP.save
also adds the thread count argument which is configurable for users, before we were just using the default of 1.
Re-creating #155940 bc it got into a bad detached state

Differential Revision: [D77231774](https://our.internmc.facebook.com/intern/diff/D77231774/)

cc H-Huang awgu wanchaol fegin fduwjj wz337 wconstab d4l3k

[ghstack-poisoned]
ankitageorge added a commit that referenced this pull request Jun 24, 2025
…h step

Pull Request resolved: #156705

Title - we can consolidate the shards to a full tensors, optionally behind a flag, in the finish step of DCP.save
also adds the thread count argument which is configurable for users, before we were just using the default of 1.
Re-creating #155940 bc it got into a bad detached state
ghstack-source-id: 292356884
@exported-using-ghexport

Differential Revision: [D77231774](https://our.internmc.facebook.com/intern/diff/D77231774/)
ankitageorge added a commit that referenced this pull request Jun 30, 2025
…h step

Pull Request resolved: #156705

Title - we can consolidate the shards to a full tensors, optionally behind a flag, in the finish step of DCP.save
also adds the thread count argument which is configurable for users, before we were just using the default of 1.
Re-creating #155940 bc it got into a bad detached state
ghstack-source-id: 293476800
@exported-using-ghexport

Differential Revision: [D77231774](https://our.internmc.facebook.com/intern/diff/D77231774/)
ankitageorge added a commit that referenced this pull request Jun 30, 2025
…es to full tensors in finish step"

Title - we can consolidate the shards to a full tensors, optionally behind a flag, in the finish step of DCP.save
also adds the thread count argument which is configurable for users, before we were just using the default of 1.
Re-creating #155940 bc it got into a bad detached state

Differential Revision: [D77231774](https://our.internmc.facebook.com/intern/diff/D77231774/)

cc H-Huang awgu wanchaol fegin fduwjj wz337 wconstab d4l3k

[ghstack-poisoned]
ankitageorge added a commit that referenced this pull request Jun 30, 2025
…rs in finish step"

Title - we can consolidate the shards to a full tensors, optionally behind a flag, in the finish step of DCP.save
also adds the thread count argument which is configurable for users, before we were just using the default of 1.
Re-creating #155940 bc it got into a bad detached state

Differential Revision: [D77231774](https://our.internmc.facebook.com/intern/diff/D77231774/)

cc H-Huang awgu wanchaol fegin fduwjj wz337 wconstab d4l3k

[ghstack-poisoned]
ankitageorge added a commit that referenced this pull request Jun 30, 2025
…es to full tensors in finish step"

Title - we can consolidate the shards to a full tensors, optionally behind a flag, in the finish step of DCP.save
also adds the thread count argument which is configurable for users, before we were just using the default of 1.
Re-creating #155940 bc it got into a bad detached state

Differential Revision: [D77231774](https://our.internmc.facebook.com/intern/diff/D77231774/)

cc H-Huang awgu wanchaol fegin fduwjj wz337 wconstab d4l3k

[ghstack-poisoned]
ankitageorge added a commit that referenced this pull request Jun 30, 2025
…rs in finish step"

Title - we can consolidate the shards to a full tensors, optionally behind a flag, in the finish step of DCP.save
also adds the thread count argument which is configurable for users, before we were just using the default of 1.
Re-creating #155940 bc it got into a bad detached state

Differential Revision: [D77231774](https://our.internmc.facebook.com/intern/diff/D77231774/)

cc H-Huang awgu wanchaol fegin fduwjj wz337 wconstab d4l3k

[ghstack-poisoned]
ankitageorge added a commit that referenced this pull request Jun 30, 2025
…h step

Pull Request resolved: #156705

Title - we can consolidate the shards to a full tensors, optionally behind a flag, in the finish step of DCP.save
also adds the thread count argument which is configurable for users, before we were just using the default of 1.
Re-creating #155940 bc it got into a bad detached state
ghstack-source-id: 293491492
@exported-using-ghexport

Differential Revision: [D77231774](https://our.internmc.facebook.com/intern/diff/D77231774/)
ankitageorge added a commit that referenced this pull request Jun 30, 2025
…es to full tensors in finish step"

Title - we can consolidate the shards to a full tensors, optionally behind a flag, in the finish step of DCP.save
also adds the thread count argument which is configurable for users, before we were just using the default of 1.
Re-creating #155940 bc it got into a bad detached state

Differential Revision: [D77231774](https://our.internmc.facebook.com/intern/diff/D77231774/)

cc H-Huang awgu wanchaol fegin fduwjj wz337 wconstab d4l3k

[ghstack-poisoned]
ankitageorge added a commit that referenced this pull request Jun 30, 2025
…rs in finish step"

Title - we can consolidate the shards to a full tensors, optionally behind a flag, in the finish step of DCP.save
also adds the thread count argument which is configurable for users, before we were just using the default of 1.
Re-creating #155940 bc it got into a bad detached state

Differential Revision: [D77231774](https://our.internmc.facebook.com/intern/diff/D77231774/)

cc H-Huang awgu wanchaol fegin fduwjj wz337 wconstab d4l3k

[ghstack-poisoned]
ankitageorge added a commit that referenced this pull request Jun 30, 2025
…h step

Pull Request resolved: #156705

Title - we can consolidate the shards to a full tensors, optionally behind a flag, in the finish step of DCP.save
also adds the thread count argument which is configurable for users, before we were just using the default of 1.
Re-creating #155940 bc it got into a bad detached state
ghstack-source-id: 293497464
@exported-using-ghexport

Differential Revision: [D77231774](https://our.internmc.facebook.com/intern/diff/D77231774/)
ankitageorge added a commit that referenced this pull request Jun 30, 2025
…es to full tensors in finish step"

Title - we can consolidate the shards to a full tensors, optionally behind a flag, in the finish step of DCP.save
also adds the thread count argument which is configurable for users, before we were just using the default of 1.
Re-creating #155940 bc it got into a bad detached state

Differential Revision: [D77231774](https://our.internmc.facebook.com/intern/diff/D77231774/)

cc H-Huang awgu wanchaol fegin fduwjj wz337 wconstab d4l3k

[ghstack-poisoned]
ankitageorge added a commit that referenced this pull request Jun 30, 2025
…rs in finish step"

Title - we can consolidate the shards to a full tensors, optionally behind a flag, in the finish step of DCP.save
also adds the thread count argument which is configurable for users, before we were just using the default of 1.
Re-creating #155940 bc it got into a bad detached state

Differential Revision: [D77231774](https://our.internmc.facebook.com/intern/diff/D77231774/)

cc H-Huang awgu wanchaol fegin fduwjj wz337 wconstab d4l3k

[ghstack-poisoned]
ankitageorge added a commit that referenced this pull request Jun 30, 2025
…h step

Pull Request resolved: #156705

Title - we can consolidate the shards to a full tensors, optionally behind a flag, in the finish step of DCP.save
also adds the thread count argument which is configurable for users, before we were just using the default of 1.
Re-creating #155940 bc it got into a bad detached state
ghstack-source-id: 293520075
@exported-using-ghexport

Differential Revision: [D77231774](https://our.internmc.facebook.com/intern/diff/D77231774/)
pytorchmergebot pushed a commit that referenced this pull request Jul 1, 2025
…h step (#156705)

Title - we can consolidate the shards to a full tensors, optionally behind a flag, in the finish step of DCP.save
also adds the thread count argument which is configurable for users, before we were just using the default of 1.
Re-creating #155940 bc it got into a bad detached state

Differential Revision: [D77231774](https://our.internmc.facebook.com/intern/diff/D77231774/)

Pull Request resolved: #156705
Approved by: https://github.com/saumishr
ghstack dependencies: #154743
ankitageorge added a commit that referenced this pull request Jul 1, 2025
…es to full tensors in finish step"

Title - we can consolidate the shards to a full tensors, optionally behind a flag, in the finish step of DCP.save
also adds the thread count argument which is configurable for users, before we were just using the default of 1.
Re-creating #155940 bc it got into a bad detached state

Differential Revision: [D77231774](https://our.internmc.facebook.com/intern/diff/D77231774/)

cc H-Huang awgu wanchaol fegin fduwjj wz337 wconstab d4l3k

[ghstack-poisoned]
ankitageorge added a commit that referenced this pull request Jul 1, 2025
…rs in finish step"

Title - we can consolidate the shards to a full tensors, optionally behind a flag, in the finish step of DCP.save
also adds the thread count argument which is configurable for users, before we were just using the default of 1.
Re-creating #155940 bc it got into a bad detached state

Differential Revision: [D77231774](https://our.internmc.facebook.com/intern/diff/D77231774/)

cc H-Huang awgu wanchaol fegin fduwjj wz337 wconstab d4l3k

[ghstack-poisoned]
@github-actions github-actions bot deleted the gh/ankitageorge/8/head branch July 25, 2025 02:20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

fb-exported oncall: distributed Add this issue/PR to distributed oncall triage queue release notes: distributed (checkpoint)

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants