Skip to content

Conversation

@fegin
Copy link
Contributor

@fegin fegin commented Nov 10, 2025

Stack from ghstack (oldest at bottom):

It is not correct as JobConfig has changed.

[ghstack-poisoned]
[ghstack-poisoned]
@meta-cla meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Nov 10, 2025
@fegin fegin mentioned this pull request Nov 10, 2025
Copy link
Contributor

@tianyu-l tianyu-l left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

please fix typo

[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
@fegin fegin changed the base branch from gh/fegin/28/base to main November 11, 2025 17:12
@fegin fegin merged commit 11d73a2 into main Nov 11, 2025
7 checks passed
fegin added a commit that referenced this pull request Nov 11, 2025
Stack from [ghstack](https://github.com/ezyang/ghstack/tree/0.12.0)
(oldest at bottom):
* __->__ #2012
* #2011

Summary:
The current configuration validation requires torchx and GPUs. It can
waste time, resources, ane engery. Polar bears are crying. Let's fix
this by providing a dry run mode. This PR doesn't verify everything. In
theory, we should be able to verify parallelisms settings as well. This
PR is just a start but it at least can let us catch the typos quickly.
ahoffman-aws pushed a commit to drcanchi-aws/torchtitan that referenced this pull request Nov 11, 2025
Stack from [ghstack](https://github.com/ezyang/ghstack/tree/0.12.0)
(oldest at bottom):
* pytorch#2012
* __->__ pytorch#2011

It is not correct as JobConfig has changed.
ahoffman-aws pushed a commit to drcanchi-aws/torchtitan that referenced this pull request Nov 11, 2025
Stack from [ghstack](https://github.com/ezyang/ghstack/tree/0.12.0)
(oldest at bottom):
* __->__ pytorch#2012
* pytorch#2011

Summary:
The current configuration validation requires torchx and GPUs. It can
waste time, resources, ane engery. Polar bears are crying. Let's fix
this by providing a dry run mode. This PR doesn't verify everything. In
theory, we should be able to verify parallelisms settings as well. This
PR is just a start but it at least can let us catch the typos quickly.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Meta Open Source bot.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants