Skip to content

fix CI#3548

Merged
vkuzo merged 4 commits intomainfrom
gh/vkuzo/192/head
Dec 26, 2025
Merged

fix CI#3548
vkuzo merged 4 commits intomainfrom
gh/vkuzo/192/head

Conversation

@vkuzo
Copy link
Copy Markdown
Contributor

@vkuzo vkuzo commented Dec 26, 2025

Summary:

pytorch/pytorch#168140 added a new op,
fixing various training callsites to properly handle it

Test Plan:

./test/float8/test_dtensor.sh
with-proxy python test/test_low_bit_optim.py TestFSDP2.test_fsdp2

Reviewers:

Subscribers:

Tasks:

Tags:

[ghstack-poisoned]
@vkuzo
Copy link
Copy Markdown
Contributor Author

vkuzo commented Dec 26, 2025

Stack from ghstack (oldest at bottom):

vkuzo added a commit that referenced this pull request Dec 26, 2025
Summary:

pytorch/pytorch#168140 added a new op,
fixing various training callsites to properly handle it

Test Plan:

```
./test/float8/test_dtensor.sh
with-proxy python test/test_low_bit_optim.py TestFSDP2.test_fsdp2
```

Reviewers:

Subscribers:

Tasks:

Tags:
ghstack-source-id: 8b1fbb1
ghstack-comment-id: 3692922247
Pull-Request: #3548
@pytorch-bot
Copy link
Copy Markdown

pytorch-bot bot commented Dec 26, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3548

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit f3cee71 with merge base a2b4750 (image):

NEW FAILURE - The following job has failed:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@meta-cla meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 26, 2025
@vkuzo vkuzo added the topic: bug fix Use this tag for PRs that fix bugs label Dec 26, 2025
[ghstack-poisoned]
vkuzo added a commit that referenced this pull request Dec 26, 2025
Summary:

pytorch/pytorch#168140 added a new op,
fixing various training callsites to properly handle it

Test Plan:

```
./test/float8/test_dtensor.sh
with-proxy python test/test_low_bit_optim.py TestFSDP2.test_fsdp2
```

Reviewers:

Subscribers:

Tasks:

Tags:
ghstack-source-id: 2e48744
ghstack-comment-id: 3692922247
Pull-Request: #3548
[ghstack-poisoned]
vkuzo added a commit that referenced this pull request Dec 26, 2025
Summary:

pytorch/pytorch#168140 added a new op,
fixing various training callsites to properly handle it

Test Plan:

```
./test/float8/test_dtensor.sh
with-proxy python test/test_low_bit_optim.py TestFSDP2.test_fsdp2
```

Reviewers:

Subscribers:

Tasks:

Tags:
ghstack-source-id: f0fe934
ghstack-comment-id: 3692922247
Pull-Request: #3548
[ghstack-poisoned]
vkuzo added a commit that referenced this pull request Dec 26, 2025
Summary:

pytorch/pytorch#168140 added a new op,
fixing various training callsites to properly handle it

Test Plan:

```
./test/float8/test_dtensor.sh
with-proxy python test/test_low_bit_optim.py TestFSDP2.test_fsdp2
```

Reviewers:

Subscribers:

Tasks:

Tags:
ghstack-source-id: 93c11ac
ghstack-comment-id: 3692922247
Pull-Request: #3548
_test_mxfp8_mlp_tensor_parallelism_dim1_triton,
]
if is_sm_at_least_100():
tests.append(_test_mxfp8_mlp_tensor_parallelism_dim1_triton)
Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

not related to this PR, but also needed to fix 4xh100 CI

@vkuzo
Copy link
Copy Markdown
Contributor Author

vkuzo commented Dec 26, 2025

need to also fix nf4 tests

@vkuzo
Copy link
Copy Markdown
Contributor Author

vkuzo commented Dec 26, 2025

we can do that in a future PR

@vkuzo vkuzo merged commit 415e0e8 into main Dec 26, 2025
56 of 60 checks passed
@vkuzo vkuzo mentioned this pull request Dec 29, 2025
vkuzo added a commit that referenced this pull request Dec 29, 2025
Summary:

Same as #3548 for one more callsite.

Test Plan:

```bash
with-proxy python test/dtypes/test_nf4.py TestComm.test_comm
```

Reviewers:

Subscribers:

Tasks:

Tags:
ghstack-source-id: eb0ce84
ghstack-comment-id: 3696758775
Pull-Request: #3555
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/4xh100 CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: bug fix Use this tag for PRs that fix bugs

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants