Skip to content

[mxfp8 moe training] mxfp8 a2a_dispatch autograd function#3579

Merged
danielvegamyhre merged 1 commit intomainfrom
danielvegamyhre/stack/103
Jan 9, 2026
Merged

[mxfp8 moe training] mxfp8 a2a_dispatch autograd function#3579
danielvegamyhre merged 1 commit intomainfrom
danielvegamyhre/stack/103

Conversation

@danielvegamyhre
Copy link
Copy Markdown
Contributor

@danielvegamyhre danielvegamyhre commented Jan 6, 2026

@pytorch-bot
Copy link
Copy Markdown

pytorch-bot bot commented Jan 6, 2026

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3579

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 9172461 with merge base 8d47813 (image):
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@meta-cla meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 6, 2026
@danielvegamyhre danielvegamyhre force-pushed the danielvegamyhre/stack/103 branch from 571e134 to a02364f Compare January 6, 2026 22:37
@danielvegamyhre danielvegamyhre added mx topic: new feature Use this tag if this PR adds a new feature moe labels Jan 6, 2026
@danielvegamyhre danielvegamyhre marked this pull request as draft January 6, 2026 23:28
@danielvegamyhre danielvegamyhre force-pushed the danielvegamyhre/stack/103 branch from a02364f to e981b56 Compare January 6, 2026 23:28
@danielvegamyhre danielvegamyhre marked this pull request as ready for review January 6, 2026 23:29
@danielvegamyhre danielvegamyhre marked this pull request as draft January 6, 2026 23:43
@danielvegamyhre danielvegamyhre marked this pull request as ready for review January 6, 2026 23:44
@danielvegamyhre danielvegamyhre marked this pull request as draft January 6, 2026 23:47
@danielvegamyhre danielvegamyhre marked this pull request as ready for review January 6, 2026 23:48
@danielvegamyhre danielvegamyhre marked this pull request as draft January 6, 2026 23:57
@danielvegamyhre danielvegamyhre marked this pull request as ready for review January 6, 2026 23:58
@danielvegamyhre danielvegamyhre marked this pull request as draft January 7, 2026 00:07
@danielvegamyhre danielvegamyhre marked this pull request as ready for review January 7, 2026 00:08
@danielvegamyhre danielvegamyhre marked this pull request as draft January 7, 2026 00:20
@danielvegamyhre danielvegamyhre marked this pull request as ready for review January 7, 2026 00:21
@danielvegamyhre danielvegamyhre marked this pull request as draft January 7, 2026 00:28
@danielvegamyhre danielvegamyhre marked this pull request as ready for review January 7, 2026 00:29

class _A2ADispatch(torch.autograd.Function):
"""
All-to-all dispatch with MXFP8 quantization in forward.
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

object name should reflect this

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Updated

return grad_input, None, None, None, None, None


def a2a_dispatch(
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

function name should explain the quantization part

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Updated

stack-info: PR: #3579, branch: danielvegamyhre/stack/103
@danielvegamyhre danielvegamyhre marked this pull request as draft January 7, 2026 18:27
@danielvegamyhre danielvegamyhre force-pushed the danielvegamyhre/stack/103 branch from e981b56 to 9172461 Compare January 7, 2026 18:27
@danielvegamyhre danielvegamyhre marked this pull request as ready for review January 7, 2026 18:28
@danielvegamyhre danielvegamyhre marked this pull request as draft January 7, 2026 18:59
@danielvegamyhre danielvegamyhre marked this pull request as ready for review January 7, 2026 19:00
@danielvegamyhre danielvegamyhre marked this pull request as draft January 7, 2026 21:57
@danielvegamyhre danielvegamyhre marked this pull request as ready for review January 7, 2026 21:57
@danielvegamyhre danielvegamyhre marked this pull request as draft January 7, 2026 22:07
@danielvegamyhre danielvegamyhre marked this pull request as ready for review January 7, 2026 22:07
@danielvegamyhre danielvegamyhre marked this pull request as draft January 8, 2026 01:29
@danielvegamyhre danielvegamyhre marked this pull request as ready for review January 8, 2026 01:30
@danielvegamyhre danielvegamyhre marked this pull request as draft January 9, 2026 04:35
@danielvegamyhre danielvegamyhre marked this pull request as ready for review January 9, 2026 04:36
Copy link
Copy Markdown
Contributor

@vkuzo vkuzo left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

didn't read closely, stamp for prototype

@danielvegamyhre danielvegamyhre merged commit 575d4bb into main Jan 9, 2026
21 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. moe mx topic: new feature Use this tag if this PR adds a new feature

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants