Skip to content

tiled mlp: auto-monkeypatch#290

Merged
sfc-gh-sbekman merged 4 commits intomainfrom
stas/auto_tiled_mlp2
Oct 9, 2025
Merged

tiled mlp: auto-monkeypatch#290
sfc-gh-sbekman merged 4 commits intomainfrom
stas/auto_tiled_mlp2

Conversation

@sfc-gh-sbekman
Copy link
Copy Markdown
Collaborator

@sfc-gh-sbekman sfc-gh-sbekman commented Oct 6, 2025

For TiledMLP

  1. auto-monkey patch - borrowing from @winglian's code
    https://github.com/axolotl-ai-cloud/axolotl/blob/ce74c20109d60df4cb023254f3a58b80b6a4cfc8/src/axolotl/monkeypatch/tiled_mlp/patch.py#L22-L27
  2. supporting MoE MLP where the x.shape is not [bs, seqlen, hidden] but [seqlen, hidden]

This also requires deepspeedai/DeepSpeed#7622 which has been merged and ds=0.18.0 has been released.

Signed-off-by: Stas Bekman <stas.bekman@snowflake.com>
Signed-off-by: Stas Bekman <stas.bekman@snowflake.com>
@sfc-gh-sbekman sfc-gh-sbekman marked this pull request as ready for review October 6, 2025 23:08
@sfc-gh-sbekman sfc-gh-sbekman requested review from sfc-gh-truwase and removed request for sfc-gh-jrasley October 6, 2025 23:09
@sfc-gh-sbekman
Copy link
Copy Markdown
Collaborator Author

sfc-gh-sbekman commented Oct 8, 2025

@sfc-gh-truwase, this is now ready to be merged if you could review please. Thank you.

@sfc-gh-sbekman sfc-gh-sbekman enabled auto-merge (squash) October 8, 2025 03:58
@sfc-gh-sbekman sfc-gh-sbekman merged commit b738b73 into main Oct 9, 2025
5 checks passed
@sfc-gh-sbekman sfc-gh-sbekman deleted the stas/auto_tiled_mlp2 branch October 9, 2025 12:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants