-
Notifications
You must be signed in to change notification settings - Fork 416
Pull requests: pytorch/torchtitan
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[DSV3] Explicitly convert to bfloat16 when use grouped mm
CLA Signed
This label is managed by the Meta Open Source bot.
#1367
opened Jul 3, 2025 by
wwwjn
Loading…
non parallelized basic validator implementation [WIP]
CLA Signed
This label is managed by the Meta Open Source bot.
#1362
opened Jul 2, 2025 by
wesleytruong
Loading…
Minimal reproduce of error when saving state dict after looping
CLA Signed
This label is managed by the Meta Open Source bot.
#1358
opened Jun 30, 2025 by
wesleytruong
Loading…
Add support for saving HF format tensors with DCP
CLA Signed
This label is managed by the Meta Open Source bot.
#1351
opened Jun 27, 2025 by
ankitageorge
•
Draft
[WIP] Document MX FP8 recipe
CLA Signed
This label is managed by the Meta Open Source bot.
#1350
opened Jun 27, 2025 by
lessw2020
Loading…
Autoparallel support for DP-only, DP+TP, or TP-only
CLA Signed
This label is managed by the Meta Open Source bot.
#1349
opened Jun 27, 2025 by
wconstab
Loading…
[WIP] Enable causal block mask for sdpa
CLA Signed
This label is managed by the Meta Open Source bot.
[SimpleFSDP] Add support for hsdp+tp
CLA Signed
This label is managed by the Meta Open Source bot.
#1343
opened Jun 26, 2025 by
ruisizhang123
Loading…
Refactor Tokenizer -> BaseTokenizer
CLA Signed
This label is managed by the Meta Open Source bot.
#1333
opened Jun 24, 2025 by
H-Huang
Loading…
[kernels][blackwell] add cutlass/cute group gemm forward for blackwell
CLA Signed
This label is managed by the Meta Open Source bot.
#1327
opened Jun 22, 2025 by
lessw2020
Loading…
dp2ep Expert Parallel
CLA Signed
This label is managed by the Meta Open Source bot.
#1324
opened Jun 21, 2025 by
tianyu-l
Loading…
Support finetuning from a pretrained model
CLA Signed
This label is managed by the Meta Open Source bot.
#1321
opened Jun 20, 2025 by
vwxyzjn
Loading…
[not for land] testing out float8 128_1_128_128 blockwise scaling
CLA Signed
This label is managed by the Meta Open Source bot.
#1317
opened Jun 18, 2025 by
vkuzo
Loading…
Do not submit: Multinode training seems to be working
CLA Signed
This label is managed by the Meta Open Source bot.
#1314
opened Jun 17, 2025 by
ahmadsharif1
•
Draft
Do not submit: Multinode is working with multiple controllers
CLA Signed
This label is managed by the Meta Open Source bot.
#1313
opened Jun 17, 2025 by
ahmadsharif1
•
Draft
[llama4][auxiliary-loss-free load balancing] update expert_bias without backward hooks
CLA Signed
This label is managed by the Meta Open Source bot.
#1304
opened Jun 16, 2025 by
hann-wang
Loading…
Finetune from pre-trained models
CLA Signed
This label is managed by the Meta Open Source bot.
#1300
opened Jun 15, 2025 by
vwxyzjn
Loading…
[not for land] Use new AC
CLA Signed
This label is managed by the Meta Open Source bot.
#1294
opened Jun 13, 2025 by
soulitzer
Loading…
WIP: Try to use monarch to run torchtitan.
CLA Signed
This label is managed by the Meta Open Source bot.
#1288
opened Jun 12, 2025 by
ahmadsharif1
•
Draft
DO NOT SUBMIT: WIP: Try to use monarch to run torchtitan.
CLA Signed
This label is managed by the Meta Open Source bot.
#1286
opened Jun 12, 2025 by
ahmadsharif1
•
Draft
[deepseek][kernels][blackwell] Cutlass blackwell grouped gemm using cute dsl (forward,backward)
CLA Signed
This label is managed by the Meta Open Source bot.
#1276
opened Jun 8, 2025 by
lessw2020
Loading…
[deepseek][blackwell] add Cutlass cute dsl blackwell dense based looping group gemm
CLA Signed
This label is managed by the Meta Open Source bot.
#1274
opened Jun 8, 2025 by
lessw2020
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.