-
Notifications
You must be signed in to change notification settings - Fork 29.8k
Pull requests: huggingface/transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Allow custom hf_quantizer in from_pretrained
#39690
opened Jul 26, 2025 by
tanuj-rai
Loading…
2 of 4 tasks
Fix issue #39191 respect accelerate config to disable torch.dynamo compilation
#39683
opened Jul 25, 2025 by
bonpiedlaroute
Loading…
[BugFix]: Support dict and config file path for deepspeed
#39675
opened Jul 25, 2025 by
yeshsurya
Loading…
1 of 5 tasks
Fix loss scaling and token aggregation to use only data parallel group
#39674
opened Jul 25, 2025 by
Krish0909
Loading…
skip
Glm4MoeModelTest::test_torch_compile_for_training
#39670
opened Jul 25, 2025 by
ydshieh
Loading…
Reduce atol values in test_dynamic_cache_exportability
#39667
opened Jul 25, 2025 by
st81
Loading…
1 of 5 tasks
Update
QAPipelineTests::test_large_model_course
after #39193
#39666
opened Jul 25, 2025 by
ydshieh
Loading…
fix(trainer): Correct loss scaling for incomplete gradient accumulation steps
#39659
opened Jul 25, 2025 by
hutaiHang
Loading…
2 of 5 tasks
Add self-hosted runner scale set workflow for mi325 CI
#39651
opened Jul 24, 2025 by
jitesh-gupta
•
Draft
5 tasks
🌐 [i18n-KO] Translated
deepseek_v3.md
to Korean
#39649
opened Jul 24, 2025 by
ssum21
Loading…
5 of 10 tasks
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-06-26.