-
Notifications
You must be signed in to change notification settings - Fork 29.3k
Pull requests: huggingface/transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add
SepCache
[An efficient and easy-to-use Cache from the SepLLM paper - ICML 2025 (https://arxiv.org/abs/2412.12094) ] to the cache_utils.py
and __init__.py
#38824
opened Jun 14, 2025 by
GaussonTschen
Loading…
5 tasks
Corrections to PR #38642 and enhancements to Wav2Vec2Processor __call__ and pad docstrings
#38822
opened Jun 13, 2025 by
renet10
Loading…
3 tasks
Codespace organic succotash 5rqgw4j5xqv376pr
#38821
opened Jun 13, 2025 by
nodoubtz
Loading…
5 tasks
Update trainer.py: add multiprocessing_context for mps devices
#38819
opened Jun 13, 2025 by
AmitMY
Loading…
1 of 5 tasks
Add kwargs support in WhisperForConditionalGeneration
#38810
opened Jun 13, 2025 by
Tanuj-rai
Loading…
2 of 5 tasks
feat: Add granite architectures to auto tokenizer name mappings
#38802
opened Jun 12, 2025 by
gabe-l-hart
Loading…
1 of 5 tasks
GraniteMoeHybrid: Allow for only shared expert case.
#38801
opened Jun 12, 2025 by
shawntan
Loading…
LlamaAttention forward function type hint is incorrect #38739
#38795
opened Jun 12, 2025 by
ArkVex
Loading…
Provide clearer instructions on how to specify target language.
#38786
opened Jun 12, 2025 by
khof312
Loading…
Fix(informer): Correct tensor shape for input_size=1
#38780
opened Jun 12, 2025 by
Flink-ddd
Loading…
fix(generation): stop beam search per-instance when heuristic satisfied
#38778
opened Jun 12, 2025 by
guang-yng
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.