-
Notifications
You must be signed in to change notification settings - Fork 28.3k
Issues: huggingface/transformers
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Can not use flash-attention and flash-varlen-attention on Ascend NPU
#36618
opened Mar 9, 2025 by
FightingZhen
Add support for StableAdamW optimizer in Trainer
Feature request
Request for a new feature
#36564
opened Mar 5, 2025 by
capemox
Support Distill Depth Anything
contributions-welcome
New model
Vision
#36499
opened Mar 2, 2025 by
oxysoft
2 tasks done
Set non_blocking=True When moving data from the CPU to the GPU
bug
#36384
opened Feb 25, 2025 by
Hukongtao
2 of 4 tasks
warning bug in Qwen2DecoderLayer in transformers ==4.49
bug
#36361
opened Feb 24, 2025 by
Kyrie666
2 of 4 tasks
TypeError: CustomTrainer.compute_loss() got an unexpected keyword argument 'num_items_in_batch'
bug
#36331
opened Feb 21, 2025 by
ruidazeng
2 of 4 tasks
The output tensor's data type is not torch.long when the input text is empty.
bug
#36277
opened Feb 19, 2025 by
wangzhen0518
2 of 4 tasks
[bug] use_gather_object is not respected after the first eval in trainer
#36213
opened Feb 15, 2025 by
ducha-aiki
Request to add DINO object detector
contributions-welcome
New model
Vision
#36205
opened Feb 14, 2025 by
tcourat
2 tasks done
Dedicated tokenizer for byte level transformers
Feature request
Request for a new feature
#36202
opened Feb 14, 2025 by
apehex
ValueError: Unrecognized image processor in Qwen/Qwen2.5-VL-3B-Instruct.
bug
#36193
opened Feb 14, 2025 by
SkalskiP
add Flash Attention Support for Helsinki-NLP/opus models
Feature request
Request for a new feature
Good Second Issue
Issues that are more difficult to do than "Good First" issues - give it a try if you want!
#36169
opened Feb 13, 2025 by
AghaDurrani
TypeError: ModernBertModel.forward() got an unexpected keyword argument 'num_items_in_batch'
bug
#36074
opened Feb 6, 2025 by
Bachstelze
4 tasks
modeling_phi3 errors with AttributeError: 'DynamicCache' object has no attribute 'get_max_length'
bug
#36071
opened Feb 6, 2025 by
doctorpangloss
1 of 4 tasks
Add Deepseek AI's Janus model
Good Difficult Issue
New model
#35928
opened Jan 28, 2025 by
ArthurZucker
2 tasks done
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-02-20.