-
Notifications
You must be signed in to change notification settings - Fork 28.9k
Issues: huggingface/transformers
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
BatchEncoding.to(device, dtype) could be worked!!
Feature request
Request for a new feature
#38096
opened May 13, 2025 by
HERIUN
Please add RIFE - Real-Time Intermediate Flow Estimation
New model
#38082
opened May 12, 2025 by
jozefchutka
2 tasks done
autoawq has been deprecated. Is it possible to support the use of llm-compresser as an alternative to autoawq
#38078
opened May 12, 2025 by
0O0OwO0O0
transformers showing decoder model architecture detected so padding should be left
bug
#38071
opened May 11, 2025 by
sleepingcat4
2 of 4 tasks
Adding native support to load GGUF models using transformers
Feature request
Request for a new feature
#38063
opened May 10, 2025 by
sleepingcat4
Weights not initialized correctly when instantiating model with a pretrained backbone
bug
#38061
opened May 10, 2025 by
matteot11
1 of 4 tasks
Attention mask for multi-image input in gemma3
bug
#38053
opened May 9, 2025 by
deval281shah
1 of 4 tasks
Modernbert 3D attention mask
Feature request
Request for a new feature
#38040
opened May 9, 2025 by
meetdoshi-iitb
Trainer API doesnt stop after the training has been completed
bug
#38039
opened May 9, 2025 by
Awaisn25
2 of 4 tasks
Removing the modification of loss value due to rounding off to 4 digits
bug
#38032
opened May 9, 2025 by
harish6696
2 of 4 tasks
TimeSformer assumes a fixed number of frames in its layers even though it interpolates temporal embeddings based on the input
bug
#38027
opened May 8, 2025 by
kamila-chay
1 of 4 tasks
while using trainer to train mnist model, 'ValueError: Found input variables with inconsistent numbers of samples: [10000, 8750]'
bug
#38024
opened May 8, 2025 by
HaoyaWHL
2 of 4 tasks
Maybe the vocab_size can be duplicated to the mainconfig for PEFT to pick up
#38017
opened May 8, 2025 by
lancercat
Trainer Stuck at 0% Progress during Training on Multi-GPU Setup
bug
#38008
opened May 8, 2025 by
yanho824
2 of 4 tasks
Does Qwen_2_5_VL support variable length attention computation?
Feature request
Request for a new feature
#38007
opened May 8, 2025 by
yingtongxiong
[bug]
use_sliding_window
doesn't work as expected
bug
#38002
opened May 7, 2025 by
ZhiyuLi-Nvidia
1 of 4 tasks
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.