-
Notifications
You must be signed in to change notification settings - Fork 28.9k
Issues: huggingface/transformers
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Maybe the vocab_size can be duplicated to the mainconfig for PEFT to pick up
#38017
opened May 8, 2025 by
lancercat
special_image_mask handling can get hit by accidental same embedding value at certain dims
#38012
opened May 8, 2025 by
lancercat
Trainer Stuck at 0% Progress during Training on Multi-GPU Setup
bug
#38008
opened May 8, 2025 by
yanho824
2 of 4 tasks
Does Qwen_2_5_VL support variable length attention computation?
Feature request
Request for a new feature
#38007
opened May 8, 2025 by
yingtongxiong
[bug]
use_sliding_window
doesn't work as expected
bug
#38002
opened May 7, 2025 by
ZhiyuLi-Nvidia
1 of 4 tasks
RuntimeError when converting and saving Flax ViT model to PyTorch
bug
Flax
#37999
opened May 7, 2025 by
nobodyPerfecZ
4 tasks
Versions greater than 4.49 are not compatible with Ascend NPU
bug
#37992
opened May 7, 2025 by
1737686924
4 tasks
Bug Report: Unexpected Keyword Argument 'padding_side' in PreTrainedTokenizerFast
bug
#37989
opened May 7, 2025 by
yunqianluo
1 of 4 tasks
Support saving tensors to a file in Model addition debuggers
Feature request
Request for a new feature
#37983
opened May 6, 2025 by
RyanMullins
Add Request for a new feature
pruna
integration for loading model through transmorfers.from_pretrained
/ pipeline
.
Feature request
#37971
opened May 6, 2025 by
davidberenstein1957
Inconsistency in installation instructions for
venv
and uv
#37956
opened May 5, 2025 by
arjunaskykok
jinja2.exceptions.UndefinedError: 'list object' has no attribute 'startswith'
bug
#37954
opened May 5, 2025 by
lucasjinreal
4 tasks done
Add examples that showcase the use of Hyperparameter search with Transformers
#37947
opened May 4, 2025 by
ParagEkbote
Alternative to trainer.hyperparameter_search for models used with custom optimizer / lrscheduler etc.
#37945
opened May 4, 2025 by
ieshaan12
[Bug] Gemma3Processor.apply_chat_template returns Tensor instead of dict with long multimodal few-shot inputs
bug
#37943
opened May 3, 2025 by
Canticle929
2 of 4 tasks
ConditionalDetrImageProcessor
still accepts the deprecated parameter max_size
#37939
opened May 3, 2025 by
arjunaskykok
Different DataLoader worker share the same seed and lost randomness
bug
#37932
opened May 2, 2025 by
gathierry
4 tasks
Updates in type-checking specifications have broken transformers' types
bug
#37928
opened May 2, 2025 by
thfrkielikone
4 tasks done
request the support for training support for QuantizationMethod.FP8
bug
#37927
opened May 2, 2025 by
edoproch
4 tasks
Training Qwen2.5 VL with dynamic image size using more balanced Sampler for each GPU mem usage
Feature request
Request for a new feature
#37914
opened May 1, 2025 by
OpenJarvisAI
DynamicCache results in too many torch recompiles after 4.51
bug
#37908
opened May 1, 2025 by
flishwang
2 of 4 tasks
Error in input expansion for
generate
with num_return_sequences
> 1 for multi-image inputs to AutoModelForImageTextToText
bug
#37900
opened Apr 30, 2025 by
saujasv
2 of 4 tasks
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.