-
Notifications
You must be signed in to change notification settings - Fork 28.4k
Issues: huggingface/transformers
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
In the _speculative_sampling function, it seems that the "squeeze" method is being used incorrectly.
#36810
by ZipECHO
was closed Mar 19, 2025
ValueError: weight is on the meta device, we need a
value
to put in on 0. Gemma3
bug
#36766
by akhilpandey95
was closed Mar 17, 2025
1 of 4 tasks
When what needs to be loaded is in the cache directory, there is no need to make a request to the remote
Feature request
Request for a new feature
#36762
by JinFish
was closed Mar 19, 2025
LlamaAttention has no attribute
rotary_emb
(4.50.0.dev0)
bug
#36758
by efsotr
was closed Mar 20, 2025
1 of 4 tasks
SFTConfig.__init__() got an unexpected keyword argument 'optimizers'
#36749
by Sneakr
was closed Mar 16, 2025
num_items_in_batch unexpected in vision encoder decoder
bug
#36744
by eljandoubi
was closed Mar 20, 2025
4 tasks
[bug] fast_image_processor register error
bug
#36715
by JJJYmmm
was closed Mar 19, 2025
1 of 4 tasks
ValueError: The checkpoint you are trying to load has model type
gemma3
but Transformers does not recognize this architecture.
bug
#36709
by JohnConnor123
was closed Mar 14, 2025
AttributeError: 'Gemma3Config' object has no attribute 'vocab_size'
bug
#36683
by jumelet
was closed Mar 19, 2025
4 tasks
NotImplementedError: aten::_log_softmax_backward_data with SparseCUDA backend
bug
#36674
by rangehow
was closed Mar 14, 2025
2 of 4 tasks
Error faced during Finetuning Deepseek-vl2
bug
#36633
by keertika-11
was closed Mar 11, 2025
2 of 4 tasks
save_only_model with FSDP throws FileNotFoundError error
bug
#36626
by kmehant
was closed Mar 13, 2025
4 tasks
Why are there so many variables named layrnorm in the codebase?
#36623
by jere357
was closed Mar 10, 2025
Llama3 tokenizer decode is incorrect for ' ...' with leading space
bug
#36622
by Naqu6
was closed Mar 9, 2025
1 of 4 tasks
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.