Issues: huggingface/transformers
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
WandbCallback always (!) uploads entire model checkpoint to wandb
#30896
opened May 19, 2024 by
mgerstgrasser
2 of 4 tasks
add_generation_prompt=False in Tokenizer.apply_chat_template has no effect
#30893
opened May 18, 2024 by
AndreiMuresanu
3 of 4 tasks
RuntimeError: Failed to import transformers.generation.utils because of the following error (look up to see its traceback): cannot import name 'GenerateOutput' from partially initialized module 'transformers.generation.utils' (most likely due to a circular import)
#30888
opened May 18, 2024 by
sysuls1
4 tasks
Wav2vec2 model has unknown attributes weight_g/weight_v when DeepSpeed ZeRO-3 is enabled
Audio
DeepSpeed
#30881
opened May 17, 2024 by
jonnyli1125
1 of 4 tasks
Have
_is_peft_model
check if there's any peft submodule/Allow quantised training
PEFT
#30878
opened May 17, 2024 by
ambroser53
Kosmos-2.5 implementation in transformers
Multimodal
New model
#30877
opened May 17, 2024 by
Natyren
2 tasks done
Owlv2 model keeps crashing
Examples
Which is related to examples in general
Vision
#30874
opened May 17, 2024 by
preethiseshadri518
Unsuppressable warning: "<model> will not detect padding tokens in
inputs_embeds
"
#30871
opened May 16, 2024 by
naimenz
2 of 4 tasks
Cache problem while runing on multiple nodes with GPU
#30859
opened May 16, 2024 by
yuane4
2 of 4 tasks
scores_for_ground_truths Error for deepset/roberta-base-squad2 model and squad_v2 dataset
Examples
Which is related to examples in general
TensorFlow
Anything TensorFlow
#30856
opened May 16, 2024 by
rahuljauhari3
2 of 4 tasks
Mamba:
use_cache
is not passed through in prepare_inputs_for_generation
#30849
opened May 16, 2024 by
uwu-420
[BLIP2] BLIP2QFormerLayer is missing the self.intermediate parameter, which makes training from scratch impossible
#30846
opened May 16, 2024 by
tongda
1 of 4 tasks
Significant performance degradation with multi-GPU training on newer torch/transformers
#30840
opened May 15, 2024 by
abdulfatir
2 of 4 tasks
Rewriting usage of
torch.bucketize
with more elementary functions
#30839
opened May 15, 2024 by
EricLBuehler
Cannot import name 'WhisperForAudioClassification -Already installed transformers==4.40.2
Audio
#30834
opened May 15, 2024 by
manjualoshious
RecurrentGemma not compatible with autocast / AMP training
#30830
opened May 15, 2024 by
xplip
4 tasks done
Unable to run generation tests for Mamba & Jamba models
#30828
opened May 15, 2024 by
amyeroberts
1 of 4 tasks
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.