Pulse · huggingface/transformers

May 28, 2025 – June 4, 2025

Overview

114 Active pull requests

82 Active issues

Could not load contribution data

Please try again later

116 Unresolved conversations

Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.

Add support for Florence-2
#38188 commented on Jun 4, 2025 • 73 new comments
GLM-4-0414 Change
#38431 commented on Jun 4, 2025 • 39 new comments
Split `transformers chat` and `transformers serve`
#38443 commented on Jun 3, 2025 • 31 new comments
Add EoMT Model
#37610 commented on Jun 4, 2025 • 27 new comments
support MiniCPM-o2.6
#37917 commented on Jun 3, 2025 • 14 new comments
Add LightGlue model
#31718 commented on Jun 4, 2025 • 11 new comments
Add Fast Image Processor for mobileViT
#37143 commented on Jun 2, 2025 • 7 new comments
Add X-Codec model
#38248 commented on May 30, 2025 • 7 new comments
Encoder-Decoder Gemma
#38332 commented on May 30, 2025 • 6 new comments
fix total batch size calculation in trainer
#38286 commented on Jun 4, 2025 • 5 new comments
support overlapping masks in mask2former image processor
#37357 commented on Jun 3, 2025 • 3 new comments
Skip non-selected experts for qwen3_moe
#38133 commented on Jun 2, 2025 • 3 new comments
[WIP] Perception lm
#37878 commented on May 31, 2025 • 2 new comments
Add Ovis2 model and processor implementation
#37088 commented on Jun 4, 2025 • 2 new comments
Add kernelize to transformers
#38205 commented on Jun 3, 2025 • 2 new comments
Enable tracing for Moshi
#36894 commented on Jun 2, 2025 • 2 new comments
Fix Whisper inference regression with backward-compatible logprob calculation
#38388 commented on Jun 3, 2025 • 2 new comments
Add Dia model
#38405 commented on Jun 4, 2025 • 2 new comments
Refactor `MambaCache` to `modeling_mamba.py` (parity with Zamba)
#38086 commented on Jun 2, 2025 • 1 new comment
feat: support indivisible shards for TP model loading and TPlizing.
#37220 commented on Jun 2, 2025 • 1 new comment
[trainer] ensure special tokens in model configs are aligned with tokenizer at train time
#38441 commented on Jun 3, 2025 • 1 new comment
[Validation] First implementation of `@strict` from `huggingface_hub`
#36534 commented on May 30, 2025 • 1 new comment
Update tokenization_utils_base.py
#37512 commented on Jun 2, 2025 • 0 new comments
internalize build_inputs_with_special_tokens and prepare_for_model
#37522 commented on Jun 2, 2025 • 0 new comments
Fix interpolation of convnext image processor
#37460 commented on Jun 4, 2025 • 0 new comments
Fast tokenizer encoding doesn't handle empty string input
#37537 commented on Jun 2, 2025 • 0 new comments
[Cache] Support compilable cache reuse with smaller batch sizes
#37394 commented on Jun 2, 2025 • 0 new comments
Add configurable normalization schemes to SigLIP image processors
#38444 commented on May 29, 2025 • 0 new comments
handle training summary when creating modelcard but offline mode is set
#37095 commented on Jun 2, 2025 • 0 new comments
[draft] random tests order
#37082 commented on Jun 2, 2025 • 0 new comments
Adding a stub for MiniCPM-o to the models
#37049 commented on Jun 3, 2025 • 0 new comments
Add Fast Segformer Processor
#37024 commented on Jun 4, 2025 • 0 new comments
Add Fast SamImageProcessor
#36999 commented on Jun 1, 2025 • 0 new comments
Add support for specifying revisions when pushing to Hub via internal Trainer call
#36852 commented on Jun 2, 2025 • 0 new comments
Support loading custom code objects (`trust_remote_code=True`) in offline mode from local
#36808 commented on Jun 4, 2025 • 0 new comments
Add Aimv2 model
#36625 commented on May 29, 2025 • 0 new comments
Fix edge case for tokenize (#36277)
#36555 commented on May 29, 2025 • 0 new comments
[Qwen2.5-VL] Fix empty string input crash in processor
#38421 commented on May 29, 2025 • 0 new comments
Lag kv cache
#38364 commented on Jun 3, 2025 • 0 new comments
align xpu's autocast behavior w/ cuda by using device agnostic torch APIs
#38284 commented on Jun 4, 2025 • 0 new comments
Add zero dim tensor check when using flash_attention
#38280 commented on May 30, 2025 • 0 new comments
[docs] Tensor parallelism
#38241 commented on Jun 2, 2025 • 0 new comments
Add SVE implementation for Mamba Sequential Scan Algorithm
#38185 commented on Jun 3, 2025 • 0 new comments
[WIP] new BLT
#38173 commented on May 30, 2025 • 0 new comments
Fix FSDP + llava-next/llava-onevision
#38141 commented on Jun 4, 2025 • 0 new comments
Cache System Refactor: Layered Architecture
#38077 commented on Jun 4, 2025 • 0 new comments
update loss computation in modeling code
#37993 commented on Jun 2, 2025 • 0 new comments
Add dia
#37941 commented on Jun 3, 2025 • 0 new comments
[WIP] Add MM Grounding DINO
#37925 commented on May 30, 2025 • 0 new comments
Add DEIM object detection model
#37875 commented on Jun 1, 2025 • 0 new comments
[WiP] Add xcodec2 model
#37868 commented on Jun 4, 2025 • 0 new comments
Fixed a bug calculating cross entropy loss in `JetMoeForCausalLM`
#37830 commented on May 30, 2025 • 0 new comments
qwen null pointer check.
#37810 commented on Jun 2, 2025 • 0 new comments
Update ruff to 0.11.7 and some fixes
#37809 commented on Jun 4, 2025 • 0 new comments
fix qwen2.5-omini cant be loaded from AutoModel
#37795 commented on Jun 2, 2025 • 0 new comments
Adding features like Tokenizer evaluation/benchmarking
#37792 commented on Jun 2, 2025 • 0 new comments
Updated Albert model Card
#37753 commented on Jun 4, 2025 • 0 new comments
refactor create_token_type_ids_from_sequences
#37681 commented on Jun 2, 2025 • 0 new comments
Non model inits
#37653 commented on Jun 2, 2025 • 0 new comments
Add DeepSeek V2 Model into Transformers
#36400 commented on Jun 3, 2025 • 0 new comments
💡 Proposal: Add temporal-grounding pipeline for video-language tasks
#38450 commented on Jun 1, 2025 • 0 new comments
Convnext image preprocessor raises an AssertionError when comparing logits
#37461 commented on Jun 1, 2025 • 0 new comments
Weights not initialized correctly when instantiating model with a pretrained backbone
#38061 commented on Jun 1, 2025 • 0 new comments
Please support GGUF format for UMT5EncoderModel
#36774 commented on May 31, 2025 • 0 new comments
Any plans on adding Flash Attention 3?
#33373 commented on May 31, 2025 • 0 new comments
401 Unauthorized Error: "Invalid credentials" on POST requests to Inference API from multiple services
#38289 commented on May 31, 2025 • 0 new comments
[BUG] Batch inference DDP + zero stage 3 = inference code hangs
#36638 commented on May 31, 2025 • 0 new comments
ModernBert Tokenizer flag `is_split_into_words` not working
#37883 commented on May 31, 2025 • 0 new comments
Error in input expansion for `generate` with `num_return_sequences` > 1 for multi-image inputs to `AutoModelForImageTextToText`
#37900 commented on May 31, 2025 • 0 new comments
Object detection training/fine-tuning for Owl-vit/Owlv2
#33664 commented on May 31, 2025 • 0 new comments
OWL-ViT training / fine-tuning code
#20091 commented on May 31, 2025 • 0 new comments
Gibberish generations with FSDP2 and MixedPrecisionPolicy
#38190 commented on May 30, 2025 • 0 new comments
A type error in the Template writing document
#37524 commented on May 30, 2025 • 0 new comments
ImageInput doesn't include JAX ndarray and TensorFlow tensor
#37857 commented on May 30, 2025 • 0 new comments
BUG: ModernBERT flash-attention2 incompatible on Ascend NPU
#37859 commented on May 30, 2025 • 0 new comments
Llama2 can output scores normally, but Llama3 outputs full inf
#37862 commented on May 30, 2025 • 0 new comments
WhisperForCTC
#26242 commented on May 30, 2025 • 0 new comments
Potential mix-up with IMAGENET_STANDARD and IMAGENET_DEFAULT values
#38318 commented on May 30, 2025 • 0 new comments
Version 4.52.3 leads to error after bundling with pyinstaller
#38402 commented on May 29, 2025 • 0 new comments
Memory saving by upcasting logits for only non-ignored positions
#38452 commented on May 29, 2025 • 0 new comments
accelerate + device_map auto = error
#38408 commented on May 29, 2025 • 0 new comments
Allow video objects (np array etc.) in apply_chat_template (not just paths or urls)
#36560 commented on May 29, 2025 • 0 new comments
The same situation as #31377 occurred when using Qwen/Qwen2-VL-7B-Instruct
#33399 commented on May 29, 2025 • 0 new comments
Gemma3: Cuda error: misaligned address
#36961 commented on May 29, 2025 • 0 new comments
Decoder Attention Mask is not passed to the VisionEncoderDecoderModel during training!!
#37823 commented on May 29, 2025 • 0 new comments
AttentionMaskVisualizer hard-code sliding_window to 5 in transformers code.
#37851 commented on May 29, 2025 • 0 new comments
Will Trainer.predict() return data in the same order as the original dataset during multi-machine and multi-gpus inference?
#33728 commented on May 29, 2025 • 0 new comments
Add support for BAGEL from ByteDance
#38267 commented on May 29, 2025 • 0 new comments
Add evolla rebase main
#36232 commented on Jun 3, 2025 • 0 new comments
Add Doge model
#35891 commented on Jun 3, 2025 • 0 new comments
Integrate xlstm cleanly.
#35377 commented on Jun 4, 2025 • 0 new comments
Correctly support resuming from checkpoint with a dataset without length
#33544 commented on Jun 3, 2025 • 0 new comments
Add Segment Anything 2 (SAM2)
#32317 commented on Jun 4, 2025 • 0 new comments
[Community contributions] Model cards
#36979 commented on Jun 4, 2025 • 0 new comments
Beit image classification have different results compared from versions prior to 4.43.0
#34446 commented on Jun 4, 2025 • 0 new comments
Processor multiprocessing error when load custom processor
#37637 commented on Jun 4, 2025 • 0 new comments
MedGemma worked find prior to 4.52.3 release but now errors
#38333 commented on Jun 4, 2025 • 0 new comments
LagKV for key-value compression
#38312 commented on Jun 4, 2025 • 0 new comments
`ConditionalDetrImageProcessor` still accepts the deprecated parameter `max_size`
#37939 commented on Jun 4, 2025 • 0 new comments
Errors using TinyLlama-1.1B-Chat-v1.0 and DirectML
#38340 commented on Jun 4, 2025 • 0 new comments
Add RoMa keypoint matcher
#36718 commented on Jun 3, 2025 • 0 new comments
Maybe the vocab_size can be duplicated to the mainconfig for PEFT to pick up
#38017 commented on Jun 3, 2025 • 0 new comments
Shape Error in Llama4VisionMLP2
#37321 commented on Jun 3, 2025 • 0 new comments
[Bug] Gemma3Processor.apply_chat_template returns Tensor instead of dict with long multimodal few-shot inputs
#37943 commented on Jun 3, 2025 • 0 new comments
Alternative to trainer.hyperparameter_search for models used with custom optimizer / lrscheduler etc.
#37945 commented on Jun 3, 2025 • 0 new comments
Add examples that showcase the use of Hyperparameter search with Transformers
#37947 commented on Jun 3, 2025 • 0 new comments
[Contributions Welcome] Add Fast Image Processors
#36978 commented on Jun 2, 2025 • 0 new comments
Model implmenetation using Liger Kernel layers
#38416 commented on Jun 2, 2025 • 0 new comments
quantizer_hqq should not require a gpu/cuda device to run
#38439 commented on Jun 2, 2025 • 0 new comments
Add Gemma 3 For Sequence Classification
#36755 commented on Jun 2, 2025 • 0 new comments
Recomputed tensor size does not match when using activation checkpointing when using FSDP and accelerate
#34928 commented on Jun 2, 2025 • 0 new comments
request the support for training support for QuantizationMethod.FP8
#37927 commented on Jun 2, 2025 • 0 new comments
Updates in type-checking specifications have broken transformers' types
#37928 commented on Jun 2, 2025 • 0 new comments
Is Llama4TextL2Norm meant to be RMS norm?
#37934 commented on Jun 2, 2025 • 0 new comments
[i18n-TR] Translating docs to Turkish
#27088 commented on Jun 1, 2025 • 0 new comments
transformers showing decoder model architecture detected so padding should be left
#38071 commented on Jun 1, 2025 • 0 new comments

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

May 28, 2025 – June 4, 2025

Overview

Could not load contribution data

2 Releases published by 1 person

61 Pull requests merged by 39 people

53 Pull requests opened by 40 people

50 Issues closed by 22 people

32 Issues opened by 32 people

116 Unresolved conversations

Insights: huggingface/transformers

May 28, 2025 – June 4, 2025

Overview

Could not load contribution data

2 Releases published by 1 person

61 Pull requests merged by 39 people

53 Pull requests opened by 40 people

50 Issues closed by 22 people

32 Issues opened by 32 people

116 Unresolved conversations