AutoModelForObjectDetection isnt working due to wrong output size #36464

nikitabelooussovbtis · 2025-02-27T21:43:56Z

System Info

new:

transformers version: 4.50.0.dev0
Platform: Linux-6.11.0-17-generic-x86_64-with-glibc2.39
Python version: 3.12.3
Huggingface_hub version: 0.29.1
Safetensors version: 0.5.3
Accelerate version: 1.4.0
Accelerate config: not found
DeepSpeed version: not installed
PyTorch version (GPU?): 2.6.0+cu124 (False)
Tensorflow version (GPU?): not installed (NA)
Flax version (CPU?/GPU?/TPU?): not installed (NA)
Jax version: not installed
JaxLib version: not installed
Using distributed or parallel set-up in script?:

old:

transformers version: 4.50.0.dev0
Platform: Linux-6.11.0-17-generic-x86_64-with-glibc2.39
Python version: 3.12.3
Huggingface_hub version: 0.29.0
Safetensors version: 0.5.2
Accelerate version: 1.4.0
Accelerate config: not found
DeepSpeed version: not installed
PyTorch version (GPU?): 2.6.0+cu124 (False)
Tensorflow version (GPU?): not installed (NA)
Flax version (CPU?/GPU?/TPU?): not installed (NA)
Jax version: not installed
JaxLib version: not installed
Using distributed or parallel set-up in script?:

Who can help?

No response

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

I was using the repisotry 2 days ago and it was working fine. With the new updates something has changed and now when I try to load a model it is telling me that the output size is incorrect: size mismatch for class_labels_classifier.weight: copying a param with shape torch.Size([92, 256]) from checkpoint, the shape in current model is torch.Size([11, 256]). The ignore_mismatched_sizes has been set to true and I am using the facebook/detr-resnet-50 model specefically. I am pretty sure that it is due to a change in the repository since I copied the code and just reinstalled the repo in a new enviroment. In the old enviroment it works fine in the new one it is breaking. This is with the run_object_detection.py script.

Expected behavior

that the model is able to adapt to the new model size and train

The text was updated successfully, but these errors were encountered:

nikitabelooussovbtis · 2025-02-27T22:10:15Z

The old comment id is: 7c5bd24
The new one is 222505c

qubvel · 2025-02-28T10:17:31Z

Hey @nikitabelooussovbtis! The fix is coming

Fix loading models with mismatched sizes #36463

nikitabelooussovbtis · 2025-02-28T10:24:09Z

@qubvel Awesome thanks

qubvel · 2025-02-28T10:54:13Z

It should be fixed now! Let me know if any other issues left 🤗

nikitabelooussovbtis added the bug label Feb 27, 2025

qubvel mentioned this issue Feb 28, 2025

Fix loading models with mismatched sizes #36463

Merged

qubvel closed this as completed Feb 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AutoModelForObjectDetection isnt working due to wrong output size #36464

AutoModelForObjectDetection isnt working due to wrong output size #36464

nikitabelooussovbtis commented Feb 27, 2025

nikitabelooussovbtis commented Feb 27, 2025

qubvel commented Feb 28, 2025

nikitabelooussovbtis commented Feb 28, 2025

qubvel commented Feb 28, 2025

AutoModelForObjectDetection isnt working due to wrong output size #36464

AutoModelForObjectDetection isnt working due to wrong output size #36464

Comments

nikitabelooussovbtis commented Feb 27, 2025

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

nikitabelooussovbtis commented Feb 27, 2025

qubvel commented Feb 28, 2025

nikitabelooussovbtis commented Feb 28, 2025

qubvel commented Feb 28, 2025