Fuyu training fails with a padding error for the same inputs as the test case #27255

Carolinabanana · 2023-11-02T23:15:46Z

System Info

transformers-4.36.0.dev0 (commit commit 552ff24)

Who can help?

@molbap

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

import io
from datasets import Dataset
from PIL import Image
import requests
import torch
from transformers import AutoTokenizer,Trainer,TrainingArguments,FuyuImageProcessor,FuyuProcessor,FuyuForCausalLM

pretrained_path = "adept/fuyu-8b"
tokenizer = AutoTokenizer.from_pretrained(pretrained_path, pad_token_id=0, padding=True,truncation=True)
tokenizer.pad_token = tokenizer.eos_token
image_processor = FuyuImageProcessor()
processor = FuyuProcessor(image_processor=image_processor, tokenizer=tokenizer)

text_prompt = "Answer the following DocVQA question based on the image. \n Which is the metro in California that has a good job Outlook?"
jobs_image_url = "https://huggingface.co/datasets/hf-internal-testing/fixtures-captioning/resolve/main/jobs.png"
jobs_image_pil = Image.open(io.BytesIO(requests.get(jobs_image_url).content))
second_text_prompt = "Answer the following DocVQA question based on the image. \n What if the maximum male life expectancy?"
chart_image_url = "https://huggingface.co/datasets/hf-internal-testing/fixtures-captioning/resolve/main/chart.png"
chart_image_pil = Image.open(io.BytesIO(requests.get(chart_image_url).content))

model_inputs = processor(text=[text_prompt,second_text_prompt], images=[jobs_image_pil,chart_image_pil]).to("cuda:0")

model = FuyuForCausalLM.from_pretrained(pretrained_path, device_map='cuda:0', torch_dtype=torch.bfloat16)

generation = processor.tokenizer.batch_decode(model.generate(
    **model_inputs, max_new_tokens=10)[:, -10:], skip_special_tokens=True)

for batched_generation in generation:
    answer = batched_generation.split('\x04 ', 1)[1] if '\x04' in batched_generation else ''
    print(answer)

#Results : Los Angeles, 80.7

tokenized_dataset = Dataset.from_dict(model_inputs)

trainer = Trainer(
    model=model,
    train_dataset=tokenized_dataset,
    tokenizer=tokenizer
)

trainer.train()

#ValueError: Unable to create tensor, you should probably activate truncation and/or padding with 'padding=True' 'truncation=True' to have batched tensors with the same length. Perhaps your features (`image_patches` in this case) have excessive nesting (inputs type `list` where type `int` is expected).

Expected behavior

When running Fuyu model training, the same inputs that work correctly in model.generate (the ones from the test case) fail with ValueError: Unable to create tensor, you should probably activate truncation and/or padding with 'padding=True' 'truncation=True' to have batched tensors with the same length. Perhaps your features (image_patches in this case) have excessive nesting

However, I have already enabled 'padding=True' 'truncation=True' on the tokenizer, and I have used the exact output of FuyuProcessor as the inputs.

I provide example code that modifies the existing test case and test data to add a Trainer.

The text was updated successfully, but these errors were encountered:

molbap · 2023-11-07T15:46:22Z

Hi @Carolinabanana , I answered here #26997 (comment) to another training script issue, can you check it it helps your case?

leo4life2 · 2023-11-17T01:14:06Z

Hey did you get your script working? I'm trying to fine-tune Fuyu as well and would appreciate an example script.

BlackHandsomeLee · 2024-01-28T13:50:00Z

Hey,I came across your fine-tuned language model repository and am very interested in learning about the fine-tuning process. Would you be willing to share any details about the techniques or code used to fine-tune the model? Understanding how others have approached fine-tuning would be really helpful as I'm new to this area.Thank you for your time.

Carolinabanana mentioned this issue Nov 5, 2023

Fuyu Finetuning Example #26997

Closed

5 tasks

Carolinabanana closed this as completed Nov 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fuyu training fails with a padding error for the same inputs as the test case #27255

Fuyu training fails with a padding error for the same inputs as the test case #27255

Carolinabanana commented Nov 2, 2023

molbap commented Nov 7, 2023

leo4life2 commented Nov 17, 2023

BlackHandsomeLee commented Jan 28, 2024

Fuyu training fails with a padding error for the same inputs as the test case #27255

Fuyu training fails with a padding error for the same inputs as the test case #27255

Comments

Carolinabanana commented Nov 2, 2023

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

molbap commented Nov 7, 2023

leo4life2 commented Nov 17, 2023

BlackHandsomeLee commented Jan 28, 2024