Fix Base Model Name of LlamaForQuestionAnswering #29258

lenglaender · 2024-02-23T17:51:48Z

What does this PR do?

The LlamaForQuestionAnswering currently has the LlamaModel in the transformer variable. This does not match the base_model_prefix set in LlamaPreTrainedModel, which is "model".

This Pull Request changes the name from transformer to model in LlamaForQuestionAnswering

Who can review?

text models: @ArthurZucker and @younesbelkada

younesbelkada

Thanks for the PR ! Unfortunately this is a breakign change - you could overwrite the base_model_prefix only for that class though, what do you think?

lenglaender · 2024-02-27T18:36:58Z

True, I didn't think about whether renaming the variable would be a breaking change. In this case, setting base_model_prefix to the right value is sufficient. I changed it.

ArthurZucker

Might be "breaking" but since it was not reported it means it was not used as you mentioned you cannot save + load

ArthurZucker · 2024-02-29T10:35:52Z

@younesbelkada feel free to merge if it is alright with you

younesbelkada

Looks great, thanks !

HuggingFaceDocBuilderDev · 2024-03-01T02:18:43Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Changes: - HF changed parts of the Llama model implementation - HF added a `LlamaForQuestionAnswering`. However, this model has a wrong base model name. I added a workaround that solves this problem until this is fixed in Transformers (huggingface/transformers#29258) --------- Co-authored-by: calpt <calpt@mail.de>

lenglaender added 2 commits February 23, 2024 18:41

LlamaForQuestionAnswering self.transformer->self.model

9c4e1e4

fix "Copied from" string

137aef6

lenglaender mentioned this pull request Feb 23, 2024

Upgrade Transformers to v4.38.x adapter-hub/adapters#654

Merged

lenglaender marked this pull request as ready for review February 23, 2024 23:20

younesbelkada reviewed Feb 27, 2024

View reviewed changes

Llama QA model: set base_model_prefix = "transformer"

5c961b1

ArthurZucker approved these changes Feb 29, 2024

View reviewed changes

ArthurZucker requested a review from younesbelkada February 29, 2024 10:35

younesbelkada approved these changes Mar 1, 2024

View reviewed changes

younesbelkada merged commit 2858d6c into huggingface:main Mar 1, 2024
19 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Base Model Name of LlamaForQuestionAnswering #29258

Fix Base Model Name of LlamaForQuestionAnswering #29258

lenglaender commented Feb 23, 2024 •

edited

younesbelkada left a comment

lenglaender commented Feb 27, 2024 •

edited

ArthurZucker left a comment

ArthurZucker commented Feb 29, 2024

younesbelkada left a comment

HuggingFaceDocBuilderDev commented Mar 1, 2024

Fix Base Model Name of LlamaForQuestionAnswering #29258

Fix Base Model Name of LlamaForQuestionAnswering #29258

Conversation

lenglaender commented Feb 23, 2024 • edited

What does this PR do?

Who can review?

younesbelkada left a comment

Choose a reason for hiding this comment

lenglaender commented Feb 27, 2024 • edited

ArthurZucker left a comment

Choose a reason for hiding this comment

ArthurZucker commented Feb 29, 2024

younesbelkada left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Mar 1, 2024

lenglaender commented Feb 23, 2024 •

edited

lenglaender commented Feb 27, 2024 •

edited