fix(camembert): add tie_word_embeddings=True to CamembertConfig by r266-tech · Pull Request #44931 · huggingface/transformers

r266-tech · 2026-03-22T17:28:57Z

What does this PR do?

Fixes a v5 regression where CamembertForMaskedLM (and all CamemBERT masked-LM tasks) produces near-zero, near-uniform logits, making the model completely non-functional.

Root cause

In v5, modeling_utils.get_expanded_tied_weights_keys() gates all weight tying on config.tie_word_embeddings:

tie_word_embeddings = getattr(self.config, "tie_word_embeddings", False)
if not tie_word_embeddings:
    return {}   # <-- skips ALL tying when attribute is missing

CamembertConfig was missing tie_word_embeddings: bool = True, so the method returned {} and lm_head.decoder.weight was randomly initialized instead of being tied to roberta.embeddings.word_embeddings.weight.

Sibling configs RobertaConfig and BertConfig already declare tie_word_embeddings: bool = True. CamemBERT is RoBERTa-based and its modeling code already defines _tied_weights_keys mapping lm_head.decoder.weight → roberta.embeddings.word_embeddings.weight — but that mapping was silently ignored.

Before fix (transformers v5.3.0)

>>> pipeline("fill-mask", model="camembert-base")("Le camembert est un délicieux fromage <mask>.")
# score=0.000108  totalité    ← near-uniform logits (broken)
# score=0.000106  Mat
# score=0.000104  Populaire

The LOAD REPORT also showed lm_head.decoder.weight: MISSING (randomly initialized).

After fix

>>> pipeline("fill-mask", model="camembert-base")("Le camembert est un délicieux fromage <mask>.")
# score=0.1819  suisse    ← matches v4 exactly ✅
# score=0.0937  français
# score=0.0495  italien

Change

One line added to CamembertConfig:

tie_word_embeddings: bool = True

Fixes #44671

Before submitting

Did you read the contributor guideline?
Was this discussed/approved via a Github issue? CamemBERT produces incorrect masked LM predictions in v5 #44671
Did you write any new necessary tests?

Who can review?

@ArthurZucker @Cyrilvallez

In v5, `modeling_utils.get_expanded_tied_weights_keys()` checks `config.tie_word_embeddings` and returns an empty dict (skipping all weight tying) when the attribute is absent or False. `CamembertConfig` was missing `tie_word_embeddings: bool = True`, causing `lm_head.decoder.weight` to be randomly initialized instead of being tied to `roberta.embeddings.word_embeddings.weight`. This produced near-uniform, near-zero logits for fill-mask (and all masked-LM tasks), making the model completely non-functional in v5. Sibling configs `RobertaConfig` and `BertConfig` already declare `tie_word_embeddings: bool = True` — this commit brings CamemBERT in line with them. Fixes huggingface#44671

github-actions · 2026-03-22T17:30:03Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: camembert

github-actions · 2026-03-22T17:37:03Z

View the CircleCI Test Summary for this PR:

https://huggingface.co/spaces/transformers-community/circle-ci-viz?pr=44931&sha=c35409

HuggingFaceDocBuilderDev · 2026-03-23T09:20:29Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

zucchini-nlp · 2026-03-23T10:19:18Z

run-slow: camembert

github-actions · 2026-03-23T10:20:37Z

Workflow Run ⚙️

This comment contains run-slow, running the specified jobs:

models: ["models/camembert"]
quantizations: []

github-actions · 2026-03-23T10:29:20Z

CI Results

Workflow Run ⚙️

Commit Info

Context	Commit	Description
RUN	7f72b3d2	workflow commit (merge commit)
PR	c354098a	branch commit (from PR)
main	55cc1a7f	base commit (on `main`)

✅ No failing test specific to this PR 🎉 👏 !

Cyrilvallez · 2026-03-23T10:32:56Z

cc @tarekziade!! This has been forgotten in a precedent refactor for sure. COuld we add the following rule for our modeling format linter: if _tied_weights_keys is present and non-empty in modeling -> Config MUST contain the tie_word_embeddings field?

Cyrilvallez

Indeed we need the flag! Thanks!

Cyrilvallez · 2026-03-23T10:47:14Z

This was forgotten in #41541 @zucchini-nlp! With the linter rule, @tarekziade will be able to determine if we missed it at other locations as well!

tarekziade · 2026-03-25T07:09:37Z

cc @tarekziade!! This has been forgotten in a precedent refactor for sure. COuld we add the following rule for our modeling format linter: if _tied_weights_keys is present and non-empty in modeling -> Config MUST contain the tie_word_embeddings field?

#44988

r266-tech mentioned this pull request Mar 22, 2026

CamemBERT produces incorrect masked LM predictions in v5 #44671

Closed

4 tasks

ArthurZucker requested a review from zucchini-nlp March 23, 2026 09:10

Cyrilvallez approved these changes Mar 23, 2026

View reviewed changes

Cyrilvallez merged commit 9dc8d8a into huggingface:main Mar 23, 2026
21 of 23 checks passed

tomaarsen mentioned this pull request Mar 26, 2026

[fix] Use the correct _tied_weights_keys for CamembertForCausalLM #45031

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(camembert): add tie_word_embeddings=True to CamembertConfig#44931

fix(camembert): add tie_word_embeddings=True to CamembertConfig#44931
Cyrilvallez merged 1 commit intohuggingface:mainfrom
r266-tech:fix/camembert-tie-word-embeddings

r266-tech commented Mar 22, 2026

Uh oh!

github-actions bot commented Mar 22, 2026

Uh oh!

github-actions bot commented Mar 22, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Mar 23, 2026

Uh oh!

zucchini-nlp commented Mar 23, 2026

Uh oh!

github-actions bot commented Mar 23, 2026

Uh oh!

github-actions bot commented Mar 23, 2026

Uh oh!

Cyrilvallez commented Mar 23, 2026

Uh oh!

Cyrilvallez left a comment

Uh oh!

Cyrilvallez commented Mar 23, 2026

Uh oh!

Uh oh!

tarekziade commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

r266-tech commented Mar 22, 2026

What does this PR do?

Root cause

Before fix (transformers v5.3.0)

After fix

Change

Before submitting

Who can review?

Uh oh!

github-actions bot commented Mar 22, 2026

Uh oh!

github-actions bot commented Mar 22, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Mar 23, 2026

Uh oh!

zucchini-nlp commented Mar 23, 2026

Uh oh!

github-actions bot commented Mar 23, 2026

Uh oh!

github-actions bot commented Mar 23, 2026

CI Results

Commit Info

Uh oh!

Cyrilvallez commented Mar 23, 2026

Uh oh!

Cyrilvallez left a comment

Choose a reason for hiding this comment

Uh oh!

Cyrilvallez commented Mar 23, 2026

Uh oh!

Uh oh!

tarekziade commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants