Parameters of `BertOnlyMLMHead` missing from the released `DeepPavlov/rubert-base-cased` model #1148

jzbjyb · 2020-03-08T19:58:52Z

Thanks for releasing the Russian BERT model! However, I found that the released model does not include the parameters for masked language modeling (i.e. the BertOnlyMLMHead layers). I manually checked the weight file downloaded from hugging face and found that it only contains weights for 12 transformer layers. As a result, every time I load the model using the following code, the BertOnlyMLMHead is randomly initialized and the prediction differs.

from transformers import *  # transformers 2.4.1
tokenizer = AutoTokenizer.from_pretrained('DeepPavlov/rubert-base-cased')
model = AutoModelWithLMHead.from_pretrained('DeepPavlov/rubert-base-cased')
inp = 'Он [MASK] человек.'
inp = tokenizer.encode(inp)
print(tokenizer.convert_ids_to_tokens(inp))  # print the tokenized input
# ['[CLS]', 'он', '[MASK]', 'человек', '.', '[SEP]']
out = model(torch.tensor([inp]))[0]
tokenizer.convert_ids_to_tokens(out[0, 2].max(0)[1]) # the most plausible prediction differs when I reload the model

Actually, I found exactly the same issue for Greek BERT (here), and they managed to fix it (here). I guess you can follow the same method.

The text was updated successfully, but these errors were encountered:

yurakuratov · 2020-05-14T09:19:31Z

Thank you!

We have weights for LM head in TensorFlow checkpoint. If you still need them you can find link on this page: http://docs.deeppavlov.ai/en/master/features/pretrained_vectors.html#downloads) and then convert it to PyTorch.

We will update our models in Transformers.

jzbjyb · 2020-05-17T01:09:34Z

Thanks for updating the tf checkpoint! I converted it to PyTorch.

yoptar assigned yurakuratov Mar 16, 2020

jzbjyb closed this as completed May 17, 2020

jzbjyb mentioned this issue May 17, 2020

Parameters of BertOnlyMLMHead missing in monologg/kobert monologg/DistilKoBERT#4

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parameters of `BertOnlyMLMHead` missing from the released `DeepPavlov/rubert-base-cased` model #1148

Parameters of `BertOnlyMLMHead` missing from the released `DeepPavlov/rubert-base-cased` model #1148

jzbjyb commented Mar 8, 2020

yurakuratov commented May 14, 2020

jzbjyb commented May 17, 2020

Parameters of BertOnlyMLMHead missing from the released DeepPavlov/rubert-base-cased model #1148

Parameters of BertOnlyMLMHead missing from the released DeepPavlov/rubert-base-cased model #1148

Comments

jzbjyb commented Mar 8, 2020

yurakuratov commented May 14, 2020

jzbjyb commented May 17, 2020

Parameters of `BertOnlyMLMHead` missing from the released `DeepPavlov/rubert-base-cased` model #1148

Parameters of `BertOnlyMLMHead` missing from the released `DeepPavlov/rubert-base-cased` model #1148