RoFormerConfig载入报错 #6

renjunxiang · 2021-05-25T04:23:28Z

非常感谢您开源这个项目，我在使用roformer_chinese_char_base的时候想扩大max_position_embeddings的长度，所以在尝试通过RoFormerConfig的方式载入权重时报错。

from roformer.modeling_roformer import RoFormerModel, RoFormerConfig
myconfig = RoFormerConfig.from_pretrained('D:/pretrain/pytorch/roformer_chinese_char_base')
myconfig.max_position_embeddings=2000
model = RoFormerModel(config=myconfig)
ckpt = torch.load('D:/pretrain/pytorch/roformer_chinese_char_base/pytorch_model.bin')
model.load_state_dict(ckpt,strict=False)

Missing key(s) in state_dict: "embeddings.word_embeddings.weight", "embeddings.token_type_embeddings.weight", "embeddings.LayerNorm.weight", "embeddings.LayerNorm.bias", "encoder.embed_positions.weight", "encoder.layer.0.attention.self.query.weight", "encoder.layer.0.attention.self.query.bias", "encoder.layer.0.attention.self.key.weight", "encoder.layer.0.attention.self.key.bias", "encoder.layer.0.attention.self.value.weight", "encoder.layer.0.attention.self.value.bias", "encoder.layer.0.attention.output.dense.weight", "encoder.layer.0.attention.output.dense.bias", "encoder.layer.0.attention.output.LayerNorm.weight", "encoder.layer.0.attention.output.LayerNorm.bias", "encoder.layer.0.intermediate.dense.weight", "encoder.layer.0.intermediate.dense.bias", "encoder.layer.0.output.dense.weight", "encoder.layer.0.output.dense.bias"
看了下RoFormerModel的网络层，貌似都是不带"roformer"，请问这个是不是需要修改RoFormerModel里面的层名称？

The text was updated successfully, but these errors were encountered:

renjunxiang · 2021-05-25T04:38:02Z

修改ckpt的层名称就可以了，不过长度原来是预训练就限定好的，没看论文我以为是可以在模型里动态修改的。

renjunxiang · 2021-05-25T04:46:45Z

不对，我尝试了roformer_chinese_base好像是支持在RoFormerConfig载入并修改长度的。是不是roformer_chinese_char_base这边您转torch的时候config多写了一个"max_position_embeddings": 512？

JunnYu · 2021-05-25T07:28:14Z

@renjunxiang

本地自己删除pytorch_model.bin中的embed_positions.weight

import torch
from collections import OrderedDict

s = OrderedDict()
state_dict = torch.load("pytorch_model.bin")

for k, v in state_dict.items():
    if "embed_positions" in k:
        continue
    s[k] = v

torch.save(s, "new_pytorch_model.bin", _use_new_zipfile_serialization=False)

然后new_pytorch_model.bin这个权重就不带embed_positions.weight。

from roformer.modeling_roformer import RoFormerModel, RoFormerConfig
import torch
myconfig = RoFormerConfig.from_pretrained(
    './config.json')
myconfig.max_position_embeddings = 2000
model = RoFormerModel.from_pretrained("./", config=myconfig)

或者这样加载模型，并修改max_position_embeddings

from roformer.modeling_roformer import RoFormerModel
model = RoFormerModel.from_pretrained("./", max_position_embeddings=2000)

'''
Some weights of the model checkpoint at ./ were not used when initializing RoFormerModel: ['cls.predictions.bias', 'cls.predictions.transform.LayerNorm.weight', 'cls.predictions.transform.dense.weight', 'cls.predictions.transform.LayerNorm.bias', 'cls.predictions.transform.dense.bias', 'cls.predictions.decoder.bias', 'cls.predictions.decoder.weight']

This IS expected if you are initializing RoFormerModel from the checkpoint of a model trained on another task or with another architecture (e.g. initializing a BertForSequenceClassification model from a BertForPreTraining model).
This IS NOT expected if you are initializing RoFormerModel from the checkpoint of a model that you expect to be exactly identical (initializing a BertForSequenceClassification model from a BertForSequenceClassification model).
Some weights of RoFormerModel were not initialized from the model checkpoint at ./ and are newly initialized: ['roformer.encoder.embed_positions.weight']
You should probably TRAIN this model on a down-stream task to be able to use it for predictions and inference.
'''

renjunxiang · 2021-05-25T09:35:41Z

感谢解答！

renjunxiang closed this as completed May 25, 2021

renjunxiang reopened this May 25, 2021

renjunxiang closed this as completed May 25, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RoFormerConfig载入报错 #6

RoFormerConfig载入报错 #6

renjunxiang commented May 25, 2021

renjunxiang commented May 25, 2021

renjunxiang commented May 25, 2021

JunnYu commented May 25, 2021 •

edited

renjunxiang commented May 25, 2021

RoFormerConfig载入报错 #6

RoFormerConfig载入报错 #6

Comments

renjunxiang commented May 25, 2021

renjunxiang commented May 25, 2021

renjunxiang commented May 25, 2021

JunnYu commented May 25, 2021 • edited

renjunxiang commented May 25, 2021

JunnYu commented May 25, 2021 •

edited