运行./test.sh时发生报错： #24

Asuka0002 · 2021-04-26T08:29:19Z

你好我在训练网络完毕之后将test.py中的m_path改为了结果中最新的checkpoint的地址

但是在运行./test.sh时发生报错：
Traceback (most recent call last):
File "test.py", line 34, in
lm_model, lm_vocab, lm_args = init_model(m_path, gpu, "./model/vocab.txt")
File "test.py", line 28, in init_model
lm_model.load_state_dict(ckpt['model'])
File "/usr/local/anaconda3/envs/GPT/lib/python3.6/site-packages/torch/nn/modules/module.py", line 1052, in load_state_dict
self.class.name, "\n\t".join(error_msgs)))
RuntimeError: Error(s) in loading state_dict for BIGLM:
size mismatch for tok_embed.weight: copying a param with shape torch.Size([6410, 768]) from checkpoint, the shape in current model is torch.Size([28781, 768]).
size mismatch for out_proj.weight: copying a param with shape torch.Size([6410, 768]) from checkpoint, the shape in current model is torch.Size([28781, 768]).
size mismatch for out_proj.bias: copying a param with shape torch.Size([6410]) from checkpoint, the shape in current model is torch.Size([28781]).

请问这是什么原因导致的呢？非常感谢

lipiji · 2021-04-26T14:19:17Z

你好，加载的哪个模型，我看看

Asuka0002 · 2021-04-28T02:40:27Z

我自己训的一个模型，是vocab维度出了问题。已经解决，非常感谢

Asuka0002 closed this as completed Apr 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

运行./test.sh时发生报错： #24

运行./test.sh时发生报错： #24

Asuka0002 commented Apr 26, 2021

lipiji commented Apr 26, 2021

Asuka0002 commented Apr 28, 2021

运行./test.sh时发生报错： #24

运行./test.sh时发生报错： #24

Comments

Asuka0002 commented Apr 26, 2021

lipiji commented Apr 26, 2021

Asuka0002 commented Apr 28, 2021