Error when using your pretrained language model for the 10kGNAD data #1

Dude43 · 2019-06-20T17:33:05Z

Hey,
unfortunately, I am getting the following error when running your exemplary code for the 10kGNAD data set. It occurs when loading the model (learn_lm.load('ulmfit_for_german_jfilter')). I hope you can help me.

Best, Jacob

RuntimeError: Error(s) in loading state_dict for SequentialRNN:
size mismatch for 0.rnns.0.weight_hh_l0_raw: copying a param with shape torch.Size([4600, 1150]) from checkpoint, the shape in current model is torch.Size([4608, 1152]).
size mismatch for 0.rnns.0.module.weight_ih_l0: copying a param with shape torch.Size([4600, 400]) from checkpoint, the shape in current model is torch.Size([4608, 400]).
size mismatch for 0.rnns.0.module.weight_hh_l0: copying a param with shape torch.Size([4600, 1150]) from checkpoint, the shape in current model is torch.Size([4608, 1152]).
size mismatch for 0.rnns.0.module.bias_ih_l0: copying a param with shape torch.Size([4600]) from checkpoint, the shape in current model is torch.Size([4608]).
size mismatch for 0.rnns.0.module.bias_hh_l0: copying a param with shape torch.Size([4600]) from checkpoint, the shape in current model is torch.Size([4608]).
size mismatch for 0.rnns.1.weight_hh_l0_raw: copying a param with shape torch.Size([4600, 1150]) from checkpoint, the shape in current model is torch.Size([4608, 1152]).
size mismatch for 0.rnns.1.module.weight_ih_l0: copying a param with shape torch.Size([4600, 1150]) from checkpoint, the shape in current model is torch.Size([4608, 1152]).
size mismatch for 0.rnns.1.module.weight_hh_l0: copying a param with shape torch.Size([4600, 1150]) from checkpoint, the shape in current model is torch.Size([4608, 1152]).
size mismatch for 0.rnns.1.module.bias_ih_l0: copying a param with shape torch.Size([4600]) from checkpoint, the shape in current model is torch.Size([4608]).
size mismatch for 0.rnns.1.module.bias_hh_l0: copying a param with shape torch.Size([4600]) from checkpoint, the shape in current model is torch.Size([4608]).
size mismatch for 0.rnns.2.module.weight_ih_l0: copying a param with shape torch.Size([1600, 1150]) from checkpoint, the shape in current model is torch.Size([1600, 1152]).

jfilter · 2019-07-20T14:20:28Z

There is yet another breaking change with the FastAI lib. There seem to be a work around though: https://forums.fast.ai/t/ulmfit-german/22529/106

ismailbbm · 2019-11-14T11:27:25Z

The workaround is not complete. However the following has worked for me:
config = awd_lstm_lm_config.copy() config['n_hid'] = 1150 learn_lm = language_model_learner(data_lm, AWD_LSTM, drop_mult=0.5, pretrained=False, config=config) learn_lm.load('ulmfit_for_german_jfilter')

jfilter · 2020-06-24T19:22:56Z

Yes, you are right. Thanks.

config = awd_lstm_lm_config.copy()
config['n_hid'] = 1150
learn_lm = language_model_learner(data_lm, AWD_LSTM, drop_mult=0.5, pretrained=False, config=config)
learn_lm.load('ulmfit_for_german_jfilter')

jfilter closed this as completed Jun 24, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error when using your pretrained language model for the 10kGNAD data #1

Error when using your pretrained language model for the 10kGNAD data #1

Dude43 commented Jun 20, 2019 •

edited

Loading

jfilter commented Jul 20, 2019

ismailbbm commented Nov 14, 2019

jfilter commented Jun 24, 2020 •

edited

Loading

Error when using your pretrained language model for the 10kGNAD data #1

Error when using your pretrained language model for the 10kGNAD data #1

Comments

Dude43 commented Jun 20, 2019 • edited Loading

jfilter commented Jul 20, 2019

ismailbbm commented Nov 14, 2019

jfilter commented Jun 24, 2020 • edited Loading

Dude43 commented Jun 20, 2019 •

edited

Loading

jfilter commented Jun 24, 2020 •

edited

Loading