Getting issue while loading model #1

vijender412 · 2020-03-09T08:40:01Z

Getting below issue while loading the model in local system. Model was trained on colab.

Traceback (most recent call last):
  File "app.py", line 74, in <module>
    MODEL.load_state_dict(torch.load(config.MODEL_PATH, map_location=torch.device('cpu'))) #New created
  File "C:\Users\Vijender\Downloads\bert_sentiment\lib\site-packages\torch\nn\modules\module.py", line 830, in load_state_dict
    self.__class__.__name__, "\n\t".join(error_msgs)))
RuntimeError: Error(s) in loading state_dict for BERTBaseUncased:
        Missing key(s) in state_dict: "bert.embeddings.word_embeddings.weight", "bert.embeddings.position_embeddings.weight", "bert.embeddings.token_type_embeddings.weight", "bert.embeddings.LayerNorm.weight", "bert.embeddings.LayerNorm.bias", "bert.encoder.layer.0.attention.self.query.weight", "bert.encoder.layer.0.attention.self.query.bias",

The text was updated successfully, but these errors were encountered:

abhishekkrthakur · 2020-03-09T08:42:25Z

did you use data parallel to train the model?

vijender412 · 2020-03-09T08:44:50Z

@abhishekkrthakur Yes used data parallel while training

abhishekkrthakur · 2020-03-09T08:46:57Z

does your bert_base_path has bert base uncased model files?

vijender412 · 2020-03-09T08:48:50Z

@abhishekkrthakur Fixed the issue was with data parallel. I local i was not making use of "MODEL = nn.DataParallel(MODEL)". Now working with this. Can you help me understand the use of DataParallel and if it does require after training also?
Thanks for quick replying. Good to close the issue

abhishekkrthakur · 2020-03-09T08:51:03Z

DataParallel is used only when you have multiple GPUs during training.
If you used it in training, you have to use in inference but there are other ways too.

Closing this issue for now. :)

nipunsadvilkar · 2020-03-27T07:16:00Z

@abhishekkrthakur : Can you give any leads on how to load DataParallel GPU model on CPU?
As per pytorch docs tried following but still raises above RuntimeError

device = torch.device('cpu')
model = TheModelClass(*args, **kwargs)
model.load_state_dict(torch.load(PATH, map_location=device))

abhishekkrthakur closed this as completed Mar 9, 2020

nipunsadvilkar mentioned this issue Mar 29, 2020

Loading DataParallel GPU model on CPU #4

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Getting issue while loading model #1

Getting issue while loading model #1

vijender412 commented Mar 9, 2020

abhishekkrthakur commented Mar 9, 2020

vijender412 commented Mar 9, 2020

abhishekkrthakur commented Mar 9, 2020

vijender412 commented Mar 9, 2020

abhishekkrthakur commented Mar 9, 2020

nipunsadvilkar commented Mar 27, 2020

Getting issue while loading model #1

Getting issue while loading model #1

Comments

vijender412 commented Mar 9, 2020

abhishekkrthakur commented Mar 9, 2020

vijender412 commented Mar 9, 2020

abhishekkrthakur commented Mar 9, 2020

vijender412 commented Mar 9, 2020

abhishekkrthakur commented Mar 9, 2020

nipunsadvilkar commented Mar 27, 2020