RuntimeError: size mismatch (when using -brnn option) #4

irshadbhat · 2017-02-27T09:34:29Z

When I train a model with default parameters and translate with translate.py it works fine. When I use the brnn option for training, the train code works fine but the translate.py throws a RuntimeError: size mismatch. Please help me out if I am doing anything wrong. These are the codes that I executed:

python preprocess.py -train_src data/src-train.txt -train_tgt data/trg-train.txt -valid_src data/src-val.txt -valid_tgt data/trg-val.txt -save_data data/ehtrans

python train.py -data data/ehtrans-train.pt -save_model eh/model -brnn -brnn_merge concat -epochs 1000 -cuda

python translate.py -model eh/model_e7_13.28.pt -src data/src-val.txt -tgt data/trg-val.txt -output file-tgt.tok -cuda

The preprocess and train codes work fine but the translate code throws the following error:

Traceback (most recent call last):
  File "translate.py", line 121, in <module>
    main()
  File "translate.py", line 74, in main
    predBatch, predScore, goldScore = translator.translate(srcBatch, tgtBatch)
  File "/DATA/USERS/irshad/OpenNMT-py/onmt/Translator.py", line 190, in translate
    pred, predScore, attn, goldScore = self.translateBatch(batch)
  File "/DATA/USERS/irshad/OpenNMT-py/onmt/Translator.py", line 87, in translateBatch
    tgtBatch[:-1], decStates, context, initOutput)
  File "/usr/local/lib/python2.7/dist-packages/torch/nn/modules/module.py", line 202, in __call__
    result = self.forward(*input, **kwargs)
  File "/DATA/USERS/irshad/OpenNMT-py/onmt/Models.py", line 119, in forward
    output, hidden = self.rnn(emb_t, hidden)
  File "/usr/local/lib/python2.7/dist-packages/torch/nn/modules/module.py", line 202, in __call__
    result = self.forward(*input, **kwargs)
  File "/DATA/USERS/irshad/OpenNMT-py/onmt/Models.py", line 61, in forward
    h_1_i, c_1_i = layer(input, (h_0[i], c_0[i]))
  File "/usr/local/lib/python2.7/dist-packages/torch/nn/modules/module.py", line 202, in __call__
    result = self.forward(*input, **kwargs)
  File "/usr/local/lib/python2.7/dist-packages/torch/nn/modules/rnn.py", line 472, in forward
    self.bias_ih, self.bias_hh,
  File "/usr/local/lib/python2.7/dist-packages/torch/nn/_functions/rnn.py", line 22, in LSTMCell
    gates = F.linear(input, w_ih, b_ih) + F.linear(hx, w_hh, b_hh)
  File "/usr/local/lib/python2.7/dist-packages/torch/nn/functional.py", line 381, in linear
    return bias and state(input, weight, bias) or state(input, weight)
  File "/usr/local/lib/python2.7/dist-packages/torch/nn/_functions/linear.py", line 10, in forward
    output.addmm_(0, 1, input, weight.t())
RuntimeError: size mismatch, m1: [30 x 250], m2: [500 x 2000] at /home/soumith/local/builder/wheel/pytorch-src/torch/lib/TH/generic/THTensorMath.c:862

The text was updated successfully, but these errors were encountered:

beichao1314 · 2017-02-28T09:42:37Z

I test it and work well, maybe you can update the OpenNMT-py

guillaumekln · 2017-02-28T10:01:14Z

I'm unable to reproduce it as well. Update the project and let us know if the issue persists.

irshadbhat · 2017-02-28T11:32:43Z

Updated and works fine for me as well.

States in translation

Make comments compatible with doc format

added sample command for training multi-encoder model with multi30k data

Make comments compatible with doc format

Option to set the number of threads for translation

irshadbhat changed the title ~~RuntimeError: size mismatch~~ RuntimeError: size mismatch (when using -brnn option) Feb 27, 2017

irshadbhat closed this as completed Feb 28, 2017

HendrikStrobelt pushed a commit to HendrikStrobelt/OpenNMT-py that referenced this issue Mar 17, 2018

Merge pull request OpenNMT#4 from HendrikStrobelt/states_in_translation

8c49727

States in translation

srush pushed a commit that referenced this issue Jun 26, 2018

Merge pull request #4 from francoishernandez/transformer_comments

043d115

Make comments compatible with doc format

Gldkslfmsd pushed a commit to Gldkslfmsd/OpenNMT-py that referenced this issue Sep 6, 2018

Merge pull request OpenNMT#4 from chrishokamp/multi-enc

2e59202

added sample command for training multi-encoder model with multi30k data

saikat107 pushed a commit to saikat107/OpenNMT-py that referenced this issue Mar 12, 2019

Merge pull request OpenNMT#4 from francoishernandez/transformer_comments

2b32f29

Make comments compatible with doc format

michaelguan1992 mentioned this issue Apr 8, 2019

ZeroDivisionError and RuntimeError #1390

Closed

avaucher referenced this issue in rxn4chemistry/OpenNMT-py Jun 17, 2022

Merge pull request #4 from rxn/translation_num_threads

4ac517c

Option to set the number of threads for translation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RuntimeError: size mismatch (when using -brnn option) #4

RuntimeError: size mismatch (when using -brnn option) #4

irshadbhat commented Feb 27, 2017 •

edited

beichao1314 commented Feb 28, 2017

guillaumekln commented Feb 28, 2017

irshadbhat commented Feb 28, 2017

RuntimeError: size mismatch (when using -brnn option) #4

RuntimeError: size mismatch (when using -brnn option) #4

Comments

irshadbhat commented Feb 27, 2017 • edited

beichao1314 commented Feb 28, 2017

guillaumekln commented Feb 28, 2017

irshadbhat commented Feb 28, 2017

irshadbhat commented Feb 27, 2017 •

edited