Errors during loading Dictionary for word embedding incremental training #3

Shiki-H · 2018-07-27T20:20:08Z

First of all, thank you very much for the great work. Would really appreciate this feature. However, as I played around with it, I could not get started on re-training a word embedding model. Here are the steps to reproduce the error:

# run the sample script first to generate a model
$ bash word-vector-example.sh

# after the model is generated from the sample script
# we run
./fasttext skipgram -input data/fil9 -inputModel result/fil9.bin -output retrained -incr

The output displayed was:

Update args
Load dict from trained model
Load dict from training data
Read 124M words
Number of words:  218316
Number of labels: 0
Merge dict
Read 124M words
Number of words:  0
Number of labels: 0
terminate called after throwing an instance of 'std::invalid_argument'
  what():  Empty vocabulary. Try a smaller -minCount value.
Aborted

I have tried to adjust -minCount option, but it did not work.

After looking at the code, I feel that the error has something to do with

dict_->addDict(dictInData, false);

from line 744 of fasttext.cc, but I am not sure about the exact cause of this problem. In fact, I am a bit confused by why we can load dictionary from the raw text we are supposed to train on.

The text was updated successfully, but these errors were encountered:

ericxsun · 2018-08-04T10:59:07Z

A bug, fixed by the last commit. As solved, I'll close this issue. Feel free to re-open it.

Shiki-H · 2018-08-19T16:02:52Z

@ericxsun thanks for the help. Everything works fine now.

ericxsun self-assigned this Aug 5, 2018

ericxsun added the bug Something isn't working label Aug 5, 2018

ericxsun closed this as completed Aug 5, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Errors during loading Dictionary for word embedding incremental training #3

Errors during loading Dictionary for word embedding incremental training #3

Shiki-H commented Jul 27, 2018

ericxsun commented Aug 4, 2018 •

edited

Loading

Shiki-H commented Aug 19, 2018

Errors during loading Dictionary for word embedding incremental training #3

Errors during loading Dictionary for word embedding incremental training #3

Comments

Shiki-H commented Jul 27, 2018

ericxsun commented Aug 4, 2018 • edited Loading

Shiki-H commented Aug 19, 2018

ericxsun commented Aug 4, 2018 •

edited

Loading