vocabulary_inv #18

linWujl · 2017-05-09T03:19:31Z

Hello, thanks for sharing the implement of the paper. When i read the code of w2v.py, i think the vocabulary_inv is a list while the doc_string says it's a dict. I wonder if something is wrong.

Another question is that when train the word2vec model, it can be seen that sentences constitute of words are used, so why the code first load the data by turning the words into digits and later turn the digits into words for training, Is it necessary?

Thank you!

alexander-rakhlin · 2017-05-09T09:44:37Z

Hi,
Thank you for interest. vocabulary_inv is list. There was typo in doc.

The second question - it is not necessary in general, but this is done to reuse keras imdb data set functionality which loads sentences as numeric array.

alexander-rakhlin · 2017-06-08T13:32:53Z

Please see updated version

alexander-rakhlin closed this as completed May 10, 2017

chunjoe mentioned this issue May 30, 2017

About embedding_weights #22

Closed

alexander-rakhlin reopened this Jun 4, 2017

alexander-rakhlin closed this as completed Jun 8, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vocabulary_inv #18

vocabulary_inv #18

linWujl commented May 9, 2017 •

edited

Loading

alexander-rakhlin commented May 9, 2017 •

edited

Loading

alexander-rakhlin commented Jun 8, 2017

vocabulary_inv #18

vocabulary_inv #18

Comments

linWujl commented May 9, 2017 • edited Loading

alexander-rakhlin commented May 9, 2017 • edited Loading

alexander-rakhlin commented Jun 8, 2017

linWujl commented May 9, 2017 •

edited

Loading

alexander-rakhlin commented May 9, 2017 •

edited

Loading