tests for word2vec's train(). Continuing #1139 #1237

robotcator · 2017-03-23T13:42:32Z

add tests to make sure that ValueError is indeed thrown when params are not supplied.

Continuing #1139

…otebook

tmylk · 2017-03-24T00:43:38Z

gensim/test/test_word2vec.py

+
+        with self.assertRaises(ValueError):
+            model.train(sentences, epochs=model.iter)
+


please add a simple model.train(sentences) example as well. Thanks

tmylk · 2017-03-27T20:17:33Z

Thanks. Could you please extend this PR by adding the ipynb changes?
Also please merge in develop branch to resolve the confllicts.

robotcator · 2017-03-28T12:54:54Z

@tmylk Ok, I will resolve the conflicts. Since the some ipynb's codes may take several hours to complete, I am seeking a remote server to run.

tmylk · 2017-03-28T13:02:30Z

Just updating the code will be fine. There is no need to re-run them.

…into fix-word2vec-notebook Conflicts: gensim/test/test_word2vec.py

tmylk · 2017-03-29T23:43:46Z

@robotcator Are all the notebooks updated here or is it a work in progress?

robotcator · 2017-03-30T02:28:02Z

@tmylk all the notebooks have been updated. the commit c9eab32 is the update for notebook, and i also remove my code for fixing the bug of reset_from(). I found this bug when I update the notebook, but it's duplicate in pr1241.
the 8024eb5 is to make function testOnlineLearningAfterSave() calls use explicit count, epochs.

tmylk · 2017-03-30T13:50:44Z

@robotcator Thanks for finishing this PR

piskvorky · 2017-04-09T07:11:30Z

gensim/models/word2vec.py

@@ -474,8 +474,8 @@ def __init__(
            if isinstance(sentences, GeneratorType):
                raise TypeError("You can't pass a generator as the sentences argument. Try an iterator.")
            self.build_vocab(sentences, trim_rule=trim_rule)
-            self.train(sentences)
-
+            self.train(sentences, total_examples=self.corpus_count, epochs=self.iter,


No vertical indent -- use hanging indent.

piskvorky · 2017-04-09T07:11:43Z

gensim/models/word2vec.py

-    def train(self, sentences, total_words=None, word_count=0,
-              total_examples=None, queue_factor=2, report_delay=1.0):
+    def train(self, sentences, total_examples=None, total_words=None,
+              epochs=None, start_alpha=None, end_alpha=None,


No vertical indent -- use hanging indent.

For clarity, what does 'hanging indent' mean in this context? Does the line with epochs=None, … move right or left? Are lines broken differently?

(FWIW, this "aligned with opening delimiter" style is the first 'yes' example given in PEP8, and used a lot through gensim already.)

Ah, this is function definition, vertical is probably OK here. I misread it as function call when reviewing.

Hanging indent would be arguments on separate lines, with extra indentation (see the PEP8 examples).

robotcator and others added 5 commits March 17, 2017 22:53

fix the compatibility between python2 & 3

1aa3f33

Merge https://github.com/RaRe-Technologies/gensim into fix-word2vec-n…

24e6331

…otebook

require explicit corpus size, epochs for train()

f6f571f

make all train() calls use explicit count, epochs

5e9529b

add tests to make sure that ValueError is indeed thrown

5c24a90

robotcator mentioned this pull request Mar 23, 2017

[WIP][DNM] error-resistant train(). Fix #1052 #1139

Closed

1 task

tmylk reviewed Mar 24, 2017

View reviewed changes

tmylk changed the title ~~tests for word2vec's train()~~ tests for word2vec's train(). Continuing #1139 Mar 24, 2017

robotcator added 2 commits March 24, 2017 22:14

update test

c89f285

fix the word2vec's reset_from()

10ff8a5

robotcator and others added 7 commits March 29, 2017 18:38

Merge branch 'fix-word2vec' into fix-word2vec-notebook

a6312ca

Merge branch 'develop' of https://github.com/RaRe-Technologies/gensim …

be5216a

…into fix-word2vec-notebook Conflicts: gensim/test/test_word2vec.py

require explicit corpus size, epochs for train()

504bd09

make all train() calls use explicit count, epochs

43f9689

update notebooks

49e3d00

fix some error

c9eab32

fix test error

8024eb5

tmylk merged commit becc6d3 into piskvorky:develop Mar 30, 2017

piskvorky reviewed Apr 9, 2017

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tests for word2vec's train(). Continuing #1139 #1237

tests for word2vec's train(). Continuing #1139 #1237

robotcator commented Mar 23, 2017 •

edited by tmylk

tmylk Mar 24, 2017

tmylk commented Mar 27, 2017

robotcator commented Mar 28, 2017

tmylk commented Mar 28, 2017

tmylk commented Mar 29, 2017

robotcator commented Mar 30, 2017

tmylk commented Mar 30, 2017

piskvorky Apr 9, 2017

piskvorky Apr 9, 2017

gojomo Apr 9, 2017 •

edited

piskvorky Apr 10, 2017 •

edited


		with self.assertRaises(ValueError):
		model.train(sentences, epochs=model.iter)

tests for word2vec's train(). Continuing #1139 #1237

tests for word2vec's train(). Continuing #1139 #1237

Conversation

robotcator commented Mar 23, 2017 • edited by tmylk

tmylk Mar 24, 2017

Choose a reason for hiding this comment

tmylk commented Mar 27, 2017

robotcator commented Mar 28, 2017

tmylk commented Mar 28, 2017

tmylk commented Mar 29, 2017

robotcator commented Mar 30, 2017

tmylk commented Mar 30, 2017

piskvorky Apr 9, 2017

Choose a reason for hiding this comment

piskvorky Apr 9, 2017

Choose a reason for hiding this comment

gojomo Apr 9, 2017 • edited

Choose a reason for hiding this comment

piskvorky Apr 10, 2017 • edited

Choose a reason for hiding this comment

robotcator commented Mar 23, 2017 •

edited by tmylk

gojomo Apr 9, 2017 •

edited

piskvorky Apr 10, 2017 •

edited