Word2vec example #270

unnonouno · 2015-07-27T14:36:44Z

I implemented word2vec example including skip-gram architecture and continuous-bow architecture with HSM and negative sampling.
Note that this implementation is slower than the original even when you use GPU.

Please review this PR after #264

unnonouno · 2015-08-04T06:13:57Z

We'll release this PR without GPU mode (#264). GPU mode will be supported in 1.2.0.

delta2323 · 2015-08-04T10:19:49Z

Although this PR is added to v1.1.2 Milestone, it must be preceded by #264, which is a feature PR.
So we must defer merge of this PR at least until next minor release.

unnonouno · 2015-08-05T02:53:14Z

No, I removed #264 from this PR. It can work with HSM (only CPU), NS(both CPU and GPU), original (both CPU and GPU).
So, it cannot run in GPU mode when a user select HSM (cause NotImplementError). It will work correctly in 1.2.0.

delta2323 · 2015-08-05T04:42:14Z

@unnonouno OK, I understand it.

beam2d · 2015-08-05T05:13:32Z

examples/word2vec/train_word2vec.py

+
+index2word = {}
+word2index = {}
+counts = collections.defaultdict(lambda: 0)


int can be used instead of lambda: 0. You can also use collections.Counter.

beam2d · 2015-08-05T05:19:42Z

LGTM except two comments!

beam2d · 2015-08-05T06:14:12Z

Word2vec example

beam2d · 2015-08-05T06:14:43Z

Thank you!

hido mentioned this pull request Jul 30, 2015

NegativeSampling and WalkerAlias do not implement to_cpu() #276

Closed

unnonouno added the cat:example Example, e.g. the MNIST example. label Aug 4, 2015

unnonouno added this to the v1.1.2 milestone Aug 4, 2015

unnonouno added 3 commits August 4, 2015 15:18

Make word2vec example

0fde668

Write readme for word2vec

bec900d

Fix readme

51a2992

unnonouno force-pushed the word2vec branch from 0bab97c to bec900d Compare August 4, 2015 06:22

beam2d self-assigned this Aug 5, 2015

beam2d reviewed Aug 5, 2015
View reviewed changes

unnonouno added 2 commits August 5, 2015 14:58

Remove comma

c184486

Use Counter instead of defaultdict

cb28a2b

beam2d added a commit that referenced this pull request Aug 5, 2015

Merge pull request #270 from pfnet/word2vec

8809242

Word2vec example

beam2d merged commit 8809242 into master Aug 5, 2015

beam2d deleted the word2vec branch August 5, 2015 06:14

delta2323 mentioned this pull request Aug 11, 2015

gpu error when train word2vec #305

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Word2vec example #270

Word2vec example #270

unnonouno commented Jul 27, 2015

unnonouno commented Aug 4, 2015

delta2323 commented Aug 4, 2015

unnonouno commented Aug 5, 2015

delta2323 commented Aug 5, 2015

beam2d Aug 5, 2015

beam2d commented Aug 5, 2015

beam2d commented Aug 5, 2015

beam2d commented Aug 5, 2015

Word2vec example #270

Word2vec example #270

Conversation

unnonouno commented Jul 27, 2015

unnonouno commented Aug 4, 2015

delta2323 commented Aug 4, 2015

unnonouno commented Aug 5, 2015

delta2323 commented Aug 5, 2015

beam2d Aug 5, 2015

Choose a reason for hiding this comment

beam2d commented Aug 5, 2015

beam2d commented Aug 5, 2015

beam2d commented Aug 5, 2015