Skip to content
Word2Vec in C++ 11
Branch: master
Clone or download
Latest commit 3b1e888 Jul 17, 2014
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
flatbuffers use flatbuffers to encode the model Jul 14, 2014
utf8cpp add utf8-cpp for gcc Jul 15, 2014
README.md Update README.md Dec 28, 2013
main.cc fix namespace Jul 15, 2014
main_zh.cc t Jul 16, 2014
model.fbs fix namespace Jul 15, 2014
model_generated.h fix namespace Jul 15, 2014
v.h
word2vec.h cleanup Jul 15, 2014

README.md

word2vec

Word2Vec in C++ 11

See main.cc for building instructions and usage. (NOTE: openmp is used in the newest version and thus g++ is required for multithreading)

Results with OMP_NUM_THREADS=8: (save model is understandably slow as it stores text)

loadvocab: 1.9952 seconds    
train: 33.5145 seconds
save model: 4.7554 seconds

Machine configuration

jackdeng-mac:word2vec jack$ sysctl -n machdep.cpu.brand_string
Intel(R) Core(TM) i7-4850HQ CPU @ 2.30GHz
You can’t perform that action at this time.