Keras implementation of Continuous Bag-of-Words Word2Vec.
https://research.google/pubs/pub41224/
Using About Wikipedia page.
https://en.wikipedia.org/wiki/Wikipedia:About
Efficient Estimation of Word Representations in Vector Space
Tomas Mikolov, Kai Chen, Greg S. Corrado, Jeffrey Dean
International Conference on Learning Representations, 2013
python TrainingWeight.py train_data_file output_weight_file output_tokenizer_file
ex)
python TrainingWeight.py ./data/train_data.txt ./weight/embed_weight.pickle ./weight/tokenizer.pickle
python PredictSimilar.py input_weight_file input_tokenizer_file input_word output_word_count
ex)
python PredictSimilar.py ./weight/embed_weight.pickle ./weight/tokenizer.pickle wikipedia 10