NOTE: This project is an optional excercise from Deep Learning Nanodegree by Udacity.
My solution: Jupyter notebook
PyTorch implementation of Word2Vec algorithm using the Skip-gram architecture and Negative Sampling to improve both the quality of the vectors and the training speed. The main porpuse of this project is for me to learn about embedding words for use in Natural Language Processing.
- Conceptual overview of Word2Vec from Chris McCormick
- First Word2Vec paper from Mikolov et al.
- Neural Information Processing Systems, paper with improvements for Word2Vec also from Mikolov et al.