Check out the app live at Streamlit Cloud: StreamLit link
The webapp is based on the Efficient Estimation of Word Representations in Vector Space paper. Read it here.
screencast.webm
- Upload your own text corpus, or even a CSV dataset.
- Train the Word2Vec on the fly using custom parameters.
- Choose either PCA or TSNE as your dimensionality reduction technique.
- Visualize the word in either 2-D or 3-D space.
- Get similar words for each word, with similarity scores.
- Option to tune the number of words you wish to see for each input.
- Setup a virtual environment using Conda or any other method you prefer.
- Install the dependencies from
requirements.txt
. - Run the following in the terminal.
pip install -U pip setuptools wheel pip install -U spacy python -m spacy download en_core_web_sm
- Run
streamlit run app.py
in the terminal to launch the web app.