Using Latent Dirichlet Allocation to Organize Tweets From Indonesia's 2014 Election
This notebook requires the following packages to be installed:
* pandas
* matplotlib
* csv
* re
* gensim
The notebook also needs the Indonesian stemmer Sastrawi installed, which can be done with the following code in your terminal:
$ pip install Sastrawi
To create the environment, type the following:
$ conda create --name py35 python=3.5 pandas matplotlib csv re gensim Sastrawi
You can then activate the environment and launch the Jupyter Notebook.
- Twitter corpus - courtesy of Ali Akbar S.
- List of Indonesian stopwords - courtesy of Kaggle
- Sastrawi stemming toolLICENSE
This project is licensed under the MIT License - see the LICENSE.md file for details