Logistic Regression & Naive Bayes on Amazon review data
Thida Aung, Sanjay Kaliyur, Erik Trewitt
Jupyter Notebook with a Python 3.5.2 kernel. Tested and run on Mac OS X, but should be platform-independent due to the nature of Python.
-
Clone this project using
git clone git@github.com:sanjaykaliyur/CSCI183-Project.git
, or download the zip file from the github page. -
If you don't already have it, install Python 3.5+ from python.org or using
sudo apt-get install python3
. -
Install pip3:
- Unix:
sudo apt-get install python3-pip
. - Other: Follow pip's manual installation instructions.
-
Ensure pip is up to date:
pip3 install --upgrade pip
. -
Install Jupyter:
pip3 install jupyter
. -
Install required dependencies:
pip3 install $(cat install-requires.txt)
. -
Download the compressed data files from source (Video Game and Musical Instrument 5-core), decompress them, and place them in
data/
. -
Open the project notebook:
python3 -m jupyter notebook sentiment-analysis--amazon_commented.ipynb
.
matplotlib
: Data plotting toolnltk
: Natural language toolkit, used for stemmingpandas
: Data processing toolkitssklearn
: Machine learning algorithm library