Sentiment Analysis for 400,000 Amazon Reviews

Description

In this project, the goal is to perform sentiment analysis to determine whether a review is positive or negative. I implemented 3 different machine learning algorithms to build text classifiers for Amazon reviews. The three algorithms are: neural networks (LSTM to be specific), decision tree and Naive Bayes.

Dataset

The data I'm using comes from the Kaggle Amazon review competition.

Analysis Result

The LSTM model performs the best (AUC 0.96) but took the longest to train.

Please refer to the .py files for my code, and analysis report.pdf for detailed description of how I pre-processed the data, built up the models and compared the performance of the three methods.

In this 5-min video, I described in detail the dataset, preprocesssing, classifications, results and discussion of the problem.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Decision tree.py		Decision tree.py
README.md		README.md
analysis report.pdf		analysis report.pdf
myLSTM.py		myLSTM.py
myNB.py		myNB.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentiment Analysis for 400,000 Amazon Reviews

Description

Dataset

Analysis Result

About

Releases

Packages

Languages

ChubingZeng/Sentiment-Analysis

Folders and files

Latest commit

History

Repository files navigation

Sentiment Analysis for 400,000 Amazon Reviews

Description

Dataset

Analysis Result

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages