Predicting the Usefulness of Yelp Reviews

What makes a useful Yelp review? Can we predict if a review will be useful based on its text content?

Goals

The goals of this project are to:

predict the usefulness of Yelp reviews as a classification problem using machine learning models
use topic modeling/decomposition to improve the accuracy of those models
evaluate the effectiveness of the models by assessing the validity of the models' predictions

Technical Report

An indepth discussion of this project is found in the technical report.

Technologies Used

All statistical analysis was done a t2.2xlarge AWS EC2 instance.

NLP: Spacy, Textacy, scikit-learn
Modeling: scikit-learn - Logistic Regression, Random Forest Classifier
Data Management: numpy, pandas, PostgreSQL

Name		Name	Last commit message	Last commit date
Latest commit History 75 Commits
data/demo		data/demo
notebooks		notebooks
visuals		visuals
.gitignore		.gitignore
README.md		README.md
Technical_Report.md		Technical_Report.md
Topic Modeling and Yelp Reviews.pdf		Topic Modeling and Yelp Reviews.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data/demo

data/demo

notebooks

notebooks

visuals

visuals

.gitignore

.gitignore

README.md

README.md

Technical_Report.md

Technical_Report.md

Topic Modeling and Yelp Reviews.pdf

Topic Modeling and Yelp Reviews.pdf

Repository files navigation

Predicting the Usefulness of Yelp Reviews

Goals

Technical Report

Technologies Used

About

Releases

Packages

Languages

gd32/yelp_topic_modeling

Folders and files

Latest commit

History

Repository files navigation

Predicting the Usefulness of Yelp Reviews

Goals

Technical Report

Technologies Used

About

Topics

Resources

Stars

Watchers

Forks

Languages