Skip to content

This project was my capstone project for General Assembly's Data Science Immersive (DSI). Using the Yelp Academic Dataset, I used Latent Semantic Analysis to generate topics to be used in classification models predicting the usefulness of Yelp reviews.

Notifications You must be signed in to change notification settings

gd32/yelp_topic_modeling

Repository files navigation

Predicting the Usefulness of Yelp Reviews

What makes a useful Yelp review? Can we predict if a review will be useful based on its text content?

Goals

The goals of this project are to:

  1. predict the usefulness of Yelp reviews as a classification problem using machine learning models
  2. use topic modeling/decomposition to improve the accuracy of those models
  3. evaluate the effectiveness of the models by assessing the validity of the models' predictions

Technical Report

An indepth discussion of this project is found in the technical report.

Technologies Used

All statistical analysis was done a t2.2xlarge AWS EC2 instance.

About

This project was my capstone project for General Assembly's Data Science Immersive (DSI). Using the Yelp Academic Dataset, I used Latent Semantic Analysis to generate topics to be used in classification models predicting the usefulness of Yelp reviews.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published