Skip to content

This project uses a small subset of the data from Kaggle's Yelp Business Rating Prediction competition to predict the Rating based on reviews published by people.

Notifications You must be signed in to change notification settings

garvitkhurana/Yelp_review_prediction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Yelp_review_prediction

This project uses a small subset of the data from Kaggle's Yelp Business Rating Prediction competition to predict the Rating based on reviews published by people.

Introduction

This exercise uses a small subset of the data from Kaggle's Yelp Business Rating Prediction competition.

Description of the data:

yelp.csv contains the dataset. It is stored in the repository (in the data directory), so there is no need to download anything from the Kaggle website. Each observation (row) in this dataset is a review of a particular business by a particular user. The stars column is the number of stars (1 through 5) assigned by the reviewer to the business. (Higher stars is better.) In other words, it is the rating of the business by the person who wrote the review. The text column is the text of the review.

Goal: Predict the star rating of a review using only the review text.

Comments on Accuracy:-

At first glance, 48.6% accuracy does not seem very good, given that it is not much higher than the null accuracy. However, I would consider the 48.6% accuracy to be quite impressive, given that humans would also have a hard time precisely identifying the star rating for many of these reviews.

About

This project uses a small subset of the data from Kaggle's Yelp Business Rating Prediction competition to predict the Rating based on reviews published by people.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published