Skip to content

Exploration and visualizations on the Yelp dataset 🍔

Notifications You must be signed in to change notification settings

dehaoterryzhang/Yelp_EDA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 

Repository files navigation

Exploration and Visualizations on the Yelp Dataset 🍔

Medium Post

Check out my Medium post " Discover Your Next Favorite Restaurant — Exploration and Visualization on Yelp Dataset" here.

Kernel

Check out my Kaggle kernel here.

Table of Content

Overview

I explored the business data within the Yelp dataset, and examined the restaurant ratings among some of the fast food restaurant chains. We then took a look at different restaurant attributes and their relationships. At last, there was an example of how we can find top restaurants that fit our needs and used the tip data to create visualizations that can help us better understand the restaurant tips.

Motivation

As of March 2020, there are 211 million cumulative reviews on Yelp. With this massive amount of data, Yelp also releases a subset of their businesses, reviews, and user data for educational and academic purposes. There is a lot of information that can be mined in this dataset and can be used to infer meaning, business attributes, and sentiment.

Procedure

  • Peek at the Business Data
  • Geographic Visualizations
  • Rating Comparisons Amond Popular Restaurant Chains
  • Examine Relationships between Attributes
  • Discover Restaurants According to Our Needs and Create Visualizations

Installation

I did my analysis through Kaggle kernel and I recommended you to do so as well, mostly based on two reasons:

  1. The size of Yelp dataset is quite large but it is pre-loaded through Kaggle kernel so you don't need to download it locally.
  2. Most libraries are already available in this environment so no need to install more libraries locally.

About

Exploration and visualizations on the Yelp dataset 🍔

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages