Natural Language Processing and Computer Vision for the Yelp reviews database

This project is a feasibility study of automatically detecting topics of dissatisfaction in Yelp reviews (NLP) and automatically labeling photos uploaded to Yelp (CV).

The dataset used is the Yelp dataset which can be downloaded freely here: https://www.yelp.com/dataset/ The 32k negative reviews filtered from the original reviews can be found in 'bad_reviews.csv' All necessary librairies are located in 'requirements.txt'

Natural Language Processing

Filtering out negative reviews from a sample (5000 reviews)
Preprocessing reviews to a format compatible with the NLP model
Extraction of topics from negative reviews using Gensim LDA
Results analysis

Computer Vision

Equalizing the histograms for each photo in the sample (100 photos per label, 500 photos total)
Testing ORB for feature extraction
Dimensionality reduction and KMeans clustering
Using transfer learning with VGG16 for feature extraction
Dimensionality reduction and KMeans clustering
Visualizing and analyzing results
Analying some examples of mislabeled photos

A synthesis of the project can be found here: https://katrinmisel.github.io/project_synthesis.html

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.gitignore		.gitignore
Ponomarjova_Katrin_1_csv_092022.csv		Ponomarjova_Katrin_1_csv_092022.csv
README.md		README.md
bad_reviews.csv		bad_reviews.csv
helpers.py		helpers.py
notebook_1_api.ipynb		notebook_1_api.ipynb
notebook_2_text_cleaning.ipynb		notebook_2_text_cleaning.ipynb
notebook_3_topic_modeling.ipynb		notebook_3_topic_modeling.ipynb
notebook_4_images.ipynb		notebook_4_images.ipynb
project_synthesis.html		project_synthesis.html
project_synthesis.ipynb		project_synthesis.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Natural Language Processing and Computer Vision for the Yelp reviews database

Natural Language Processing

Computer Vision

About

Releases

Packages

Languages

katrinmisel/cv_nlp

Folders and files

Latest commit

History

Repository files navigation

Natural Language Processing and Computer Vision for the Yelp reviews database

Natural Language Processing

Computer Vision

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages