Cuisine Classification by Ingredients

Team: Ronak Desai, Shidhesh Supekar, Kalven Bonin

Is it possible to classify a recipe’s cuisine type just from a list of ingredients? Our project seeks to answer this question and does so using some basic tools of Natural Language Processing. We take a dataset from Kaggle.com that has a list of ~40,000 recipes with a cuisine type classification. One method we employ is called a Bag of Words (BoW) model which take all words found in the ingredients list and builds a classifier based on the occurrences of those words in the training set. The other method is the Term Frequency-Inverse Document Frequency (TF-IDF) which considers the frequencies of individual words in the training set. Both methods produced a testing accuracy of greater than 60%, which is good considering that we implemented the most naïve NLP models.

The included Jupyter Notebooks showcase our approaches to solving this problem. Another person from Kaggle solved this problem with a 78.8% accuracy using the NLTK toolkit's built-in TF-IDF vectorizer (https://www.kaggle.com/code/rahulsridhar2811/cuisine-classification-with-accuracy-78-88/notebook). We attempted some simpler models that are more easily understood for the purposes of learning the basics of NLP, so we speculate that a more advanced classifier could be easily capable of producing more than 80% accuracy.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.ipynb_checkpoints		.ipynb_checkpoints
Scratch Work		Scratch Work
Cuisine Prediction from Top Key Words.ipynb		Cuisine Prediction from Top Key Words.ipynb
README.md		README.md
TF-IDF Model.ipynb		TF-IDF Model.ipynb
neighbors_comp.png		neighbors_comp.png
test.json		test.json
train.json		train.json
words_comp.png		words_comp.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.ipynb_checkpoints

.ipynb_checkpoints

Scratch Work

Scratch Work

Cuisine Prediction from Top Key Words.ipynb

Cuisine Prediction from Top Key Words.ipynb

README.md

README.md

TF-IDF Model.ipynb

TF-IDF Model.ipynb

neighbors_comp.png

neighbors_comp.png

test.json

test.json

train.json

train.json

words_comp.png

words_comp.png

Repository files navigation

Cuisine Classification by Ingredients

Team: Ronak Desai, Shidhesh Supekar, Kalven Bonin

About

Releases

Packages

Contributors 3

Languages

ronak-n-desai/cuisine-classfication

Folders and files

Latest commit

History

Repository files navigation

Cuisine Classification by Ingredients

Team: Ronak Desai, Shidhesh Supekar, Kalven Bonin

About

Resources

Stars

Watchers

Forks

Languages