Welcome to ChefNet

What can we teach computer's about food? ChefNet is a convolutional neural network that identifies the ingredients in a dish by analyzing a picture of it.

For example, what ingredients do you think are in this delicious looking cake?

Data

To create a labeled dataset of images of food, I scraped the recipes and user submitted photos of 17,000 recipes, totaling 230,000 user photos, from allrecipes.com. The scraper was run on an AWS instance and took about a day to run. Massive speedups may be attributed to parallelizing and threading the scraping process. Code is in the Web Scrapers folder under Scripts.

Ingredient label wrangling

In order to train a neural net, I needed to create consistent labels for ingredients. I took two approaches. My first approach was to start with the scraped list of ingredients, and identify the keyword using the indico keyword extraction api, while iteratively remove all words not critical to the underlying food item (Code may be found in Recipe Generation RNN). My second approach, which was ultimately used to train the Ingredent Identifier, was to start with a cleaned list of ingredients initially scraped from enchantedlearning.com

It is critical that image labels are as clean as possible, otherwise the neural network will have difficulty learning. It was also important to allow the model to have multi-word labels to represent items such as bell pepper. A useful extension of this project may be to vectorize the labels, so that the net will learn that the ingredient similarity, for example, misclassifying beef and steak is closer than chicken and peas. Vectorization methods require tokenization of text first. This may be an area worth explore further.

Image Processing (Code)

Neural networks were trained with raw image data, and convolved imaged data that was passed though the 2014 image net winner, VGG-16 from Oxford. Transfer learning proved more fruitful given the limited size of my dataset. Activations were taken at the end of layer 30, before flattening to dense layers. It would be interesting to compare results using activations taken after these dense layers, but I did not have time to explore this comparison. Here is a rough illustration of how the image vectorization process:

Images were downsized to 100x100 so that I could iterate through training multiple models, in the time allotted for capstone projects.

Neural Network Architecture (Code)

My architecture went though multiple iterations, ultimately I settled on preprocessing images with VGG-16, and passing those activations into 3 hidden dense layers. My output layer consists of a sigmoid activation for each ingredient, and uses binary crossentropy loss.

Results

This section provides some high level results for how my final net performed, and then some examples of how it predicts on a particular image.

Overall results (Code)

ChefNet: 48% Recall, 38% Precision

ChefNet without VGG-16 Processing: 46% Recall, 35% Precision

Naive simulation: 6% Recall, 21% Precision

The algorithm was able to perform better for some ingredients than other. Below you may see what classes had best recall (top 10 ranged from 75%-100%). The net had better recall for those ingredients that were more frequent in the dataset:

Below represents the top classes in terms of Precision (top 10 range form 60%-100%), note that these classes are different. In general, the net was more precise with ingredients it only predicted a few times:

Example Predictions

Here are some examples of how well ChefNet predicted:

Carrot Cake

My Lunch last Monday

Data Issues

The data is not perfect, below is a slide that shows two different images for the same recipe. Not only can the sugar cookie look completely different based on the decoration decisions, but there are also misplaced pictures.

As an example, below you may see two user submitted images for the same sugar cookie recipe. The first illustrates how a cooking may look like just about anything, the second shows how users misplace images:

Try ChefNet yourself

First you will need to install these dependencies, in addition to Conda:

You will need to download the weights of my trained convolutional neural net and place the .h5 in the Models folder

My weights: CNN Weights

You will also need to download the weights of trained VGG-16 and place the .h5 file in the VGG Weights folder.

VGG-16 weights: vgg16_weights.h5

Next you should move any image file you would like to predict on into the images folder.

Now you can run predict_ingredients.py to have the model make your predictions (make sure you navigate to the ingredient identifier before running ipython).

If you run the script in ipython, you may just run predict_user_photo(model, vocab) to predict additional photos without reloading the model.

Here is an example of how it should look:

Next Steps

There are a number of next steps that can be taken with this project.

The model may benefit from further tuning, and more neural network structures could be explored. It may also benefit from training on full size images.
Additional data may be scraped from other recipe websites to create a larger dataset.
Another extension may involve true image captioning at a character or word level. I started exploring this option, but found that is was less useful toward my motivation of predicting underlying ingredients.

Recipe Generation with Recurrent Neural Networks

Separate from the ingredient identifier, I've also developed a model that will generate novel ingredients character by character, based on the cleaned recipes scraped from allrecipes.com. Here are a couple examples, I'm not sure how well they would turn out:

banana, lemon juice, cream mushroom soup, milk, cheese sauce
garlic, cheese, cheddar cheese, salt, cheese, sausage, garlic, pork, bacon hotme, chicken, sesame, asparagus, bread, cheese, bacon, hamburger bun

You may notice bacon hotme is not an actual ingredient. This is an odd result that can come out of character by character text generation.

All scripts for this maybe found in Recipe Generation RNN.

Thank you

Big thank you to Jesse Lieman-Sifry for the inspiration behind this project, as well as to my Galvanize Instructors and peers for the continuous help and suggestions along the way. It was a pleasure to work with all of you.

References

VGG Net Representation:

K. Simonyan, A. Zisserman
arXiv:1409.1556 ```

Name		Name	Last commit message	Last commit date
Latest commit History 177 Commits
Scripts		Scripts
figures		figures
images		images
models		models
vgg_weights		vgg_weights
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scripts

Scripts

figures

figures

images

images

models

models

vgg_weights

vgg_weights

.gitignore

.gitignore

README.md

README.md

Repository files navigation

Welcome to ChefNet

Table of Contents

Data

Ingredient label wrangling

Image Processing (Code)

Neural Network Architecture (Code)

Results

Overall results (Code)

Example Predictions

Carrot Cake

My Lunch last Monday

Data Issues

Try ChefNet yourself

Next Steps

Recipe Generation with Recurrent Neural Networks

Thank you

References

About

Releases

Packages

Languages

Mikelew88/ChefNet

Folders and files

Latest commit

History

Repository files navigation

Welcome to ChefNet

Table of Contents

Data

Ingredient label wrangling

Image Processing (Code)

Neural Network Architecture (Code)

Results

Overall results (Code)

Example Predictions

Carrot Cake

My Lunch last Monday

Data Issues

Try ChefNet yourself

Next Steps

Recipe Generation with Recurrent Neural Networks

Thank you

References

About

Resources

Stars

Watchers

Forks

Languages