Smile Detection

Reference

https://github.com/meng1994412/Smile_Detection

Project Objectives

Implement a convolutional neural network capable of detecting a person smiling or not:

Constructed LeNet architecture from scratch.
Trained a model on a dataset of images that contain faces of people who are smiling or not smiling.
Developed a script to detect smile in real-time.

Language / Packages Used

Python 3.7
OpenCV 4.5.5
keras 2.7.0
Imutils
NumPy

Approaches

The dataset, named SMILES, comes from Daniel Hromada (check reference) (~8000 images) and WIKI (check reference) (~5000 images). There are totol 13,000 images in the dataset, where each image has a dimension of 64x64. And the images in the dataset are tightly cropped around the face. Data from GitHub open-source had been labed and resized but from WIKI has not. The data from WIKI has been resized, labeled again and mixtured with data from GitHub open-source.

The Figure 1 shows some examples of smiling image, and Figure 2 shows some example of not smiling image.

Figure 1: Positive example of the dataset (smiling).

Figure 2: Negative example of the dataset (not smiling).

Results

Build the LeNet architecture from scratch

The LeNet architecture can be found in lenet.py inside pipeline/nn/conv/ directory. The input to the model includes dimensions of the image (height, width, the depth), and number of classes. In this project, the input would be (width = 32, height = 32, depth = 1, classes = 2).

Table 1 demonstrates the architecture of LeNet. The activation layer is not shown in the table, which should be one after each CONV layer. The ReLU activation function is used in the project.

Layer Type	Output Size	Filter Size / Stride
Input Image	32 x 32 x 1
CONV	28 x 28 x 4	5 x 5, K = 4
CONV	24 x 24 x 8	5 x 5, K = 8
POOL	12 x 12 x 16	2 x 2
CONV	8 x 8 x 16	5 x 5, K = 16
POOL	4 x 4 x 16	2 x 2
FC	120
Dropout	0.2 (20%)
softmax	2

Table 1: Summary of the LeNet architecture.

Train the Smile CNN

The train_model.py is used for the training process. The weighted model will be saved after training (chere here).The saved model can be used for detecting smile in real-time later.

Figure 3 shows the plot of loss and accuracy for the training and validation set. As we can see from the figure, validation loss past 6th epoch starts to stagnate. Further training past 20th epoch may result in overfitting. Implement data augmentation on training set would be a good future "next-step" plan.

Figure 3: Plot of loss and accuracy for the training and validation set.

Figure 4 illustrates the evaluation of the network, which obtains about 90% classification accuracy on validation set.

Figure 4: Evaluation of the network.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
dataset		dataset
output		output
pipeline/nn/conv		pipeline/nn/conv
preprocessing		preprocessing
trainLenet		trainLenet
README.md		README.md
detect_smile.py		detect_smile.py
detect_smile.py.ipynb		detect_smile.py.ipynb
haarcascade_frontalface_default.xml		haarcascade_frontalface_default.xml
requirements.txt		requirements.txt
train_model.ipynb		train_model.ipynb
train_model.py		train_model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Smile Detection

Reference

Project Objectives

Language / Packages Used

Approaches

Results

Build the LeNet architecture from scratch

Train the Smile CNN

Run the Smile CNN in real-time

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Smile Detection

Reference

Project Objectives

Language / Packages Used

Approaches

Results

Build the LeNet architecture from scratch

Train the Smile CNN

Run the Smile CNN in real-time

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages