Solution writeup to the comma.ai Driver Monitoring Challenge

Input

4x 60s 20hz video

Ouput

Annotated face tracking video
Head pose feature vector

Dependencies

numpy
sklearn
skimage
cv2

Layout

Core
- Frame preprocessing
- Facial detection
- Facial landmark identification
- Geometric orientation
- Rendering
- Main
Support
- Configuration
- SVM preprocessing
- SVM training
- File utilities
- Numpy utilities
- Image utilities
Data
- Trained SVM
- Input
  - Video files
- Intermediate
  - Preprocessing (optional)
  - imagesToFaces
  - videosToFrames
- Output
  - Annotated video
  - Head pose estimation feature vectors
- Haar cascade classifier
Spike
- Random excursions
Tests
- TBD

Method

video -> video preprocess + dataset preprocess -> train svm -> face detection -> retrain svm -> face detection -> find landmarks -> calculate geometry -> render

Pipeline

read frame -> frame preprocess -> face detection -> find landmarks -> calculate geometry -> render

SVM Preprocessing

HEVC video dataset -> frames
Yale faces dataset -> cropped

SVM Training

Cropped yale faces -> positive samples
256 object categories dataset -> negative samples
Annotate samples
Train linear SVM
Save SVM model
(After sliding window): Retain SVM with hard-negative mining
Save new SVM model

Face Detection

Sliding window over image pyramid
Non-maximum suppression

Face Alignment and Head Pose

Facial landmark alignment
2D-3D point mapping
Compute head orientation

Render Tracking and Pose

Future:

Pupil detection
- CDF
- Feature Extraction and Normalization
Gaze Classification and Decision Pruning

Method:

Using comma ai dataset: Take in hevc video Extract frames from 60s of 20hz video (~1200)

Using yale faces dataset: Convert to jpg and grayscale Crop the images using builtin haar cascades uniform resize write to disk generate (~165) positive samples for SVM using skimage hog descriptor

Using 256_object_categories dataset: generate (~30600) negative samples for SVM using skimage hog in batches of 1000 saving to disk

arrange data correctly + add labels train svm with the positive and negative samples save trained svm

sliding window image pyramid non-maximum suppression hard negative mining retrain

find face find eyes geometric transformation for facial plane generate vector

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
core		core
spike		spike
support		support
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
app.py		app.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

core

core

spike

spike

support

support

.gitignore

.gitignore

README.md

README.md

init.py

init.py

app.py

app.py

Repository files navigation

Solution writeup to the comma.ai Driver Monitoring Challenge

Input

Ouput

Dependencies

Layout

Method

Pipeline

SVM Preprocessing

SVM Training

Face Detection

Face Alignment and Head Pose

Render Tracking and Pose

Future:

About

Releases

Packages

Languages

acarcher/monitoring

Folders and files

Latest commit

History

Repository files navigation

Solution writeup to the comma.ai Driver Monitoring Challenge

Input

Ouput

Dependencies

Layout

Method

Pipeline

SVM Preprocessing

SVM Training

Face Detection

Face Alignment and Head Pose

Render Tracking and Pose

Future:

About

Resources

Stars

Watchers

Forks

Languages