GitHub - jguoaj/face-detection

Face Detection

General Idea

The task given by this assignment is to implement a multi-scale sliding window face detector based on concepts presented in Dalal-Triggs 2005 and Viola-Jones 2001. The algorithm will be evaluated using a common benchmark for face detection (Caltech).

Reference Paper

Histograms of Oriented Gradients for Human Detection

Methodology

Use HOG descriptor to genearte positive face images with cell size 3
Horizontally flip face images, contrast original face images with darker faces images( img*0.8 ), use HOG descriptor to generate postive images from these images and add these images into the postive face samples
Use HOG descriptor to generate negative cropped images (50000 images) from the database
Train the linear SVM Classifier
For each test image, for each position at each scale in the image, create a window and run the classifier to determine whether or not there is a face at that location
Step 5 will result in many overlapping bounding boxes for the the same face, which must then be combined or suppressed into one final bounding box (non maximum suppression).

Experiments and Results

We conducted several experiments to indentify the best approach to detect the face.

Cell Size 6, flip and contrast the faces, without hard negative mining

++The average accuracy is 0.840, and it is very quick.++

Cell Size 3, flip and contrast the faces, without hard negative mining

++The average accuracy is 0.916, and it takes about 20 minutes.++

Cell Size 3, without hard negative mining

++The average accuracy is 0.823, and it takes about 30-40 minutes.++

Cell Size 3, flip and contrast the faces, with hard negative mining

++The average accuracy is 0.901, and it takes about 90 minutes.++

Discussion

Notice that when the cell size gets smaller, the HOG descriptor can persent more details about the gradient and edge direction at each pixel, which leads to higher accuracy.
With more negative samples, the accuracy can be improved a bit, but at the same time, the detection speed becomes slow.
When we horizontally flip face images and make images darker (i.e. add more positive training data), we can largley improve the accuracy since we found that some faces in very dark background could not be detected and some side faces can not be recognized.
LBP

Bonus

Hard Negative Mining.

Train on the original datasets, collect images which are falsely detected as faces and add them into the negative samples. Retrain again.

Find and utilize alternative positive training data.

Horizontally Flip face images
Generate contrast images (image * 0.8)

Implement an interesting feature

Implement Local Binary Pattern (LBP). Instead of using HOG descriptor, we use LBP to extract features from images.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
code		code
result		result
README.md		README.md
dalal_triggs_cvpr_2005.pdf		dalal_triggs_cvpr_2005.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Face Detection

General Idea

Reference Paper

Methodology

Experiments and Results

Discussion

Bonus

Hard Negative Mining.

Find and utilize alternative positive training data.

Implement an interesting feature

About

Releases

Packages

Languages

jguoaj/face-detection

Folders and files

Latest commit

History

Repository files navigation

Face Detection

General Idea

Reference Paper

Methodology

Experiments and Results

Discussion

Bonus

Hard Negative Mining.

Find and utilize alternative positive training data.

Implement an interesting feature

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages