This web page contains information of face detection works. As a first step to encourage researchers to embark on this topic, we also provide some sample code, scripts, and plots to develop face detection systems. Consequently, this page does not contain all the detailed information that you may need. Nevertheless, we think it contains "substantial" information that you will find useful in developing face detection methods (01/21/2001).
ReferenceRecent methods (after March, 2001) that are not included in the survey paper due to publication deadline
In this page, you will find the following information:
Please send comments or suggestions to: mhyang@vision.ai.uiuc.edu , or visit my web page
Data Set |
Location |
Description |
MIT Media Labs Database |
Faces of 16 people, 27 of each person under various illumination conditions, scale and head orientation. |
|
MIT CBCL Face Data Set |
http://www.ai.mit.edu/projects/cbcl/software-datasets/FaceData2.html |
A training set consists of 6,977 cropped images (2,429 faces and 4,548 nonfaces), and the test set consists of 24,045 images (472 faces and 23,573 nonfaces). |
FERET Database |
A large collection of male and female faces. Each image contains a single person with certain expression. |
|
UMIST |
http://images.ee.umist.ac.uk/danny/database.html |
564 images of 20 subjects. Each subject covers a range of poses from profile to frontal views. |
University of Bern |
300 frontal view face images of 30 people (10 images per person) and 150 profile face images (5 images per person). |
|
Yale Database |
Face images with expressions, glasses under different illumination conditions. |
|
AT&T (Olivettti) Database |
40 subjects, 10 images per subject. (10 images per person) and 150 profile face images (5 images per person) |
|
Harvard Database |
Cropped, masked frontal face images under a wide range of illumination conditions. |
|
M2VTS Database |
A multimodal database containing various image sequences. |
|
Purdue Database |
3,276 face images with different facial expressions and occlusions under different illuminations. |
Data Set |
Location |
Description |
MIT Test Set |
Two sets of high and low resolution gray scale images with multiple faces in complex background. Subsumed by CMU frontal face test set. |
|
CMU Frontal Face Test Set |
130 gray scale images with a total of 507 frontal 507 frontal faces. |
|
CMU Profile Face Test Set |
ftp://eyes.ius.cs.cmu.edu/usr20/ftp/testing_face_images.tar.gz |
208 gray scale images with faces in profile views. |
Kodak Data Set |
|
Faces of multiple size, pose and under varying illumination
in color images. Designed for face detection and recognition. |
Note: The section provides a sample code for face detection in gray-scale images. It is not an full-fledged system that you can retrieve detected faces in an image. Our intention is to provide a first step for researchers to get some exposure to this subject. The code is provided as is since we may not have time to update/modify the code. We hope and encourage researchers to develop better evaluation protocol for face detection.
We use the the normalized intensity values as representations for SVM with linear and polynomial kernels. We use the SvmFu package (available at http://five-percent-nation.mit.edu/SvmFu/) and MIT CBCL Face Data to train our classifiers. In our experiments, the parameter C has been set to 10.
The training set consists of 6,977 images (2,429 faces and 4,548 nonfaces), and the test set consists of 24,045 images (472 faces and 23,573 nonfaces). (We would like to thank MIT CBCL for distributing the face data set). Our training and test sets are similar to the ones used in some recent works which show that SVMs with the feature representation of normalized intensity values perform better than the ones with Harr wavelet and gradient representations. In our experiment, each image is normalized to 20 by 20 pixels and processed with histogram equalization and quantization (50 rather than 256 scales). The following figure shows some face images in the training and test sets.
We train SVMs using linear and 2nd degree polynomial kernels. The empirical results on a test set of 24,045 images (472 faces and 23,573 nonfaces) are shown using the ROC (Receiver Operator Characteristic) curves.