Haar Cascade Face Detector Quality Dependence on Training Dataset Variablity





Face detector, Haar cascades, Training set, Boosting, Training set composition


Background. When training generalized face detectors based on Haar cascades, there is a problem of long learning time of the resulting cascades and their poor quality. Therefore, in practice, frontal and profile face detectors are trained separately. Such approach makes recognition systems more complex.

Objective. The aim of the paper to compare the impact of the training set composition with faces at different inclination angles on the quality of the trained detectors.

Methods. It is proposed to train a series of face detectors on sub-sets that cover different ranges of face angles. All other parameters of training are fixed. As the result, the learning time and the quality of the obtained cascades will be compared.

Results. The quality and the training time of face classifiers are evaluated depending on the composition of the training subsets. Also the quality of the frontal and side face classifiers is compared having the same sizes of training sets. It is shown that the AUC metric has a difference of 0.003 between the frontal and profile face detectors.

Conclusions. It has been shown experimentally that the more variations present in the object’s dataset (the side-view of faces compared to the frontal positions), the longer and harder the Haar cascade learns, given the same amounts of the training samples. Using the proposed approach, the quality of the final classifier can be controlled by selecting the appropriate composition of the training samples.

Author Biographies

Sergii S. Nikolaiev, Igor Sikorsky Kyiv Polytechnic Institute

Сергій Сергійович Ніколаєв 

Yurii O. Tymoshenko, Igor Sikorsky Kyiv Polytechnic Institute

Юрій Олександрович Тимошенко 

Kateryna Yu. Matviiv, Igor Sikorsky Kyiv Polytechnic Institute

Катерина Юріївна Матвіїв 


M. Modi and F.Macwan. (2014). Face Detection Approaches: A Survey. [Online]. Available: https://www.ijirset.com/upload/2014/april/36_Face.pdf

P. Viola and M.J. Jones, “Rapid object detection using a boosted cascade of simple features”, in Proc. IEEE Comput. Soc. Conf. Comput. Vision Pattern Recog., 2001, pp. 511–518. doi: 10.1109/CVPR.2001.990517

P. Viola and M.J. Jones, “Robust real-time face detection”, Int. J. Comp.Vision., vol. 57, no. 2, pp. 137–154, 2004. doi: 10.1023/B:VISI.0000013087.49260.fb

C. Calistra. (2015, May 7). 60 Facial Recognition Databases [Online]. Available: https://www.kairos.com/blog/60-facial-recog­ni­tion-databases

A. Bansal. UMDFaces: An Annotated Face Data Set for Training Deep Networks [Online]. Available: http://umdfaces.io

M. Everingham. (2005). The VOC 2005 Database: Test Set 2 [Online]. Available: http://host.robots.ox.ac.uk/pascal/VOC/databases.html#TUD