تشخیص چهره در تصویر با استفاده از روش ویولا-جونز و تحلیل بافت تصویر
محورهای موضوعی : پردازش چند رسانه ای، سیستمهای ارتباطی، سیستمهای هوشمندمهدی حریری 1 , نرمینه حیدرزاده 2
1 - استادیار، گروه مهندسی برق و کامپیوتر ، دانشگاه زنجان، زنجان، ایران
2 - کارشناسی ارشد، مهندسی کامپیوتر، واحد زنجان، دانشگاه آزاد اسلامی، زنجان، ایران
کلید واژه: تصویر یکپارچه, تشخیص چهره, الگوریتم ویولا-جونز, تطبیق چشم و بینی, شبکه عصبی رگرسیونی,
چکیده مقاله :
شناسایی چهره از مهمترین فناوریهای بیومتریک برای شناسایی افراد است که در کنترل دسترسی هم استفاده دارد. تشخیص چهره یکی از مراحل مهم قبل از شناسایی هویت می باشد. معمولا برای تشخیص وجود چهره در تصاویر از یک روش استفاده شده است، اما در این تحقیق برای افزایش صحت تشخیص از ترکیب دو روش الگوریتم ویولا-جونز و تطبیق اجزا و بافت تصویر با اجزا چهره و پوست برای بهبود عملکرد تشخیص استفاده میشود. در مرحله اول با استفاده از الگوریتم ویولا-جونز به تشخیص اجزای صورت میپردازیم و در مرحله بعدی با شبکه های عصبی رگرسیونی ویژگی های بافت چشم و بینی را مورد بررسی قرارداده و باتطبیق ویژگی های بافت صورت اجزاء صورت بهتر تشخیص داده می شوند. در این تحقیق از ویژگی های بافت مربوط به چشم راست و چپ و بینی درصورت برای افزایش دقت تطبیق استفاده می شود. برای ارزیابی روش پیشنهادی از چهره های مجموعه داده FDD-Fold استفاده نمودهایم. در مقایسه عملکرد این روش با روش شبکه عمیقRCNN با تعداد خیلی کمتر داده های آموزشی نسبت به آن به صحت 96.36% ، بیشتر از شبکهی یادگیری عمیق رسیدیم. این روش در سیستمهای با توانایی محاسباتی محدود با تعداد دادهی متوسط نتیجه مطلوبی می دهد.
Introduction: Face recognition is one of the most important biometric technologies for people identification, also used in access control. Face recognition is one of the important steps before identity recognition. Usually, one method is used to detect the presence of faces in images. Still, in this research, to increase the accuracy of detection, the combination of two methods is used to improve the detection performance: Viola-Jones algorithm and the matching of image components and texture with face and skin components. In the first step, we use the Viola-Jones algorithm to detect the facial features. In the next step, the features of the eye and nose tissues are analyzed with regression neural networks, and facial features are recognized better according to the facial features. In this research, the texture features of the right and left eyes and the nose of the face are used to increase the matching accuracy. We have used the faces of the FDD-Fold dataset to evaluate the proposed method. Comparing the performance of this method with the RCNN deep network method with a much smaller number of training data, we reached an accuracy of 96.36%, more than the deep learning network. This method gives good results in systems with limited computing ability and average amount of data.The face recognition system is one of the biometric identification systems and one of the most important technologies for people identification, which is also used in access control. Face identification is one of the few biometric methods that, with the advantages of high accuracy and low level of human intervention, is used in cases such as information security, law enforcement and monitoring, traffic control, and registration in attendance systems. This method creates more convenience and development with fewer requirements. then, this method has received more attention during the last twenty years.Face detection is a local binary classification problem that shows the presence of faces in the given image using boxes surrounding them. Although the Viola-Jones method is less accurate than modern methods such as convolutional neural networks; Its much lower efficiency and training parameters compared to the millions of parameters of a typical CNN result in faster training, better accuracy with limited data, and its use in devices with limited computing power such as cameras and mobile phones. The innovation of this method is matching the geometric pattern of the edges to identify the presence of the face in the image, along with matching the skin texture. This method seems to be faster and more accurate than the previous ones.
Method: In this research, in the first step, we use Viola-Jones, one of the optimal face recognition algorithms in the image, to detect facial components. In the next step, we use the adaptation of the general shape of facial parts such as eyes, and match the textures in the image with the predicted texture for human skin, to improve the recognition performance and increase the recognition accuracy, in such a way that the regression neural networks examine the eye and nose tissue characteristics and according to the characteristics of the facial tissue, the facial components are recognized by the regression neural network. The investigated features in the texture include minimum and maximum color intensity, mean and median, and variance of the image. The data is given to the regression neural network for training. Here Remarkable thing is matching the overall shape of the human head and face, and in the next step matching the overall shape of the facial parts such as the eyes to improve the accuracy of the presented method. We also use the matching of textures in the image with the texture predicted for human skin to further improve the accuracy of the program's performance.
S. J. McKenna, S. Gong, and J. Collins, "Face Tracking and Pose Representation," in BMVC, 1996, pp. 1-10.
[2] D. Comaniciu, V. Ramesh, and P. Meer, "Real-time tracking of non-rigid objects using mean shift," in Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No. PR00662), 2000, pp. 142-149.
[3] K. Nummiaro, E. Koller-Meier, and L. Van Gool, "An adaptive color-based particle filter," Image and vision computing, vol. 21, pp. 99-110, 2003.
[4] R. Paúl, E. Aguirre, M. García-Silvente, and R. Munoz-Salinas, "A new fuzzy based algorithm for solving stereo vagueness in detecting and tracking people," International Journal of Approximate Reasoning, vol. 53, pp. 693-708, 2012.
[5] F. J. Aherne, N. A. Thacker, and P. I. Rockett, "The Bhattacharyya metric as an absolute similarity measure for frequency coded data," Kybernetika, vol. 34, pp. [363]-368, 1998.
[6] T. Darrell, G. Gordon, M. Harville, and J. Woodfill, "Integrated person tracking using stereo, color, and pattern detection," International Journal of Computer Vision, vol. 37, pp. 175-185, 2000.
[7] D. Grest and R. Koch, "Realtime multi-camera person tracking for immersive environments," in IEEE 6th Workshop on Multimedia Signal Processing, 2004., 2004, pp. 387-390.
[8] L. R. Cerna, G. Camara-Chavez, and D. Menotti, "Face detection: Histogram of oriented gradients and bag of feature method," in Proceedings of the International Conference on Image Processing, Computer Vision, and Pattern Recognition (IPCV), 2013, p. 1.
[9] V. Manian and A. Ross, "A texture-based approach to face detection," in Biometric Consortium Conference (BCC), 2002, pp. 342-347.
[10] P. Bose and S. Bandyopadhyay, "Human Face and Facial Parts Detection using Template Matching Technique," International Journal of Engineering and Advanced Technology (IJE, 9 (4), 2020.
[11] Hasani Ahangar, M. (2019). Triple-Sequential Deep Convolutional Neural Networks in Face Recognition, The 5th National Conference of Electrical and Mechatronics Engineering of Iran, Tehran, Iran. [Persian]
[12] R. C. Gonzalez and R. E. Woods, "Digital image processing 4th edition, global edition," 2018.
[13] L. I. Kuncheva, Combining pattern classifiers: methods and algorithms: John Wiley & Sons, 2014.
[14] M. T. Hagan, H. B. Demuth, M. H. Beale, and O. De Jesús, "Neural network design. 2nd edition," Oklahoma: Martin Hagan, 2014.
[15] M. H. Beale, M. T. Hagan, and H. B. Demuth, "Neural network toolbox," User’s Guide, MathWorks, vol. 2, pp. 77-81, 2010.
[16] Y.-Q. Wang, "An analysis of the Viola-Jones face detection algorithm," Image Processing On Line, vol. 4, pp. 128-148, 2014.
[17] P. Viola and M. Jones, "Rapid object detection using a boosted cascade of simple features," in Proceedings of the 2001 IEEE Computer Society Conference on computer vision and pattern recognition. CVPR 2001, 2001, pp. I-I.
[18] V. Jain and E. Learned-Miller, "Fddb: A benchmark for face detection in unconstrained settings," UMass Amherst technical report 2010.
[19] B. J. Boom, G. Beumer, L. J. Spreeuwers, and R. N. Veldhuis, "The effect of image resolution on the performance of a face recognition system," in 2006 9Th international conference on control, automation, robotics and vision, 2006, pp. 1-6.
[20] X. Sun, P. Wu, and S. C. Hoi, "Face detection using deep learning: An improved faster RCNN approach," Neurocomputing, vol. 299, pp. 42-50, 2018.