A. D. Egorov, Almaz F. Idiyatullin, Artur D. Zakirov
{"title":"参数优化实现的Viola-Jones目标检测方法与MTCNN的比较","authors":"A. D. Egorov, Almaz F. Idiyatullin, Artur D. Zakirov","doi":"10.1109/CTS53513.2021.9562926","DOIUrl":null,"url":null,"abstract":"Face detection on images is a classical computer vision task. Solution to this problem is used in a wide range of apps, mainly in those connected with providing for the elements of administration (access to the data based on a person's face) and elements of control (perimeter safety control systems). The most spread object's face detecting methods may be divided in two groups: classical one (common example is Viola–Jones method) and one based on a neural network (common example is a cascaded convolutional neural network). There are plenty comparisons of face detection methods that, however, do not usually consider the potential for optimization within the parameters of realization methods in question. This paper focuses on the comparison of the parametrically optimized ways of implementation of methods which are typical for each class. Optimization is being carried out in accordance with the algorithm suggested in this paper. It reveals that the optimized Viola–Jones method is inferior to MTCNN by 20–50 % in view of the quality metrics but outruns it 7–14 times when it comes to the processing speed per image. It shows that without using algorithm optimization by parameters, it is barely impossible to get a fair quality and capacity just as the average value of the quality and capacity metrics are visibly lower than the optimal ones.","PeriodicalId":371882,"journal":{"name":"2021 IV International Conference on Control in Technical Systems (CTS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Comparison of the Parametrically Optimized Implementation of Viola–Jones Object Detection Method and MTCNN\",\"authors\":\"A. D. Egorov, Almaz F. Idiyatullin, Artur D. Zakirov\",\"doi\":\"10.1109/CTS53513.2021.9562926\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Face detection on images is a classical computer vision task. Solution to this problem is used in a wide range of apps, mainly in those connected with providing for the elements of administration (access to the data based on a person's face) and elements of control (perimeter safety control systems). The most spread object's face detecting methods may be divided in two groups: classical one (common example is Viola–Jones method) and one based on a neural network (common example is a cascaded convolutional neural network). There are plenty comparisons of face detection methods that, however, do not usually consider the potential for optimization within the parameters of realization methods in question. This paper focuses on the comparison of the parametrically optimized ways of implementation of methods which are typical for each class. Optimization is being carried out in accordance with the algorithm suggested in this paper. It reveals that the optimized Viola–Jones method is inferior to MTCNN by 20–50 % in view of the quality metrics but outruns it 7–14 times when it comes to the processing speed per image. It shows that without using algorithm optimization by parameters, it is barely impossible to get a fair quality and capacity just as the average value of the quality and capacity metrics are visibly lower than the optimal ones.\",\"PeriodicalId\":371882,\"journal\":{\"name\":\"2021 IV International Conference on Control in Technical Systems (CTS)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IV International Conference on Control in Technical Systems (CTS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CTS53513.2021.9562926\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IV International Conference on Control in Technical Systems (CTS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CTS53513.2021.9562926","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Comparison of the Parametrically Optimized Implementation of Viola–Jones Object Detection Method and MTCNN
Face detection on images is a classical computer vision task. Solution to this problem is used in a wide range of apps, mainly in those connected with providing for the elements of administration (access to the data based on a person's face) and elements of control (perimeter safety control systems). The most spread object's face detecting methods may be divided in two groups: classical one (common example is Viola–Jones method) and one based on a neural network (common example is a cascaded convolutional neural network). There are plenty comparisons of face detection methods that, however, do not usually consider the potential for optimization within the parameters of realization methods in question. This paper focuses on the comparison of the parametrically optimized ways of implementation of methods which are typical for each class. Optimization is being carried out in accordance with the algorithm suggested in this paper. It reveals that the optimized Viola–Jones method is inferior to MTCNN by 20–50 % in view of the quality metrics but outruns it 7–14 times when it comes to the processing speed per image. It shows that without using algorithm optimization by parameters, it is barely impossible to get a fair quality and capacity just as the average value of the quality and capacity metrics are visibly lower than the optimal ones.