{"title":"一种多任务人脸检测和人脸质量评估方法","authors":"Rod Izadi, Chen Liu","doi":"10.1109/IWBF57495.2023.10157540","DOIUrl":null,"url":null,"abstract":"Face in Video Recognition (FiVR) commonly follows a sequential pipeline of face detection, face quality assessment, and face recognition. However, performing these often machine learning-based tasks sequentially in real-time is a challenge when considering the excessive overhead caused by convolution and other feature extraction operations typically seen in neural networks employed across these stages. To overcome this challenge, a process that can perform these operations in parallel is needed. In this paper, we propose a methodology that can alleviate the constraints of real-time processing found in the sequential pipeline of FiVR. We exploit the similarities in features used in face detection and face quality assessment, hence designing a multi-tasked face detection and quality assessment network which can perform our FiVR operations with less inference time without sparing prediction accuracy. We evaluated the face quality prediction performance of our proposed approach in comparison with a stand-alone face quality network. We also evaluated the reduction in inference time by comparing the prediction speed of our multi-tasked face detection and quality network against its sequential counterparts. Our experimental results show that our multi-tasked model can successfully meet real-time processing demand while performing at the same level of accuracy as the sequential stand-alone models.","PeriodicalId":273412,"journal":{"name":"2023 11th International Workshop on Biometrics and Forensics (IWBF)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Multi-Tasked Approach Towards Face Detection and Face Quality Assessment\",\"authors\":\"Rod Izadi, Chen Liu\",\"doi\":\"10.1109/IWBF57495.2023.10157540\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Face in Video Recognition (FiVR) commonly follows a sequential pipeline of face detection, face quality assessment, and face recognition. However, performing these often machine learning-based tasks sequentially in real-time is a challenge when considering the excessive overhead caused by convolution and other feature extraction operations typically seen in neural networks employed across these stages. To overcome this challenge, a process that can perform these operations in parallel is needed. In this paper, we propose a methodology that can alleviate the constraints of real-time processing found in the sequential pipeline of FiVR. We exploit the similarities in features used in face detection and face quality assessment, hence designing a multi-tasked face detection and quality assessment network which can perform our FiVR operations with less inference time without sparing prediction accuracy. We evaluated the face quality prediction performance of our proposed approach in comparison with a stand-alone face quality network. We also evaluated the reduction in inference time by comparing the prediction speed of our multi-tasked face detection and quality network against its sequential counterparts. Our experimental results show that our multi-tasked model can successfully meet real-time processing demand while performing at the same level of accuracy as the sequential stand-alone models.\",\"PeriodicalId\":273412,\"journal\":{\"name\":\"2023 11th International Workshop on Biometrics and Forensics (IWBF)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-04-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 11th International Workshop on Biometrics and Forensics (IWBF)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IWBF57495.2023.10157540\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 11th International Workshop on Biometrics and Forensics (IWBF)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IWBF57495.2023.10157540","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Multi-Tasked Approach Towards Face Detection and Face Quality Assessment
Face in Video Recognition (FiVR) commonly follows a sequential pipeline of face detection, face quality assessment, and face recognition. However, performing these often machine learning-based tasks sequentially in real-time is a challenge when considering the excessive overhead caused by convolution and other feature extraction operations typically seen in neural networks employed across these stages. To overcome this challenge, a process that can perform these operations in parallel is needed. In this paper, we propose a methodology that can alleviate the constraints of real-time processing found in the sequential pipeline of FiVR. We exploit the similarities in features used in face detection and face quality assessment, hence designing a multi-tasked face detection and quality assessment network which can perform our FiVR operations with less inference time without sparing prediction accuracy. We evaluated the face quality prediction performance of our proposed approach in comparison with a stand-alone face quality network. We also evaluated the reduction in inference time by comparing the prediction speed of our multi-tasked face detection and quality network against its sequential counterparts. Our experimental results show that our multi-tasked model can successfully meet real-time processing demand while performing at the same level of accuracy as the sequential stand-alone models.