{"title":"基于人脸生物特征的多镜头人体再识别关键帧提取","authors":"Agus Gunawan, D. H. Widyantoro","doi":"10.1109/ICACSIS47736.2019.8979799","DOIUrl":null,"url":null,"abstract":"Although achieving a good accuracy, the multi-shot human re-identification (MS Re-ID) experiences long processing time and large memory consumption, because it uses all of the frames it receives. The usage of all of the frames for reidentification does not only weigh the system’s process, but also gives redundant information. To enable a faster and lighter execution of the MS Re-ID system, the present study proposed a key frame extraction method using face biometric feature for MS Re-ID system. Key frames were responsible to provide important information of an individual’s face for re-identification, and avoid collecting the redundant features. The proposed key frame extraction method consists of two phases: face detection with facial landmark extraction and face tilt angle calculation. Multitask Cascaded Convolutional Networks (MTCNN) was used for face detection with facial landmark extraction, while 10 was used as the optimal number of key frames and the optimal face tilt angle range was -10° until 10°. We also compared MS Re-ID system with the proposed method to the normal single-shot and multi-shot systems to examine the proposed method’s impact on the performance. We found that the proposed key frame extraction method successfully retained 100% accuracy while reducing the memory consumption by 2% and the execution time by 19%, compared to MS Re-ID system without key frame extraction.","PeriodicalId":165090,"journal":{"name":"2019 International Conference on Advanced Computer Science and information Systems (ICACSIS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Key Frame Extraction with Face Biometric Features in Multi-shot Human Re-identification System\",\"authors\":\"Agus Gunawan, D. H. Widyantoro\",\"doi\":\"10.1109/ICACSIS47736.2019.8979799\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Although achieving a good accuracy, the multi-shot human re-identification (MS Re-ID) experiences long processing time and large memory consumption, because it uses all of the frames it receives. The usage of all of the frames for reidentification does not only weigh the system’s process, but also gives redundant information. To enable a faster and lighter execution of the MS Re-ID system, the present study proposed a key frame extraction method using face biometric feature for MS Re-ID system. Key frames were responsible to provide important information of an individual’s face for re-identification, and avoid collecting the redundant features. The proposed key frame extraction method consists of two phases: face detection with facial landmark extraction and face tilt angle calculation. Multitask Cascaded Convolutional Networks (MTCNN) was used for face detection with facial landmark extraction, while 10 was used as the optimal number of key frames and the optimal face tilt angle range was -10° until 10°. We also compared MS Re-ID system with the proposed method to the normal single-shot and multi-shot systems to examine the proposed method’s impact on the performance. We found that the proposed key frame extraction method successfully retained 100% accuracy while reducing the memory consumption by 2% and the execution time by 19%, compared to MS Re-ID system without key frame extraction.\",\"PeriodicalId\":165090,\"journal\":{\"name\":\"2019 International Conference on Advanced Computer Science and information Systems (ICACSIS)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Conference on Advanced Computer Science and information Systems (ICACSIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICACSIS47736.2019.8979799\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Advanced Computer Science and information Systems (ICACSIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICACSIS47736.2019.8979799","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
多帧人再识别(multiple -shot human Re-ID, MS Re-ID)虽然精度较高,但由于要使用接收到的所有帧,处理时间长,内存消耗大。使用所有的帧进行重新识别不仅权衡了系统的进程,而且提供了冗余信息。为了使MS Re-ID系统更快、更轻地执行,本研究提出了一种基于人脸生物特征的MS Re-ID系统关键帧提取方法。关键帧负责提供人脸的重要信息以供重新识别,避免收集冗余特征。提出的关键帧提取方法包括两个阶段:人脸检测与人脸特征提取和人脸倾斜角计算。采用多任务级联卷积网络(Multitask cascade Convolutional Networks, MTCNN)进行人脸检测,提取人脸地标,最优关键帧数为10帧,最优人脸倾斜角度范围为-10°~ 10°。我们还将采用该方法的MS Re-ID系统与普通的单发和多发系统进行了比较,以检验该方法对性能的影响。我们发现,与没有关键帧提取的MS Re-ID系统相比,所提出的关键帧提取方法成功地保持了100%的准确率,同时减少了2%的内存消耗和19%的执行时间。
Key Frame Extraction with Face Biometric Features in Multi-shot Human Re-identification System
Although achieving a good accuracy, the multi-shot human re-identification (MS Re-ID) experiences long processing time and large memory consumption, because it uses all of the frames it receives. The usage of all of the frames for reidentification does not only weigh the system’s process, but also gives redundant information. To enable a faster and lighter execution of the MS Re-ID system, the present study proposed a key frame extraction method using face biometric feature for MS Re-ID system. Key frames were responsible to provide important information of an individual’s face for re-identification, and avoid collecting the redundant features. The proposed key frame extraction method consists of two phases: face detection with facial landmark extraction and face tilt angle calculation. Multitask Cascaded Convolutional Networks (MTCNN) was used for face detection with facial landmark extraction, while 10 was used as the optimal number of key frames and the optimal face tilt angle range was -10° until 10°. We also compared MS Re-ID system with the proposed method to the normal single-shot and multi-shot systems to examine the proposed method’s impact on the performance. We found that the proposed key frame extraction method successfully retained 100% accuracy while reducing the memory consumption by 2% and the execution time by 19%, compared to MS Re-ID system without key frame extraction.