Ear Recognition In The Wild with Convolutional Neural Networks

2021 XLVII Latin American Computing Conference (CLEI) Pub Date : 2021-10-25 DOI:10.1109/CLEI53233.2021.9640083

Solange Ramos-Cooper, Guillermo Cámara Chávez

{"title":"Ear Recognition In The Wild with Convolutional Neural Networks","authors":"Solange Ramos-Cooper, Guillermo Cámara Chávez","doi":"10.1109/CLEI53233.2021.9640083","DOIUrl":null,"url":null,"abstract":"Ear recognition has gained attention in recent years. The possibility of being captured from a distance, contactless, without the cooperation of the subject and not be affected by facial expressions makes ear recognition a captivating choice for surveillance and security applications, and even more in the current COVID-19 pandemic context where modalities like face recognition fail due to mouth and facial covering masks usage. Applying any deep learning (DL) algorithm usually demands a large amount of training data and appropriate network architectures, therefore we introduce a large-scale database and explore fine-tuning pre-trained convolutional neural networks (CNNs) looking for a robust representation of ear images taken under uncontrolled conditions. Taking advantage of the face recognition field, we built an ear dataset based on the VGGFace dataset and use the Mask-RCNN for ear detection. Besides, adapting the VGGFace model to the ear domain leads to a better performance than using a model trained for general image recognition. Experiments on the UERC dataset have shown that fine-tuning from a face recognition model and using a larger dataset leads to a significant improvement of around 9% compared to state-of-the-art methods on the ear recognition field. In addition, we have explored score-level fusion by combining matching scores of the fine-tuning models which leads to an improvement of around 4% more. Open-set and close-set experiments have been performed and evaluated using Rank-1 and Rank-5 recognition rate metrics.","PeriodicalId":6803,"journal":{"name":"2021 XLVII Latin American Computing Conference (CLEI)","volume":"29 1","pages":"1-10"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 XLVII Latin American Computing Conference (CLEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CLEI53233.2021.9640083","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Ear recognition has gained attention in recent years. The possibility of being captured from a distance, contactless, without the cooperation of the subject and not be affected by facial expressions makes ear recognition a captivating choice for surveillance and security applications, and even more in the current COVID-19 pandemic context where modalities like face recognition fail due to mouth and facial covering masks usage. Applying any deep learning (DL) algorithm usually demands a large amount of training data and appropriate network architectures, therefore we introduce a large-scale database and explore fine-tuning pre-trained convolutional neural networks (CNNs) looking for a robust representation of ear images taken under uncontrolled conditions. Taking advantage of the face recognition field, we built an ear dataset based on the VGGFace dataset and use the Mask-RCNN for ear detection. Besides, adapting the VGGFace model to the ear domain leads to a better performance than using a model trained for general image recognition. Experiments on the UERC dataset have shown that fine-tuning from a face recognition model and using a larger dataset leads to a significant improvement of around 9% compared to state-of-the-art methods on the ear recognition field. In addition, we have explored score-level fusion by combining matching scores of the fine-tuning models which leads to an improvement of around 4% more. Open-set and close-set experiments have been performed and evaluated using Rank-1 and Rank-5 recognition rate metrics.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

卷积神经网络在野外的耳朵识别

近年来，耳朵识别技术引起了人们的关注。在没有受试者合作的情况下，可以从远距离、非接触式、不受面部表情影响的方式进行捕捉，这使得耳朵识别成为监控和安全应用的一个迷人选择，在当前COVID-19大流行的背景下，人脸识别等方式由于使用嘴巴和面部覆盖口罩而失效，这种选择就更有吸引力了。应用任何深度学习(DL)算法通常需要大量的训练数据和适当的网络架构，因此我们引入了一个大规模的数据库，并探索微调预训练卷积神经网络(cnn)，寻找在不受控制的条件下拍摄的耳朵图像的鲁棒表示。利用人脸识别领域的优势，我们在VGGFace数据集的基础上构建了一个耳朵数据集，并使用Mask-RCNN进行耳朵检测。此外，将VGGFace模型应用于耳域比使用一般图像识别训练的模型具有更好的性能。在UERC数据集上的实验表明，与耳朵识别领域最先进的方法相比，面部识别模型的微调和使用更大的数据集可以显著提高约9%。此外，我们还通过结合微调模型的匹配分数来探索分数级融合，这导致了大约4%的改进。使用Rank-1和Rank-5识别率指标进行了开集和闭集实验并进行了评估。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2021 XLVII Latin American Computing Conference (CLEI)

自引率

0.00%

发文量

期刊最新文献

Structured Text Generation for Spanish Freestyle Battles using Neural Networks Learning factory for the Software Engineering area: First didactic transformation An Early Alert System for Software Vulnerabilities based on Vulnerability Repositories and Social Networks Data Quality Management oriented to the Electronic Medical Record Program Committees