Weak saliency ensemble network for person Re-identification using infrared light images

IF 8 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Engineering Applications of Artificial Intelligence Pub Date : 2024-10-23 DOI:10.1016/j.engappai.2024.109517

Min Su Jeong, Seong In Jeong, Dong Chan Lee, Seung Yong Jung, Kang Ryoung Park

{"title":"Weak saliency ensemble network for person Re-identification using infrared light images","authors":"Min Su Jeong, Seong In Jeong, Dong Chan Lee, Seung Yong Jung, Kang Ryoung Park","doi":"10.1016/j.engappai.2024.109517","DOIUrl":null,"url":null,"abstract":"<div><div>In recent years, person re-identification (re-id) has primarily been studied using visible light (VL) images. However, the challenges of employing VL images in nighttime environments have prompted research into using infrared light (IR) images. Yet, the utilization of both VL and IR images in person re-id has resulted in increased computational cost and processing time in multi-modality systems, leading to studies focusing solely on IR images. Nevertheless, IR images, lacking color and texture information, generally yield lower recognition performance in existing person re-id studies. In addition, previous studies have shown that person re-id performance suffers in the presence of complex background noise. To tackle these challenges, this study proposes a new weak saliency ensemble network (WSE-Net) for person re-id using IR images. WSE-Net incorporates a channel reduction of feature (CRF) method to reduce computational cost in the ensemble network, a technique for converting input images into group of patch images and feeding them into the ensemble model to enhance the reduced feature information, and a grouped convolution ensemble network (GCE-Net) that enables the fusion of features extracted from original and attention-guided ensemble models.</div><div>The performance of person re-id using WSE-Net was evaluated on the Dongguk body-based person recognition database version 1 (DBPerson-Recog-DB1) and the Sun Yat-sen university multiple modality re-identification version 1 (SYSU-MM01). Experimental results demonstrated that on DBPerson-Recog-DB1, WSE-Net achieved 93.65% in rank 1, 95.28% in mean average precision (mAP), and 93.52% in the harmonic mean of precision and recall. Additionally, on SYSU-MM01, WSE-Net achieved 86.85% in rank 1, 44.58% in mAP, and 40.06% in the harmonic mean of precision and recall. Furthermore, the accuracy of WSE-Net on both datasets surpassed that of state-of-the-art methods.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"139 ","pages":"Article 109517"},"PeriodicalIF":8.0000,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197624016750","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

In recent years, person re-identification (re-id) has primarily been studied using visible light (VL) images. However, the challenges of employing VL images in nighttime environments have prompted research into using infrared light (IR) images. Yet, the utilization of both VL and IR images in person re-id has resulted in increased computational cost and processing time in multi-modality systems, leading to studies focusing solely on IR images. Nevertheless, IR images, lacking color and texture information, generally yield lower recognition performance in existing person re-id studies. In addition, previous studies have shown that person re-id performance suffers in the presence of complex background noise. To tackle these challenges, this study proposes a new weak saliency ensemble network (WSE-Net) for person re-id using IR images. WSE-Net incorporates a channel reduction of feature (CRF) method to reduce computational cost in the ensemble network, a technique for converting input images into group of patch images and feeding them into the ensemble model to enhance the reduced feature information, and a grouped convolution ensemble network (GCE-Net) that enables the fusion of features extracted from original and attention-guided ensemble models.

The performance of person re-id using WSE-Net was evaluated on the Dongguk body-based person recognition database version 1 (DBPerson-Recog-DB1) and the Sun Yat-sen university multiple modality re-identification version 1 (SYSU-MM01). Experimental results demonstrated that on DBPerson-Recog-DB1, WSE-Net achieved 93.65% in rank 1, 95.28% in mean average precision (mAP), and 93.52% in the harmonic mean of precision and recall. Additionally, on SYSU-MM01, WSE-Net achieved 86.85% in rank 1, 44.58% in mAP, and 40.06% in the harmonic mean of precision and recall. Furthermore, the accuracy of WSE-Net on both datasets surpassed that of state-of-the-art methods.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用红外光图像进行人物再识别的弱显著性集合网络

近年来，对人员重新识别（re-id）的研究主要使用可见光（VL）图像。然而，在夜间环境中使用可见光图像所面临的挑战促使人们开始研究使用红外图像。然而，在人员重新识别中同时使用可见光和红外图像会增加多模态系统的计算成本和处理时间，导致研究只关注红外图像。然而，红外图像缺乏颜色和纹理信息，在现有的人员重识别研究中识别率通常较低。此外，以往的研究表明，在存在复杂背景噪声的情况下，人物再识别性能也会受到影响。为了应对这些挑战，本研究提出了一种新的弱显著性集合网络（WSE-Net），用于利用红外图像进行人物重识别。WSE-Net 采用了一种通道特征还原（CRF）方法来降低集合网络的计算成本；一种将输入图像转换为一组补丁图像并将其输入集合模型以增强还原特征信息的技术；以及一种分组卷积集合网络（GCE-Net），该网络可将从原始集合模型和注意力引导集合模型中提取的特征进行融合。使用 WSE-Net 进行的人物再识别性能评估是在基于人体的东国人物识别数据库第一版（DBPerson-Recog-DB1）和中山大学多模态再识别第一版（SYSU-MM01）上进行的。实验结果表明，在 DBPerson-Recog-DB1 上，WSE-Net 取得了 93.65% 的排名第一、95.28% 的平均精确度 (mAP)，以及 93.52% 的精确度和召回率谐波平均值。此外，在 SYSU-MM01 上，WSE-Net 的排名第一的准确率为 86.85%，mAP 为 44.58%，精确度和召回率的调和平均值为 40.06%。此外，WSE-Net 在这两个数据集上的准确率都超过了最先进的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Engineering Applications of Artificial Intelligence 工程技术-工程：电子与电气

CiteScore

9.60

自引率

10.00%

发文量

505

审稿时长

68 days

期刊介绍： Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.