KiRV: Robust Human Identification via Multimodal Learning Based on Kinetic Gait Features of Radar and Vision

IF 8.9 1区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS IEEE Internet of Things Journal Pub Date : 2025-03-12 DOI:10.1109/JIOT.2025.3550532

Lang Deng;Jifang Pei;Yuansen Song;Weibo Huo;Yin Zhang;Yulin Huang

{"title":"KiRV: Robust Human Identification via Multimodal Learning Based on Kinetic Gait Features of Radar and Vision","authors":"Lang Deng;Jifang Pei;Yuansen Song;Weibo Huo;Yin Zhang;Yulin Huang","doi":"10.1109/JIOT.2025.3550532","DOIUrl":null,"url":null,"abstract":"Gait is an appealing biometric pattern that aims to identify individuals based on the way they walk. Gait recognition, a passive human identification technology utilized from a distance without subject cooperation, plays a considerable role in life monitoring, crime prevention, security guarantee, and other identity recognition applications. Although vision-based methods dominate the state-of-the-art field, their performance degrades under poor illumination. In contrast, radar signals are not affected by light and are more sensitive to micro-motion information. In this article, we design a Kinetic feature-based Radar-Vision fused (KiRV) gait recognition method, which leverages millimeter-wave radar echo signals and a video for illumination robust human identification. In the KiRV, we propose a novel kinetic gait feature representation framework based on radar micro-Doppler and visual optical flow information, which are the direct expressions of the gait motion process. The physical meaning of the kinetic features under the two modalities is similar, while the semantic information is complementary. Therefore, the two features can be effectively fused. To learn robust gait information, we propose two 2-D residual CNN-based lightweight backbone networks to encode the kinetic features, respectively, and further propose a two-stream cross-correlated fusion method, including radar-vision cross-correlated fusion (RVCF) and radar-vision gate unit (RVGU) modules. The RVCF adaptively adjusts the attention to radar and vision for better recognition performance, while the RVGU controls the contribution of each modality to the fused feature to improve the robustness of the model. Finally, the gait retrieval task can be achieved through the above innovative model and joint loss calculation at different feature levels. Extensive experiments are conducted in the real world and semi-simulation, demonstrating that the KiRV outperforms state-of-the-art gait recognition methods with well-illumination robustness.","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"12 12","pages":"22224-22242"},"PeriodicalIF":8.9000,"publicationDate":"2025-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet of Things Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10924240/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Gait is an appealing biometric pattern that aims to identify individuals based on the way they walk. Gait recognition, a passive human identification technology utilized from a distance without subject cooperation, plays a considerable role in life monitoring, crime prevention, security guarantee, and other identity recognition applications. Although vision-based methods dominate the state-of-the-art field, their performance degrades under poor illumination. In contrast, radar signals are not affected by light and are more sensitive to micro-motion information. In this article, we design a Kinetic feature-based Radar-Vision fused (KiRV) gait recognition method, which leverages millimeter-wave radar echo signals and a video for illumination robust human identification. In the KiRV, we propose a novel kinetic gait feature representation framework based on radar micro-Doppler and visual optical flow information, which are the direct expressions of the gait motion process. The physical meaning of the kinetic features under the two modalities is similar, while the semantic information is complementary. Therefore, the two features can be effectively fused. To learn robust gait information, we propose two 2-D residual CNN-based lightweight backbone networks to encode the kinetic features, respectively, and further propose a two-stream cross-correlated fusion method, including radar-vision cross-correlated fusion (RVCF) and radar-vision gate unit (RVGU) modules. The RVCF adaptively adjusts the attention to radar and vision for better recognition performance, while the RVGU controls the contribution of each modality to the fused feature to improve the robustness of the model. Finally, the gait retrieval task can be achieved through the above innovative model and joint loss calculation at different feature levels. Extensive experiments are conducted in the real world and semi-simulation, demonstrating that the KiRV outperforms state-of-the-art gait recognition methods with well-illumination robustness.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于雷达和视觉运动步态特征的多模态学习鲁棒人体识别

步态是一种吸引人的生物识别模式，旨在根据人们走路的方式来识别他们。步态识别是一种无需主体配合的远距离被动人体识别技术，在生活监控、预防犯罪、安全保障等身份识别应用中发挥着重要作用。尽管基于视觉的方法在最先进的领域占据主导地位，但在光照不足的情况下，它们的性能会下降。相比之下，雷达信号不受光的影响，对微运动信息更敏感。在本文中，我们设计了一种基于运动特征的雷达视觉融合（KiRV）步态识别方法，该方法利用毫米波雷达回波信号和视频进行照明鲁棒人体识别。在KiRV中，我们提出了一种新的基于雷达微多普勒和视觉光流信息的动态步态特征表示框架，这是步态运动过程的直接表达。两种形态下的运动特征的物理意义相似，而语义信息是互补的。因此，这两个特征可以有效地融合在一起。为了学习稳健的步态信息，我们提出了两个基于二维残差cnn的轻量级骨干网络分别对步态特征进行编码，并进一步提出了两流交叉相关融合方法，包括雷达视觉交叉相关融合（RVCF）和雷达视觉门单元（RVGU）模块。RVCF自适应调整对雷达和视觉的关注以获得更好的识别性能，而RVGU控制各模态对融合特征的贡献以提高模型的鲁棒性。最后，通过上述创新模型和不同特征层次的关节损失计算，完成步态检索任务。在现实世界和半模拟中进行了大量实验，证明KiRV具有良好的光照鲁棒性，优于最先进的步态识别方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Internet of Things Journal Computer Science-Information Systems

CiteScore

17.60

自引率

13.20%

发文量

1982

期刊介绍： The EEE Internet of Things (IoT) Journal publishes articles and review articles covering various aspects of IoT, including IoT system architecture, IoT enabling technologies, IoT communication and networking protocols such as network coding, and IoT services and applications. Topics encompass IoT's impacts on sensor technologies, big data management, and future internet design for applications like smart cities and smart homes. Fields of interest include IoT architecture such as things-centric, data-centric, service-oriented IoT architecture; IoT enabling technologies and systematic integration such as sensor technologies, big sensor data management, and future Internet design for IoT; IoT services, applications, and test-beds such as IoT service middleware, IoT application programming interface (API), IoT application design, and IoT trials/experiments; IoT standardization activities and technology development in different standard development organizations (SDO) such as IEEE, IETF, ITU, 3GPP, ETSI, etc.