Biao Jin;Hao Wu;Zhenkai Zhang;Zhuxian Lian;Xiangqun Zhang;Genyuan Du
{"title":"SRDST: Effective Dynamic Gesture Recognition With Sparse Representation and Dual-Stream Transformers in mmWave Radar","authors":"Biao Jin;Hao Wu;Zhenkai Zhang;Zhuxian Lian;Xiangqun Zhang;Genyuan Du","doi":"10.1109/TII.2024.3455419","DOIUrl":null,"url":null,"abstract":"Millimeter-wave radar holds significant potential for dynamic gesture recognition in contactless human-computer interaction, particularly in the Internet of Things and consumer electronics applications. However, a considerable challenge persists in filtering vast amounts of extraneous data from millimeter-wave radar echoes to isolate meaningful gesture features. We present a novel approach based on sparse representation principles to address this. We first generate a range-Doppler map of gestures using a two-dimensional (2-D) fast Fourier transform, then construct a Doppler-Time trajectory from aggregated data across multiple frames. Capitalizing on the intrinsic sparsity in the Doppler-time domain, we employ the orthogonal matching pursuit algorithm to refine a multidimensional feature sequence across time, Doppler, and range dimensions. Central to our approach is a dual-stream Transformer network that explores complex 2-D correlations in feature sequences via multihead self-attention mechanisms. This technique significantly improves gesture feature extraction efficiency and reduces data redundancy. The experimental results show that our model has an average recognition accuracy of 99.17% and a size of 0.17M, which is very suitable for application in embedded devices.","PeriodicalId":13301,"journal":{"name":"IEEE Transactions on Industrial Informatics","volume":"21 1","pages":"604-612"},"PeriodicalIF":9.9000,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Industrial Informatics","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10682117/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Millimeter-wave radar holds significant potential for dynamic gesture recognition in contactless human-computer interaction, particularly in the Internet of Things and consumer electronics applications. However, a considerable challenge persists in filtering vast amounts of extraneous data from millimeter-wave radar echoes to isolate meaningful gesture features. We present a novel approach based on sparse representation principles to address this. We first generate a range-Doppler map of gestures using a two-dimensional (2-D) fast Fourier transform, then construct a Doppler-Time trajectory from aggregated data across multiple frames. Capitalizing on the intrinsic sparsity in the Doppler-time domain, we employ the orthogonal matching pursuit algorithm to refine a multidimensional feature sequence across time, Doppler, and range dimensions. Central to our approach is a dual-stream Transformer network that explores complex 2-D correlations in feature sequences via multihead self-attention mechanisms. This technique significantly improves gesture feature extraction efficiency and reduces data redundancy. The experimental results show that our model has an average recognition accuracy of 99.17% and a size of 0.17M, which is very suitable for application in embedded devices.
期刊介绍:
The IEEE Transactions on Industrial Informatics is a multidisciplinary journal dedicated to publishing technical papers that connect theory with practical applications of informatics in industrial settings. It focuses on the utilization of information in intelligent, distributed, and agile industrial automation and control systems. The scope includes topics such as knowledge-based and AI-enhanced automation, intelligent computer control systems, flexible and collaborative manufacturing, industrial informatics in software-defined vehicles and robotics, computer vision, industrial cyber-physical and industrial IoT systems, real-time and networked embedded systems, security in industrial processes, industrial communications, systems interoperability, and human-machine interaction.