{"title":"Recognition Method with Deep Contrastive Learning and Improved Transformer for 3D Human Motion Pose","authors":"Datian Liu, Haitao Yang, Zhang Lei","doi":"10.1007/s44196-023-00351-1","DOIUrl":null,"url":null,"abstract":"Abstract Three-dimensional (3D) human pose recognition techniques based on spatial data have gained attention. However, existing models and algorithms fail to achieve desired precision. We propose a 3D human motion pose recognition method using deep contrastive learning and an improved Transformer. The improved Transformer removes noise between human motion RGB and depth images, addressing orientation correlation in 3D models. Two-dimensional (2D) pose features are extracted from de-noised RGB images using a kernel generation module in a graph convolutional network (GCN). Depth features are extracted from de-noised depth images. The 2D pose features and depth features are fused using a regression module in the GCN to obtain 3D pose recognition results. The results demonstrate that the proposed method captures RGB and depth images, achieving high recognition accuracy and fast speed. The proposed method demonstrates good accuracy in 3D human motion pose recognition.","PeriodicalId":54967,"journal":{"name":"International Journal of Computational Intelligence Systems","volume":"2011 3","pages":"0"},"PeriodicalIF":2.9000,"publicationDate":"2023-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Computational Intelligence Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s44196-023-00351-1","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Abstract Three-dimensional (3D) human pose recognition techniques based on spatial data have gained attention. However, existing models and algorithms fail to achieve desired precision. We propose a 3D human motion pose recognition method using deep contrastive learning and an improved Transformer. The improved Transformer removes noise between human motion RGB and depth images, addressing orientation correlation in 3D models. Two-dimensional (2D) pose features are extracted from de-noised RGB images using a kernel generation module in a graph convolutional network (GCN). Depth features are extracted from de-noised depth images. The 2D pose features and depth features are fused using a regression module in the GCN to obtain 3D pose recognition results. The results demonstrate that the proposed method captures RGB and depth images, achieving high recognition accuracy and fast speed. The proposed method demonstrates good accuracy in 3D human motion pose recognition.
期刊介绍:
The International Journal of Computational Intelligence Systems publishes original research on all aspects of applied computational intelligence, especially targeting papers demonstrating the use of techniques and methods originating from computational intelligence theory. The core theories of computational intelligence are fuzzy logic, neural networks, evolutionary computation and probabilistic reasoning. The journal publishes only articles related to the use of computational intelligence and broadly covers the following topics:
-Autonomous reasoning-
Bio-informatics-
Cloud computing-
Condition monitoring-
Data science-
Data mining-
Data visualization-
Decision support systems-
Fault diagnosis-
Intelligent information retrieval-
Human-machine interaction and interfaces-
Image processing-
Internet and networks-
Noise analysis-
Pattern recognition-
Prediction systems-
Power (nuclear) safety systems-
Process and system control-
Real-time systems-
Risk analysis and safety-related issues-
Robotics-
Signal and image processing-
IoT and smart environments-
Systems integration-
System control-
System modelling and optimization-
Telecommunications-
Time series prediction-
Warning systems-
Virtual reality-
Web intelligence-
Deep learning