{"title":"基于多媒体信息处理技术的运动训练动作可视化","authors":"Yanle Li","doi":"10.1007/s12652-024-04767-1","DOIUrl":null,"url":null,"abstract":"<p>The rapid development of multimedia information processing technology provides development opportunities for digitization in sports, among which motion capture technology, as the latest achievement of multimedia information processing technology, has gradually gained the attention of scholars and started to be used for visualization of sports movements. Therefore, this paper introduces a monocular video motion capture method and optimizes it for the problems of reconstructing human movements such as floating, ground penetration and sliding, which provides a technical path for the specific application of motion capture technology in the field of sports training and also provides a technical guarantee for the visualization of sports training movements. Introduced a new motion capture optimization method. This method captures human motion trajectories from monocular videos, and trajectory operations combine human pose estimation and physical constraints. The proposed method uses foot contact judgment to obtain foot contact events for each motion frame. Then, it optimizes the overall body motion trajectory of the key points based on the obtained contact conditions, making the generated motion visually closer to reality. This article proposes LiteHumanPose Net with a inference speed of up to 22FPS, and conducts experimental analysis and comparison of several popular pose estimation methods from the perspectives of frame rate and average accuracy, such as Sim pleBaseline, HRNet, and Hourglass Net. LiteHumanPose Net outperforms Hourglass Net in terms of frame rate and accuracy, while HRNet has high accuracy due to its multiple parameters but low frame rate. The LiteHumanPose network proposed in this article has a good balance between accuracy and frame rate, and has obvious landing advantages.</p>","PeriodicalId":14959,"journal":{"name":"Journal of Ambient Intelligence and Humanized Computing","volume":"63 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Visualization of movements in sports training based on multimedia information processing technology\",\"authors\":\"Yanle Li\",\"doi\":\"10.1007/s12652-024-04767-1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>The rapid development of multimedia information processing technology provides development opportunities for digitization in sports, among which motion capture technology, as the latest achievement of multimedia information processing technology, has gradually gained the attention of scholars and started to be used for visualization of sports movements. Therefore, this paper introduces a monocular video motion capture method and optimizes it for the problems of reconstructing human movements such as floating, ground penetration and sliding, which provides a technical path for the specific application of motion capture technology in the field of sports training and also provides a technical guarantee for the visualization of sports training movements. Introduced a new motion capture optimization method. This method captures human motion trajectories from monocular videos, and trajectory operations combine human pose estimation and physical constraints. The proposed method uses foot contact judgment to obtain foot contact events for each motion frame. Then, it optimizes the overall body motion trajectory of the key points based on the obtained contact conditions, making the generated motion visually closer to reality. This article proposes LiteHumanPose Net with a inference speed of up to 22FPS, and conducts experimental analysis and comparison of several popular pose estimation methods from the perspectives of frame rate and average accuracy, such as Sim pleBaseline, HRNet, and Hourglass Net. LiteHumanPose Net outperforms Hourglass Net in terms of frame rate and accuracy, while HRNet has high accuracy due to its multiple parameters but low frame rate. The LiteHumanPose network proposed in this article has a good balance between accuracy and frame rate, and has obvious landing advantages.</p>\",\"PeriodicalId\":14959,\"journal\":{\"name\":\"Journal of Ambient Intelligence and Humanized Computing\",\"volume\":\"63 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-03-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Ambient Intelligence and Humanized Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1007/s12652-024-04767-1\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Ambient Intelligence and Humanized Computing","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s12652-024-04767-1","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Computer Science","Score":null,"Total":0}
Visualization of movements in sports training based on multimedia information processing technology
The rapid development of multimedia information processing technology provides development opportunities for digitization in sports, among which motion capture technology, as the latest achievement of multimedia information processing technology, has gradually gained the attention of scholars and started to be used for visualization of sports movements. Therefore, this paper introduces a monocular video motion capture method and optimizes it for the problems of reconstructing human movements such as floating, ground penetration and sliding, which provides a technical path for the specific application of motion capture technology in the field of sports training and also provides a technical guarantee for the visualization of sports training movements. Introduced a new motion capture optimization method. This method captures human motion trajectories from monocular videos, and trajectory operations combine human pose estimation and physical constraints. The proposed method uses foot contact judgment to obtain foot contact events for each motion frame. Then, it optimizes the overall body motion trajectory of the key points based on the obtained contact conditions, making the generated motion visually closer to reality. This article proposes LiteHumanPose Net with a inference speed of up to 22FPS, and conducts experimental analysis and comparison of several popular pose estimation methods from the perspectives of frame rate and average accuracy, such as Sim pleBaseline, HRNet, and Hourglass Net. LiteHumanPose Net outperforms Hourglass Net in terms of frame rate and accuracy, while HRNet has high accuracy due to its multiple parameters but low frame rate. The LiteHumanPose network proposed in this article has a good balance between accuracy and frame rate, and has obvious landing advantages.
期刊介绍:
The purpose of JAIHC is to provide a high profile, leading edge forum for academics, industrial professionals, educators and policy makers involved in the field to contribute, to disseminate the most innovative researches and developments of all aspects of ambient intelligence and humanized computing, such as intelligent/smart objects, environments/spaces, and systems. The journal discusses various technical, safety, personal, social, physical, political, artistic and economic issues. The research topics covered by the journal are (but not limited to):
Pervasive/Ubiquitous Computing and Applications
Cognitive wireless sensor network
Embedded Systems and Software
Mobile Computing and Wireless Communications
Next Generation Multimedia Systems
Security, Privacy and Trust
Service and Semantic Computing
Advanced Networking Architectures
Dependable, Reliable and Autonomic Computing
Embedded Smart Agents
Context awareness, social sensing and inference
Multi modal interaction design
Ergonomics and product prototyping
Intelligent and self-organizing transportation networks & services
Healthcare Systems
Virtual Humans & Virtual Worlds
Wearables sensors and actuators