Shanshan Ji, Qiwei Meng, Wen Wang, Zheyuan Lin, Te Li, Minhong Wan, Chunlong Zhang, J. Gu
{"title":"MSMB-GCN:用于三维人体姿态估计的多尺度多分支融合图卷积网络","authors":"Shanshan Ji, Qiwei Meng, Wen Wang, Zheyuan Lin, Te Li, Minhong Wan, Chunlong Zhang, J. Gu","doi":"10.1109/ROBIO58561.2023.10354638","DOIUrl":null,"url":null,"abstract":"In human-robot interaction (HRI), human pose estimation is a necessary technology for the robot to perceive the dynamic environment and make interactive actions. Recently, graph convolutional networks (GCNs) have been increasingly used for 2D to 3D pose estimation tasks since the skeleton topologies can be viewed as graph structures. In this paper, we propose a novel graph convolutional network architecture, Multi-scale Multi-branch Fusion Graph Convolutional Networks (MSMB-GCN), for 3D Human Pose Estimation(3D HPE) task. The proposed model consists of multiple GCN blocks with a multi-branch architecture. This multi-branch architecture enables the model to get multi-scale features for human skeletal representations. The group of GCN blocks, which has strong multi-level feature extraction capabilities, allows the model to learn global and local features, lower-level and higher-level features. Experiment results on the HumanPose benchmark demonstrate that our model outperforms the state-of-the-art and ablation studies validate the effectiveness of our approach.","PeriodicalId":505134,"journal":{"name":"2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)","volume":"72 5","pages":"1-5"},"PeriodicalIF":0.0000,"publicationDate":"2023-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MSMB-GCN: Multi-scale Multi-branch Fusion Graph Convolutional Networks for 3D Human Pose Estimation\",\"authors\":\"Shanshan Ji, Qiwei Meng, Wen Wang, Zheyuan Lin, Te Li, Minhong Wan, Chunlong Zhang, J. Gu\",\"doi\":\"10.1109/ROBIO58561.2023.10354638\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In human-robot interaction (HRI), human pose estimation is a necessary technology for the robot to perceive the dynamic environment and make interactive actions. Recently, graph convolutional networks (GCNs) have been increasingly used for 2D to 3D pose estimation tasks since the skeleton topologies can be viewed as graph structures. In this paper, we propose a novel graph convolutional network architecture, Multi-scale Multi-branch Fusion Graph Convolutional Networks (MSMB-GCN), for 3D Human Pose Estimation(3D HPE) task. The proposed model consists of multiple GCN blocks with a multi-branch architecture. This multi-branch architecture enables the model to get multi-scale features for human skeletal representations. The group of GCN blocks, which has strong multi-level feature extraction capabilities, allows the model to learn global and local features, lower-level and higher-level features. Experiment results on the HumanPose benchmark demonstrate that our model outperforms the state-of-the-art and ablation studies validate the effectiveness of our approach.\",\"PeriodicalId\":505134,\"journal\":{\"name\":\"2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)\",\"volume\":\"72 5\",\"pages\":\"1-5\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-12-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ROBIO58561.2023.10354638\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ROBIO58561.2023.10354638","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
MSMB-GCN: Multi-scale Multi-branch Fusion Graph Convolutional Networks for 3D Human Pose Estimation
In human-robot interaction (HRI), human pose estimation is a necessary technology for the robot to perceive the dynamic environment and make interactive actions. Recently, graph convolutional networks (GCNs) have been increasingly used for 2D to 3D pose estimation tasks since the skeleton topologies can be viewed as graph structures. In this paper, we propose a novel graph convolutional network architecture, Multi-scale Multi-branch Fusion Graph Convolutional Networks (MSMB-GCN), for 3D Human Pose Estimation(3D HPE) task. The proposed model consists of multiple GCN blocks with a multi-branch architecture. This multi-branch architecture enables the model to get multi-scale features for human skeletal representations. The group of GCN blocks, which has strong multi-level feature extraction capabilities, allows the model to learn global and local features, lower-level and higher-level features. Experiment results on the HumanPose benchmark demonstrate that our model outperforms the state-of-the-art and ablation studies validate the effectiveness of our approach.