{"title":"改进的堆叠沙漏网络用于稳健的6D目标姿态估计","authors":"Kun Li, Hui Zhang, Lei Peng","doi":"10.1145/3459104.3459177","DOIUrl":null,"url":null,"abstract":"In this article, we introduce an accurate yet robust method to recover the 6D pose of the object from an RGB image. The core of our method is using the farthest point sampling algorithm to design a set of representative keypoints on the object model surface, and then use the improved stacked hourglass network (ISHN) with multi-scale aggregation module to localize them in the 2D image by predicting the keypoints heatmaps. Finally, the PnP algorithm can recover the 6D pose according to the 3D-2D relationship of keypoints. Besides, when the object is partially occluded, we can successfully recover the pose of the object by selecting the most confident keypoints. Our method can simultaneously detect and recover the 6D pose of the instance object in the RGB image without additional post-processing steps. Experimental results show that compared with the state-of-the-art RGB-based pose estimation methods, our method can achieve competitive or more superior performance on two benchmark datasets.","PeriodicalId":142284,"journal":{"name":"2021 International Symposium on Electrical, Electronics and Information Engineering","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improved Stacked Hourglass Network for Robust 6D Object Pose Estimation\",\"authors\":\"Kun Li, Hui Zhang, Lei Peng\",\"doi\":\"10.1145/3459104.3459177\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this article, we introduce an accurate yet robust method to recover the 6D pose of the object from an RGB image. The core of our method is using the farthest point sampling algorithm to design a set of representative keypoints on the object model surface, and then use the improved stacked hourglass network (ISHN) with multi-scale aggregation module to localize them in the 2D image by predicting the keypoints heatmaps. Finally, the PnP algorithm can recover the 6D pose according to the 3D-2D relationship of keypoints. Besides, when the object is partially occluded, we can successfully recover the pose of the object by selecting the most confident keypoints. Our method can simultaneously detect and recover the 6D pose of the instance object in the RGB image without additional post-processing steps. Experimental results show that compared with the state-of-the-art RGB-based pose estimation methods, our method can achieve competitive or more superior performance on two benchmark datasets.\",\"PeriodicalId\":142284,\"journal\":{\"name\":\"2021 International Symposium on Electrical, Electronics and Information Engineering\",\"volume\":\"20 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-02-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Symposium on Electrical, Electronics and Information Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3459104.3459177\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Symposium on Electrical, Electronics and Information Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3459104.3459177","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improved Stacked Hourglass Network for Robust 6D Object Pose Estimation
In this article, we introduce an accurate yet robust method to recover the 6D pose of the object from an RGB image. The core of our method is using the farthest point sampling algorithm to design a set of representative keypoints on the object model surface, and then use the improved stacked hourglass network (ISHN) with multi-scale aggregation module to localize them in the 2D image by predicting the keypoints heatmaps. Finally, the PnP algorithm can recover the 6D pose according to the 3D-2D relationship of keypoints. Besides, when the object is partially occluded, we can successfully recover the pose of the object by selecting the most confident keypoints. Our method can simultaneously detect and recover the 6D pose of the instance object in the RGB image without additional post-processing steps. Experimental results show that compared with the state-of-the-art RGB-based pose estimation methods, our method can achieve competitive or more superior performance on two benchmark datasets.