{"title":"AB-PointNet for 3D Point Cloud Recognition","authors":"J. Komori, K. Hotta","doi":"10.1109/DICTA47822.2019.8945926","DOIUrl":null,"url":null,"abstract":"Semantic segmentation of 3D point clouds is difficult task due to its unordered representation. PointNet is a pioneering work which used 3D point clouds directly to predict 3D point semantic labels. However, it has a problem that it predicts labels without using local structure in metric space. Recent researches tackled this problem and achieved better performance. In addition to the problem, we considered that treating all channels with the same weight is obstacle to improve the accuracy. Therefore, we propose AB-PointNet which has been modified to predict 3D point semantic labels by considering the importance of channels. To emphasize the important channels, we used attention module which emphasizes channels that are useful for prediction and suppresses unimportant channels. This makes it possible to learn more effective features. In experiments, we evaluate our method on the large-scale indoor spaces 3D point cloud dataset with 13 semantic labels. Our proposed AB-PointNet has advanced performance of 3.2% in mean IoU in comparison with the conventional PointNet.","PeriodicalId":6696,"journal":{"name":"2019 Digital Image Computing: Techniques and Applications (DICTA)","volume":"57 1","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 Digital Image Computing: Techniques and Applications (DICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DICTA47822.2019.8945926","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Semantic segmentation of 3D point clouds is difficult task due to its unordered representation. PointNet is a pioneering work which used 3D point clouds directly to predict 3D point semantic labels. However, it has a problem that it predicts labels without using local structure in metric space. Recent researches tackled this problem and achieved better performance. In addition to the problem, we considered that treating all channels with the same weight is obstacle to improve the accuracy. Therefore, we propose AB-PointNet which has been modified to predict 3D point semantic labels by considering the importance of channels. To emphasize the important channels, we used attention module which emphasizes channels that are useful for prediction and suppresses unimportant channels. This makes it possible to learn more effective features. In experiments, we evaluate our method on the large-scale indoor spaces 3D point cloud dataset with 13 semantic labels. Our proposed AB-PointNet has advanced performance of 3.2% in mean IoU in comparison with the conventional PointNet.