RangeBird: Multi View Panoptic Segmentation of 3D Point Clouds with Neighborhood Attention

Fabian Duerr, H. Weigel, J. Beyerer
{"title":"RangeBird: Multi View Panoptic Segmentation of 3D Point Clouds with Neighborhood Attention","authors":"Fabian Duerr, H. Weigel, J. Beyerer","doi":"10.1109/icra46639.2022.9811998","DOIUrl":null,"url":null,"abstract":"Panoptic segmentation of point clouds is one of the key challenges of 3D scene understanding, requiring the simultaneous prediction of semantics and object instances. Tasks like autonomous driving strongly depend on these information to get a holistic understanding of their 3D environment. This work presents a novel proposal free framework for lidar-based panoptic segmentation, which exploits three different point cloud representations, leveraging their strengths and compensating their weaknesses. The efficient projection-based range view and bird's eye view are combined and further extended by a point-based network with a novel attention-based neighborhood aggregation for improved semantic features. Cluster-based object recognition in bird's eye view enables an efficient and high-quality instance segmentation. Semantic and instance segmentation are fused and further refined by a novel instance classification for the final panoptic segmentation. The results on two challenging large-scale datasets, nuScenes and SemanticKITTI, show the success of the proposed framework, which outperforms all existing approaches on nuScenes and achieves state-of-the-art results on SemanticKITTI.","PeriodicalId":341244,"journal":{"name":"2022 International Conference on Robotics and Automation (ICRA)","volume":"68 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Robotics and Automation (ICRA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icra46639.2022.9811998","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Panoptic segmentation of point clouds is one of the key challenges of 3D scene understanding, requiring the simultaneous prediction of semantics and object instances. Tasks like autonomous driving strongly depend on these information to get a holistic understanding of their 3D environment. This work presents a novel proposal free framework for lidar-based panoptic segmentation, which exploits three different point cloud representations, leveraging their strengths and compensating their weaknesses. The efficient projection-based range view and bird's eye view are combined and further extended by a point-based network with a novel attention-based neighborhood aggregation for improved semantic features. Cluster-based object recognition in bird's eye view enables an efficient and high-quality instance segmentation. Semantic and instance segmentation are fused and further refined by a novel instance classification for the final panoptic segmentation. The results on two challenging large-scale datasets, nuScenes and SemanticKITTI, show the success of the proposed framework, which outperforms all existing approaches on nuScenes and achieves state-of-the-art results on SemanticKITTI.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
RangeBird:基于邻域关注的三维点云多视场分割
点云的全视分割是3D场景理解的关键挑战之一,需要同时预测语义和对象实例。像自动驾驶这样的任务强烈依赖于这些信息来全面了解他们的3D环境。这项工作提出了一种新的基于激光雷达的全光分割框架,该框架利用三种不同的点云表示,利用它们的优点并弥补它们的缺点。将高效的基于投影的距离视图和鸟瞰视图结合起来,并通过基于点的网络进行扩展,并采用新颖的基于注意力的邻域聚合来改进语义特征。鸟瞰图中基于聚类的目标识别实现了高效、高质量的实例分割。将语义分割和实例分割融合,并通过一种新的实例分类进一步细化,最终实现全视分割。在nuScenes和SemanticKITTI两个具有挑战性的大规模数据集上的结果表明,所提出的框架是成功的,它优于所有现有的nuScenes方法,并在SemanticKITTI上取得了最先进的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Can your drone touch? Exploring the boundaries of consumer-grade multirotors for physical interaction Underwater Dock Detection through Convolutional Neural Networks Trained with Artificial Image Generation Immersive Virtual Walking System Using an Avatar Robot R2poweR: The Proof-of-Concept of a Backdrivable, High-Ratio Gearbox for Human-Robot Collaboration* Cityscapes TL++: Semantic Traffic Light Annotations for the Cityscapes Dataset
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1