Progressive 3D Scene Understanding with Stacked Neural Networks

Youcheng Song, Zhengxing Sun
{"title":"Progressive 3D Scene Understanding with Stacked Neural Networks","authors":"Youcheng Song, Zhengxing Sun","doi":"10.2312/pg.20181280","DOIUrl":null,"url":null,"abstract":"3D scene understanding is difficult due to the natural hierarchical structures and complicated contextual relationships in the 3d scenes. In this paper, a progressive 3D scene understanding method is proposed. The scene understanding task is decomposed into several different but related tasks, and semantic objects are progressively separated from coarse to fine. It is achieved by stacking multiple segmentation networks. The former network segments the 3D scene at a coarser level and passes the result as context to the latter one for a finer-grained segmentation. For the network training, we build a connection graph (vertices indicating objects and edges’ weights indicating contact area between objects), and calculate a maximum spanning tree to generate coarse-to-fine labels. Then we train the stacked network by hierarchical supervision based on the generated coarseto-fine labels. Finally, using the trained model, we can not only obtain better segmentation accuracy at the finest-grained than directly using the segmentation network, but also obtain a hierarchical understanding result of the 3d scene as a bonus. CCS Concepts •Computing methodologies → Scene understanding; Neural networks; Shape representations;","PeriodicalId":88304,"journal":{"name":"Proceedings. Pacific Conference on Computer Graphics and Applications","volume":"30 1","pages":"57-60"},"PeriodicalIF":0.0000,"publicationDate":"2018-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. Pacific Conference on Computer Graphics and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2312/pg.20181280","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

3D scene understanding is difficult due to the natural hierarchical structures and complicated contextual relationships in the 3d scenes. In this paper, a progressive 3D scene understanding method is proposed. The scene understanding task is decomposed into several different but related tasks, and semantic objects are progressively separated from coarse to fine. It is achieved by stacking multiple segmentation networks. The former network segments the 3D scene at a coarser level and passes the result as context to the latter one for a finer-grained segmentation. For the network training, we build a connection graph (vertices indicating objects and edges’ weights indicating contact area between objects), and calculate a maximum spanning tree to generate coarse-to-fine labels. Then we train the stacked network by hierarchical supervision based on the generated coarseto-fine labels. Finally, using the trained model, we can not only obtain better segmentation accuracy at the finest-grained than directly using the segmentation network, but also obtain a hierarchical understanding result of the 3d scene as a bonus. CCS Concepts •Computing methodologies → Scene understanding; Neural networks; Shape representations;
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
渐进式3D场景理解与堆叠神经网络
由于三维场景具有自然的层次结构和复杂的上下文关系,给理解三维场景带来了困难。本文提出了一种渐进的三维场景理解方法。场景理解任务被分解为多个不同但又相互关联的任务,语义对象由粗到细逐步分离。它是通过叠加多个分割网络来实现的。前者在较粗的层次上对3D场景进行分割,并将结果作为上下文传递给后者进行更细粒度的分割。对于网络训练,我们建立一个连接图(顶点表示对象,边的权重表示对象之间的接触面积),并计算一个最大生成树来生成粗到细的标签。然后基于生成的粗-精标签,采用分层监督的方法对堆叠网络进行训练。最后,利用训练好的模型,我们不仅可以在最细粒度处获得比直接使用分割网络更好的分割精度,而且还可以获得三维场景的分层理解结果。•计算方法→场景理解;神经网络;形状表示;
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Cloud-Assisted Hybrid Rendering for Thin-Client Games and VR Applications Interactive Deformable Image Registration with Dual Cursor DFGA: Digital Human Faces Generation and Animation from the RGB Video using Modern Deep Learning Technology Aesthetic Enhancement via Color Area and Location Awareness Learning a Style Space for Interactive Line Drawing Synthesis from Animated 3D Models
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1