3D Scene Graph Generation Using Prior Knowledge from Large Language Model (LLM)

Ho-Jun Baek, Incheol Kim
{"title":"3D Scene Graph Generation Using Prior Knowledge from Large Language Model (LLM)","authors":"Ho-Jun Baek, Incheol Kim","doi":"10.9717/kmms.2023.26.8.859","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a novel 3D scene graph generation model, L3DSG, which can make use of rich prior knowledge obtained from large language model (LLM) by prompt engineering. The proposed model is built upon our previous 3D scene graph generation model, C3DSG, that adopts Point Transformer as 3D geometric feature extractor and uses the NE-GAT graph neural network as context reasoner. The new proposed model addresses the inability of C3DSG to utilize prior knowledge on indoor physical environments. It focuses on issues of how to obtain prior knowledge from LLM and how to make use of it for predicting objects and their relations effectively. The proposed model is extended from C3DSG by adding several elaborate modules to prompt, encode, and fuse prior knowledge from LLM. Through various experiments using the benchmark dataset 3DSSG, we show the superiority of the proposed model.","PeriodicalId":16316,"journal":{"name":"Journal of Korea Multimedia Society","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Korea Multimedia Society","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.9717/kmms.2023.26.8.859","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In this paper, we propose a novel 3D scene graph generation model, L3DSG, which can make use of rich prior knowledge obtained from large language model (LLM) by prompt engineering. The proposed model is built upon our previous 3D scene graph generation model, C3DSG, that adopts Point Transformer as 3D geometric feature extractor and uses the NE-GAT graph neural network as context reasoner. The new proposed model addresses the inability of C3DSG to utilize prior knowledge on indoor physical environments. It focuses on issues of how to obtain prior knowledge from LLM and how to make use of it for predicting objects and their relations effectively. The proposed model is extended from C3DSG by adding several elaborate modules to prompt, encode, and fuse prior knowledge from LLM. Through various experiments using the benchmark dataset 3DSSG, we show the superiority of the proposed model.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于LLM先验知识的三维场景图生成
本文提出了一种新的三维场景图生成模型L3DSG,该模型可以利用大语言模型(large language model, LLM)中丰富的先验知识。该模型是在我们之前的三维场景图生成模型C3DSG的基础上建立的,C3DSG采用Point Transformer作为三维几何特征提取器,并使用NE-GAT图神经网络作为上下文推理器。新提出的模型解决了C3DSG无法利用室内物理环境的先验知识的问题。重点研究了如何从LLM中获取先验知识,以及如何利用先验知识有效地预测对象及其关系。该模型是在C3DSG的基础上扩展而来的,通过添加一些精细的模块来提示、编码和融合来自LLM的先验知识。通过使用基准数据集3DSSG的各种实验,我们证明了所提出模型的优越性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Usability Study of GAN-based Webtoon Background Image Data Augmentation A Smart Sensor for Sleep Posture Measurement Using Pressure Sensors LNG and HFO Fuel Consumption Forecasting Modeling Using LightGBM Input Data Processing Methods to Improve Point Cloud Completion Model for Dental Prosthesis Low-Resolution Image Upsampling Method Using Super Resolution Based Adaptive Pixel Shuffle
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1