布局结构辅助室内图像生成

Zhijie Qin, Wei Zhong, Fei Hu, Xinyan Yang, Long Ye, Qin Zhang
{"title":"布局结构辅助室内图像生成","authors":"Zhijie Qin, Wei Zhong, Fei Hu, Xinyan Yang, Long Ye, Qin Zhang","doi":"10.1109/MIPR51284.2021.00061","DOIUrl":null,"url":null,"abstract":"The existing methods can generate images in accord with scene graph, but the obtained images may appear blurs at the edges and disorders in the structure, due to the lacks of the structure information. In this paper, by considering the indoor images contain more layout structures than outdoor ones, we focus on the indoor image generation assisted with the layout structures. In the proposed method, through fusing the scene graph features together with the layout structure, the graph convolutional network is employed to convert the fused semantic information into the feature representation of scenes. Subsequently, a refined encoder-decoder network is also used for generating the final images. In the experiments, we compare the proposed method with the existing works on the indoor image dataset in terms of subjective and objective evaluations. The experimental results show that our method can achieve better IoU metric, and the visualized results also illustrate that the proposed approach can generate more clear indoor images with better layout structures.","PeriodicalId":139543,"journal":{"name":"2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Layout Structure Assisted Indoor Image Generation\",\"authors\":\"Zhijie Qin, Wei Zhong, Fei Hu, Xinyan Yang, Long Ye, Qin Zhang\",\"doi\":\"10.1109/MIPR51284.2021.00061\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The existing methods can generate images in accord with scene graph, but the obtained images may appear blurs at the edges and disorders in the structure, due to the lacks of the structure information. In this paper, by considering the indoor images contain more layout structures than outdoor ones, we focus on the indoor image generation assisted with the layout structures. In the proposed method, through fusing the scene graph features together with the layout structure, the graph convolutional network is employed to convert the fused semantic information into the feature representation of scenes. Subsequently, a refined encoder-decoder network is also used for generating the final images. In the experiments, we compare the proposed method with the existing works on the indoor image dataset in terms of subjective and objective evaluations. The experimental results show that our method can achieve better IoU metric, and the visualized results also illustrate that the proposed approach can generate more clear indoor images with better layout structures.\",\"PeriodicalId\":139543,\"journal\":{\"name\":\"2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MIPR51284.2021.00061\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MIPR51284.2021.00061","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

现有的方法可以生成符合场景图的图像,但由于缺乏结构信息,得到的图像可能出现边缘模糊和结构混乱的情况。考虑到室内图像比室外图像包含更多的布局结构,本文重点研究了在布局结构辅助下的室内图像生成。该方法通过将场景图特征与布局结构融合,利用图卷积网络将融合后的语义信息转化为场景的特征表示。随后,还使用改进的编码器-解码器网络来生成最终图像。在实验中,我们将所提出的方法与现有的室内图像数据集的主观和客观评价进行了比较。实验结果表明,该方法可以获得更好的IoU度量,可视化结果也表明,该方法可以生成更清晰的室内图像和更好的布局结构。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Layout Structure Assisted Indoor Image Generation
The existing methods can generate images in accord with scene graph, but the obtained images may appear blurs at the edges and disorders in the structure, due to the lacks of the structure information. In this paper, by considering the indoor images contain more layout structures than outdoor ones, we focus on the indoor image generation assisted with the layout structures. In the proposed method, through fusing the scene graph features together with the layout structure, the graph convolutional network is employed to convert the fused semantic information into the feature representation of scenes. Subsequently, a refined encoder-decoder network is also used for generating the final images. In the experiments, we compare the proposed method with the existing works on the indoor image dataset in terms of subjective and objective evaluations. The experimental results show that our method can achieve better IoU metric, and the visualized results also illustrate that the proposed approach can generate more clear indoor images with better layout structures.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
XM2A: Multi-Scale Multi-Head Attention with Cross-Talk for Multi-Variate Time Series Analysis Demo Paper: Ad Hoc Search On Statistical Data Based On Categorization And Metadata Augmentation An Introduction to the JPEG Fake Media Initiative Augmented Tai-Chi Chuan Practice Tool with Pose Evaluation Exploring the Spatial-Visual Locality of Geo-tagged Urban Street Images
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1