Zhijie Qin, Wei Zhong, Fei Hu, Xinyan Yang, Long Ye, Qin Zhang
{"title":"Layout Structure Assisted Indoor Image Generation","authors":"Zhijie Qin, Wei Zhong, Fei Hu, Xinyan Yang, Long Ye, Qin Zhang","doi":"10.1109/MIPR51284.2021.00061","DOIUrl":null,"url":null,"abstract":"The existing methods can generate images in accord with scene graph, but the obtained images may appear blurs at the edges and disorders in the structure, due to the lacks of the structure information. In this paper, by considering the indoor images contain more layout structures than outdoor ones, we focus on the indoor image generation assisted with the layout structures. In the proposed method, through fusing the scene graph features together with the layout structure, the graph convolutional network is employed to convert the fused semantic information into the feature representation of scenes. Subsequently, a refined encoder-decoder network is also used for generating the final images. In the experiments, we compare the proposed method with the existing works on the indoor image dataset in terms of subjective and objective evaluations. The experimental results show that our method can achieve better IoU metric, and the visualized results also illustrate that the proposed approach can generate more clear indoor images with better layout structures.","PeriodicalId":139543,"journal":{"name":"2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MIPR51284.2021.00061","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The existing methods can generate images in accord with scene graph, but the obtained images may appear blurs at the edges and disorders in the structure, due to the lacks of the structure information. In this paper, by considering the indoor images contain more layout structures than outdoor ones, we focus on the indoor image generation assisted with the layout structures. In the proposed method, through fusing the scene graph features together with the layout structure, the graph convolutional network is employed to convert the fused semantic information into the feature representation of scenes. Subsequently, a refined encoder-decoder network is also used for generating the final images. In the experiments, we compare the proposed method with the existing works on the indoor image dataset in terms of subjective and objective evaluations. The experimental results show that our method can achieve better IoU metric, and the visualized results also illustrate that the proposed approach can generate more clear indoor images with better layout structures.