基于单幅图像的人体形状估计的纹理改进

24th Irish Machine Vision and Image Processing Conference Pub Date : 2022-08-31 DOI:10.56541/soww6683

Jorge Gonzalez Escribano, Susana Rauno, A. Swaminathan, David Smyth, A. Smolic

{"title":"基于单幅图像的人体形状估计的纹理改进","authors":"Jorge Gonzalez Escribano, Susana Rauno, A. Swaminathan, David Smyth, A. Smolic","doi":"10.56541/soww6683","DOIUrl":null,"url":null,"abstract":"Current human digitization techniques from a single image are showing promising results when it comes to the quality of the estimated geometry, but they often fall short when it comes to the texture of the generated 3D model, especially on the occluded side of the person, while some others do not even output a texture for the model. Our goal in this paper is to improve the predicted texture of these models without requiring any other additional input more than the original image used to generate the 3D model in the first place. For that, we propose a novel way to predict the back view of the person by including semantic and positional information that outperforms the state-of-the-art techniques. Our method is based on a general-purpose image-to-image translation algorithm with conditional adversarial networks adapted to predict the back view of a human. Furthermore, we use the predicted image to improve the texture of the 3D estimated model and we provide a 3D dataset, V-Human, to train our method and also any 3D human shape estimation algorithms which use meshes such as PIFu.","PeriodicalId":180076,"journal":{"name":"24th Irish Machine Vision and Image Processing Conference","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Texture improvement for human shape estimation from a single image\",\"authors\":\"Jorge Gonzalez Escribano, Susana Rauno, A. Swaminathan, David Smyth, A. Smolic\",\"doi\":\"10.56541/soww6683\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Current human digitization techniques from a single image are showing promising results when it comes to the quality of the estimated geometry, but they often fall short when it comes to the texture of the generated 3D model, especially on the occluded side of the person, while some others do not even output a texture for the model. Our goal in this paper is to improve the predicted texture of these models without requiring any other additional input more than the original image used to generate the 3D model in the first place. For that, we propose a novel way to predict the back view of the person by including semantic and positional information that outperforms the state-of-the-art techniques. Our method is based on a general-purpose image-to-image translation algorithm with conditional adversarial networks adapted to predict the back view of a human. Furthermore, we use the predicted image to improve the texture of the 3D estimated model and we provide a 3D dataset, V-Human, to train our method and also any 3D human shape estimation algorithms which use meshes such as PIFu.\",\"PeriodicalId\":180076,\"journal\":{\"name\":\"24th Irish Machine Vision and Image Processing Conference\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"24th Irish Machine Vision and Image Processing Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.56541/soww6683\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"24th Irish Machine Vision and Image Processing Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.56541/soww6683","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

目前的人体数字化技术在单个图像的估计几何质量方面显示出有希望的结果，但是当涉及到生成的3D模型的纹理时，特别是在人被遮挡的一侧，而其他一些甚至不为模型输出纹理时，它们往往会出现不足。我们在本文中的目标是改善这些模型的预测纹理，而不需要任何其他额外的输入，而不是首先用于生成3D模型的原始图像。为此，我们提出了一种新颖的方法，通过包含语义和位置信息来预测人的后视图，这种方法优于最先进的技术。我们的方法基于一种通用的图像到图像翻译算法，该算法具有条件对抗网络，可用于预测人类的后视图。此外，我们使用预测图像来改进3D估计模型的纹理，并提供了一个3D数据集V-Human来训练我们的方法以及任何使用PIFu等网格的3D人体形状估计算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Texture improvement for human shape estimation from a single image

Current human digitization techniques from a single image are showing promising results when it comes to the quality of the estimated geometry, but they often fall short when it comes to the texture of the generated 3D model, especially on the occluded side of the person, while some others do not even output a texture for the model. Our goal in this paper is to improve the predicted texture of these models without requiring any other additional input more than the original image used to generate the 3D model in the first place. For that, we propose a novel way to predict the back view of the person by including semantic and positional information that outperforms the state-of-the-art techniques. Our method is based on a general-purpose image-to-image translation algorithm with conditional adversarial networks adapted to predict the back view of a human. Furthermore, we use the predicted image to improve the texture of the 3D estimated model and we provide a 3D dataset, V-Human, to train our method and also any 3D human shape estimation algorithms which use meshes such as PIFu.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

24th Irish Machine Vision and Image Processing Conference

自引率

0.00%

发文量