根据单张图片制作乐高塑像

IF 4.7 2区化学 Q2 MATERIALS SCIENCE, MULTIDISCIPLINARY ACS Applied Polymer Materials Pub Date : 2024-07-19 DOI:10.1145/3658167

Jiahao Ge, Mingjun Zhou, Wenrui Bao, Hao Xu, Chi-Wing Fu

{"title":"根据单张图片制作乐高塑像","authors":"Jiahao Ge, Mingjun Zhou, Wenrui Bao, Hao Xu, Chi-Wing Fu","doi":"10.1145/3658167","DOIUrl":null,"url":null,"abstract":"\n This paper presents a computational pipeline for creating personalized, physical LEGO\n ®1\n figurines from user-input portrait photos. The generated figurine is an assembly of coherently-connected LEGO\n ®\n bricks detailed with uv-printed decals, capturing prominent features such as hairstyle, clothing style, and garment color, and also intricate details such as logos, text, and patterns. This task is non-trivial, due to the substantial domain gap between unconstrained user photos and the stylistically-consistent LEGO\n ®\n figurine models. To ensure assemble-ability by LEGO\n ®\n bricks while capturing prominent features and intricate details, we design a three-stage pipeline: (i) we formulate a CLIP-guided retrieval approach to connect the domains of user photos and LEGO\n ®\n figurines, then output physically-assemble-able LEGO\n ®\n figurines with decals excluded; (ii) we then synthesize decals on the figurines via a symmetric U-Nets architecture conditioned on appearance features extracted from user photos; and (iii) we next reproject and uv-print the decals on associated LEGO\n ®\n bricks for physical model production. We evaluate the effectiveness of our method against eight hundred expert-designed figurines, using a comprehensive set of metrics, which include a novel GPT-4V-based evaluation metric, demonstrating superior performance of our method in visual quality and resemblance to input photos. Also, we show our method's robustness by generating LEGO\n ®\n figurines from diverse inputs and physically fabricating and assembling several of them.\n","PeriodicalId":7,"journal":{"name":"ACS Applied Polymer Materials","volume":" June","pages":""},"PeriodicalIF":4.7000,"publicationDate":"2024-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Creating LEGO Figurines from Single Images\",\"authors\":\"Jiahao Ge, Mingjun Zhou, Wenrui Bao, Hao Xu, Chi-Wing Fu\",\"doi\":\"10.1145/3658167\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n This paper presents a computational pipeline for creating personalized, physical LEGO\\n ®1\\n figurines from user-input portrait photos. The generated figurine is an assembly of coherently-connected LEGO\\n ®\\n bricks detailed with uv-printed decals, capturing prominent features such as hairstyle, clothing style, and garment color, and also intricate details such as logos, text, and patterns. This task is non-trivial, due to the substantial domain gap between unconstrained user photos and the stylistically-consistent LEGO\\n ®\\n figurine models. To ensure assemble-ability by LEGO\\n ®\\n bricks while capturing prominent features and intricate details, we design a three-stage pipeline: (i) we formulate a CLIP-guided retrieval approach to connect the domains of user photos and LEGO\\n ®\\n figurines, then output physically-assemble-able LEGO\\n ®\\n figurines with decals excluded; (ii) we then synthesize decals on the figurines via a symmetric U-Nets architecture conditioned on appearance features extracted from user photos; and (iii) we next reproject and uv-print the decals on associated LEGO\\n ®\\n bricks for physical model production. We evaluate the effectiveness of our method against eight hundred expert-designed figurines, using a comprehensive set of metrics, which include a novel GPT-4V-based evaluation metric, demonstrating superior performance of our method in visual quality and resemblance to input photos. Also, we show our method's robustness by generating LEGO\\n ®\\n figurines from diverse inputs and physically fabricating and assembling several of them.\\n\",\"PeriodicalId\":7,\"journal\":{\"name\":\"ACS Applied Polymer Materials\",\"volume\":\" June\",\"pages\":\"\"},\"PeriodicalIF\":4.7000,\"publicationDate\":\"2024-07-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACS Applied Polymer Materials\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1145/3658167\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MATERIALS SCIENCE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Polymer Materials","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1145/3658167","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

摘要

本文介绍了利用用户输入的肖像照片制作个性化实体乐高®1 塑像的计算流程。生成的塑像是由连贯连接的乐高®砖块组装而成，细节部分采用 UV 印刷贴花，捕捉了发型、服装款式和服装颜色等显著特征，以及徽标、文字和图案等复杂细节。由于无约束的用户照片与风格一致的乐高® 塑像模型之间存在巨大的领域差距，因此这项任务并不轻松。为了确保乐高®积木的可装配性，同时捕捉到突出的特征和复杂的细节，我们设计了一个三阶段流水线：(i) 我们制定了一种以 CLIP 为指导的检索方法来连接用户照片和乐高® 塑像的领域，然后输出不含贴花的可物理组装的乐高® 塑像；(ii) 然后，我们根据从用户照片中提取的外观特征，通过对称 U-Nets 架构在塑像上合成贴花；(iii) 接下来，我们在相关的乐高® 砖上重新投影和uv-打印贴花，以制作物理模型。我们使用一套全面的指标（包括基于 GPT-4V 的新颖评估指标）评估了我们的方法与八百个专家设计的小雕像的效果，结果表明我们的方法在视觉质量和与输入照片的相似度方面表现出色。此外，我们还根据不同的输入信息生成乐高®小雕像，并实际制作和组装了其中几个，从而展示了我们方法的鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Creating LEGO Figurines from Single Images

This paper presents a computational pipeline for creating personalized, physical LEGO ®1 figurines from user-input portrait photos. The generated figurine is an assembly of coherently-connected LEGO ® bricks detailed with uv-printed decals, capturing prominent features such as hairstyle, clothing style, and garment color, and also intricate details such as logos, text, and patterns. This task is non-trivial, due to the substantial domain gap between unconstrained user photos and the stylistically-consistent LEGO ® figurine models. To ensure assemble-ability by LEGO ® bricks while capturing prominent features and intricate details, we design a three-stage pipeline: (i) we formulate a CLIP-guided retrieval approach to connect the domains of user photos and LEGO ® figurines, then output physically-assemble-able LEGO ® figurines with decals excluded; (ii) we then synthesize decals on the figurines via a symmetric U-Nets architecture conditioned on appearance features extracted from user photos; and (iii) we next reproject and uv-print the decals on associated LEGO ® bricks for physical model production. We evaluate the effectiveness of our method against eight hundred expert-designed figurines, using a comprehensive set of metrics, which include a novel GPT-4V-based evaluation metric, demonstrating superior performance of our method in visual quality and resemblance to input photos. Also, we show our method's robustness by generating LEGO ® figurines from diverse inputs and physically fabricating and assembling several of them.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

ACS Applied Polymer Materials Multiple-

CiteScore

7.20

自引率

6.00%

发文量

810

期刊介绍： ACS Applied Polymer Materials is an interdisciplinary journal publishing original research covering all aspects of engineering, chemistry, physics, and biology relevant to applications of polymers. The journal is devoted to reports of new and original experimental and theoretical research of an applied nature that integrates fundamental knowledge in the areas of materials, engineering, physics, bioscience, polymer science and chemistry into important polymer applications. The journal is specifically interested in work that addresses relationships among structure, processing, morphology, chemistry, properties, and function as well as work that provide insights into mechanisms critical to the performance of the polymer for applications.