Aditya Ganeshan, Ryan Y. Huang, Xianghao Xu, R. Kenny Jones, Daniel Ritchie
{"title":"ParSEL: Parameterized Shape Editing with Language","authors":"Aditya Ganeshan, Ryan Y. Huang, Xianghao Xu, R. Kenny Jones, Daniel Ritchie","doi":"arxiv-2405.20319","DOIUrl":null,"url":null,"abstract":"The ability to edit 3D assets from natural language presents a compelling\nparadigm to aid in the democratization of 3D content creation. However, while\nnatural language is often effective at communicating general intent, it is\npoorly suited for specifying precise manipulation. To address this gap, we\nintroduce ParSEL, a system that enables controllable editing of high-quality 3D\nassets from natural language. Given a segmented 3D mesh and an editing request,\nParSEL produces a parameterized editing program. Adjusting the program\nparameters allows users to explore shape variations with a precise control over\nthe magnitudes of edits. To infer editing programs which align with an input\nedit request, we leverage the abilities of large-language models (LLMs).\nHowever, while we find that LLMs excel at identifying initial edit operations,\nthey often fail to infer complete editing programs, and produce outputs that\nviolate shape semantics. To overcome this issue, we introduce Analytical Edit\nPropagation (AEP), an algorithm which extends a seed edit with additional\noperations until a complete editing program has been formed. Unlike prior\nmethods, AEP searches for analytical editing operations compatible with a range\nof possible user edits through the integration of computer algebra systems for\ngeometric analysis. Experimentally we demonstrate ParSEL's effectiveness in\nenabling controllable editing of 3D objects through natural language requests\nover alternative system designs.","PeriodicalId":501033,"journal":{"name":"arXiv - CS - Symbolic Computation","volume":"149 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Symbolic Computation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2405.20319","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The ability to edit 3D assets from natural language presents a compelling
paradigm to aid in the democratization of 3D content creation. However, while
natural language is often effective at communicating general intent, it is
poorly suited for specifying precise manipulation. To address this gap, we
introduce ParSEL, a system that enables controllable editing of high-quality 3D
assets from natural language. Given a segmented 3D mesh and an editing request,
ParSEL produces a parameterized editing program. Adjusting the program
parameters allows users to explore shape variations with a precise control over
the magnitudes of edits. To infer editing programs which align with an input
edit request, we leverage the abilities of large-language models (LLMs).
However, while we find that LLMs excel at identifying initial edit operations,
they often fail to infer complete editing programs, and produce outputs that
violate shape semantics. To overcome this issue, we introduce Analytical Edit
Propagation (AEP), an algorithm which extends a seed edit with additional
operations until a complete editing program has been formed. Unlike prior
methods, AEP searches for analytical editing operations compatible with a range
of possible user edits through the integration of computer algebra systems for
geometric analysis. Experimentally we demonstrate ParSEL's effectiveness in
enabling controllable editing of 3D objects through natural language requests
over alternative system designs.
用自然语言编辑 3D 资产的能力为 3D 内容创作的民主化提供了一个引人注目的范式。然而,虽然自然语言通常能有效传达一般意图,但却不太适合指定精确操作。为了弥补这一不足,我们推出了 ParSEL 系统,它可以通过自然语言对高质量的 3D 资产进行可控编辑。给定一个分割的三维网格和一个编辑请求,ParSEL 会生成一个参数化的编辑程序。通过调整程序参数,用户可以探索形状的变化,并精确控制编辑的幅度。为了推断出与输入编辑请求相一致的编辑程序,我们利用了大型语言模型(LLM)的能力。然而,尽管我们发现 LLM 擅长识别初始编辑操作,但它们往往无法推断出完整的编辑程序,并产生违反形状语义的输出。为了克服这个问题,我们引入了分析编辑推广算法(AEP),这种算法通过附加操作来扩展种子编辑,直到形成完整的编辑程序。与传统方法不同的是,AEP 通过整合计算机代数系统的计量分析,寻找与一系列可能的用户编辑相兼容的分析编辑操作。通过实验,我们证明了 ParSEL 在通过自然语言请求对 3D 物体进行可控编辑方面的有效性。