从平面到空间:利用神经网络从二维平面图构建三维、二维和 1/2D 模型的四种方法比较

Jacob Sam, Karan Patel, Mike Saad
{"title":"从平面到空间:利用神经网络从二维平面图构建三维、二维和 1/2D 模型的四种方法比较","authors":"Jacob Sam, Karan Patel, Mike Saad","doi":"arxiv-2407.19970","DOIUrl":null,"url":null,"abstract":"In the field of architecture, the conversion of single images into 2 and 1/2D\nand 3D meshes is a promising technology that enhances design visualization and\nefficiency. This paper evaluates four innovative methods: \"One-2-3-45,\" \"CRM:\nSingle Image to 3D Textured Mesh with Convolutional Reconstruction Model,\"\n\"Instant Mesh,\" and \"Image-to-Mesh.\" These methods are at the forefront of this\ntechnology, focusing on their applicability in architectural design and\nvisualization. They streamline the creation of 3D architectural models,\nenabling rapid prototyping and detailed visualization from minimal initial\ninputs, such as photographs or simple sketches.One-2-3-45 leverages a\ndiffusion-based approach to generate multi-view reconstructions, ensuring high\ngeometric fidelity and texture quality. CRM utilizes a convolutional network to\nintegrate geometric priors into its architecture, producing detailed and\ntextured meshes quickly and efficiently. Instant Mesh combines the strengths of\nmulti-view diffusion and sparse-view models to offer speed and scalability,\nsuitable for diverse architectural projects. Image-to-Mesh leverages a\ngenerative adversarial network (GAN) to produce 3D meshes from single images,\nfocusing on maintaining high texture fidelity and geometric accuracy by\nincorporating image and depth map data into its training process. It uses a\nhybrid approach that combines voxel-based representations with surface\nreconstruction techniques to ensure detailed and realistic 3D models.This\ncomparative study highlights each method's contribution to reducing design\ncycle times, improving accuracy, and enabling flexible adaptations to various\narchitectural styles and requirements. By providing architects with powerful\ntools for rapid visualization and iteration, these advancements in 3D mesh\ngeneration are set to revolutionize architectural practices.","PeriodicalId":501174,"journal":{"name":"arXiv - CS - Graphics","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"From Flat to Spatial: Comparison of 4 methods constructing 3D, 2 and 1/2D Models from 2D Plans with neural networks\",\"authors\":\"Jacob Sam, Karan Patel, Mike Saad\",\"doi\":\"arxiv-2407.19970\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the field of architecture, the conversion of single images into 2 and 1/2D\\nand 3D meshes is a promising technology that enhances design visualization and\\nefficiency. This paper evaluates four innovative methods: \\\"One-2-3-45,\\\" \\\"CRM:\\nSingle Image to 3D Textured Mesh with Convolutional Reconstruction Model,\\\"\\n\\\"Instant Mesh,\\\" and \\\"Image-to-Mesh.\\\" These methods are at the forefront of this\\ntechnology, focusing on their applicability in architectural design and\\nvisualization. They streamline the creation of 3D architectural models,\\nenabling rapid prototyping and detailed visualization from minimal initial\\ninputs, such as photographs or simple sketches.One-2-3-45 leverages a\\ndiffusion-based approach to generate multi-view reconstructions, ensuring high\\ngeometric fidelity and texture quality. CRM utilizes a convolutional network to\\nintegrate geometric priors into its architecture, producing detailed and\\ntextured meshes quickly and efficiently. Instant Mesh combines the strengths of\\nmulti-view diffusion and sparse-view models to offer speed and scalability,\\nsuitable for diverse architectural projects. Image-to-Mesh leverages a\\ngenerative adversarial network (GAN) to produce 3D meshes from single images,\\nfocusing on maintaining high texture fidelity and geometric accuracy by\\nincorporating image and depth map data into its training process. It uses a\\nhybrid approach that combines voxel-based representations with surface\\nreconstruction techniques to ensure detailed and realistic 3D models.This\\ncomparative study highlights each method's contribution to reducing design\\ncycle times, improving accuracy, and enabling flexible adaptations to various\\narchitectural styles and requirements. By providing architects with powerful\\ntools for rapid visualization and iteration, these advancements in 3D mesh\\ngeneration are set to revolutionize architectural practices.\",\"PeriodicalId\":501174,\"journal\":{\"name\":\"arXiv - CS - Graphics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Graphics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2407.19970\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Graphics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.19970","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

在建筑领域,将单幅图像转换为二维、1/2 维和三维网格是一项很有前途的技术,可提高设计的可视化和效率。本文评估了四种创新方法:"One-2-3-45"、"CRM:利用卷积重建模型将单张图像转换为三维纹理网格"、"即时网格 "和 "图像到网格"。这些方法都处于该技术的前沿,重点关注其在建筑设计和可视化方面的适用性。One-2-3-45 利用基于扩散的方法生成多视角重建,确保了高几何保真度和纹理质量。CRM 利用卷积网络将几何先验整合到其架构中,快速高效地生成细节丰富、纹理清晰的网格。Instant Mesh 结合了多视图扩散和稀疏视图模型的优势,速度快,可扩展性强,适用于各种建筑项目。Image-to-Mesh 利用生成对抗网络 (GAN) 从单张图像生成三维网格,通过将图像和深度图数据纳入训练过程,重点保持高纹理保真度和几何精度。该比较研究强调了每种方法在缩短设计周期时间、提高精度以及灵活适应各种建筑风格和要求方面的贡献。通过为建筑师提供快速可视化和迭代的强大工具,这些三维网格生成技术的进步必将彻底改变建筑实践。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
From Flat to Spatial: Comparison of 4 methods constructing 3D, 2 and 1/2D Models from 2D Plans with neural networks
In the field of architecture, the conversion of single images into 2 and 1/2D and 3D meshes is a promising technology that enhances design visualization and efficiency. This paper evaluates four innovative methods: "One-2-3-45," "CRM: Single Image to 3D Textured Mesh with Convolutional Reconstruction Model," "Instant Mesh," and "Image-to-Mesh." These methods are at the forefront of this technology, focusing on their applicability in architectural design and visualization. They streamline the creation of 3D architectural models, enabling rapid prototyping and detailed visualization from minimal initial inputs, such as photographs or simple sketches.One-2-3-45 leverages a diffusion-based approach to generate multi-view reconstructions, ensuring high geometric fidelity and texture quality. CRM utilizes a convolutional network to integrate geometric priors into its architecture, producing detailed and textured meshes quickly and efficiently. Instant Mesh combines the strengths of multi-view diffusion and sparse-view models to offer speed and scalability, suitable for diverse architectural projects. Image-to-Mesh leverages a generative adversarial network (GAN) to produce 3D meshes from single images, focusing on maintaining high texture fidelity and geometric accuracy by incorporating image and depth map data into its training process. It uses a hybrid approach that combines voxel-based representations with surface reconstruction techniques to ensure detailed and realistic 3D models.This comparative study highlights each method's contribution to reducing design cycle times, improving accuracy, and enabling flexible adaptations to various architectural styles and requirements. By providing architects with powerful tools for rapid visualization and iteration, these advancements in 3D mesh generation are set to revolutionize architectural practices.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Gaussian Garments: Reconstructing Simulation-Ready Clothing with Photorealistic Appearance from Multi-View Video Thermal3D-GS: Physics-induced 3D Gaussians for Thermal Infrared Novel-view Synthesis Instant Facial Gaussians Translator for Relightable and Interactable Facial Rendering StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos Multi-scale Cycle Tracking in Dynamic Planar Graphs
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1