{"title":"From Flat to Spatial: Comparison of 4 methods constructing 3D, 2 and 1/2D Models from 2D Plans with neural networks","authors":"Jacob Sam, Karan Patel, Mike Saad","doi":"arxiv-2407.19970","DOIUrl":null,"url":null,"abstract":"In the field of architecture, the conversion of single images into 2 and 1/2D\nand 3D meshes is a promising technology that enhances design visualization and\nefficiency. This paper evaluates four innovative methods: \"One-2-3-45,\" \"CRM:\nSingle Image to 3D Textured Mesh with Convolutional Reconstruction Model,\"\n\"Instant Mesh,\" and \"Image-to-Mesh.\" These methods are at the forefront of this\ntechnology, focusing on their applicability in architectural design and\nvisualization. They streamline the creation of 3D architectural models,\nenabling rapid prototyping and detailed visualization from minimal initial\ninputs, such as photographs or simple sketches.One-2-3-45 leverages a\ndiffusion-based approach to generate multi-view reconstructions, ensuring high\ngeometric fidelity and texture quality. CRM utilizes a convolutional network to\nintegrate geometric priors into its architecture, producing detailed and\ntextured meshes quickly and efficiently. Instant Mesh combines the strengths of\nmulti-view diffusion and sparse-view models to offer speed and scalability,\nsuitable for diverse architectural projects. Image-to-Mesh leverages a\ngenerative adversarial network (GAN) to produce 3D meshes from single images,\nfocusing on maintaining high texture fidelity and geometric accuracy by\nincorporating image and depth map data into its training process. It uses a\nhybrid approach that combines voxel-based representations with surface\nreconstruction techniques to ensure detailed and realistic 3D models.This\ncomparative study highlights each method's contribution to reducing design\ncycle times, improving accuracy, and enabling flexible adaptations to various\narchitectural styles and requirements. By providing architects with powerful\ntools for rapid visualization and iteration, these advancements in 3D mesh\ngeneration are set to revolutionize architectural practices.","PeriodicalId":501174,"journal":{"name":"arXiv - CS - Graphics","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Graphics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.19970","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In the field of architecture, the conversion of single images into 2 and 1/2D
and 3D meshes is a promising technology that enhances design visualization and
efficiency. This paper evaluates four innovative methods: "One-2-3-45," "CRM:
Single Image to 3D Textured Mesh with Convolutional Reconstruction Model,"
"Instant Mesh," and "Image-to-Mesh." These methods are at the forefront of this
technology, focusing on their applicability in architectural design and
visualization. They streamline the creation of 3D architectural models,
enabling rapid prototyping and detailed visualization from minimal initial
inputs, such as photographs or simple sketches.One-2-3-45 leverages a
diffusion-based approach to generate multi-view reconstructions, ensuring high
geometric fidelity and texture quality. CRM utilizes a convolutional network to
integrate geometric priors into its architecture, producing detailed and
textured meshes quickly and efficiently. Instant Mesh combines the strengths of
multi-view diffusion and sparse-view models to offer speed and scalability,
suitable for diverse architectural projects. Image-to-Mesh leverages a
generative adversarial network (GAN) to produce 3D meshes from single images,
focusing on maintaining high texture fidelity and geometric accuracy by
incorporating image and depth map data into its training process. It uses a
hybrid approach that combines voxel-based representations with surface
reconstruction techniques to ensure detailed and realistic 3D models.This
comparative study highlights each method's contribution to reducing design
cycle times, improving accuracy, and enabling flexible adaptations to various
architectural styles and requirements. By providing architects with powerful
tools for rapid visualization and iteration, these advancements in 3D mesh
generation are set to revolutionize architectural practices.