Tengfei Wang, Zongqian Zhan, Rui Xia, Linxia Ji, Xin Wang
{"title":"MGFs:基于多视图图像的用于网格构建的屏蔽高斯场","authors":"Tengfei Wang, Zongqian Zhan, Rui Xia, Linxia Ji, Xin Wang","doi":"arxiv-2408.03060","DOIUrl":null,"url":null,"abstract":"Over the last few decades, image-based building surface reconstruction has\ngarnered substantial research interest and has been applied across various\nfields, such as heritage preservation, architectural planning, etc. Compared to\nthe traditional photogrammetric and NeRF-based solutions, recently, Gaussian\nfields-based methods have exhibited significant potential in generating surface\nmeshes due to their time-efficient training and detailed 3D information\npreservation. However, most gaussian fields-based methods are trained with all\nimage pixels, encompassing building and nonbuilding areas, which results in a\nsignificant noise for building meshes and degeneration in time efficiency. This\npaper proposes a novel framework, Masked Gaussian Fields (MGFs), designed to\ngenerate accurate surface reconstruction for building in a time-efficient way.\nThe framework first applies EfficientSAM and COLMAP to generate multi-level\nmasks of building and the corresponding masked point clouds. Subsequently, the\nmasked gaussian fields are trained by integrating two innovative losses: a\nmulti-level perceptual masked loss focused on constructing building regions and\na boundary loss aimed at enhancing the details of the boundaries between\ndifferent masks. Finally, we improve the tetrahedral surface mesh extraction\nmethod based on the masked gaussian spheres. Comprehensive experiments on UAV\nimages demonstrate that, compared to the traditional method and several\nNeRF-based and Gaussian-based SOTA solutions, our approach significantly\nimproves both the accuracy and efficiency of building surface reconstruction.\nNotably, as a byproduct, there is an additional gain in the novel view\nsynthesis of building.","PeriodicalId":501174,"journal":{"name":"arXiv - CS - Graphics","volume":"85 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MGFs: Masked Gaussian Fields for Meshing Building based on Multi-View Images\",\"authors\":\"Tengfei Wang, Zongqian Zhan, Rui Xia, Linxia Ji, Xin Wang\",\"doi\":\"arxiv-2408.03060\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Over the last few decades, image-based building surface reconstruction has\\ngarnered substantial research interest and has been applied across various\\nfields, such as heritage preservation, architectural planning, etc. Compared to\\nthe traditional photogrammetric and NeRF-based solutions, recently, Gaussian\\nfields-based methods have exhibited significant potential in generating surface\\nmeshes due to their time-efficient training and detailed 3D information\\npreservation. However, most gaussian fields-based methods are trained with all\\nimage pixels, encompassing building and nonbuilding areas, which results in a\\nsignificant noise for building meshes and degeneration in time efficiency. This\\npaper proposes a novel framework, Masked Gaussian Fields (MGFs), designed to\\ngenerate accurate surface reconstruction for building in a time-efficient way.\\nThe framework first applies EfficientSAM and COLMAP to generate multi-level\\nmasks of building and the corresponding masked point clouds. Subsequently, the\\nmasked gaussian fields are trained by integrating two innovative losses: a\\nmulti-level perceptual masked loss focused on constructing building regions and\\na boundary loss aimed at enhancing the details of the boundaries between\\ndifferent masks. Finally, we improve the tetrahedral surface mesh extraction\\nmethod based on the masked gaussian spheres. Comprehensive experiments on UAV\\nimages demonstrate that, compared to the traditional method and several\\nNeRF-based and Gaussian-based SOTA solutions, our approach significantly\\nimproves both the accuracy and efficiency of building surface reconstruction.\\nNotably, as a byproduct, there is an additional gain in the novel view\\nsynthesis of building.\",\"PeriodicalId\":501174,\"journal\":{\"name\":\"arXiv - CS - Graphics\",\"volume\":\"85 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Graphics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2408.03060\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Graphics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.03060","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
MGFs: Masked Gaussian Fields for Meshing Building based on Multi-View Images
Over the last few decades, image-based building surface reconstruction has
garnered substantial research interest and has been applied across various
fields, such as heritage preservation, architectural planning, etc. Compared to
the traditional photogrammetric and NeRF-based solutions, recently, Gaussian
fields-based methods have exhibited significant potential in generating surface
meshes due to their time-efficient training and detailed 3D information
preservation. However, most gaussian fields-based methods are trained with all
image pixels, encompassing building and nonbuilding areas, which results in a
significant noise for building meshes and degeneration in time efficiency. This
paper proposes a novel framework, Masked Gaussian Fields (MGFs), designed to
generate accurate surface reconstruction for building in a time-efficient way.
The framework first applies EfficientSAM and COLMAP to generate multi-level
masks of building and the corresponding masked point clouds. Subsequently, the
masked gaussian fields are trained by integrating two innovative losses: a
multi-level perceptual masked loss focused on constructing building regions and
a boundary loss aimed at enhancing the details of the boundaries between
different masks. Finally, we improve the tetrahedral surface mesh extraction
method based on the masked gaussian spheres. Comprehensive experiments on UAV
images demonstrate that, compared to the traditional method and several
NeRF-based and Gaussian-based SOTA solutions, our approach significantly
improves both the accuracy and efficiency of building surface reconstruction.
Notably, as a byproduct, there is an additional gain in the novel view
synthesis of building.