{"title":"Small but mighty: Enhancing 3D point clouds semantic segmentation with U-Next framework","authors":"Ziyin Zeng, Qingyong Hu, Zhong Xie, Bijun Li, Jian Zhou, Yongyang Xu","doi":"10.1016/j.jag.2024.104309","DOIUrl":null,"url":null,"abstract":"We investigate the problem of 3D point clouds semantic segmentation. Recently, a large amount of research work has focused on local feature aggregation. However, the foundational framework of semantic segmentation of 3D point clouds has been neglected, where the majority of current methods default to the U-Net framework. In this study, we present U-Next, a small but mighty framework designed specifically for point cloud semantic segmentation. The key innovation of this framework is to capture multi-scale hierarchical features. Specifically, we construct the U-Next by stacking multiple U-Net <mml:math altimg=\"si10.svg\" display=\"inline\"><mml:msup><mml:mrow><mml:mi>L</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msup></mml:math> sub-networks in a dense arrangement to diminish the semantic gap. Concurrently, it integrates feature maps across various scales to proficiently restore intricate fine-grained details. Additionally, a multi-level deep supervision mechanism is introduced for smoothing gradient propagation and facilitating network optimization. We conduct extensive experiments on benchmarks, including the indoor S3DIS dataset, the LiDAR-based outdoor Toronto3D dataset, and the urban-scale photogrammetry-based SensatUrban dataset, demonstrate the superiority of U-Next. The U-Next framework consistently exhibits significant performance enhancements across various benchmarks and baselines, demonstrating its considerable potential as a versatile point-based framework for future endeavors. The code has been released at <ce:inter-ref xlink:href=\"https://github.com/zeng-ziyin/U-Next\" xlink:type=\"simple\">https://github.com/zeng-ziyin/U-Next</ce:inter-ref>.","PeriodicalId":50341,"journal":{"name":"International Journal of Applied Earth Observation and Geoinformation","volume":"252 1","pages":""},"PeriodicalIF":7.5000,"publicationDate":"2024-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Applied Earth Observation and Geoinformation","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1016/j.jag.2024.104309","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Earth and Planetary Sciences","Score":null,"Total":0}
引用次数: 0
Abstract
We investigate the problem of 3D point clouds semantic segmentation. Recently, a large amount of research work has focused on local feature aggregation. However, the foundational framework of semantic segmentation of 3D point clouds has been neglected, where the majority of current methods default to the U-Net framework. In this study, we present U-Next, a small but mighty framework designed specifically for point cloud semantic segmentation. The key innovation of this framework is to capture multi-scale hierarchical features. Specifically, we construct the U-Next by stacking multiple U-Net L1 sub-networks in a dense arrangement to diminish the semantic gap. Concurrently, it integrates feature maps across various scales to proficiently restore intricate fine-grained details. Additionally, a multi-level deep supervision mechanism is introduced for smoothing gradient propagation and facilitating network optimization. We conduct extensive experiments on benchmarks, including the indoor S3DIS dataset, the LiDAR-based outdoor Toronto3D dataset, and the urban-scale photogrammetry-based SensatUrban dataset, demonstrate the superiority of U-Next. The U-Next framework consistently exhibits significant performance enhancements across various benchmarks and baselines, demonstrating its considerable potential as a versatile point-based framework for future endeavors. The code has been released at https://github.com/zeng-ziyin/U-Next.
期刊介绍:
The International Journal of Applied Earth Observation and Geoinformation publishes original papers that utilize earth observation data for natural resource and environmental inventory and management. These data primarily originate from remote sensing platforms, including satellites and aircraft, supplemented by surface and subsurface measurements. Addressing natural resources such as forests, agricultural land, soils, and water, as well as environmental concerns like biodiversity, land degradation, and hazards, the journal explores conceptual and data-driven approaches. It covers geoinformation themes like capturing, databasing, visualization, interpretation, data quality, and spatial uncertainty.