首页 > 最新文献

International journal of applied earth observation and geoinformation : ITC journal最新文献

英文 中文
Mangrove-Net: A multi-stage attention fusion and class token network for mangrove species mapping using lidar point clouds 红树林网:基于激光雷达点云的多阶段注意力融合和分类令牌网络
IF 8.6 Q1 REMOTE SENSING Pub Date : 2026-03-01 Epub Date: 2026-02-20 DOI: 10.1016/j.jag.2026.105166
Dezhi Wang , Ziyang Wu , Penghua Qiu , Simeng Wang , Bo Wan , Zongzhu Chen , Sheng Nie , Yihui Zhang
Mapping mangrove species is crucial for mangrove biodiversity protection and ecological restoration. Existing mangrove classification methods mostly rely on the height and intensity metrics of rasterized lidar point clouds, which lead to the partial loss of fine three-dimensional (3D) structural information, and are difficult to distinguish among small inter-class differences, complex structures (dense canopy and aerial roots), and uneven mangrove stands. To directly capture the species community structure of mangroves from a 3D perspective, this study develops a point-wise deep learning species classification method specifically for mangroves—called Mangrove Multi-stage Attention Fusion and Class Token Network (Mangrove-Net), using UAV lidar point clouds and RGB imagery fusion data. Mangrove-Net uses a multi-stage attention fusion and class token mechanism to balance local geometric representation with global semantic perception, effectively capturing representative features among mangrove species. To address sample-imbalanced classification tasks, Mangrove-Net introduces a weighted cross-entropy loss function. We tested the proposed method on a typical mangrove forest dataset (from the Hainan Qinglangang Provincial Nature Reserve, China) and on the open dataset ModelNet40. The results indicated that Mangrove-Net showed superior performance compared to six state-of-the-art point-based deep learning methods (such as PointNet++, Point Transformer, PointMLP, and GPSFormer) in mapping fine-grained mangrove species, with a minimum improvement of 6.60% and 7.71% in overall accuracy (OA) and mean category accuracy (mAcc), respectively. Multi-scale visualization (sample, local, and study area) showed that the mangrove species maps generated by our proposed method had higher spatial continuity and ecological consistency. Evaluated on the ModelNet40 dataset, Mangrove-Net also surpassed those baseline methods, further demonstrating the effectiveness and robustness of the method. This study provides a reliable point-wise deep learning method specifically for mangrove species classification to support various tasks in fine-grained precision mangrove forestry.
红树林物种图谱对红树林生物多样性保护和生态恢复具有重要意义。现有的红树林分类方法大多依赖光栅化激光雷达点云的高度和强度度量,导致精细三维(3D)结构信息的部分丢失,难以区分小的类间差异、复杂的结构(茂密的树冠和气根)和不均匀的红树林林分。为了从三维角度直接捕捉红树林的物种群落结构,本研究利用无人机激光雷达点云和RGB图像融合数据,开发了一种专门针对红树林的点向深度学习物种分类方法——红树林多阶段注意力融合和类标记网络(Mangrove- net)。红树林网络利用多阶段注意力融合和类标记机制平衡局部几何表征和全局语义感知,有效捕获红树林物种间的代表性特征。为了解决样本不平衡的分类任务,红树林网络引入了加权交叉熵损失函数。我们在一个典型的红树林数据集(来自中国海南青廊港省级自然保护区)和开放数据集ModelNet40上测试了所提出的方法。结果表明,与PointNet++、Point Transformer、PointMLP和GPSFormer等6种最先进的基于点的深度学习方法相比,红树林网络在绘制细粒度红树林物种方面表现优异,总体精度(OA)和平均分类精度(mAcc)分别提高了6.60%和7.71%。多尺度可视化(样本、局部和研究区)结果表明,该方法生成的红树林物种图谱具有较高的空间连续性和生态一致性。在ModelNet40数据集上进行评估,红树林网络也优于这些基线方法,进一步证明了该方法的有效性和鲁棒性。本研究为红树林物种分类提供了一种可靠的逐点深度学习方法,以支持细粒度精确红树林林业的各种任务。
{"title":"Mangrove-Net: A multi-stage attention fusion and class token network for mangrove species mapping using lidar point clouds","authors":"Dezhi Wang ,&nbsp;Ziyang Wu ,&nbsp;Penghua Qiu ,&nbsp;Simeng Wang ,&nbsp;Bo Wan ,&nbsp;Zongzhu Chen ,&nbsp;Sheng Nie ,&nbsp;Yihui Zhang","doi":"10.1016/j.jag.2026.105166","DOIUrl":"10.1016/j.jag.2026.105166","url":null,"abstract":"<div><div>Mapping mangrove species is crucial for mangrove biodiversity protection and ecological restoration. Existing mangrove classification methods mostly rely on the height and intensity metrics of rasterized lidar point clouds, which lead to the partial loss of fine three-dimensional (3D) structural information, and are difficult to distinguish among small inter-class differences, complex structures (dense canopy and aerial roots), and uneven mangrove stands. To directly capture the species community structure of mangroves from a 3D perspective, this study develops a point-wise deep learning species classification method specifically for mangroves—called Mangrove Multi-stage Attention Fusion and Class Token Network (Mangrove-Net), using UAV lidar point clouds and RGB imagery fusion data. Mangrove-Net uses a multi-stage attention fusion and class token mechanism to balance local geometric representation with global semantic perception, effectively capturing representative features among mangrove species. To address sample-imbalanced classification tasks, Mangrove-Net introduces a weighted cross-entropy loss function. We tested the proposed method on a typical mangrove forest dataset (from the Hainan Qinglangang Provincial Nature Reserve, China) and on the open dataset ModelNet40. The results indicated that Mangrove-Net showed superior performance compared to six state-of-the-art point-based deep learning methods (such as PointNet++, Point Transformer, PointMLP, and GPSFormer) in mapping fine-grained mangrove species, with a minimum improvement of 6.60% and 7.71% in overall accuracy (OA) and mean category accuracy (mAcc), respectively. Multi-scale visualization (sample, local, and study area) showed that the mangrove species maps generated by our proposed method had higher spatial continuity and ecological consistency. Evaluated on the ModelNet40 dataset, Mangrove-Net also surpassed those baseline methods, further demonstrating the effectiveness and robustness of the method. This study provides a reliable point-wise deep learning method specifically for mangrove species classification to support various tasks in fine-grained precision mangrove forestry.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"147 ","pages":"Article 105166"},"PeriodicalIF":8.6,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147278286","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Widespread biophysical cooling effects due to post-fire greening 火灾后绿化引起的广泛的生物物理冷却效应
IF 8.6 Q1 REMOTE SENSING Pub Date : 2026-03-01 Epub Date: 2026-02-28 DOI: 10.1016/j.jag.2026.105211
Huipeng Xi , Qunming Wang , Yuelong Xiao , Ru Guo , Xiaohua Tong , Peter M. Atkinson
Wildfires and greening are two important biophysical processes that influence land surface-climate feedback patterns. However, the impact of post-fire greening, which primarily reflects canopy structural recovery, on land surface temperature (LST) remains uncertain, particularly at the daily scale, as this temporal resolution allows for a clearer observation of how fire seasonality and vegetation regrowth influence short-term land surface energy dynamics. In this research, using satellite sensor observations covering the globe from 2004 to 2019, we found that the median post-fire recovery time of leaf area index (LAI) was 479.5 days. Most forests exhibited faster canopy LAI recovery than low-stature herbaceous vegetation, likely due to differences in burn severity and affected plant structures. Fire seasonality also shaped LAI recovery patterns: dry-season and spring fires led to quicker regrowth, with the shortest recovery in temperate spring fires (173.7 days), while wet-season and summer fires showed delayed recovery, especially in cold zones where summer fires need 425.5 days to recovery. During post-fire greening, the annual cycle of LAI recovery caused an average cooling effect of −0.04 K d-1, due to the strong evapotranspiration-climate negative feedback. However, seasonal effects varied: summer fires in temperate and cold zones led to cooling, with the strongest warming observed after temperate winter fires (up to 0.104 K d⁻1). Furthermore, we observed a widespread decrease in carbon use efficiency during post-fire LAI recovery, which means that the recovery rate of ecosystem carbon sinks may not be synchronized with the rate of vegetation greening. By distinguishing between structural and functional recovery, we found that early evapotranspiration-driven cooling during structural recovery may not persist throughout ecosystem functional recovery. This study enhances our understanding of the global biophysical climate effects of post-fire greening in the context of the earth’s land surface-climate feedback, and reveals precise changes in the component parts of this feedback effect.
野火和绿化是影响地表-气候反馈模式的两个重要生物物理过程。然而,火灾后绿化(主要反映冠层结构恢复)对地表温度(LST)的影响仍然不确定,特别是在日尺度上,因为这种时间分辨率允许更清楚地观察火灾季节性和植被再生如何影响短期地表能量动态。利用2004 - 2019年覆盖全球的卫星传感器观测数据,研究发现,林火后叶面积指数(LAI)恢复时间中位数为479.5 d。大多数森林表现出比低高度草本植被更快的冠层LAI恢复,可能是由于烧伤严重程度和受影响植物结构的差异。火灾的季节性也影响了LAI的恢复模式:旱季和春季火灾的恢复速度更快,温带春季火灾的恢复时间最短(173.7 d),而雨季和夏季火灾的恢复时间较晚,特别是在寒冷地区,夏季火灾需要425.5 d才能恢复。在火灾后绿化期间,由于强烈的蒸散发-气候负反馈,LAI恢复的年循环平均产生−0.04 K d-1的降温效应。然而,季节效应各不相同:温带和寒冷地区的夏季火灾导致气温变冷,在温带冬季火灾后观测到的增温最强(高达0.104 K d毒血症)。此外,我们观察到在火灾后LAI恢复过程中碳利用效率普遍下降,这意味着生态系统碳汇的恢复速度可能与植被绿化速度不同步。通过区分结构恢复和功能恢复,我们发现结构恢复期间早期蒸散驱动的冷却可能不会持续整个生态系统功能恢复。本研究增强了我们对地表-气候反馈背景下全球火灾后绿化生物物理气候效应的认识,并揭示了该反馈效应各组成部分的精确变化。
{"title":"Widespread biophysical cooling effects due to post-fire greening","authors":"Huipeng Xi ,&nbsp;Qunming Wang ,&nbsp;Yuelong Xiao ,&nbsp;Ru Guo ,&nbsp;Xiaohua Tong ,&nbsp;Peter M. Atkinson","doi":"10.1016/j.jag.2026.105211","DOIUrl":"10.1016/j.jag.2026.105211","url":null,"abstract":"<div><div>Wildfires and greening are two important biophysical processes that influence land surface-climate feedback patterns. However, the impact of post-fire greening, which primarily reflects canopy structural recovery, on land surface temperature (LST) remains uncertain, particularly at the daily scale, as this temporal resolution allows for a clearer observation of how fire seasonality and vegetation regrowth influence short-term land surface energy dynamics. In this research, using satellite sensor observations covering the globe from 2004 to 2019, we found that the median post-fire recovery time of leaf area index (LAI) was 479.5 days. Most forests exhibited faster canopy LAI recovery than low-stature herbaceous vegetation, likely due to differences in burn severity and affected plant structures. Fire seasonality also shaped LAI recovery patterns: dry-season and spring fires led to quicker regrowth, with the shortest recovery in temperate spring fires (173.7 days), while wet-season and summer fires showed delayed recovery, especially in cold zones where summer fires need 425.5 days to recovery. During post-fire greening, the annual cycle of LAI recovery caused an average cooling effect of −0.04 K d<sup>-1</sup>, due to the strong evapotranspiration-climate negative feedback. However, seasonal effects varied: summer fires in temperate and cold zones led to cooling, with the strongest warming observed after temperate winter fires (up to 0.104 K d⁻<sup>1</sup>). Furthermore, we observed a widespread decrease in carbon use efficiency during post-fire LAI recovery, which means that the recovery rate of ecosystem carbon sinks may not be synchronized with the rate of vegetation greening. By distinguishing between structural and functional recovery, we found that early evapotranspiration-driven cooling during structural recovery may not persist throughout ecosystem functional recovery. This study enhances our understanding of the global biophysical climate effects of post-fire greening in the context of the earth’s land surface-climate feedback, and reveals precise changes in the component parts of this feedback effect.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"147 ","pages":"Article 105211"},"PeriodicalIF":8.6,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147360587","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Constrained planar primitive segmentation for building component extraction from airborne LiDAR point clouds 基于约束平面基元分割的机载LiDAR点云建筑构件提取
IF 8.6 Q1 REMOTE SENSING Pub Date : 2026-03-01 Epub Date: 2026-02-26 DOI: 10.1016/j.jag.2026.105182
Hongxin Yang , Zhipeng Luo , Dedong Zhang , Jonathan Li
Accurately segmenting planar primitives from airborne LiDAR point clouds is crucial for urban planning applications and three-dimensional (3D) building reconstruction. However, existing approaches often exhibit limited extensibility and diminished robustness when segmenting buildings with diverse architectural styles. To address this issue, this paper proposes an enhanced constrained k-Plane Clustering (kPC) method that segments building point clouds into distinct planar primitives. The kPC algorithm formulates the segmentation task as a mixed-integer non-convex optimization problem incorporating three geometric constraints: point-to-plane distance minimization, cluster-center proximity enforcement, and directional regularization. This problem is solved via an alternating minimization strategy, which iteratively updates cluster assignments and plane parameters using Singular Value Decomposition (SVD) until convergence is reached. The proposed constrained kPC method provides two key advantages. First, it effectively mitigates the infinite extensibility of fitted planes. This issue is common in conventional kPC methods, and its mitigation directly addresses a major source of suboptimal segmentation performance. Second, the framework demonstrates robustness to variations in the optimization objective’s coefficients while maintaining consistent performance. Extensive experiments on both synthetic and real-world multi-style building datasets demonstrate that the proposed method achieves superior segmentation accuracy and outperforms state-of-the-art approaches in both qualitative and quantitative evaluations. Furthermore, the performance of the proposed method is robust to variations in its parameters, maintaining effectiveness across a wide range of values.
从机载激光雷达点云中准确分割平面基元对于城市规划应用和三维(3D)建筑重建至关重要。然而,现有的方法在分割具有不同建筑风格的建筑物时往往表现出有限的可扩展性和降低的健壮性。为了解决这一问题,本文提出了一种增强的约束k-平面聚类(kPC)方法,该方法将构建点云分割为不同的平面基元。kPC算法将分割任务描述为一个混合整数非凸优化问题,该问题包含三个几何约束:点到平面距离最小化、簇中心接近性强制执行和方向正则化。该算法采用交替最小化策略,利用奇异值分解(SVD)迭代更新聚类分配和平面参数,直至收敛。提出的约束kPC方法有两个主要优点。首先,它有效地减轻了拟合平面的无限可扩展性。这个问题在传统的kPC方法中很常见,它的缓解直接解决了次优分割性能的主要来源。其次,该框架对优化目标系数的变化具有鲁棒性,同时保持一致的性能。在合成和真实世界的多风格建筑数据集上进行的大量实验表明,所提出的方法具有优越的分割精度,并且在定性和定量评估方面都优于最先进的方法。此外,该方法的性能对其参数的变化具有鲁棒性,在广泛的值范围内保持有效性。
{"title":"Constrained planar primitive segmentation for building component extraction from airborne LiDAR point clouds","authors":"Hongxin Yang ,&nbsp;Zhipeng Luo ,&nbsp;Dedong Zhang ,&nbsp;Jonathan Li","doi":"10.1016/j.jag.2026.105182","DOIUrl":"10.1016/j.jag.2026.105182","url":null,"abstract":"<div><div>Accurately segmenting planar primitives from airborne LiDAR point clouds is crucial for urban planning applications and three-dimensional (3D) building reconstruction. However, existing approaches often exhibit limited extensibility and diminished robustness when segmenting buildings with diverse architectural styles. To address this issue, this paper proposes an enhanced constrained k-Plane Clustering (kPC) method that segments building point clouds into distinct planar primitives. The kPC algorithm formulates the segmentation task as a mixed-integer non-convex optimization problem incorporating three geometric constraints: point-to-plane distance minimization, cluster-center proximity enforcement, and directional regularization. This problem is solved via an alternating minimization strategy, which iteratively updates cluster assignments and plane parameters using Singular Value Decomposition (SVD) until convergence is reached. The proposed constrained kPC method provides two key advantages. First, it effectively mitigates the infinite extensibility of fitted planes. This issue is common in conventional kPC methods, and its mitigation directly addresses a major source of suboptimal segmentation performance. Second, the framework demonstrates robustness to variations in the optimization objective’s coefficients while maintaining consistent performance. Extensive experiments on both synthetic and real-world multi-style building datasets demonstrate that the proposed method achieves superior segmentation accuracy and outperforms state-of-the-art approaches in both qualitative and quantitative evaluations. Furthermore, the performance of the proposed method is robust to variations in its parameters, maintaining effectiveness across a wide range of values.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"147 ","pages":"Article 105182"},"PeriodicalIF":8.6,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147360593","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Individual sapling-level leaf chlorophyll content estimation in Ginkgo biloba with weed background: A novel deep-learning approach coupling APUNet-YOLO with PROSAIL-DNN 杂草背景下银杏幼苗叶片叶绿素含量估算:一种结合APUNet-YOLO和PROSAIL-DNN的深度学习方法
IF 8.6 Q1 REMOTE SENSING Pub Date : 2026-03-01 Epub Date: 2026-02-24 DOI: 10.1016/j.jag.2026.105194
Lin Su , Kai Zhou , Nicholas C. Coops , Justin Morgenroth , Liam A.K. Irwin , Donald White , Jimei Han , Xin Shen , Lin Cao
Accurate estimation of chlorophyll content helps understand the physiological health of saplings. Traditional field measurement methods limit large-scale applications, while remote sensing monitoring methods, though improving efficiency, still face challenges in data analysis and processing. A key challenge is extracting the canopy of individual trees and estimating chlorophyll content amid interference from weed vegetation. To overcome this challenge, this study develops a novel coupled deep learning approach. First, an Attention and Pyramid U-Net (APUNet) was proposed to improve the segmentation accuracy of Ginkgo seedlings crown boundaries in multispectral drone images. Second, the hyper-parameter combination of the You Only Look Once (YOLO) model was optimized to improve sapling detection accuracy under complex conditions with high weed coverage. Next, a sliding window method was developed to reduce duplicate detection rates for detecting all saplings in the entire image, and the intersection of APUNet and detection results was used for segmentation of individual saplings. Finally, a PROSAIL-Deep Neural Network (DNN) model for chlorophyll estimation was developed, and the impact of using APUNet YOLO to remove weed pixels on chlorophyll estimation was investigated, along with how over-coverage and under-coverage during the segmentation process affect the accuracy of chlorophyll estimation. The results show that, compared to U-Net, APUNet improved segmentation performance, with the IoU increasing from 0.697 to 0.736. After hyper-parameter optimization through grid search, the YOLOv8l model achieved an mAP50 of 0.968 and when this model was applied to full-image detection using a sliding window approach, its duplicate detection rate (ddr) decreased by 11.2% compared with the general method. In chlorophyll inversion, the chlorophyll estimation accuracy reached its highest after using APUNet-YOLO to reduce the interference from weed pixels, with an R2 of 0.752 and an RMSE of 7.143 μg/cm2. Meanwhile, the sensitivity analysis of the impact of segmentation errors on chlorophyll estimation accuracy shows that, compared to under-coverage segmentation, over-inclusion segmentation leads to a greater decline in chlorophyll estimation accuracy. This novel coupled model can effectively monitor the chlorophyll content of Ginkgo saplings in settings with complex backgrounds, providing a practical possibility for the realization of precision forestry.
叶绿素含量的准确估算有助于了解幼树的生理健康状况。传统的野外测量方法限制了大规模应用,而遥感监测方法虽然提高了效率,但在数据分析和处理方面仍然面临挑战。一个关键的挑战是在杂草植被的干扰下提取单个树木的冠层并估计叶绿素含量。为了克服这一挑战,本研究开发了一种新的耦合深度学习方法。首先,为了提高多光谱无人机图像中银杏树冠边界的分割精度,提出了一种关注金字塔U-Net (Attention and Pyramid U-Net, APUNet)算法。其次,对YOLO (You Only Look Once)模型的超参数组合进行优化,提高高杂草覆盖复杂条件下的树苗检测精度。其次,开发滑动窗口方法,降低重复检测率,对整个图像中的所有树苗进行检测,并利用APUNet与检测结果的交集对单个树苗进行分割。最后,建立了PROSAIL-Deep Neural Network (DNN)叶绿素估计模型,研究了APUNet YOLO去除杂草像元对叶绿素估计的影响,以及分割过程中的过覆盖和欠覆盖对叶绿素估计精度的影响。结果表明,与U-Net相比,APUNet提高了分割性能,IoU从0.697提高到0.736。通过网格搜索进行超参数优化后,YOLOv8l模型的mAP50值为0.968,将该模型应用于滑动窗口方法的全图像检测时,其重复检测率(ddr)比一般方法降低了11.2%。在叶绿素反演中,利用APUNet-YOLO减少杂草像元干扰后,叶绿素估算精度达到最高,R2为0.752,RMSE为7.143 μg/cm2。同时,对分割误差对叶绿素估计精度影响的敏感性分析表明,与覆盖不足的分割相比,过度包裹体分割导致叶绿素估计精度的下降幅度更大。该耦合模型能有效监测复杂背景环境下银杏树苗叶绿素含量,为实现精准林业提供了现实可能。
{"title":"Individual sapling-level leaf chlorophyll content estimation in Ginkgo biloba with weed background: A novel deep-learning approach coupling APUNet-YOLO with PROSAIL-DNN","authors":"Lin Su ,&nbsp;Kai Zhou ,&nbsp;Nicholas C. Coops ,&nbsp;Justin Morgenroth ,&nbsp;Liam A.K. Irwin ,&nbsp;Donald White ,&nbsp;Jimei Han ,&nbsp;Xin Shen ,&nbsp;Lin Cao","doi":"10.1016/j.jag.2026.105194","DOIUrl":"10.1016/j.jag.2026.105194","url":null,"abstract":"<div><div>Accurate estimation of chlorophyll content helps understand the physiological health of saplings. Traditional field measurement methods limit large-scale applications, while remote sensing monitoring methods, though improving efficiency, still face challenges in data analysis and processing. A key challenge is extracting the canopy of individual trees and estimating chlorophyll content amid interference from weed vegetation. To overcome this challenge, this study develops a novel coupled deep learning approach. First, an Attention and Pyramid U-Net (APUNet) was proposed to improve the segmentation accuracy of Ginkgo seedlings crown boundaries in multispectral drone images. Second, the hyper-parameter combination of the You Only Look Once (YOLO) model was optimized to improve sapling detection accuracy under complex conditions with high weed coverage. Next, a sliding window method was developed to reduce duplicate detection rates for detecting all saplings in the entire image, and the intersection of APUNet and detection results was used for segmentation of individual saplings. Finally, a PROSAIL-Deep Neural Network (DNN) model for chlorophyll estimation was developed, and the impact of using APUNet YOLO to remove weed pixels on chlorophyll estimation was investigated, along with how over-coverage and under-coverage during the segmentation process affect the accuracy of chlorophyll estimation. The results show that, compared to U-Net, APUNet improved segmentation performance, with the IoU increasing from 0.697 to 0.736. After hyper-parameter optimization through grid search, the YOLOv8l model achieved an mAP50 of 0.968 and when this model was applied to full-image detection using a sliding window approach, its duplicate detection rate (ddr) decreased by 11.2% compared with the general method. In chlorophyll inversion, the chlorophyll estimation accuracy reached its highest after using APUNet-YOLO to reduce the interference from weed pixels, with an R<sup>2</sup> of 0.752 and an RMSE of 7.143 μg/cm<sup>2</sup>. Meanwhile, the sensitivity analysis of the impact of segmentation errors on chlorophyll estimation accuracy shows that, compared to under-coverage segmentation, over-inclusion segmentation leads to a greater decline in chlorophyll estimation accuracy. This novel coupled model can effectively monitor the chlorophyll content of Ginkgo saplings in settings with complex backgrounds, providing a practical possibility for the realization of precision forestry.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"147 ","pages":"Article 105194"},"PeriodicalIF":8.6,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147360642","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MAC-DETR: Re-parameterized Mambaout for robust object detection in high-resolution remote sensing MAC-DETR:用于高分辨率遥感鲁棒目标检测的重新参数化Mambaout
IF 8.6 Q1 REMOTE SENSING Pub Date : 2026-03-01 Epub Date: 2026-02-24 DOI: 10.1016/j.jag.2026.105169
Huanxu Li , Keke Xu , Xianglei Liu , Jianlong Wang , Guoqing Zhou
Accurate and efficient object detection in remote sensing imagery is fundamental to applications such as urban monitoring, infrastructure assessment, and disaster management. However, two persistent challenges remain: (1) small objects occupy only a few pixels in ultra-high-resolution images, making them difficult to detect reliably, and (2) the rapid growth of large-scale satellite data demands computationally efficient methods suitable for practical deployment. Existing lightweight convolutional networks provide real-time efficiency but often fail to capture sufficient discriminative information for small targets, while Transformer-based detectors improve representation power at the cost of high computational complexity. To address these challenges, we propose MAC-DETR, a CNN-Mambaout-Transformer hybrid detection framework designed for remote sensing imagery. MAC-DETR integrates three complementary modules: (1) Mambaout-RepLK, a Mambaout-style gated convolution block that reduces computational overhead while preserving expressive feature extraction, achieving + 2.5% mAP improvement in fine-grained categories on NWPU VHR-10 while reducing parameters by 23%; (2) CROSS-UP, an adaptive upsampling block that enhances multi-scale fusion and improves the detection of small objects—yielding up to + 2.6% mAP gain on multi-scale categories in NWPU VHR-10—without introducing extra complexity; and (3) ASSAIFI, a sparse attention module that strengthens feature interaction to further refine small-object representations. Experiments on four benchmark datasets demonstrate that MAC-DETR achieves 74.1% mAP on DOTA-v1.5, 95.4% on NWPU VHR-10, 94.4% on HRSC2016, and 95.4% on the infrared SIRSTv2 dataset, consistently outperforming both CNN– and Transformer-based baselines. Ablation studies show that the design reduces parameters by 23% and computation by 10–15%, offering a practical balance between accuracy and efficiency. These results highlight MAC-DETR’s effectiveness for large-scale, multi-modal Earth observation applications. The source code will be available at: https://github.com/Keykeykeykeykeykey/MAC-DETR.
准确、高效的遥感图像目标检测是城市监测、基础设施评估和灾害管理等应用的基础。然而,仍然存在两个持续的挑战:(1)小物体在超高分辨率图像中仅占几个像素,难以可靠地检测到它们;(2)大规模卫星数据的快速增长需要适合实际部署的计算效率高的方法。现有的轻量级卷积网络提供了实时效率,但通常无法捕获小目标的足够判别信息,而基于transformer的检测器以高计算复杂度为代价提高了表示能力。为了解决这些挑战,我们提出了MAC-DETR,一种针对遥感图像设计的CNN-Mambaout-Transformer混合检测框架。MAC-DETR集成了三个互补模块:(1)Mambaout-RepLK, mambaout风格的门控卷积块,减少计算开销,同时保留表达性特征提取,在NWPU VHR-10上实现+ 2.5%的细粒度分类mAP改进,同时减少23%的参数;(2) CROSS-UP,一种自适应上采样块,增强了多尺度融合,提高了对小目标的检测——在不引入额外复杂性的情况下,NWPU vhr -10在多尺度类别上产生了+ 2.6%的mAP增益;(3) ASSAIFI,一个稀疏关注模块,加强特征交互以进一步细化小对象表示。在4个基准数据集上的实验表明,MAC-DETR在DOTA-v1.5上的mAP值为74.1%,在NWPU VHR-10上的mAP值为95.4%,在HRSC2016上的mAP值为94.4%,在红外SIRSTv2数据集上的mAP值为95.4%,始终优于基于CNN和transformer的基线。烧蚀研究表明,该设计将参数减少23%,计算减少10-15%,在精度和效率之间实现了实际的平衡。这些结果突出了MAC-DETR在大规模、多模态地球观测应用中的有效性。源代码可从https://github.com/Keykeykeykeykeykey/MAC-DETR获得。
{"title":"MAC-DETR: Re-parameterized Mambaout for robust object detection in high-resolution remote sensing","authors":"Huanxu Li ,&nbsp;Keke Xu ,&nbsp;Xianglei Liu ,&nbsp;Jianlong Wang ,&nbsp;Guoqing Zhou","doi":"10.1016/j.jag.2026.105169","DOIUrl":"10.1016/j.jag.2026.105169","url":null,"abstract":"<div><div>Accurate and efficient object detection in remote sensing imagery is fundamental to applications such as urban monitoring, infrastructure assessment, and disaster management. However, two persistent challenges remain: (1) small objects occupy only a few pixels in ultra-high-resolution images, making them difficult to detect reliably, and (2) the rapid growth of large-scale satellite data demands computationally efficient methods suitable for practical deployment. Existing lightweight convolutional networks provide real-time efficiency but often fail to capture sufficient discriminative information for small targets, while Transformer-based detectors improve representation power at the cost of high computational complexity. To address these challenges, we propose MAC-DETR, a CNN-Mambaout-Transformer hybrid detection framework designed for remote sensing imagery. MAC-DETR integrates three complementary modules: (1) Mambaout-RepLK, a Mambaout-style gated convolution block that reduces computational overhead while preserving expressive feature extraction, achieving + 2.5% mAP improvement in fine-grained categories on NWPU VHR-10 while reducing parameters by 23%; (2) CROSS-UP, an adaptive upsampling block that enhances multi-scale fusion and improves the detection of small objects—yielding up to + 2.6% mAP gain on multi-scale categories in NWPU VHR-10—without introducing extra complexity; and (3) ASSAIFI, a sparse attention module that strengthens feature interaction to further refine small-object representations. Experiments on four benchmark datasets demonstrate that MAC-DETR achieves 74.1% mAP on DOTA-v1.5, 95.4% on NWPU VHR-10, 94.4% on HRSC2016, and 95.4% on the infrared SIRSTv2 dataset, consistently outperforming both CNN– and Transformer-based baselines. Ablation studies show that the design reduces parameters by 23% and computation by 10–15%, offering a practical balance between accuracy and efficiency. These results highlight MAC-DETR’s effectiveness for large-scale, multi-modal Earth observation applications. The source code will be available at: https://github.com/Keykeykeykeykeykey/MAC-DETR.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"147 ","pages":"Article 105169"},"PeriodicalIF":8.6,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147360851","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Differential land deformation patterns and response mechanism to groundwater recovery in the Beijing-Tianjin-Hebei Plain revealed by reconstructing and clustering of TS-InSAR observations 基于TS-InSAR观测数据重建与聚类的京津冀平原地表差异变形模式及地下水恢复响应机制
IF 8.6 Q1 REMOTE SENSING Pub Date : 2026-03-01 Epub Date: 2026-02-21 DOI: 10.1016/j.jag.2026.105196
Dexin Meng , Beibei Chen , Shubo Zhang , Huili Gong , Chaofan Zhou , Sidi Chen , Kunchao Lei , Haigang Wang
The Beijing-Tianjin-Hebei (BTH) Plain in north China has experienced significant land subsidence due to long-term groundwater overexploitation. Although water diversion project and climate change have improved groundwater conditions overall, the spatiotemporal heterogeneity in land deformation responses to groundwater recovery remains unclear. This study reconstructs and identifies differentiated deformation patterns across the BTH Plain under the new hydrological background, revealing their multi-scale response mechanisms to groundwater change. After mosaicking Time-Series Interferometric Synthetic Aperture Radar (TS-InSAR) deformation fields, we reconstructed the seasonal and long-term components of deformation. A long-term deformation characteristic-constrained K-means clustering method was developed to classify distinct deformation evolution patterns. Coupling these patterns with groundwater-level (GWL) observations to further distinguish the annual lag and long-term delayed response of deformation to GWL changes. From 2016 to 2024, the BTH Plain exhibits clear spatial partitioning of deformation behaviors, categorized as 10.14% continuous subsidence, 18.86% reduced subsidence, 46.09% stable deformation, and 24.91% slight rebound. Uncertainties in the reconstructed deformation reflect short-term fluctuations driven by extreme hydrological or anthropogenic disturbances. Short-time Fourier transform analysis reveals periodic response of deformation to GWL change and annual lag: elastic deformation dominated areas lag GWL by ≤ 2 months, whereas clay-rich persistent subsiding areas lag 4–6 months. The exponential decay model further indicates that the recovery of subsidence funnels is governed by geological structures: funnels intersecting faults show the slowest subsidence decay along their edges, while those without faults display a more uniform edge-to-center recovery pattern.
京津冀平原由于地下水长期过度开采,地表沉降严重。尽管调水工程和气候变化总体上改善了地下水条件,但地下水恢复对陆地变形响应的时空异质性尚不清楚。本研究在新的水文背景下重建并识别了BTH平原的分异变形模式,揭示了其对地下水变化的多尺度响应机制。通过拼接时间序列干涉合成孔径雷达(TS-InSAR)的形变场,重建了形变的季节分量和长期分量。提出了一种基于长期变形特征约束的k均值聚类方法,对不同的变形演化模式进行分类。将这些模式与地下水位(GWL)观测相结合,进一步区分变形对地下水位变化的年滞后和长期延迟响应。2016 - 2024年,BTH平原变形行为空间分异明显,表现为10.14%的持续沉降、18.86%的减少沉降、46.09%的稳定变形和24.91%的轻微反弹。重建变形的不确定性反映了极端水文或人为扰动驱动的短期波动。短时傅里叶变换分析揭示了变形对GWL变化和年滞后的周期性响应:弹性变形为主的地区滞后GWL≤2个月,而富含粘土的持续沉降区滞后4-6个月。指数衰减模型进一步表明,沉降漏斗的恢复受地质构造的支配:与断层相交的漏斗沿边缘的沉降衰减最慢,而无断层的漏斗沿边缘到中心的沉降恢复模式更为均匀。
{"title":"Differential land deformation patterns and response mechanism to groundwater recovery in the Beijing-Tianjin-Hebei Plain revealed by reconstructing and clustering of TS-InSAR observations","authors":"Dexin Meng ,&nbsp;Beibei Chen ,&nbsp;Shubo Zhang ,&nbsp;Huili Gong ,&nbsp;Chaofan Zhou ,&nbsp;Sidi Chen ,&nbsp;Kunchao Lei ,&nbsp;Haigang Wang","doi":"10.1016/j.jag.2026.105196","DOIUrl":"10.1016/j.jag.2026.105196","url":null,"abstract":"<div><div>The Beijing-Tianjin-Hebei (BTH) Plain in north China has experienced significant land subsidence due to long-term groundwater overexploitation. Although water diversion project and climate change have improved groundwater conditions overall, the spatiotemporal heterogeneity in land deformation responses to groundwater recovery remains unclear. This study reconstructs and identifies differentiated deformation patterns across the BTH Plain under the new hydrological background, revealing their multi-scale response mechanisms to groundwater change. After mosaicking Time-Series Interferometric Synthetic Aperture Radar (TS-InSAR) deformation fields, we reconstructed the seasonal and long-term components of deformation. A long-term deformation characteristic-constrained K-means clustering method was developed to classify distinct deformation evolution patterns. Coupling these patterns with groundwater-level (GWL) observations to further distinguish the annual lag and long-term delayed response of deformation to GWL changes. From 2016 to 2024, the BTH Plain exhibits clear spatial partitioning of deformation behaviors, categorized as 10.14% continuous subsidence, 18.86% reduced subsidence, 46.09% stable deformation, and 24.91% slight rebound. Uncertainties in the reconstructed deformation reflect short-term fluctuations driven by extreme hydrological or anthropogenic disturbances. Short-time Fourier transform analysis reveals periodic response of deformation to GWL change and annual lag: elastic deformation dominated areas lag GWL by ≤ 2 months, whereas clay-rich persistent subsiding areas lag 4–6 months. The exponential decay model further indicates that the recovery of subsidence funnels is governed by geological structures: funnels intersecting faults show the slowest subsidence decay along their edges, while those without faults display a more uniform edge-to-center recovery pattern.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"147 ","pages":"Article 105196"},"PeriodicalIF":8.6,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147278238","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GeoAgentic-RAG: A Multi-Agent framework for autonomous geospatial reasoning and visual insight generation with LLM geoagent - rag:基于LLM的自主地理空间推理和视觉洞察生成的多代理框架
IF 8.6 Q1 REMOTE SENSING Pub Date : 2026-03-01 Epub Date: 2026-02-20 DOI: 10.1016/j.jag.2026.105195
Chao Liang , Yuanzheng Cui , Run Shi , Guixiang Zha , Xin Yin , Mingzhong Xiao , Dong Xu , Xuejun Duan , Bo Huang
Conventional Retrieval-Augmented Generation (RAG) systems have limited effectiveness in geospatial question answering because text-based similarity retrieval cannot adequately represent spatial semantics such as topology and spatial context. To overcome this limitation, we propose GeoAgentic-RAG, a multi-agent framework that enables multimodal large language models (MLLMs) to perform autonomous geospatial reasoning. The framework integrates natural language query parsing, semantic-spatial retrieval, and executable geospatial analysis within a unified, agent-based workflow. Multiple specialized agents collaboratively interpret user queries, retrieve relevant vector and raster datasets from a unified geospatial database, decompose analytical tasks, generate valid spatial logic, and produce interpretable analytical results. We evaluate GeoAgentic-RAG using a benchmark of geospatial retrieval, feature characterization, and spatial relational reasoning tasks in Nanjing and Guangzhou. The proposed framework achieves a pass rate of 85.3% and an answer correctness of 88.3%, outperforming conventional RAG methods and representative code-generation baselines. These results demonstrate that agent-based integration of retrieval and spatial analysis substantially improves the reliability of geospatial question answering and provides a practical framework for the next-generation intelligent GIS applications.
传统的检索-增强生成(RAG)系统在地理空间问题回答中的有效性有限,因为基于文本的相似性检索不能充分表示拓扑和空间上下文等空间语义。为了克服这一限制,我们提出了geoagent - rag,这是一个多智能体框架,使多模态大语言模型(mllm)能够执行自主地理空间推理。该框架将自然语言查询解析、语义空间检索和可执行的地理空间分析集成在一个统一的、基于代理的工作流中。多个专门的代理协作解释用户查询,从统一的地理空间数据库检索相关的矢量和栅格数据集,分解分析任务,生成有效的空间逻辑,并产生可解释的分析结果。我们以南京和广州的地理空间检索、特征表征和空间关系推理任务为基准,对geoagent - rag进行了评估。该框架实现了85.3%的通过率和88.3%的答案正确性,优于传统的RAG方法和代表性的代码生成基线。这些结果表明,基于agent的检索和空间分析的集成大大提高了地理空间问题回答的可靠性,并为下一代智能GIS应用提供了一个实用的框架。
{"title":"GeoAgentic-RAG: A Multi-Agent framework for autonomous geospatial reasoning and visual insight generation with LLM","authors":"Chao Liang ,&nbsp;Yuanzheng Cui ,&nbsp;Run Shi ,&nbsp;Guixiang Zha ,&nbsp;Xin Yin ,&nbsp;Mingzhong Xiao ,&nbsp;Dong Xu ,&nbsp;Xuejun Duan ,&nbsp;Bo Huang","doi":"10.1016/j.jag.2026.105195","DOIUrl":"10.1016/j.jag.2026.105195","url":null,"abstract":"<div><div>Conventional Retrieval-Augmented Generation (RAG) systems have limited effectiveness in geospatial question answering because text-based similarity retrieval cannot adequately represent spatial semantics such as topology and spatial context. To overcome this limitation, we propose GeoAgentic-RAG, a multi-agent framework that enables multimodal large language models (MLLMs) to perform autonomous geospatial reasoning. The framework integrates natural language query parsing, semantic-spatial retrieval, and executable geospatial analysis within a unified, agent-based workflow. Multiple specialized agents collaboratively interpret user queries, retrieve relevant vector and raster datasets from a unified geospatial database, decompose analytical tasks, generate valid spatial logic, and produce interpretable analytical results. We evaluate GeoAgentic-RAG using a benchmark of geospatial retrieval, feature characterization, and spatial relational reasoning tasks in Nanjing and Guangzhou. The proposed framework achieves a pass rate of 85.3% and an answer correctness of 88.3%, outperforming conventional RAG methods and representative code-generation baselines. These results demonstrate that agent-based integration of retrieval and spatial analysis substantially improves the reliability of geospatial question answering and provides a practical framework for the next-generation intelligent GIS applications.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"147 ","pages":"Article 105195"},"PeriodicalIF":8.6,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147278242","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ESCAPE: An ensemble-based self-calibrated autoencoder with physics-informed estimation of high-resolution soil moisture and surface roughness from ALOS-2/PALSAR-2 polarimetric observations ESCAPE:基于集成的自校准自编码器,可根据ALOS-2/PALSAR-2极化观测数据估算高分辨率土壤湿度和表面粗糙度
IF 8.6 Q1 REMOTE SENSING Pub Date : 2026-03-01 Epub Date: 2026-02-26 DOI: 10.1016/j.jag.2026.105206
Jaese Lee , Haemi Park , Jungho Im , Christian Koyama , Rogelio Ruzcko Tobias , Takeo Tadono
Soil moisture (SM) is a key variable governing land–atmosphere interactions and terrestrial hydrological processes. Recent satellite missions provide global SM observations, among which ALOS-2/PALSAR-2 offers full polarimetric L-band SAR measurements at spatial resolutions as fine as 6 m. Retrieving SM from SAR data requires disentangling scattering contributions from vegetation and surface roughness (e.g., root mean square height; Hrms). Conventional approaches typically rely on collocated in-situ measurements of SM and Hrms for calibration, but such data are often sparse. In addition, the simultaneous estimation of multiple scattering parameters from limited SAR observables can lead to ill-posed inversion and unstable retrievals. To address these challenges, we propose ESCAPE (Ensemble-based Self-Calibrated Autoencoder with Physics-informed Estimation), a self-calibrating framework that integrates polarimetric ALOS-2/PALSAR-2 observations within a 10-member physics-informed neural network (PINN) ensemble. ESCAPE embeds physically based scattering models into an autoencoder architecture and estimates SM and Hrms without using in-situ SM measurements as direct training targets. The ensemble strategy mitigates uncertainty arising from ill-posed physical constraints and gradient-based optimization. Spatial evaluation over Spain and Austria demonstrates robust performance, with a correlation coefficient R = 0.701, an unbiased root mean squared difference (ubRMSD) of 0.089 m3 m−3, and a bias of 0.006 m3 m−3. Temporal evaluation at an independent site in Japan yields R = 0.568, ubRMSD of 0.06 m3 m−3, and a bias of − 0.13 m3 m−3. Ensemble analysis further reveals that aggregating multiple PINN realizations improves robustness by reducing the influence of poorly converged members. Compared with conventional approaches trained directly on in-situ data, ESCAPE exhibits improved generalization across heterogeneous environments. These results highlight the potential of ESCAPE as a self-calibrated, satellite-only framework for high-resolution estimation of geophysical parameters, demonstrating the value of combining physical principles with ensemble-based deep learning for SAR remote sensing applications.
土壤水分是控制陆地-大气相互作用和陆地水文过程的关键变量。最近的卫星任务提供了全球SM观测,其中ALOS-2/PALSAR-2提供了精确到6米空间分辨率的全极化l波段SAR测量。从SAR数据中提取SM需要从植被和表面粗糙度(例如,均方根高度;Hrms)中分离散射贡献。传统的方法通常依赖于SM和Hrms的同步原位测量来进行校准,但这样的数据通常是稀疏的。此外,从有限的SAR观测数据中同时估计多个散射参数会导致反演不适定和反演不稳定。为了解决这些挑战,我们提出了ESCAPE(基于集成的自校准自编码器与物理信息估计),这是一个自校准框架,将偏振ALOS-2/PALSAR-2观测集成在一个10成员物理信息神经网络(PINN)集成中。ESCAPE将基于物理的散射模型嵌入到自动编码器架构中,并在不使用原位SM测量作为直接训练目标的情况下估计SM和Hrms。集成策略减轻了由病态物理约束和基于梯度的优化引起的不确定性。西班牙和奥地利的空间评价表现出稳健的表现,相关系数R = 0.701,无偏均方根差(ubRMSD)为0.089 m3 m - 3,偏差为0.006 m3 m - 3。在日本的一个独立站点进行的时间评估得出R = 0.568, ubRMSD为0.06 m3 m - 3,偏差为- 0.13 m3 m - 3。集成分析进一步表明,聚合多个PINN实现通过减少低收敛成员的影响来提高鲁棒性。与直接在原位数据上训练的传统方法相比,ESCAPE在异构环境中表现出更好的泛化能力。这些结果突出了ESCAPE作为一种自校准的、仅用于高分辨率地球物理参数估计的卫星框架的潜力,展示了将物理原理与基于集成的深度学习相结合的SAR遥感应用的价值。
{"title":"ESCAPE: An ensemble-based self-calibrated autoencoder with physics-informed estimation of high-resolution soil moisture and surface roughness from ALOS-2/PALSAR-2 polarimetric observations","authors":"Jaese Lee ,&nbsp;Haemi Park ,&nbsp;Jungho Im ,&nbsp;Christian Koyama ,&nbsp;Rogelio Ruzcko Tobias ,&nbsp;Takeo Tadono","doi":"10.1016/j.jag.2026.105206","DOIUrl":"10.1016/j.jag.2026.105206","url":null,"abstract":"<div><div>Soil moisture (SM) is a key variable governing land–atmosphere interactions and terrestrial hydrological processes. Recent satellite missions provide global SM observations, among which ALOS-2/PALSAR-2 offers full polarimetric L-band SAR measurements at spatial resolutions as fine as 6 m. Retrieving SM from SAR data requires disentangling scattering contributions from vegetation and surface roughness (e.g., root mean square height; <em>H<sub>rms</sub></em>). Conventional approaches typically rely on collocated in-situ measurements of SM and <em>H<sub>rms</sub></em> for calibration, but such data are often sparse. In addition, the simultaneous estimation of multiple scattering parameters from limited SAR observables can lead to ill-posed inversion and unstable retrievals. To address these challenges, we propose ESCAPE (Ensemble-based Self-Calibrated Autoencoder with Physics-informed Estimation), a self-calibrating framework that integrates polarimetric ALOS-2/PALSAR-2 observations within a 10-member physics-informed neural network (PINN) ensemble. ESCAPE embeds physically based scattering models into an autoencoder architecture and estimates SM and <em>H<sub>rms</sub></em> without using in-situ SM measurements as direct training targets. The ensemble strategy mitigates uncertainty arising from ill-posed physical constraints and gradient-based optimization. Spatial evaluation over Spain and Austria demonstrates robust performance, with a correlation coefficient R = 0.701, an unbiased root mean squared difference (ubRMSD) of 0.089 m<sup>3</sup> m<sup>−3</sup>, and a bias of 0.006 m<sup>3</sup> m<sup>−3</sup>. Temporal evaluation at an independent site in Japan yields R = 0.568, ubRMSD of 0.06 m<sup>3</sup> m<sup>−3</sup>, and a bias of − 0.13 m<sup>3</sup> m<sup>−3</sup>. Ensemble analysis further reveals that aggregating multiple PINN realizations improves robustness by reducing the influence of poorly converged members. Compared with conventional approaches trained directly on in-situ data, ESCAPE exhibits improved generalization across heterogeneous environments. These results highlight the potential of ESCAPE as a self-calibrated, satellite-only framework for high-resolution estimation of geophysical parameters, demonstrating the value of combining physical principles with ensemble-based deep learning for SAR remote sensing applications.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"147 ","pages":"Article 105206"},"PeriodicalIF":8.6,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147360591","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automatic alignment of semantic building models with digital city models through multi-level point cloud semantic segmentation and registration using PSO optimization: A case study of the TUM main campus 基于PSO优化的多层次点云语义分割和配准的语义建筑模型与数字城市模型的自动对齐:以TUM主校区为例
IF 8.6 Q1 REMOTE SENSING Pub Date : 2026-03-01 Epub Date: 2026-02-26 DOI: 10.1016/j.jag.2026.105191
Mansour Mehranfar , Alexander Braun , André Borrmann , Medhini Heeramaglore , Thomas H. Kolbe , Zhaiyu Chen , Xiao Xiang Zhu
Multi-scale Digital Twins (DTs) of the built environment provide valuable spatial and semantic insights into city assets, supporting urban management, sustainability, and mobility. However, aligning semantic building models with large-scale digital city models remains a significant challenge due to discrepancies in geospatial alignment, spatial resolution, and heterogeneity of data sources. This paper proposes a novel pipeline for the automatic alignment of semantic indoor building models with digital city models. The pipeline comprises two main steps: first, multi-level semantic segmentation of using Artificial Intelligence (AI) models, and second, point cloud registration using the Particle Swarm Optimization (PSO) algorithm, which estimates the transformation parameters required to align the corresponding models. The results of testing the proposed pipeline on real-world data from the Technical University of Munich (TUM) demonstrate the effectiveness of the proposed method in aligning semantic digital models across multiple scales.
建筑环境的多尺度数字孪生(DTs)为城市资产提供了宝贵的空间和语义洞察,支持城市管理、可持续性和流动性。然而,由于地理空间对齐、空间分辨率和数据源异质性的差异,将语义建筑模型与大规模数字城市模型对齐仍然是一个重大挑战。本文提出了一种新的语义室内建筑模型与数字城市模型自动对齐的管道。该流程包括两个主要步骤:首先,使用人工智能(AI)模型进行多级语义分割,其次,使用粒子群优化(PSO)算法进行点云配准,该算法估计对齐相应模型所需的转换参数。在慕尼黑工业大学(TUM)的真实数据上测试所提出的管道的结果表明,所提出的方法在跨多个尺度对齐语义数字模型方面是有效的。
{"title":"Automatic alignment of semantic building models with digital city models through multi-level point cloud semantic segmentation and registration using PSO optimization: A case study of the TUM main campus","authors":"Mansour Mehranfar ,&nbsp;Alexander Braun ,&nbsp;André Borrmann ,&nbsp;Medhini Heeramaglore ,&nbsp;Thomas H. Kolbe ,&nbsp;Zhaiyu Chen ,&nbsp;Xiao Xiang Zhu","doi":"10.1016/j.jag.2026.105191","DOIUrl":"10.1016/j.jag.2026.105191","url":null,"abstract":"<div><div>Multi-scale Digital Twins (DTs) of the built environment provide valuable spatial and semantic insights into city assets, supporting urban management, sustainability, and mobility. However, aligning semantic building models with large-scale digital city models remains a significant challenge due to discrepancies in geospatial alignment, spatial resolution, and heterogeneity of data sources. This paper proposes a novel pipeline for the automatic alignment of semantic indoor building models with digital city models. The pipeline comprises two main steps: first, multi-level semantic segmentation of using Artificial Intelligence (AI) models, and second, point cloud registration using the Particle Swarm Optimization (PSO) algorithm, which estimates the transformation parameters required to align the corresponding models. The results of testing the proposed pipeline on real-world data from the Technical University of Munich (TUM) demonstrate the effectiveness of the proposed method in aligning semantic digital models across multiple scales.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"147 ","pages":"Article 105191"},"PeriodicalIF":8.6,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147360592","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tree-level inversion of leaf chlorophyll content from UAV imagery using a 3D model-based visual factor correction method for adjacency scattering effect 基于邻接散射效应的三维模型视觉因子校正方法反演无人机影像中叶片叶绿素含量
IF 8.6 Q1 REMOTE SENSING Pub Date : 2026-03-01 Epub Date: 2026-02-24 DOI: 10.1016/j.jag.2026.105200
Shangbo Liu , Linyuan Li , Seping Dai , Yu Su , Xiaoyao Li , Yanqiong Li , Xiankai Lu
For tree-level inversion of leaf chlorophyll content (LCC) from UAV imagery, the adjacency scattering effect of individual trees can affect the reflectance of the specific tree crown, thereby increasing the uncertainty and difficulty of LCC inversion. In this study, a novel visual factor (VF) correction framework for such adjacency scattering effect is proposed, which can efficiently quantify the adjacency effects by geometric probabilities instead of computationally intensive 3D radiative transfer modeling (3D-RTM). This framework is realized by extracting the light intersection probability between the target tree crown and the adjacency scenes (Crown, Ground, and Sky) via ray tracing on LiDAR-derived 3D scenes. We constructed a VF-based adjacency effect correction equation to optimize the extracted vegetation indices (VIs) of tree crowns which were utilized for tree-level LCC inversion. The results indicate that most ratio-based vegetation indices that include near-infrared bands, such as NDVI, RVI, MCARI, CIrededge, and GNDVI, increased the accuracy of LCC after calibration (the mean R2 of univariate prediction was improved to 0.46), while DVI and EVI were insensitive to the adjacency effect and TVI showed a decrease in accuracy. Notably, VIs after adjacency effect correction demonstrated a remarkable improvement under the multivariate regression algorithm, with the R2 increasing by 0.4 compared to that before correction, achieving an accuracy of R2 = 0.67. The method overcame the uncertainty of tree-level signals in UAV imagery by eliminating the influence of the adjacency scattering and provided an efficient solution for the accurate inversion of parameters at the individual-tree scale, which is of great application value for the assessment of precise forestry management.
对于无人机影像树叶叶绿素含量(LCC)的树级反演,单株树的邻接散射效应会影响到特定树冠的反射率,从而增加了LCC反演的不确定性和难度。本文提出了一种新的邻接散射效应的视觉因子(VF)校正框架,该框架可以有效地通过几何概率来量化邻接效应,而不是计算密集型的三维辐射传输建模(3D- rtm)。该框架通过对lidar衍生的3D场景进行光线追踪,提取目标树冠与相邻场景(crown、Ground和Sky)之间的光相交概率来实现。构建了基于vf的邻接效应校正方程,对提取的树冠植被指数(VIs)进行优化,用于树级LCC反演。结果表明,包含近红外波段的植被指数NDVI、RVI、MCARI、cireddge和GNDVI在校正后可提高LCC的精度(单变量预测R2均值提高至0.46),而DVI和EVI对邻接效应不敏感,TVI精度下降。值得注意的是,在多元回归算法下,邻接效应校正后的VIs得到了显著改善,R2比校正前提高了0.4,达到R2 = 0.67的精度。该方法通过消除邻接散射的影响,克服了无人机影像中树级信号的不确定性,为单树尺度参数的精确反演提供了有效的解决方案,对林业精准经营评价具有重要的应用价值。
{"title":"Tree-level inversion of leaf chlorophyll content from UAV imagery using a 3D model-based visual factor correction method for adjacency scattering effect","authors":"Shangbo Liu ,&nbsp;Linyuan Li ,&nbsp;Seping Dai ,&nbsp;Yu Su ,&nbsp;Xiaoyao Li ,&nbsp;Yanqiong Li ,&nbsp;Xiankai Lu","doi":"10.1016/j.jag.2026.105200","DOIUrl":"10.1016/j.jag.2026.105200","url":null,"abstract":"<div><div>For tree-level inversion of leaf chlorophyll content (LCC) from UAV imagery, the adjacency scattering effect of individual trees can affect the reflectance of the specific tree crown, thereby increasing the uncertainty and difficulty of LCC inversion. In this study, a novel visual factor (VF) correction framework for such adjacency scattering effect is proposed, which can efficiently quantify the adjacency effects by geometric probabilities instead of computationally intensive 3D radiative transfer modeling (3D-RTM). This framework is realized by extracting the light intersection probability between the target tree crown and the adjacency scenes (Crown, Ground, and Sky) via ray tracing on LiDAR-derived 3D scenes. We constructed a VF-based adjacency effect correction equation to optimize the extracted vegetation indices (VIs) of tree crowns which were utilized for tree-level LCC inversion. The results indicate that most ratio-based vegetation indices that include near-infrared bands, such as NDVI, RVI, MCARI, CIrededge, and GNDVI, increased the accuracy of LCC after calibration (the mean R<sup>2</sup> of univariate prediction was improved to 0.46), while DVI and EVI were insensitive to the adjacency effect and TVI showed a decrease in accuracy. Notably, VIs after adjacency effect correction demonstrated a remarkable improvement under the multivariate regression algorithm, with the R<sup>2</sup> increasing by 0.4 compared to that before correction, achieving an accuracy of R<sup>2</sup> = 0.67. The method overcame the uncertainty of tree-level signals in UAV imagery by eliminating the influence of the adjacency scattering and provided an efficient solution for the accurate inversion of parameters at the individual-tree scale, which is of great application value for the assessment of precise forestry management.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"147 ","pages":"Article 105200"},"PeriodicalIF":8.6,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147360641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International journal of applied earth observation and geoinformation : ITC journal
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1