首页 > 最新文献

International journal of applied earth observation and geoinformation : ITC journal最新文献

英文 中文
A global analysis of SAR altimetry signals over different landcover types 不同地表覆盖类型SAR测高信号的全球分析
IF 8.6 Q1 REMOTE SENSING Pub Date : 2026-02-01 Epub Date: 2025-12-03 DOI: 10.1016/j.jag.2025.105000
Maximilian Eitel, Michael Schmitt
Satellite radar altimeters were originally designed for water applications, but their echoes over land surfaces remain less well understood. In this study we analyze how Sentinel-3 (S3) synthetic aperture radar (SAR) altimetry waveforms respond to different surface types and what physical characteristics are encoded in the signal. To probe this, we conduct classification experiments with a feature-enhanced one-dimensional convolutional neural network (1D-CNN) and analyze its performance. Since surface type information is relevant for climate, hydrology, and biodiversity applications, understanding these signal responses shows to what extent altimetric waveforms may provide consistent class-specific information despite their large elliptical footprint and heterogeneous landscapes. This study investigates the response of Sentinel-3 altimetry waveforms to different land cover types by employing a 1D-CNN to extract land cover information, complemented by a visual analysis of waveform patterns in relation to surface structures. Our results show that information about the underlying surface is embedded in the signals and can be extracted. They further reveal the sensitivity of Sentinel-3 altimetry to variations in land cover. By enhancing our 1D-CNN model with shape-based and contextual features, it effectively captures surface characteristics despite the large altimeter footprint. An ablation study highlights the complementary role of these features, as their removal negatively impacts performance. The best-performing 1D-CNN achieves a macro-averaged F1 (Macro-F1) score of 0.57 and an overall accuracy of 0.67, outperforming both a random forest and a dummy baseline. The classification includes six surface types: Tree, Shrub, Grass, Crop, Bare/Sparse Vegetation, and Water. Although some misclassification occurs, particularly in transition zones and among classes with similar vegetation structures and soil properties, the model provides valuable insights into systematic waveform behavior, highlighting the potential of SAR altimetry signals to capture broad surface characteristics.
卫星雷达高度计最初是为水上应用而设计的,但它们在陆地表面的回波仍然不太清楚。在本研究中,我们分析了Sentinel-3 (S3)合成孔径雷达(SAR)测高波形对不同地表类型的响应,以及信号中编码的物理特征。为了探讨这一点,我们使用特征增强的一维卷积神经网络(1D-CNN)进行分类实验,并分析其性能。由于地表类型信息与气候、水文和生物多样性应用相关,了解这些信号响应表明,尽管高海拔波形具有较大的椭圆足迹和异质性景观,但它们在多大程度上可以提供一致的类别特定信息。本研究通过使用1D-CNN提取土地覆盖信息,并辅以与地表结构相关的波形模式的可视化分析,研究了Sentinel-3高度计波形对不同土地覆盖类型的响应。我们的研究结果表明,下垫面的信息被嵌入到信号中,并且可以被提取出来。它们进一步揭示了哨兵3号测高对土地覆盖变化的敏感性。通过使用基于形状和上下文的特征增强我们的1D-CNN模型,尽管高度计占用空间很大,但它仍能有效地捕获表面特征。消融研究强调了这些特征的互补作用,因为它们的去除会对性能产生负面影响。表现最好的1D-CNN的宏观平均F1 (Macro-F1)得分为0.57,总体精度为0.67,优于随机森林和虚拟基线。分类包括六种表面类型:树木、灌木、草、作物、裸/稀疏植被和水。虽然会出现一些错误分类,特别是在过渡带和具有相似植被结构和土壤性质的类别之间,但该模型提供了对系统波形行为的有价值的见解,突出了SAR测高信号捕捉广泛地表特征的潜力。
{"title":"A global analysis of SAR altimetry signals over different landcover types","authors":"Maximilian Eitel,&nbsp;Michael Schmitt","doi":"10.1016/j.jag.2025.105000","DOIUrl":"10.1016/j.jag.2025.105000","url":null,"abstract":"<div><div>Satellite radar altimeters were originally designed for water applications, but their echoes over land surfaces remain less well understood. In this study we analyze how Sentinel-3 (S3) synthetic aperture radar (SAR) altimetry waveforms respond to different surface types and what physical characteristics are encoded in the signal. To probe this, we conduct classification experiments with a feature-enhanced one-dimensional convolutional neural network (1D-CNN) and analyze its performance. Since surface type information is relevant for climate, hydrology, and biodiversity applications, understanding these signal responses shows to what extent altimetric waveforms may provide consistent class-specific information despite their large elliptical footprint and heterogeneous landscapes. This study investigates the response of Sentinel-3 altimetry waveforms to different land cover types by employing a 1D-CNN to extract land cover information, complemented by a visual analysis of waveform patterns in relation to surface structures. Our results show that information about the underlying surface is embedded in the signals and can be extracted. They further reveal the sensitivity of Sentinel-3 altimetry to variations in land cover. By enhancing our 1D-CNN model with shape-based and contextual features, it effectively captures surface characteristics despite the large altimeter footprint. An ablation study highlights the complementary role of these features, as their removal negatively impacts performance. The best-performing 1D-CNN achieves a macro-averaged F1 (Macro-F1) score of 0.57 and an overall accuracy of 0.67, outperforming both a random forest and a dummy baseline. The classification includes six surface types: Tree, Shrub, Grass, Crop, Bare/Sparse Vegetation, and Water. Although some misclassification occurs, particularly in transition zones and among classes with similar vegetation structures and soil properties, the model provides valuable insights into systematic waveform behavior, highlighting the potential of SAR altimetry signals to capture broad surface characteristics.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"146 ","pages":"Article 105000"},"PeriodicalIF":8.6,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145659109","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An automated method for estimating the depth of melt ponds using ICESat-2 LiDAR point cloud data: application to surface melt of Arctic sea ice 利用ICESat-2激光雷达点云数据估算融化池深度的自动化方法:应用于北极海冰表面融化
IF 8.6 Q1 REMOTE SENSING Pub Date : 2026-02-01 Epub Date: 2025-12-16 DOI: 10.1016/j.jag.2025.105033
Xiaoyi Shen , Haili Li , Chang-Qing Ke
The depth of melt ponds upon Arctic sea ice is a critical indicator for understanding the state of sea ice and the progression of climate change. The advent of the ICESat-2 lidar altimeter offers a promising avenue to monitor melt ponds, and existing studies are capable of detecting melt pond location and depth. Here we propose a novel approach that incorporates ICESat-2 photon positioning and elevation uncertainties. By converting the photon elevation distribution along the satellite track into a two-dimensional image where horizontal and vertical axes represent track distance and elevation, we employ image classification techniques to distinguish the surface and bottom of melt ponds. This provides a fully automated method for identifying melt ponds, and then the estimation of their depths. Converting photon information into images effectively simplifies the computation. We tested this method on 100 randomly distributed melt pond samples of varying sizes and depths. Out of these, 97 melt ponds were successfully identified. A comparison with manually annotated data revealed an average absolute bias of 5 cm and a correlation coefficient of 0.86, outperforming other methods. This approach can detect large and deep melt ponds with widths greater than 17 m and depths ranging from 0.3 to 2 m, which can be further used to monitor the melt ponds in the whole Arctic. It facilitates the acquisition of more detailed melt pond depth information, which is crucial for quantifying the surface melt of Arctic sea ice.
北极海冰上融化池的深度是了解海冰状况和气候变化进程的关键指标。ICESat-2激光雷达高度计的出现为监测熔池提供了一条有前途的途径,现有的研究能够探测熔池的位置和深度。本文提出了一种结合ICESat-2光子定位和高程不确定性的新方法。通过将卫星轨道上的光子高程分布转换成二维图像(横轴和纵轴分别表示轨道距离和高程),利用图像分类技术区分熔池的表面和底部。这提供了一种完全自动化的方法来识别融化池,然后估计它们的深度。将光子信息转换成图像有效地简化了计算。我们在100个随机分布的不同大小和深度的熔池样本上测试了这种方法。其中,97个融池被成功识别。与人工标注的数据比较,平均绝对偏差为5 cm,相关系数为0.86,优于其他方法。该方法可以探测到宽度大于17 m、深度在0.3 ~ 2 m之间的大而深的融化池,可进一步用于整个北极地区的融化池监测。它有助于获得更详细的融池深度信息,这对于量化北极海冰的表面融化至关重要。
{"title":"An automated method for estimating the depth of melt ponds using ICESat-2 LiDAR point cloud data: application to surface melt of Arctic sea ice","authors":"Xiaoyi Shen ,&nbsp;Haili Li ,&nbsp;Chang-Qing Ke","doi":"10.1016/j.jag.2025.105033","DOIUrl":"10.1016/j.jag.2025.105033","url":null,"abstract":"<div><div>The depth of melt ponds upon Arctic sea ice is a critical indicator for understanding the state of sea ice and the progression of climate change. The advent of the ICESat-2 lidar altimeter offers a promising avenue to monitor melt ponds, and existing studies are capable of detecting melt pond location and depth. Here we propose a novel approach that incorporates ICESat-2 photon positioning and elevation uncertainties. By converting the photon elevation distribution along the satellite track into a two-dimensional image where horizontal and vertical axes represent track distance and elevation, we employ image classification techniques to distinguish the surface and bottom of melt ponds. This provides a fully automated method for identifying melt ponds, and then the estimation of their depths. Converting photon information into images effectively simplifies the computation. We tested this method on 100 randomly distributed melt pond samples of varying sizes and depths. Out of these, 97 melt ponds were successfully identified. A comparison with manually annotated data revealed an average absolute bias of 5 cm and a correlation coefficient of 0.86, outperforming other methods. This approach can detect large and deep melt ponds with widths greater than 17 m and depths ranging from 0.3 to 2 m, which can be further used to monitor the melt ponds in the whole Arctic. It facilitates the acquisition of more detailed melt pond depth information, which is crucial for quantifying the surface melt of Arctic sea ice.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"146 ","pages":"Article 105033"},"PeriodicalIF":8.6,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145791197","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Remote sensing of a gantry-equipped facility: Optimizing accuracy by integrating SfM photogrammetry and laserscan computer graphics through fixed base model geometry 配备龙门架的设施的遥感:通过固定的基础几何模型,通过集成SfM摄影测量和激光扫描计算机图形来优化精度
IF 8.6 Q1 REMOTE SENSING Pub Date : 2026-02-01 Epub Date: 2026-01-17 DOI: 10.1016/j.jag.2026.105098
Eise W. Nota , Brechtje A. van Amstel , Wiebe Nijland , Marcel C.G. van Maarseveen , Maarten G. Kleinhans
Gantry-equipped facilities are used in a wide range of research fields, for instance geomorphology. Such setups allow for frequent and comprehensive measurements from sensors mounted on gantries or in fixed overhead position. These measurements often require high spatial and relative accuracy because of small scale morphology. Our objective is to develop high-performance data processing methods for our 20 by 3 m laboratory facility that is used for physical scale experiments on coastal morphology. We compare new and existing methods through quantification of spatial and relative errors in DEMs. Our new data processing methods incorporate SfM photogrammetry and computer graphics using a “base model” of our facility under idealized conditions. With this base model, we are able to successfully align all sensor outputs, creating large and geometrically consistent timeseries of orthomosaics and DEMs. Furthermore, the base model geometry provides for a gridded laserscan processing method, incorporating fixed extrinsic, intrinsic and distortion parameters of the sensors along the gantry. Compared to SfM photogrammetrically constructed DEMs, this new method shows greatly improved relative accuracy (0.74 mm compared to 0.97–4.18 mm) and spatial accuracy (2.13 mm compared to 2.39–8.25 mm). Moreover, we show that variations of extrinsic parameters of laserscan equipment along the gantry survey are significant, while these parameters are often assumed to be constant. Our improved methods may open up new research questions in experimental geomorphology and can be applied to other facilities, research fields and industries.
配备龙门的设备广泛用于地貌学等研究领域。这样的设置允许频繁和全面的测量传感器安装在龙门架或固定的头顶位置。由于小尺度的形貌,这些测量通常需要很高的空间和相对精度。我们的目标是为我们的20 × 3米的实验室设施开发高性能的数据处理方法,用于海岸形态的物理尺度实验。我们通过量化dem的空间误差和相对误差,比较了新的和现有的方法。我们的新数据处理方法结合了SfM摄影测量和计算机图形学,使用我们设施在理想条件下的“基本模型”。有了这个基本模型,我们能够成功地对齐所有传感器输出,创建大的、几何上一致的正交和dem时间序列。此外,基本几何模型提供了一种网格化激光扫描处理方法,将传感器沿龙门的固定外在、内在和畸变参数结合起来。与SfM摄影测量构造的dem相比,该方法的相对精度(0.74 mm比0.97-4.18 mm)和空间精度(2.13 mm比2.39-8.25 mm)有了很大的提高。此外,我们还表明,激光扫描设备的外在参数在龙门测量过程中的变化是显著的,而这些参数通常被认为是恒定的。我们的改进方法可以为实验地貌学开辟新的研究问题,并可应用于其他设施、研究领域和行业。
{"title":"Remote sensing of a gantry-equipped facility: Optimizing accuracy by integrating SfM photogrammetry and laserscan computer graphics through fixed base model geometry","authors":"Eise W. Nota ,&nbsp;Brechtje A. van Amstel ,&nbsp;Wiebe Nijland ,&nbsp;Marcel C.G. van Maarseveen ,&nbsp;Maarten G. Kleinhans","doi":"10.1016/j.jag.2026.105098","DOIUrl":"10.1016/j.jag.2026.105098","url":null,"abstract":"<div><div>Gantry-equipped facilities are used in a wide range of research fields, for instance geomorphology. Such setups allow for frequent and comprehensive measurements from sensors mounted on gantries or in fixed overhead position. These measurements often require high spatial and relative accuracy because of small scale morphology. Our objective is to develop high-performance data processing methods for our 20 by 3 m laboratory facility that is used for physical scale experiments on coastal morphology. We compare new and existing methods through quantification of spatial and relative errors in DEMs. Our new data processing methods incorporate SfM photogrammetry and computer graphics using a “base model” of our facility under idealized conditions. With this base model, we are able to successfully align all sensor outputs, creating large and geometrically consistent timeseries of orthomosaics and DEMs. Furthermore, the base model geometry provides for a gridded laserscan processing method, incorporating fixed extrinsic, intrinsic and distortion parameters of the sensors along the gantry. Compared to SfM photogrammetrically constructed DEMs, this new method shows greatly improved relative accuracy (0.74 mm compared to 0.97–4.18 mm) and spatial accuracy (2.13 mm compared to 2.39–8.25 mm). Moreover, we show that variations of extrinsic parameters of laserscan equipment along the gantry survey are significant, while these parameters are often assumed to be constant. Our improved methods may open up new research questions in experimental geomorphology and can be applied to other facilities, research fields and industries.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"146 ","pages":"Article 105098"},"PeriodicalIF":8.6,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146006417","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Moving vehicles tracking from satellite video data based on spatiotemporal high-order relation learning and reasoning 基于时空高阶关系学习与推理的卫星视频移动车辆跟踪
IF 8.6 Q1 REMOTE SENSING Pub Date : 2026-02-01 Epub Date: 2025-12-13 DOI: 10.1016/j.jag.2025.105015
Ziyuan Feng , Xianfeng Zhang , Bo Zhou , Miao Ren , Xiaobo Zhi
Tracking moving vehicles in satellite videos presents several challenges, including complex background interference and the difficulty of detecting small targets. Most existing multiple object tracking (MOT) methods utilize convolutional models to capture local semantics or self-attention mechanisms to address global semantics for moving target detection. However, these methods tend to struggle with small and visually similar targets, making them particularly vulnerable to complex background interference, which often results in a large number of false positives and missed detections. Furthermore, many current approaches rely on the Hungarian matching algorithm or other intricate, unlearnable association optimization methods to achieve effective tracking once relevant information is gathered. This reliance often yields suboptimal outputs from the network models. To tackle these issues, this article presents an end-to-end graph network based on spatiotemporal high-order relation learning and reasoning for vehicle tracking in satellite video. The representation module of spatial high-order relations is designed to capture the spatial high-order relations between moving vehicles and their local environments, as well as global key references. Meanwhile, the temporal semantic reasoning module focuses on analyzing the evolution of these spatial high-order relations over time, thereby constructing the spatiotemporal high-order connections among the targets of interest and ensuring the continuous and stable detection of moving vehicles. Ultimately, a graph network based on spatiotemporal high-order relation reasoning is developed to perform learnable associations of target information across video frames, achieving a globally optimal solution to the tracking problem. Comparative experiments on the SatVideoDT, CGSTL, and ShuangQing-1 satellite video datasets demonstrate that the proposed method effectively enables end-to-end tracking of moving vehicles, attaining state-of-the-art performance across most evaluation metrics. On the SatVideoDT dataset, the model achieves a Multiple Object Tracking Accuracy (MOTA) of 65.1% and an Identity F1 Score (IDF1) of 70.9%. The proposed network model holds significant promise for the automated interpretation of satellite video data. The code is available at https://github.com/zsspo/GHOST-R.
在卫星视频中跟踪移动车辆存在一些挑战,包括复杂的背景干扰和检测小目标的困难。现有的多目标跟踪(MOT)方法大多利用卷积模型捕获局部语义或自注意机制来处理运动目标检测的全局语义。然而,这些方法往往与小的和视觉上相似的目标作斗争,使它们特别容易受到复杂背景干扰,这往往导致大量的误报和漏检。此外,目前的许多方法依赖于匈牙利匹配算法或其他复杂的、不可学习的关联优化方法,一旦收集到相关信息,就可以实现有效的跟踪。这种依赖通常会从网络模型中产生次优输出。为了解决这些问题,本文提出了一种基于时空高阶关系学习和推理的端到端图网络,用于卫星视频中的车辆跟踪。空间高阶关系表示模块旨在捕捉运动车辆与其局部环境以及全局关键引用之间的空间高阶关系。同时,时间语义推理模块侧重于分析这些空间高阶关系随时间的演变,从而构建感兴趣目标之间的时空高阶联系,保证对运动车辆的连续稳定检测。最后,开发了基于时空高阶关系推理的图网络,实现了跨视频帧目标信息的可学习关联,实现了跟踪问题的全局最优解。在SatVideoDT、CGSTL和双清一号卫星视频数据集上的对比实验表明,所提出的方法有效地实现了移动车辆的端到端跟踪,在大多数评估指标上都达到了最先进的性能。在SatVideoDT数据集上,该模型实现了65.1%的多目标跟踪精度(MOTA)和70.9%的身份F1分数(IDF1)。所提出的网络模型对卫星视频数据的自动解释具有重要的前景。代码可在https://github.com/zsspo/GHOST-R上获得。
{"title":"Moving vehicles tracking from satellite video data based on spatiotemporal high-order relation learning and reasoning","authors":"Ziyuan Feng ,&nbsp;Xianfeng Zhang ,&nbsp;Bo Zhou ,&nbsp;Miao Ren ,&nbsp;Xiaobo Zhi","doi":"10.1016/j.jag.2025.105015","DOIUrl":"10.1016/j.jag.2025.105015","url":null,"abstract":"<div><div>Tracking moving vehicles in satellite videos presents several challenges, including complex background interference and the difficulty of detecting small targets. Most existing multiple object tracking (MOT) methods utilize convolutional models to capture local semantics or self-attention mechanisms to address global semantics for moving target detection. However, these methods tend to struggle with small and visually similar targets, making them particularly vulnerable to complex background interference, which often results in a large number of false positives and missed detections. Furthermore, many current approaches rely on the Hungarian matching algorithm or other intricate, unlearnable association optimization methods to achieve effective tracking once relevant information is gathered. This reliance often yields suboptimal outputs from the network models. To tackle these issues, this article presents an end-to-end graph network based on spatiotemporal high-order relation learning and reasoning for vehicle tracking in satellite video. The representation module of spatial high-order relations is designed to capture the spatial high-order relations between moving vehicles and their local environments, as well as global key references. Meanwhile, the temporal semantic reasoning module focuses on analyzing the evolution of these spatial high-order relations over time, thereby constructing the spatiotemporal high-order connections among the targets of interest and ensuring the continuous and stable detection of moving vehicles. Ultimately, a graph network based on spatiotemporal high-order relation reasoning is developed to perform learnable associations of target information across video frames, achieving a globally optimal solution to the tracking problem. Comparative experiments on the SatVideoDT, CGSTL, and ShuangQing-1 satellite video datasets demonstrate that the proposed method effectively enables end-to-end tracking of moving vehicles, attaining state-of-the-art performance across most evaluation metrics. On the SatVideoDT dataset, the model achieves a Multiple Object Tracking Accuracy (MOTA) of 65.1% and an Identity F1 Score (IDF1) of 70.9%. The proposed network model holds significant promise for the automated interpretation of satellite video data. The code is available at https://github.com/zsspo/GHOST-R.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"146 ","pages":"Article 105015"},"PeriodicalIF":8.6,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145759385","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AI in soil moisture remote sensing 人工智能在土壤湿度遥感中的应用
IF 8.6 Q1 REMOTE SENSING Pub Date : 2026-02-01 Epub Date: 2025-12-11 DOI: 10.1016/j.jag.2025.105011
Carsten Montzka , Luca Brocca , Hao Chen , Narendra N. Das , Antara Dasgupta , Mehdi Rahmati , Thomas Jagdhuber
Soil moisture, a pivotal component of the hydrological cycle, exerts a profound influence on land surface exchange processes, but its spatial variability poses challenges for large-scale field observations, increasing reliance on satellite-based retrievals. However, spaceborne estimates face limitations due to model uncertainties and sensor-related constraints. Recent advances in artificial intelligence (AI) offer promising alternatives to traditional methods by enabling data-driven estimation of soil moisture without strong physical assumptions. Thus, a critical review of emerging AI-based soil moisture retrieval methods with respect to their advantages and disadvantages is vital to ensure the best utilization of such tools for soil moisture sensing, especially with novel sensors and data constantly being generated.
In this comprehensive review, we furnish the first structured overview of AI methods and their applications in soil moisture retrievals from remote sensing. AI is able to enhance soil moisture retrieval by learning complex (highly nonlinear) relationships between satellite observations and ground reference data, to support time series reconstruction by filling gaps in data sets, to estimate subsurface soil moisture conditions from surface signals and auxiliary inputs, to enable spatial scaling by translating soil moisture estimates across different resolutions using multi-resolution data, to predict temporal dynamics as a soil moisture forecast, and to contribute to broader assessments of the water cycle and beyond by integrating soil moisture with further hydrological variables. Future directions for each method are also identified to address the scientific challenges of soil moisture retrieval and help focus the research community on the key open questions in the new era of rapidly expanding AI applications.
土壤湿度是水循环的关键组成部分,对陆地表面交换过程产生深远影响,但其空间变异性对大规模野外观测构成挑战,增加了对卫星检索的依赖。然而,由于模型的不确定性和与传感器相关的限制,星载估算面临局限性。人工智能(AI)的最新进展为传统方法提供了有希望的替代方案,即在没有强物理假设的情况下,实现数据驱动的土壤湿度估计。因此,对新兴的基于人工智能的土壤水分检索方法的优缺点进行批判性回顾,对于确保最佳地利用这些工具进行土壤水分传感至关重要,特别是在不断产生新的传感器和数据的情况下。在这篇全面的综述中,我们提供了人工智能方法及其在土壤水分遥感反演中的应用的第一个结构化概述。人工智能能够通过学习卫星观测和地面参考数据之间复杂(高度非线性)的关系来增强土壤湿度检索,通过填补数据集中的空白来支持时间序列重建,通过地面信号和辅助输入来估计地下土壤湿度状况,通过使用多分辨率数据转换不同分辨率的土壤湿度估计来实现空间缩放,预测时间动态作为土壤湿度预测。并通过将土壤湿度与其他水文变量结合起来,为更广泛的水循环评估做出贡献。还确定了每种方法的未来方向,以解决土壤水分检索的科学挑战,并帮助研究界关注快速扩展人工智能应用的新时代的关键开放性问题。
{"title":"AI in soil moisture remote sensing","authors":"Carsten Montzka ,&nbsp;Luca Brocca ,&nbsp;Hao Chen ,&nbsp;Narendra N. Das ,&nbsp;Antara Dasgupta ,&nbsp;Mehdi Rahmati ,&nbsp;Thomas Jagdhuber","doi":"10.1016/j.jag.2025.105011","DOIUrl":"10.1016/j.jag.2025.105011","url":null,"abstract":"<div><div>Soil moisture, a pivotal component of the hydrological cycle, exerts a profound influence on land surface exchange processes, but its spatial variability poses challenges for large-scale field observations, increasing reliance on satellite-based retrievals. However, spaceborne estimates face limitations due to model uncertainties and sensor-related constraints. Recent advances in artificial intelligence (AI) offer promising alternatives to traditional methods by enabling data-driven estimation of soil moisture without strong physical assumptions. Thus, a critical review of emerging AI-based soil moisture retrieval methods with respect to their advantages and disadvantages is vital to ensure the best utilization of such tools for soil moisture sensing, especially with novel sensors and data constantly being generated.</div><div>In this comprehensive review, we furnish the first structured overview of AI methods and their applications in soil moisture retrievals from remote sensing. AI is able to enhance soil moisture retrieval by learning complex (highly nonlinear) relationships between satellite observations and ground reference data, to support time series reconstruction by filling gaps in data sets, to estimate subsurface soil moisture conditions from surface signals and auxiliary inputs, to enable spatial scaling by translating soil moisture estimates across different resolutions using multi-resolution data, to predict temporal dynamics as a soil moisture forecast, and to contribute to broader assessments of the water cycle and beyond by integrating soil moisture with further hydrological variables. Future directions for each method are also identified to address the scientific challenges of soil moisture retrieval and help focus the research community on the key open questions in the new era of rapidly expanding AI applications.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"146 ","pages":"Article 105011"},"PeriodicalIF":8.6,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145738415","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sim-to-real image to image translation for remote sensing fine-grained ship images using generative diffusion models 基于生成扩散模型的遥感细粒度船舶图像的模拟到实像到图像转换
IF 8.6 Q1 REMOTE SENSING Pub Date : 2026-02-01 Epub Date: 2025-12-11 DOI: 10.1016/j.jag.2025.104996
Zhiming Deng, Baixin Ai, Tianyu Zhang, Cheng Wei, Xibin Cao
Advances in artificial intelligence have enabled the automation of many remote sensing tasks; however, performance remains constrained by dataset quality, especially for fine-grained ship classification in remote sensing imagery, where publicly available datasets suffer from class imbalance and sample scarcity. To address these issues, we propose a novel simulation-to-real style-transfer pipeline for fine-grained ship imagery, comprising three modules: the Simulation Image Generation (SIG) module, the Conditional Image Generation (IG) module, and the Wake Inpainting (WI) module. In the SIG module, we construct an optical remote sensing imaging system capable of producing high-resolution simulated images containing fine-grained ship objects. To overcome the loss of detailed features inherent in global style-transfer methods, the IG module introduces the SPAM-ControlNet algorithm, which generates fine-grained ship images with accurate characteristics. In the WI module, we generate the inpainting region at the stern based on the ship wake model, then apply Stable Diffusion Inpainting to synthesize realistic wake patterns, thereby harmonizing the generated ship objects with the ocean background. This pipeline enables the synthesis of seamless, high-resolution remote sensing images populated with detailed ship objects. Building on this pipeline, we also release a hybrid dataset, FGSCR-SR-12, which combines real and synthetic images across 12 ship classes to mitigate long-tail distribution challenges caused by scarce classes. All code and the FGSCR-SR-12 dataset are publicly available at https://github.com/Slimyer/SPAM-Controlnet.
人工智能的进步使许多遥感任务实现了自动化;然而,性能仍然受到数据集质量的限制,特别是对于遥感图像中的细粒度船舶分类,其中公开可用的数据集遭受类别不平衡和样本稀缺性的影响。为了解决这些问题,我们提出了一种用于细粒度船舶图像的新型模拟到真实风格传输管道,包括三个模块:模拟图像生成(SIG)模块,条件图像生成(IG)模块和尾迹绘制(WI)模块。在SIG模块中,我们构建了一个光学遥感成像系统,能够生成包含细粒度船舶物体的高分辨率模拟图像。为了克服全局风格转移方法固有的细节特征的丢失,IG模块引入了SPAM-ControlNet算法,该算法生成具有精确特征的细粒度船舶图像。在WI模块中,我们基于船舶尾流模型在尾部生成着色区域,然后应用稳定扩散着色合成真实的尾流图案,从而使生成的船舶目标与海洋背景相协调。该管道可以合成无缝、高分辨率的遥感图像,其中包含详细的船舶物体。在此基础上,我们还发布了一个混合数据集fgscrr - sr -12,该数据集结合了12个船级的真实和合成图像,以减轻由于船级稀缺造成的长尾分布挑战。所有代码和FGSCR-SR-12数据集可在https://github.com/Slimyer/SPAM-Controlnet上公开获取。
{"title":"Sim-to-real image to image translation for remote sensing fine-grained ship images using generative diffusion models","authors":"Zhiming Deng,&nbsp;Baixin Ai,&nbsp;Tianyu Zhang,&nbsp;Cheng Wei,&nbsp;Xibin Cao","doi":"10.1016/j.jag.2025.104996","DOIUrl":"10.1016/j.jag.2025.104996","url":null,"abstract":"<div><div>Advances in artificial intelligence have enabled the automation of many remote sensing tasks; however, performance remains constrained by dataset quality, especially for fine-grained ship classification in remote sensing imagery, where publicly available datasets suffer from class imbalance and sample scarcity. To address these issues, we propose a novel simulation-to-real style-transfer pipeline for fine-grained ship imagery, comprising three modules: the Simulation Image Generation (SIG) module, the Conditional Image Generation (IG) module, and the Wake Inpainting (WI) module. In the SIG module, we construct an optical remote sensing imaging system capable of producing high-resolution simulated images containing fine-grained ship objects. To overcome the loss of detailed features inherent in global style-transfer methods, the IG module introduces the SPAM-ControlNet algorithm, which generates fine-grained ship images with accurate characteristics. In the WI module, we generate the inpainting region at the stern based on the ship wake model, then apply Stable Diffusion Inpainting to synthesize realistic wake patterns, thereby harmonizing the generated ship objects with the ocean background. This pipeline enables the synthesis of seamless, high-resolution remote sensing images populated with detailed ship objects. Building on this pipeline, we also release a hybrid dataset, FGSCR-SR-12, which combines real and synthetic images across 12 ship classes to mitigate long-tail distribution challenges caused by scarce classes. All code and the FGSCR-SR-12 dataset are publicly available at <span><span>https://github.com/Slimyer/SPAM-Controlnet</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"146 ","pages":"Article 104996"},"PeriodicalIF":8.6,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145738349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards onboard thermal hotspots segmentation with raw multispectral satellite imagery 基于原始多光谱卫星图像的机载热热点分割研究
IF 8.6 Q1 REMOTE SENSING Pub Date : 2026-02-01 Epub Date: 2026-01-15 DOI: 10.1016/j.jag.2026.105095
Cristopher Castro Traba , David Rijlaarsdam , Jian Guo , Roberto Del Prete , Gabriele Meoni
The rapid spread and destructive nature of wildfires and volcanic activity have intensified the need for low latency detection systems. The growing intensity and frequency of globally distributed thermal hotspots have driven the development of satellite-based detection solutions. Conventional approaches rely on ground-based processing, which limits low latency capabilities due to revisit times over ground stations and data handling requirements. This work proposes the first onboard payload processing pipeline for segmentation of thermal hotspots in raw multispectral satellite imagery. The pipeline leverages the Near InfraRed (NIR) and Short-Wave InfraRed (SWIR) spectral bands, and the combination of onboard Artificial Intelligence (AI) and raw imagery significantly reduces the delay between image acquisition and event detection. Furthermore, we present Segmentation of Thermal Hotspots in Raw Sentinel-2 data (SegTHRawS), the first publicly available dataset for thermal hotspot segmentation in raw multispectral satellite imagery. The segmentation model employed is a Fully Convolutional Network (FCN) derived from U-Net, named ResUnet-S2, designed for fast on-device inference. This model achieved an Intersection over Union (IoU) of 0.988 and an F-1 score of 0.986 on SegTHRawS, with its detection and generalization capabilities validated using an external thermal hotspot segmentation dataset. The proposed pipeline was verified on CubeSat-compatible hardware, achieving an end-to-end execution, from image acquisition to event detection, in 1.45 s, faster than the image acquisition process, and consuming a peak power of 4.05 W. These results demonstrate the potential of onboard processing solutions for minimizing the detection latency of current approaches, particularly for thermal hotspot segmentation, using edge computing satellite hardware.
野火和火山活动的迅速蔓延和破坏性加剧了对低延迟探测系统的需求。全球分布的热热点的强度和频率不断增加,推动了基于卫星的探测解决方案的发展。传统方法依赖于地面处理,由于地面站的重访时间和数据处理要求,这限制了低延迟能力。这项工作提出了第一个机载有效载荷处理管道,用于分割原始多光谱卫星图像中的热热点。该管道利用了近红外(NIR)和短波红外(SWIR)光谱波段,并结合了机载人工智能(AI)和原始图像,大大减少了图像采集和事件检测之间的延迟。此外,我们提出了Sentinel-2原始数据中的热热点分割(SegTHRawS),这是第一个公开的多光谱卫星原始图像热热点分割数据集。所采用的分割模型是源自U-Net的全卷积网络(FCN),名为ResUnet-S2,旨在实现快速的设备上推理。该模型在SegTHRawS上实现了0.988的IoU和0.986的F-1分数,并使用外部热热点分割数据集验证了其检测和泛化能力。在与cubesat兼容的硬件上验证了所提出的管道,实现了从图像采集到事件检测的端到端执行,时间为1.45 s,比图像采集过程快,峰值功耗为4.05 W。这些结果表明,利用边缘计算卫星硬件,机载处理解决方案可以最大限度地减少当前方法的检测延迟,特别是在热热点分割方面。
{"title":"Towards onboard thermal hotspots segmentation with raw multispectral satellite imagery","authors":"Cristopher Castro Traba ,&nbsp;David Rijlaarsdam ,&nbsp;Jian Guo ,&nbsp;Roberto Del Prete ,&nbsp;Gabriele Meoni","doi":"10.1016/j.jag.2026.105095","DOIUrl":"10.1016/j.jag.2026.105095","url":null,"abstract":"<div><div>The rapid spread and destructive nature of wildfires and volcanic activity have intensified the need for low latency detection systems. The growing intensity and frequency of globally distributed thermal hotspots have driven the development of satellite-based detection solutions. Conventional approaches rely on ground-based processing, which limits low latency capabilities due to revisit times over ground stations and data handling requirements. This work proposes the first onboard payload processing pipeline for segmentation of thermal hotspots in raw multispectral satellite imagery. The pipeline leverages the Near InfraRed (NIR) and Short-Wave InfraRed (SWIR) spectral bands, and the combination of onboard Artificial Intelligence (AI) and raw imagery significantly reduces the delay between image acquisition and event detection. Furthermore, we present Segmentation of Thermal Hotspots in Raw Sentinel-2 data (SegTHRawS), the first publicly available dataset for thermal hotspot segmentation in raw multispectral satellite imagery. The segmentation model employed is a Fully Convolutional Network (FCN) derived from U-Net, named ResUnet-S2, designed for fast on-device inference. This model achieved an Intersection over Union (IoU) of 0.988 and an F-1 score of 0.986 on SegTHRawS, with its detection and generalization capabilities validated using an external thermal hotspot segmentation dataset. The proposed pipeline was verified on CubeSat-compatible hardware, achieving an end-to-end execution, from image acquisition to event detection, in 1.45 s, faster than the image acquisition process, and consuming a peak power of 4.05 W. These results demonstrate the potential of onboard processing solutions for minimizing the detection latency of current approaches, particularly for thermal hotspot segmentation, using edge computing satellite hardware.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"146 ","pages":"Article 105095"},"PeriodicalIF":8.6,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145977789","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Geospatial machine learning model for limestone suitability assessment 石灰岩适宜性评价的地理空间机器学习模型
IF 8.6 Q1 REMOTE SENSING Pub Date : 2026-02-01 Epub Date: 2026-01-06 DOI: 10.1016/j.jag.2025.105055
Idowu Jane Bada , Michael Adeyinka Oladunjoye , Moruffdeen Adedapo Adabanija
Accurate assessment of limestone quality and investment potential requires advanced techniques due to spatial variability in thickness and geochemical composition which traditional exploration methods cannot capture efficiently. This study integrates Machine Learning (ML) and Geographic Information Systems (GIS) to optimize limestone exploration. The analysis used limestone and overburden thickness, X-ray Fluorescence (XRF), and Atomic Absorption Spectroscopy (AAS) data from 23 core samples with multiple ML classifiers: Decision Tree (DT), Logistic Regression (LR), XGBoost, Support Vector Machine (SVM), and Random Forest (RF). Predictive ability was evaluated using Accuracy, F1 Scores, Precision-Recall (P-R) curves, Area under P-R curves (AUC-PR), and Feature importance Analysis. Features were validated using a non-parametric approach. Predicted datasets of the selected classifier were subjected to limestone classification criteria and integrated into a GIS to generate predictive Limestone Suitability Index (LSI) map. DT and LR models showed 100 % accuracy, XGBoost performed poorly at 60 %, and SVM and RF had moderate performance (80 %). The F1-scores of 1.00 for LR and DT, 0.71 for SVM and RF, and 0.45 for XGBoost indicate prediction reliability differences. RF and SVM achieved balanced precision-recall (0.65–0.80), with RF attaining a higher AUC_PR (0.871) than SVM (0.643). The non-parametric validation of the features identified RF as most suitable. The LSI map based on RF outputs, categorized the area into high, medium, and low potential zones with high potential zones characterized by thick, CaO rich limestone beds (16.0–34.1 m, CaO ≥ 50 %, SiO2 ≤ 8 %). This made ML, specifically, RF an essential tool for limestone resource evaluation.
由于石灰石厚度和地球化学成分的空间变异性,传统的勘探方法无法有效地捕获,因此准确评估石灰石的质量和投资潜力需要先进的技术。该研究整合了机器学习(ML)和地理信息系统(GIS)来优化石灰岩勘探。分析使用石灰石和覆盖层厚度、x射线荧光(XRF)和原子吸收光谱(AAS)数据,来自23个岩心样本,使用多个ML分类器:决策树(DT)、逻辑回归(LR)、XGBoost、支持向量机(SVM)和随机森林(RF)。预测能力评估采用准确性、F1评分、精确召回率(P-R)曲线、P-R曲线下面积(AUC-PR)和特征重要性分析。使用非参数方法验证特征。所选分类器的预测数据集将受到石灰石分类标准的影响,并集成到GIS中以生成预测石灰石适宜性指数(LSI)地图。DT和LR模型的准确率为100%,XGBoost模型的准确率为60%,而SVM和RF模型的准确率为80%。LR和DT的f1得分为1.00,SVM和RF的f1得分为0.71,XGBoost的f1得分为0.45,表明预测可靠性存在差异。RF和SVM的precision-recall达到平衡(0.65 ~ 0.80),其中RF的AUC_PR(0.871)高于SVM(0.643)。特征的非参数验证确定RF是最合适的。基于RF输出的LSI地图将该区域划分为高、中、低电位区,其中高电位区以厚的、富含CaO的石灰岩层(16.0 ~ 34.1 m, CaO≥50%,SiO2≤8%)为特征。这使得ML,特别是RF成为石灰石资源评估的重要工具。
{"title":"Geospatial machine learning model for limestone suitability assessment","authors":"Idowu Jane Bada ,&nbsp;Michael Adeyinka Oladunjoye ,&nbsp;Moruffdeen Adedapo Adabanija","doi":"10.1016/j.jag.2025.105055","DOIUrl":"10.1016/j.jag.2025.105055","url":null,"abstract":"<div><div>Accurate assessment of limestone quality and investment potential requires advanced techniques due to spatial variability in thickness and geochemical composition which traditional exploration methods cannot capture efficiently. This study integrates Machine Learning (ML) and Geographic Information Systems (GIS) to optimize limestone exploration. The analysis used limestone and overburden thickness, X-ray Fluorescence (XRF), and Atomic Absorption Spectroscopy (AAS) data from 23 core samples with multiple ML classifiers: Decision Tree (DT), Logistic Regression (LR), XGBoost, Support Vector Machine (SVM), and Random Forest (RF). Predictive ability was evaluated using Accuracy, F1 Scores, Precision-Recall (P-R) curves, Area under P-R curves (AUC-PR), and Feature importance Analysis. Features were validated using a non-parametric approach. Predicted datasets of the selected classifier were subjected to limestone classification criteria and integrated into a GIS to generate predictive Limestone Suitability Index (LSI) map. DT and LR models showed 100 % accuracy, XGBoost performed poorly at 60 %, and SVM and RF had moderate performance (80 %). The F1-scores of 1.00 for LR and DT, 0.71 for SVM and RF, and 0.45 for XGBoost indicate prediction reliability differences. RF and SVM achieved balanced precision-recall (0.65–0.80), with RF attaining a higher AUC_PR (0.871) than SVM (0.643). The non-parametric validation of the features identified RF as most suitable. The LSI map based on RF outputs, categorized the area into high, medium, and low potential zones with high potential zones characterized by thick, CaO rich limestone beds (16.0–34.1 m, CaO ≥ 50 %, SiO2 ≤ 8 %). This made ML, specifically, RF an essential tool for limestone resource<!--> <!-->evaluation.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"146 ","pages":"Article 105055"},"PeriodicalIF":8.6,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145926086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Few-Shot change detection in optical and SAR remote sensing images for disaster response 面向灾害响应的光学和SAR遥感影像少镜头变化检测
IF 8.6 Q1 REMOTE SENSING Pub Date : 2026-02-01 Epub Date: 2026-01-12 DOI: 10.1016/j.jag.2026.105100
Di Wang , Guorui Ma , Xiao Wang , Ronghao Yang , Yongxian Zhang
Few-shot change detection in optical and Synthetic Aperture Radar images is a critical task for disaster monitoring. offering significant application value in complex scenarios with extremely limited labeled samples. However, the randomness of disasters causes a notable data distribution shift between public datasets and real disaster scenarios. With only a few annotated image pairs, existing methods struggle to effectively fuse features from heterogeneous images, leading to severe performance degradation. To address this challenge, we propose a Dual-Stage Training framework for Change Detection (DSTCD), specifically designed for few-shot scenarios involving fewer than 20 labeled image pairs. DSTCD first undergoes source task pre-training on a heterogeneous image registration dataset. Subsequently, in the target task stage, it leverages task guided feature transfer module to transfer the structural and semantic features of image registration to the change detection task. This mechanism significantly enriches the feature representations under few-shot conditions, enabling accurate identification of affected areas. To validate its performance, we conducted comparative and ablation studies against eleven state-of-the-art methods on four public datasets covering both urban expansion and water expansion scenarios. Experimental results demonstrate that DSTCD achieves a significant performance lead. Its average F1-score surpasses the second-best method by 6.98% in urban expansion scenarios and by 13.09% in water expansion scenarios, proving its superior performance and strong multi-scenario adaptability. Furthermore, robustness analysis of varying training sample sizes and real-world disaster application validation further confirm the method’s practicality and robustness for data-scarce disaster monitoring tasks. The code of the proposed method will be made available at https://github.com/Lucky-DW/DSTCD.
光学和合成孔径雷达图像的少镜头变化检测是灾害监测的关键任务。在极其有限的标记样本的复杂场景中提供重要的应用价值。然而,灾害的随机性导致公共数据集和真实灾害场景之间的数据分布发生了明显的变化。由于只有少数带注释的图像对,现有方法难以有效地融合异构图像的特征,导致性能严重下降。为了解决这一挑战,我们提出了一个双阶段变化检测训练框架(DSTCD),专门为涉及少于20个标记图像对的少量场景设计。DSTCD首先在异构图像配准数据集上进行源任务预训练。随后,在目标任务阶段,利用任务导向特征转移模块将图像配准的结构特征和语义特征转移到变化检测任务中。这一机制大大丰富了少镜头条件下的特征表示,使受影响区域的准确识别成为可能。为了验证其性能,我们在四个公共数据集上对11种最先进的方法进行了比较和消融研究,包括城市扩张和水扩张情景。实验结果表明,DSTCD具有明显的性能领先优势。该方法在城市扩张情景下的平均f1得分比次优方法高6.98%,在水域扩张情景下的平均f1得分比次优方法高13.09%,证明了其优越的性能和较强的多情景适应性。此外,对不同训练样本大小的鲁棒性分析和实际灾难应用验证进一步证实了该方法对于数据稀缺的灾害监测任务的实用性和鲁棒性。建议方法的代码将在https://github.com/Lucky-DW/DSTCD上提供。
{"title":"Few-Shot change detection in optical and SAR remote sensing images for disaster response","authors":"Di Wang ,&nbsp;Guorui Ma ,&nbsp;Xiao Wang ,&nbsp;Ronghao Yang ,&nbsp;Yongxian Zhang","doi":"10.1016/j.jag.2026.105100","DOIUrl":"10.1016/j.jag.2026.105100","url":null,"abstract":"<div><div>Few-shot change detection in optical and Synthetic Aperture Radar images is a critical task for disaster monitoring. offering significant application value in complex scenarios with extremely limited labeled samples. However, the randomness of disasters causes a notable data distribution shift between public datasets and real disaster scenarios. With only a few annotated image pairs, existing methods struggle to effectively fuse features from heterogeneous images, leading to severe performance degradation. To address this challenge, we propose a Dual-Stage Training framework for Change Detection (DSTCD), specifically designed for few-shot scenarios involving fewer than 20 labeled image pairs. DSTCD first undergoes source task pre-training on a heterogeneous image registration dataset. Subsequently, in the target task stage, it leverages task guided feature transfer module to transfer the structural and semantic features of image registration to the change detection task. This mechanism significantly enriches the feature representations under few-shot conditions, enabling accurate identification of affected areas. To validate its performance, we conducted comparative and ablation studies against eleven state-of-the-art methods on four public datasets covering both urban expansion and water expansion scenarios. Experimental results demonstrate that DSTCD achieves a significant performance lead. Its average F1-score surpasses the second-best method by 6.98% in urban expansion scenarios and by 13.09% in water expansion scenarios, proving its superior performance and strong multi-scenario adaptability. Furthermore, robustness analysis of varying training sample sizes and real-world disaster application validation further confirm the method’s practicality and robustness for data-scarce disaster monitoring tasks. The code of the proposed method will be made available at <span><span>https://github.com/Lucky-DW/DSTCD</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"146 ","pages":"Article 105100"},"PeriodicalIF":8.6,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145957348","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High–low frequency feature fusion network for pavement crack segmentation in complex environments 复杂环境下路面裂缝分割的高低频特征融合网络
IF 8.6 Q1 REMOTE SENSING Pub Date : 2026-02-01 Epub Date: 2025-12-18 DOI: 10.1016/j.jag.2025.105048
Huazhong Jin, Guofeng Wu, Yi Cen
For routine road maintenance, accurate and efficient crack detection holds significant research value and practical importance. With advances in computer vision, image processing-based crack detection has achieved promising results. However, in complex environments, factors such as illumination and weather introduce noise interference, especially in high-resolution crack images where fine fracture features show heightened complexity and diversity. This often leads to degraded detection accuracy in such challenging scenarios. To address this, we propose FE-SegNeXt, a convolutional neural network segmentation framework tailored for high-resolution crack detection in complex environments. The algorithm leverages distinct characteristics of high- and low-frequency features in crack images: average pooling separates these features, while a dedicated Frequency Collaborative Enhancement Module (FCEM) independently enhances high- and low-frequency components before fusing multi-band information. This design significantly improves the model’s ability to extract subtle cracks from high-resolution images under noisy conditions. Additionally, we introduce a Locally Enhanced Feed-Forward Network (LE-FFN) to amplify perception of weak crack signals in local regions, further refining fine-grained feature extraction. Experimental results on the public datasets Sun520, Rain365, and BJN260 demonstrate that the proposed method achieves F1-scores of 61.55%, 62.57%, and 60.31%, respectively, outperforming existing crack detection algorithms.
对于日常道路养护而言,准确、高效的裂缝检测具有重要的研究价值和实际意义。随着计算机视觉技术的发展,基于图像处理的裂纹检测已经取得了可喜的成果。然而,在复杂的环境中,光照和天气等因素会引入噪声干扰,特别是在高分辨率裂缝图像中,精细裂缝特征显示出更高的复杂性和多样性。在这种具有挑战性的情况下,这通常会导致检测精度下降。为了解决这个问题,我们提出了FE-SegNeXt,这是一种为复杂环境中的高分辨率裂纹检测量身定制的卷积神经网络分割框架。该算法利用了裂缝图像中高频和低频特征的不同特征:平均池化分离这些特征,而专用的频率协同增强模块(FCEM)在融合多波段信息之前独立增强高低频成分。这种设计显著提高了模型在噪声条件下从高分辨率图像中提取细微裂纹的能力。此外,我们引入了局部增强前馈网络(LE-FFN)来放大局部区域的弱裂纹信号感知,进一步细化细粒度特征提取。在公共数据集Sun520、Rain365和BJN260上的实验结果表明,该方法的f1得分分别为61.55%、62.57%和60.31%,优于现有的裂纹检测算法。
{"title":"High–low frequency feature fusion network for pavement crack segmentation in complex environments","authors":"Huazhong Jin,&nbsp;Guofeng Wu,&nbsp;Yi Cen","doi":"10.1016/j.jag.2025.105048","DOIUrl":"10.1016/j.jag.2025.105048","url":null,"abstract":"<div><div>For routine road maintenance, accurate and efficient crack detection holds significant research value and practical importance. With advances in computer vision, image processing-based crack detection has achieved promising results. However, in complex environments, factors such as illumination and weather introduce noise interference, especially in high-resolution crack images where fine fracture features show heightened complexity and diversity. This often leads to degraded detection accuracy in such challenging scenarios. To address this, we propose FE-SegNeXt, a convolutional neural network segmentation framework tailored for high-resolution crack detection in complex environments. The algorithm leverages distinct characteristics of high- and low-frequency features in crack images: average pooling separates these features, while a dedicated Frequency Collaborative Enhancement Module (FCEM) independently enhances high- and low-frequency components before fusing multi-band information. This design significantly improves the model’s ability to extract subtle cracks from high-resolution images under noisy conditions. Additionally, we introduce a Locally Enhanced Feed-Forward Network (LE-FFN) to amplify perception of weak crack signals in local regions, further refining fine-grained feature extraction. Experimental results on the public datasets Sun520, Rain365, and BJN260 demonstrate that the proposed method achieves F1-scores of 61.55%, 62.57%, and 60.31%, respectively, outperforming existing crack detection algorithms.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"146 ","pages":"Article 105048"},"PeriodicalIF":8.6,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145791191","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International journal of applied earth observation and geoinformation : ITC journal
全部 Geobiology Appl. Clay Sci. Geochim. Cosmochim. Acta J. Hydrol. Org. Geochem. Carbon Balance Manage. Contrib. Mineral. Petrol. Int. J. Biometeorol. IZV-PHYS SOLID EART+ J. Atmos. Chem. Acta Oceanolog. Sin. Acta Geophys. ACTA GEOL POL ACTA PETROL SIN ACTA GEOL SIN-ENGL AAPG Bull. Acta Geochimica Adv. Atmos. Sci. Adv. Meteorol. Am. J. Phys. Anthropol. Am. J. Sci. Am. Mineral. Annu. Rev. Earth Planet. Sci. Appl. Geochem. Aquat. Geochem. Ann. Glaciol. Archaeol. Anthropol. Sci. ARCHAEOMETRY ARCT ANTARCT ALP RES Asia-Pac. J. Atmos. Sci. ATMOSPHERE-BASEL Atmos. Res. Aust. J. Earth Sci. Atmos. Chem. Phys. Atmos. Meas. Tech. Basin Res. Big Earth Data BIOGEOSCIENCES Geostand. Geoanal. Res. GEOLOGY Geosci. J. Geochem. J. Geochem. Trans. Geosci. Front. Geol. Ore Deposits Global Biogeochem. Cycles Gondwana Res. Geochem. Int. Geol. J. Geophys. Prospect. Geosci. Model Dev. GEOL BELG GROUNDWATER Hydrogeol. J. Hydrol. Earth Syst. Sci. Hydrol. Processes Int. J. Climatol. Int. J. Earth Sci. Int. Geol. Rev. Int. J. Disaster Risk Reduct. Int. J. Geomech. Int. J. Geog. Inf. Sci. Isl. Arc J. Afr. Earth. Sci. J. Adv. Model. Earth Syst. J APPL METEOROL CLIM J. Atmos. Oceanic Technol. J. Atmos. Sol. Terr. Phys. J. Clim. J. Earth Sci. J. Earth Syst. Sci. J. Environ. Eng. Geophys. J. Geog. Sci. Mineral. Mag. Miner. Deposita Mon. Weather Rev. Nat. Hazards Earth Syst. Sci. Nat. Clim. Change Nat. Geosci. Ocean Dyn. Ocean and Coastal Research npj Clim. Atmos. Sci. Ocean Modell. Ocean Sci. Ore Geol. Rev. OCEAN SCI J Paleontol. J. PALAEOGEOGR PALAEOCL PERIOD MINERAL PETROLOGY+ Phys. Chem. Miner. Polar Sci. Prog. Oceanogr. Quat. Sci. Rev. Q. J. Eng. Geol. Hydrogeol. RADIOCARBON Pure Appl. Geophys. Resour. Geol. Rev. Geophys. Sediment. Geol.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1