首页 > 最新文献

IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing最新文献

英文 中文
In-Season Sugarcane Mapping in the U.S. and Brazil Using Time-Invariant Phenological Features 在美国和巴西使用时不变物候特征的当季甘蔗制图
IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-23 DOI: 10.1109/JSTARS.2026.3657381
Hui Li;Liping Di;Ruixin Yang;John J. Qu;Daniel Q. Tong;Liying Guo;Eugene G. Yu;Ziao Liu;Bosen Shao;Gavin Middleton
As the world’s largest sugar crop by production volume, sugarcane is a major economic resource used for both food and industrial materials worldwide. Timely sugarcane cultivation mapping will provide spatial distribution information on sugarcane fields and support further decision-making. Traditional time-series mapping using machine learning and remote sensing faces challenges due to limited training data across distant producers and heterogeneous growth seasons. To address these, an automated phenology-based transfer learning workflow was developed, improving upon our previous study, to produce in-season sugarcane maps for the U.S. and Brazil only using historical U.S. training data. Time-series Sentinel-2 multispectral data were transformed to time-invariant phenological features via Linear Cosine Regression, integrating high-confidence labels from the U.S. Cropland Data Layer (CDL) to develop random forest classifiers. Classifications were further optimized using the Product of Absolute Difference and majority filters. The workflow produced four-monthly sugarcane maps (June–September 2023) for Lafourche County, Louisiana, U.S., achieving 0.938 overall accuracy and 0.934 F1 score, surpassing CDL2023’s 0.894, and reducing nonsugarcane regions’ average misclassification rate to 0.18%, validated by ground truth data. Similarly, five-monthly sugarcane maps (March–July 2022) for São Paulo, Brazil, attained 0.881 overall accuracy and 0.911 F1 score in May 2022, validated by the European Space Agency Global Crop Type Validation Data Set. This study demonstrates the potential of using time-invariant phenological features for in-season sugarcane mapping across geographically distant nations and different time periods.
作为世界上产量最大的糖作物,甘蔗是世界范围内用于食品和工业原料的主要经济资源。及时绘制甘蔗种植地图将提供甘蔗田的空间分布信息,为进一步决策提供支持。由于远程生产者的培训数据有限和生长季节不均匀,使用机器学习和遥感的传统时间序列制图面临挑战。为了解决这些问题,我们开发了一个基于物候的自动化迁移学习工作流程,改进了我们之前的研究,仅使用美国的历史训练数据为美国和巴西制作当季甘蔗地图。通过线性余弦回归将时间序列Sentinel-2多光谱数据转换为时不变物候特征,并整合来自美国农田数据层(CDL)的高置信度标签,开发随机森林分类器。采用绝对差积和多数滤波器进一步优化分类。该工作流为美国路易斯安那州Lafourche县制作了四个月的甘蔗地图(2023年6月至9月),总体准确率为0.938,F1得分为0.934,超过CDL2023的0.894,并将非甘蔗地区的平均误分类率降低至0.18%,经实地数据验证。同样,经过欧洲航天局全球作物类型验证数据集验证,巴西圣保罗的五个月甘蔗地图(2022年3月至7月)在2022年5月获得了0.881的总体精度和0.911的F1分数。这项研究证明了使用时不变物候特征在地理遥远的国家和不同的时间段进行当季甘蔗制图的潜力。
{"title":"In-Season Sugarcane Mapping in the U.S. and Brazil Using Time-Invariant Phenological Features","authors":"Hui Li;Liping Di;Ruixin Yang;John J. Qu;Daniel Q. Tong;Liying Guo;Eugene G. Yu;Ziao Liu;Bosen Shao;Gavin Middleton","doi":"10.1109/JSTARS.2026.3657381","DOIUrl":"https://doi.org/10.1109/JSTARS.2026.3657381","url":null,"abstract":"As the world’s largest sugar crop by production volume, sugarcane is a major economic resource used for both food and industrial materials worldwide. Timely sugarcane cultivation mapping will provide spatial distribution information on sugarcane fields and support further decision-making. Traditional time-series mapping using machine learning and remote sensing faces challenges due to limited training data across distant producers and heterogeneous growth seasons. To address these, an automated phenology-based transfer learning workflow was developed, improving upon our previous study, to produce in-season sugarcane maps for the U.S. and Brazil only using historical U.S. training data. Time-series Sentinel-2 multispectral data were transformed to time-invariant phenological features via Linear Cosine Regression, integrating high-confidence labels from the U.S. Cropland Data Layer (CDL) to develop random forest classifiers. Classifications were further optimized using the Product of Absolute Difference and majority filters. The workflow produced four-monthly sugarcane maps (June–September 2023) for Lafourche County, Louisiana, U.S., achieving 0.938 overall accuracy and 0.934 F1 score, surpassing CDL2023’s 0.894, and reducing nonsugarcane regions’ average misclassification rate to 0.18%, validated by ground truth data. Similarly, five-monthly sugarcane maps (March–July 2022) for São Paulo, Brazil, attained 0.881 overall accuracy and 0.911 F1 score in May 2022, validated by the European Space Agency Global Crop Type Validation Data Set. This study demonstrates the potential of using time-invariant phenological features for in-season sugarcane mapping across geographically distant nations and different time periods.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"19 ","pages":"5814-5833"},"PeriodicalIF":5.3,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11362946","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175691","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Model for Land Cover Ecological State Assessment of Southern Ukraine Based on Remote Data for Analyzing the Consequences of the Kakhovka Reservoir Shallowing 基于远程数据的乌克兰南部土地覆盖生态状态评价模型——基于Kakhovka水库浅化后果分析
IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-23 DOI: 10.1109/JSTARS.2026.3657506
Bohdan Yailymov;Liudmyla Pidgorodetska;Liudmyla Kolos;Oleh Fedorov;Hanna Yailymova
To develop scenarios for postwar reconstruction of Ukraine, in particular the measures to stabilize the functioning of war-damaged territories, it is necessary to assess the land cover ecological state based on comprehensive system monitoring. Currently, there are no comprehensive quantitative assessments that combine multiparametric analysis of physical, hydrological properties of the surface soil layer and land cover parameters using high spatial resolution satellite data. The purpose of this article is to create a model for assessing land cover ecological state of southern Ukraine based on satellite data with high spatial resolution (10 m) to analyze the consequences of the Kakhovka Reservoir shallowing. To form an integrated ecological state index (ESI), the principal component analysis method is used for four main indicators: normalized difference vegetation index, land surface temperature, moisture deficit in the surface soil layer, and drought severity index . Sentinel, Landsat and MODIS satellite data, as well as machine learning methods, are used to calculate the values of the indicators. The ESI allows for a quantitative and qualitative assessment of the land cover state. Analysis of long-term ESI data showed a significant deterioration in the ecological state of the studied territory. The average index value decreased by 17.3% over 2019–2024, with the sharpest deterioration observed in 2022–2024 (a decrease of 15.5%). Qualitative analysis revealed that the area of favorable zones decreased by almost half (–49%), while the area of problem zones increased by 59%. If in 2019–2022 the distribution of zones remained relatively stable (the share of problem zones increased only from 33.9%to 34.8%), then in 2022–2024 there were sharp changes (an increase to 53.9%), likely caused by increased drought conditions and the consequences of the destruction of the Kakhovka HPP. The developed model with 10 m resolution enables ecological state assessment, land degradation detection and restoration planning, irrigation optimization in water-scarce conditions, evaluation of crop production potential and food security, monitoring of territories where ground surveys are impossible, and evidence-based support for land management and postwar recovery.
为了制定乌克兰战后重建方案,特别是稳定战争破坏领土功能的措施,有必要在综合系统监测的基础上评估土地覆盖生态状态。目前,还没有利用高空间分辨率卫星数据结合多参数分析表层土壤物理、水文特性和土地覆盖参数的综合定量评估。本文的目的是基于高空间分辨率(10米)的卫星数据创建一个评估乌克兰南部土地覆盖生态状态的模型,以分析Kakhovka水库变浅的后果。为形成综合生态状态指数(ESI),采用主成分分析法对归一化植被差指数、地表温度、表层土壤水分亏缺指数和干旱严重程度指数4个主要指标进行分析。使用Sentinel、Landsat和MODIS卫星数据以及机器学习方法来计算指标的值。ESI允许对土地覆盖状态进行定量和定性评估。长期ESI数据分析表明,研究区生态状况明显恶化。平均指数值比2019-2024年下降了17.3%,其中恶化最严重的是2022-2024年,下降了15.5%。定性分析显示,有利区域的面积减少了近一半(-49%),而问题区域的面积增加了59%。如果在2019-2022年区域分布保持相对稳定(问题区域的比例仅从33.9%增加到34.8%),那么在2022-2024年发生急剧变化(增加到53.9%),可能是由于干旱条件增加和卡霍夫卡HPP破坏的后果。开发的10米分辨率模型可用于生态状态评估、土地退化检测和恢复规划、缺水条件下的灌溉优化、作物生产潜力和粮食安全评估、无法进行地面调查的地区监测,以及为土地管理和战后恢复提供循证支持。
{"title":"A Model for Land Cover Ecological State Assessment of Southern Ukraine Based on Remote Data for Analyzing the Consequences of the Kakhovka Reservoir Shallowing","authors":"Bohdan Yailymov;Liudmyla Pidgorodetska;Liudmyla Kolos;Oleh Fedorov;Hanna Yailymova","doi":"10.1109/JSTARS.2026.3657506","DOIUrl":"https://doi.org/10.1109/JSTARS.2026.3657506","url":null,"abstract":"To develop scenarios for postwar reconstruction of Ukraine, in particular the measures to stabilize the functioning of war-damaged territories, it is necessary to assess the land cover ecological state based on comprehensive system monitoring. Currently, there are no comprehensive quantitative assessments that combine multiparametric analysis of physical, hydrological properties of the surface soil layer and land cover parameters using high spatial resolution satellite data. The purpose of this article is to create a model for assessing land cover ecological state of southern Ukraine based on satellite data with high spatial resolution (10 m) to analyze the consequences of the Kakhovka Reservoir shallowing. To form an integrated ecological state index (ESI), the principal component analysis method is used for four main indicators: normalized difference vegetation index, land surface temperature, moisture deficit in the surface soil layer, and drought severity index . Sentinel, Landsat and MODIS satellite data, as well as machine learning methods, are used to calculate the values of the indicators. The ESI allows for a quantitative and qualitative assessment of the land cover state. Analysis of long-term ESI data showed a significant deterioration in the ecological state of the studied territory. The average index value decreased by 17.3% over 2019–2024, with the sharpest deterioration observed in 2022–2024 (a decrease of 15.5%). Qualitative analysis revealed that the area of favorable zones decreased by almost half (–49%), while the area of problem zones increased by 59%. If in 2019–2022 the distribution of zones remained relatively stable (the share of problem zones increased only from 33.9%to 34.8%), then in 2022–2024 there were sharp changes (an increase to 53.9%), likely caused by increased drought conditions and the consequences of the destruction of the Kakhovka HPP. The developed model with 10 m resolution enables ecological state assessment, land degradation detection and restoration planning, irrigation optimization in water-scarce conditions, evaluation of crop production potential and food security, monitoring of territories where ground surveys are impossible, and evidence-based support for land management and postwar recovery.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"19 ","pages":"5552-5567"},"PeriodicalIF":5.3,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11362969","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175599","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PASG-Net:Spatial-Guided Frequency Compensation Polarized Attention Fusion Network for Hyperspectral and Multispectral Image Fusion PASG-Net:用于高光谱和多光谱图像融合的空间制导频率补偿极化注意融合网络
IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-23 DOI: 10.1109/JSTARS.2026.3657496
Yan Mou;Zheng Chen;Jinjiang Li
Hyperspectral and multispectral image fusion aims to integrate the complementary characteristics of both modalities to reconstruct high spatial-resolution hyperspectral images (HR-HSI). In recent years, joint modeling in the spatial and frequency domains has become an effective strategy for enhancing fusion performance. However, existing methods still exhibit limitations in extracting spatial–frequency features and achieving complete and efficient integration of complementary information, which often leads to fused images that fail to maintain spatial–spectral consistency. To overcome these challenges, this article proposes a spatial-guided frequency compensation polarized attention fusion network (PASG-Net), achieving HR-HSI reconstruction by integrating spatial-domain and frequency-domain features. Specifically, the grouped spatial feature extraction module employs grouped dense residual learning to capture local features in the spatial domain. The spatial-guided frequency compensation module is designed based on the observation of “phase similarity and magnitude complementarity,” utilizing spatial priors to generate dynamic weights, achieving magnitude fusion and phase fine-tuning to capture comprehensive global frequency-domain features. The symmetric polarized cross-attention module introduces polarized linear cross-attention; explicit positive–negative polarity modeling is added to linear attention, effectively integrating complementary information from both domains while maintaining low computational complexity. Extensive experiments demonstrate that the proposed PASG-Net outperforms the current State-of-the-Art methods.
高光谱和多光谱图像融合的目的是融合两种模式的互补特性,重建高空间分辨率高光谱图像(HR-HSI)。近年来,空间和频域联合建模已成为提高融合性能的有效策略。然而,现有的方法在提取空间频率特征和实现互补信息的完整有效整合方面仍然存在局限性,这往往导致融合后的图像不能保持空间光谱一致性。为了克服这些挑战,本文提出了一种空间引导频率补偿极化注意融合网络(PASG-Net),通过整合空间域和频域特征实现HR-HSI重构。具体来说,分组空间特征提取模块采用分组密集残差学习来捕获空间域中的局部特征。空间制导频率补偿模块的设计基于“相位相似和幅度互补”的观察,利用空间先验产生动态权重,实现幅度融合和相位微调,以捕获全面的全球频域特征。对称极化交叉注意模块引入极化线性交叉注意;将显式正负极性建模添加到线性关注中,有效地集成了两个领域的互补信息,同时保持了较低的计算复杂度。大量的实验表明,所提出的PASG-Net优于目前最先进的方法。
{"title":"PASG-Net:Spatial-Guided Frequency Compensation Polarized Attention Fusion Network for Hyperspectral and Multispectral Image Fusion","authors":"Yan Mou;Zheng Chen;Jinjiang Li","doi":"10.1109/JSTARS.2026.3657496","DOIUrl":"https://doi.org/10.1109/JSTARS.2026.3657496","url":null,"abstract":"Hyperspectral and multispectral image fusion aims to integrate the complementary characteristics of both modalities to reconstruct high spatial-resolution hyperspectral images (HR-HSI). In recent years, joint modeling in the spatial and frequency domains has become an effective strategy for enhancing fusion performance. However, existing methods still exhibit limitations in extracting spatial–frequency features and achieving complete and efficient integration of complementary information, which often leads to fused images that fail to maintain spatial–spectral consistency. To overcome these challenges, this article proposes a spatial-guided frequency compensation polarized attention fusion network (PASG-Net), achieving HR-HSI reconstruction by integrating spatial-domain and frequency-domain features. Specifically, the grouped spatial feature extraction module employs grouped dense residual learning to capture local features in the spatial domain. The spatial-guided frequency compensation module is designed based on the observation of “phase similarity and magnitude complementarity,” utilizing spatial priors to generate dynamic weights, achieving magnitude fusion and phase fine-tuning to capture comprehensive global frequency-domain features. The symmetric polarized cross-attention module introduces polarized linear cross-attention; explicit positive–negative polarity modeling is added to linear attention, effectively integrating complementary information from both domains while maintaining low computational complexity. Extensive experiments demonstrate that the proposed PASG-Net outperforms the current State-of-the-Art methods.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"19 ","pages":"5695-5709"},"PeriodicalIF":5.3,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11362981","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175750","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Characterizing Short-Duration Rainfall Events on the Tibetan Plateau Based on Ground Observation, Satellite Remote Sensing, and Reanalysis Data 基于地面观测、卫星遥感和再分析资料的青藏高原短时降水特征
IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-23 DOI: 10.1109/JSTARS.2026.3656905
Xiaoyan Ling;Yingying Chen;Kun Yang;Run Han;Lazhu>;Xu Zhou;Xin Li;Changhui Zhan;Yaozhi Jiang;Jiaxin Tian;Yan Wang;Ming Chen
Given the vast expanses of bare soil and sparse grassland in the central and western Tibetan Plateau (CWTP), the duration of precipitation serves as a critical control on key hydrological and geomorphological responses, such as runoff generation, soil erosion, and sediment transport. This study analyzes the spatial distribution of precipitation events with different durations (short-duration, medium-duration, and long-duration) across the Tibetan Plateau (TP). To explore the differences in precipitation duration between the central-western and eastern TP, we utilize hourly precipitation data from a newly established cross-sectional rainfall observation network (53 gauges) on the CWTP and the China Meteorological Administration observation (145 gauges) on the eastern TP for comparison. The main conclusions are as follows: First, short-duration (1–3 h) precipitation events contribute more than 50% of the total rainfall in the CWTP. In contrast, short-duration events contribute less than 30% in the eastern TP, where medium- and long-duration precipitation events dominate. Second, the reanalysis data ECMWF reanalysis 5 and the high-resolution atmospheric simulation data high asia refined analysis version 2 tend to systematically underestimate (overestimate) the contribution of short-duration (long-duration) precipitation events. Specifically, they exhibit mean biases of 23% and 19% for short-duration precipitation, and 34% and 20% for long-duration precipitation, respectively. Third, the satellite remote sensing precipitation data integrated multisatellite retrievals perform well in estimating the contribution of precipitation events, with biases mostly within 30% . By comparing the original and gauge-calibrated satellite datasets, the results show that calibration with coarse temporal resolutions (daily or monthly) does not necessarily improve the identification of short-duration precipitation events. Our results not only enhance the understanding of precipitation characteristics and processes over the TP, but also provide valuable guidance for hydrological modeling and the evaluation and improvement of satellite-based precipitation data.
鉴于青藏高原中西部广阔的裸露土壤和稀疏草地,降水持续时间对产流、土壤侵蚀和输沙等关键水文和地貌响应起着关键的控制作用。研究了青藏高原不同持续时间(短持续、中持续和长持续)降水事件的空间分布特征。为了探讨青藏高原中西部和东部降水持续时间的差异,我们利用青藏高原新建立的53个断面降水观测网和中国气象局青藏高原东部145个断面降水观测网的逐时降水数据进行比较。主要结论如下:第一,短历时(1 ~ 3 h)降水事件对库区总降水的贡献大于50%;相比之下,在青藏高原东部,短持续时间降水事件的贡献率不到30%,而中、长持续时间降水事件占主导地位。(2)再分析资料ECMWF再分析5和高分辨率大气模拟资料高亚洲精细化分析版本2有系统低估(高估)短时(长时)降水事件贡献的倾向。具体来说,它们对短时间降水的平均偏差分别为23%和19%,对长时间降水的平均偏差分别为34%和20%。③卫星遥感降水数据综合多星检索在估算降水事件贡献方面表现较好,偏差大多在30%以内。通过对原始卫星数据集和标准校准卫星数据集的比较,结果表明,粗时间分辨率(日或月)的校准不一定能提高对短持续时间降水事件的识别。研究结果不仅增强了对青藏高原降水特征和过程的认识,而且为水文建模和卫星降水数据的评价和改进提供了有价值的指导。
{"title":"Characterizing Short-Duration Rainfall Events on the Tibetan Plateau Based on Ground Observation, Satellite Remote Sensing, and Reanalysis Data","authors":"Xiaoyan Ling;Yingying Chen;Kun Yang;Run Han;Lazhu>;Xu Zhou;Xin Li;Changhui Zhan;Yaozhi Jiang;Jiaxin Tian;Yan Wang;Ming Chen","doi":"10.1109/JSTARS.2026.3656905","DOIUrl":"https://doi.org/10.1109/JSTARS.2026.3656905","url":null,"abstract":"Given the vast expanses of bare soil and sparse grassland in the central and western Tibetan Plateau (CWTP), the duration of precipitation serves as a critical control on key hydrological and geomorphological responses, such as runoff generation, soil erosion, and sediment transport. This study analyzes the spatial distribution of precipitation events with different durations (short-duration, medium-duration, and long-duration) across the Tibetan Plateau (TP). To explore the differences in precipitation duration between the central-western and eastern TP, we utilize hourly precipitation data from a newly established cross-sectional rainfall observation network (53 gauges) on the CWTP and the China Meteorological Administration observation (145 gauges) on the eastern TP for comparison. The main conclusions are as follows: First, short-duration (1–3 h) precipitation events contribute more than 50% of the total rainfall in the CWTP. In contrast, short-duration events contribute less than 30% in the eastern TP, where medium- and long-duration precipitation events dominate. Second, the reanalysis data ECMWF reanalysis 5 and the high-resolution atmospheric simulation data high asia refined analysis version 2 tend to systematically underestimate (overestimate) the contribution of short-duration (long-duration) precipitation events. Specifically, they exhibit mean biases of 23% and 19% for short-duration precipitation, and 34% and 20% for long-duration precipitation, respectively. Third, the satellite remote sensing precipitation data integrated multisatellite retrievals perform well in estimating the contribution of precipitation events, with biases mostly within 30% . By comparing the original and gauge-calibrated satellite datasets, the results show that calibration with coarse temporal resolutions (daily or monthly) does not necessarily improve the identification of short-duration precipitation events. Our results not only enhance the understanding of precipitation characteristics and processes over the TP, but also provide valuable guidance for hydrological modeling and the evaluation and improvement of satellite-based precipitation data.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"19 ","pages":"5680-5694"},"PeriodicalIF":5.3,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11361388","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175602","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessing Geo-Foundational Models for Flood Inundation Mapping: Benchmarking Models for Sentinel-1, Sentinel-2, and Planetscope 评估洪水淹没测绘的地理基础模型:Sentinel-1、Sentinel-2和Planetscope的基准模型
IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-23 DOI: 10.1109/JSTARS.2026.3656855
Saurabh Kaushik;Lalit Maurya;Beth Tellman;ZhiJie Zhang
Geo-foundational models (GFMs) enable fast and reliable extraction of spatiotemporal information from satellite imagery, improving flood inundation mapping by leveraging location and time embeddings. Despite their potential, it remains unclear whether GFMs outperform traditional models such as U-Net. A systematic comparison across sensors and data availability scenarios is still lacking, which is essential to guide end-users in model selection. To address this, we evaluate three GFMs—Prithvi 2.0, Clay V1.5, dynamic one-for-all (DOFA)—and UViT (a Prithvi variant), against TransNorm, U-Net, DeepLabv3+, and Attention U-Net using PlanetScope (PS), Sentinel-1, and Sentinel-2. We observe competitive performance among all GFMs, with 2% –5% variation between the best and worst models across sensors. Clay outperforms others on PS (0.79 mIoU) and Sentinel-2 (0.72), while Prithvi leads on Sentinel-1 (0.57). In leave-one-region-out cross-validation across five regions, Clay shows slightly better performance across all sensors, with mIoU scores of 0.72, 0.66, and 0.51 for PanetScope, Sentinel-2, and Sentinel-1 respectively, compared to Prithvi (0.70, 0.64, 0.49) and DOFA (0.67, 0.64, 0.49). Across all 19 sites, cross-validation reveals a 4% improvement by Clay over U-Net. Visual inspection highlights Clay’s superior ability to retain fine details. Few-shot experiments show Clay achieves 0.64 mIoU on PS with just five training images, outperforming Prithvi (0.24) and DOFA (0.35). In terms of computational time, Clay is a better choice due to its smaller model size (26M parameters), making it approximately 3× faster than Prithvi (650M) and 2× faster than DOFA (410M). Our results suggest GFMs offer small to moderate improvements in flood mapping accuracy at lower computational cost and labeling effort.
地理基础模型(GFMs)能够从卫星图像中快速可靠地提取时空信息,通过利用位置和时间嵌入改进洪水淹没制图。尽管有潜力,但gfm是否优于U-Net等传统模式仍不清楚。目前仍然缺乏对传感器和数据可用性场景的系统比较,这对于指导最终用户选择模型至关重要。为了解决这个问题,我们使用PlanetScope (PS)、Sentinel-1和Sentinel-2评估了三种GFMs-Prithvi 2.0、Clay V1.5、动态一对一(DOFA)和UViT (Prithvi的一种变体)与TransNorm、U-Net、DeepLabv3+和Attention U-Net的对比。我们观察了所有GFMs的竞争表现,在不同传感器的最佳和最差模型之间存在2% -5%的差异。Clay在PS (0.79 mIoU)和Sentinel-2 (0.72 mIoU)上的表现优于其他网站,而Prithvi在Sentinel-1 (0.57 mIoU)上领先。在跨5个区域的留一个区域交叉验证中,Clay在所有传感器上的表现略好,PanetScope、Sentinel-2和Sentinel-1的mIoU得分分别为0.72、0.66和0.51,而Prithvi(0.70、0.64、0.49)和DOFA(0.67、0.64、0.49)。在所有19个站点中,交叉验证显示Clay比U-Net提高了4%。目视检查突出了克莱保留细节的卓越能力。少量实验表明Clay在PS上仅用5张训练图像就达到了0.64 mIoU,优于Prithvi(0.24)和DOFA(0.35)。在计算时间方面,Clay是一个更好的选择,因为它的模型尺寸较小(26M参数),比Prithvi (650M)快约3倍,比DOFA (410M)快2倍。我们的研究结果表明,GFMs在较低的计算成本和标记工作下,对洪水测绘精度提供了小到中等程度的改进。
{"title":"Assessing Geo-Foundational Models for Flood Inundation Mapping: Benchmarking Models for Sentinel-1, Sentinel-2, and Planetscope","authors":"Saurabh Kaushik;Lalit Maurya;Beth Tellman;ZhiJie Zhang","doi":"10.1109/JSTARS.2026.3656855","DOIUrl":"https://doi.org/10.1109/JSTARS.2026.3656855","url":null,"abstract":"Geo-foundational models (GFMs) enable fast and reliable extraction of spatiotemporal information from satellite imagery, improving flood inundation mapping by leveraging location and time embeddings. Despite their potential, it remains unclear whether GFMs outperform traditional models such as U-Net. A systematic comparison across sensors and data availability scenarios is still lacking, which is essential to guide end-users in model selection. To address this, we evaluate three GFMs—Prithvi 2.0, Clay V1.5, dynamic one-for-all (DOFA)—and UViT (a Prithvi variant), against TransNorm, U-Net, DeepLabv3+, and Attention U-Net using PlanetScope (PS), Sentinel-1, and Sentinel-2. We observe competitive performance among all GFMs, with 2% –5% variation between the best and worst models across sensors. Clay outperforms others on PS (0.79 mIoU) and Sentinel-2 (0.72), while Prithvi leads on Sentinel-1 (0.57). In leave-one-region-out cross-validation across five regions, Clay shows slightly better performance across all sensors, with mIoU scores of 0.72, 0.66, and 0.51 for PanetScope, Sentinel-2, and Sentinel-1 respectively, compared to Prithvi (0.70, 0.64, 0.49) and DOFA (0.67, 0.64, 0.49). Across all 19 sites, cross-validation reveals a 4% improvement by Clay over U-Net. Visual inspection highlights Clay’s superior ability to retain fine details. Few-shot experiments show Clay achieves 0.64 mIoU on PS with just five training images, outperforming Prithvi (0.24) and DOFA (0.35). In terms of computational time, Clay is a better choice due to its smaller model size (26M parameters), making it approximately 3× faster than Prithvi (650M) and 2× faster than DOFA (410M). Our results suggest GFMs offer small to moderate improvements in flood mapping accuracy at lower computational cost and labeling effort.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"19 ","pages":"5649-5665"},"PeriodicalIF":5.3,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11362928","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175600","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessment of Building Subsidence Along Suzhou Metro Lines Using Decade-Scale Multitemporal InSAR 基于十年期多时相InSAR的苏州地铁沿线建筑沉降评价
IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-22 DOI: 10.1109/JSTARS.2026.3656654
Lina Zhang;Xin Gao;Hongyu Liang;Lei Zhang;Xinyou Song;Jie Chen;Zheng Zhang;Yong Yan;Jianwan Ji
As a major city in eastern China, Suzhou faces significant challenges in metro construction due to its soft soil conditions, as excavations can induce differential settlement in nearby buildings. To assess such impacts, this article employs decade-scale (2009–2019) multitemporal InSAR using high-resolution TerraSAR-X imagery. A customized processing chain—integrating Persistent Scatterer and Small Baseline approaches with optimized phase unwrapping and thermal dilation correction—was applied to extract time-series deformation along Suzhou's metro network. The results show overall stability in central zones, with negligible subsidence within the historic moat area, yet reveal several localized subsidence bowls exceeding 15 mm/year in the suburbs. Time-series analysis correlates settlement of high-rise buildings (> 14 mm/year) with surface loading and soil consolidation. By integrating InSAR-derived displacements with building safety standards and metro engineering thresholds, a risk evaluation framework was developed. Its application to 49 279 structures identified 4 severely at-risk buildings (0.008%), 101 moderately at-risk buildings (0.205%), and 172 mildly at-risk buildings (0.349%). This article establishes a practical risk-assessment protocol and provides the first large-scale visualization of subsidence hazards along Suzhou's metro lines, offering valuable guidance for metro planning, operational safety, and infrastructure preservation.
苏州作为中国东部主要城市,由于其软土条件,在地铁建设中面临着巨大的挑战,因为开挖会引起附近建筑物的差异沉降。为了评估这种影响,本文使用了十年尺度(2009-2019)的多时相InSAR和高分辨率TerraSAR-X图像。采用定制化的处理链,结合持久散射和小基线方法,优化相位展开和热膨胀校正,用于提取苏州地铁网络的时间序列变形。结果表明,中心区域总体稳定,历史护城河区域的沉降可以忽略不计,但郊区有几个局部沉降碗超过15 mm/年。时间序列分析表明,高层建筑的沉降(1 ~ 14 mm/年)与地表荷载和土壤固结有关。通过将insar导出的位移与建筑安全标准和地铁工程阈值相结合,开发了一个风险评估框架。应用该方法对49279座建筑进行分析,确定了4座严重风险建筑(0.008%)、101座中度风险建筑(0.205%)和172座轻度风险建筑(0.349%)。本文建立了一种实用的风险评估方案,首次实现了苏州地铁沿线沉降危害的大规模可视化,为地铁规划、运营安全和基础设施保护提供了有价值的指导。
{"title":"Assessment of Building Subsidence Along Suzhou Metro Lines Using Decade-Scale Multitemporal InSAR","authors":"Lina Zhang;Xin Gao;Hongyu Liang;Lei Zhang;Xinyou Song;Jie Chen;Zheng Zhang;Yong Yan;Jianwan Ji","doi":"10.1109/JSTARS.2026.3656654","DOIUrl":"https://doi.org/10.1109/JSTARS.2026.3656654","url":null,"abstract":"As a major city in eastern China, Suzhou faces significant challenges in metro construction due to its soft soil conditions, as excavations can induce differential settlement in nearby buildings. To assess such impacts, this article employs decade-scale (2009–2019) multitemporal InSAR using high-resolution TerraSAR-X imagery. A customized processing chain—integrating Persistent Scatterer and Small Baseline approaches with optimized phase unwrapping and thermal dilation correction—was applied to extract time-series deformation along Suzhou's metro network. The results show overall stability in central zones, with negligible subsidence within the historic moat area, yet reveal several localized subsidence bowls exceeding 15 mm/year in the suburbs. Time-series analysis correlates settlement of high-rise buildings (> 14 mm/year) with surface loading and soil consolidation. By integrating InSAR-derived displacements with building safety standards and metro engineering thresholds, a risk evaluation framework was developed. Its application to 49 279 structures identified 4 severely at-risk buildings (0.008%), 101 moderately at-risk buildings (0.205%), and 172 mildly at-risk buildings (0.349%). This article establishes a practical risk-assessment protocol and provides the first large-scale visualization of subsidence hazards along Suzhou's metro lines, offering valuable guidance for metro planning, operational safety, and infrastructure preservation.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"19 ","pages":"5539-5551"},"PeriodicalIF":5.3,"publicationDate":"2026-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11361027","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175786","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Edge-Aware Superpixel Dual-Graph GCN for Topographically Heterogeneous Landslide Susceptibility Assessment 基于边缘感知的超像素双图GCN地形非均质滑坡易感性评价
IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-22 DOI: 10.1109/JSTARS.2026.3656940
Changle Li;Yuewei Wang;Feifei Zhang;Hongwei Zhang
Pixel-based landslide susceptibility assessment (LSA) is prone to boundary blurring, salt and pepper artifacts, and unstable generalization in topographically heterogeneous mountains. To address these issues, we propose an edge-aware superpixel graph framework with a dual-graph graph convolutional network. Terrain enhanced simple linear iterative clustering with watershed initialization is employed to generate edge-aware superpixels that better follow geomorphic and engineered boundaries, thereby reducing label noise and boundary leakage. A fused graph is then constructed by combining a spatial adjacency graph with a feature similarity graph based on $k$-nearest neighbors, enabling information propagation among both neighboring and environmentally similar superpixels. Learning under severe class imbalance is stabilized through focal loss with class weighting, spatially blocked data splits, probability calibration, and threshold scanning. Experiments conducted over a remote sensing strip study area, defined by the footprint of a Landsat 9 scene in Shanxi, China, and integrating multisource conditioning factors, show that the proposed method exceeds the mean performance of the compared methods by 11.6% in area under curve (AUC) and 18.1% in $F1$ under identical data and spatial splits. Gains are most pronounced along narrow valley flanks, ridge gully transition zones, and engineered cut slopes, whereas changes are modest on broad, low relief interfluves. The resulting susceptibility maps exhibit clearer slope breaks, reduced speckle, and more contiguous high susceptibility belts, providing interpretable, object level outputs to support planning and hazard mitigation in rugged mountainous terrain.
基于像素的滑坡易感性评价(LSA)在地形异质性山区容易出现边界模糊、盐胡椒伪影和泛化不稳定等问题。为了解决这些问题,我们提出了一个具有双图卷积网络的边缘感知超像素图框架。采用分水岭初始化的地形增强简单线性迭代聚类,生成边缘感知的超像素,更好地遵循地貌和工程边界,从而减少标签噪声和边界泄漏。然后,将空间邻接图与基于k个近邻的特征相似图相结合,构建融合图,实现相邻和环境相似超像素之间的信息传播。严重类失衡下的学习通过类加权、空间阻塞数据分割、概率校准和阈值扫描等方法实现焦点损失的稳定。在以中国山西Landsat 9场景足迹为定义的遥感条带研究区域上进行的实验表明,在相同数据和空间分割的情况下,该方法的曲线下面积(AUC)和$F1$的平均性能分别超过了比较方法的11.6%和18.1%。在狭窄的山谷侧翼、山脊沟壑过渡区和工程切割斜坡上,增益最为明显,而在宽阔的低地势交汇处,变化不大。由此产生的易感性图显示出更清晰的坡折、更少的斑点和更连续的高易感性带,提供可解释的对象级输出,以支持崎岖山区地形的规划和减灾。
{"title":"Edge-Aware Superpixel Dual-Graph GCN for Topographically Heterogeneous Landslide Susceptibility Assessment","authors":"Changle Li;Yuewei Wang;Feifei Zhang;Hongwei Zhang","doi":"10.1109/JSTARS.2026.3656940","DOIUrl":"https://doi.org/10.1109/JSTARS.2026.3656940","url":null,"abstract":"Pixel-based landslide susceptibility assessment (LSA) is prone to boundary blurring, salt and pepper artifacts, and unstable generalization in topographically heterogeneous mountains. To address these issues, we propose an edge-aware superpixel graph framework with a dual-graph graph convolutional network. Terrain enhanced simple linear iterative clustering with watershed initialization is employed to generate edge-aware superpixels that better follow geomorphic and engineered boundaries, thereby reducing label noise and boundary leakage. A fused graph is then constructed by combining a spatial adjacency graph with a feature similarity graph based on <inline-formula><tex-math>$k$</tex-math></inline-formula>-nearest neighbors, enabling information propagation among both neighboring and environmentally similar superpixels. Learning under severe class imbalance is stabilized through focal loss with class weighting, spatially blocked data splits, probability calibration, and threshold scanning. Experiments conducted over a remote sensing strip study area, defined by the footprint of a Landsat 9 scene in Shanxi, China, and integrating multisource conditioning factors, show that the proposed method exceeds the mean performance of the compared methods by 11.6% in area under curve (AUC) and 18.1% in <inline-formula><tex-math>$F1$</tex-math></inline-formula> under identical data and spatial splits. Gains are most pronounced along narrow valley flanks, ridge gully transition zones, and engineered cut slopes, whereas changes are modest on broad, low relief interfluves. The resulting susceptibility maps exhibit clearer slope breaks, reduced speckle, and more contiguous high susceptibility belts, providing interpretable, object level outputs to support planning and hazard mitigation in rugged mountainous terrain.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"19 ","pages":"5634-5648"},"PeriodicalIF":5.3,"publicationDate":"2026-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11361010","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175617","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hallucination-Resistant Change Detection in Multimodal Large Models for Autonomous Land Management Agents 自主土地管理主体多模态大模型的抗幻觉变化检测
IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-21 DOI: 10.1109/JSTARS.2026.3656403
Luo Zuo;Jiayi Sun;Jie Li;Feixiang Liu;Jinglei Li;Guanchong Niu
Change detection (CD) in remote sensing is essential for tracking transformations in land use, infrastructure, and landslide mapping over time. Although traditional deep learning (DL) models have shown effectiveness in standard scenarios, they often struggle with robustness and generalization, resulting in unreliable predictions in novel or complex environments. While large language models (LLMs) have been proposed to address generalization ability in various tasks, their performance is limited due to the hallucination issue. To overcome these limitations, we propose a change vector analysis (CVA)-based hallucination-resistant change detection multimodal large language model framework named CVAHR-CD-LLM, which is designed to suppress hallucinations while maintaining strong generalization capabilities. Specifically, we expand existing CD datasets, including those for landslide mapping, using a semisupervised approach and fine-tune CVAHR-CD-LLM to enhance its generalization ability. In addition, to address the hallucination issues commonly associated with LLMs, we introduce a CVA-based coordinations iterative calibration loop. Furthermore, we incorporate a self-consistency mechanism, which aggregates multiple reasoning paths to ensure robust predictions and reduce hallucinations during the CD process. This mechanism applies iterative corrections and multistep inference to refine detected change coordinates, leading to enhanced accuracy and reliability. Our method demonstrates substantial improvements in prediction precision, advancing the potential for autonomous and accurate land management applications.
遥感中的变化检测(CD)对于跟踪土地利用、基础设施和滑坡制图的变化至关重要。尽管传统的深度学习(DL)模型在标准场景中显示出有效性,但它们经常在鲁棒性和泛化方面遇到困难,导致在新颖或复杂的环境中做出不可靠的预测。虽然大型语言模型(llm)已经被提出用于解决各种任务的泛化能力,但由于幻觉问题,它们的性能受到限制。为了克服这些限制,我们提出了一种基于变化向量分析(CVA)的抗幻觉变化检测多模态大语言模型框架CVAHR-CD-LLM,该框架旨在抑制幻觉的同时保持强大的泛化能力。具体而言,我们使用半监督方法和微调CVAHR-CD-LLM来扩展现有的CD数据集,包括滑坡制图数据集,以增强其泛化能力。此外,为了解决通常与llm相关的幻觉问题,我们引入了基于cva的协调迭代校准回路。此外,我们结合了一个自一致性机制,该机制聚合了多个推理路径,以确保在CD过程中进行稳健的预测并减少幻觉。该机制采用迭代修正和多步推理来细化检测到的变化坐标,从而提高了精度和可靠性。我们的方法在预测精度上有了实质性的提高,推进了自主和精确土地管理应用的潜力。
{"title":"Hallucination-Resistant Change Detection in Multimodal Large Models for Autonomous Land Management Agents","authors":"Luo Zuo;Jiayi Sun;Jie Li;Feixiang Liu;Jinglei Li;Guanchong Niu","doi":"10.1109/JSTARS.2026.3656403","DOIUrl":"https://doi.org/10.1109/JSTARS.2026.3656403","url":null,"abstract":"Change detection (CD) in remote sensing is essential for tracking transformations in land use, infrastructure, and landslide mapping over time. Although traditional deep learning (DL) models have shown effectiveness in standard scenarios, they often struggle with robustness and generalization, resulting in unreliable predictions in novel or complex environments. While large language models (LLMs) have been proposed to address generalization ability in various tasks, their performance is limited due to the hallucination issue. To overcome these limitations, we propose a change vector analysis (CVA)-based hallucination-resistant change detection multimodal large language model framework named CVAHR-CD-LLM, which is designed to suppress hallucinations while maintaining strong generalization capabilities. Specifically, we expand existing CD datasets, including those for landslide mapping, using a semisupervised approach and fine-tune CVAHR-CD-LLM to enhance its generalization ability. In addition, to address the hallucination issues commonly associated with LLMs, we introduce a CVA-based coordinations iterative calibration loop. Furthermore, we incorporate a self-consistency mechanism, which aggregates multiple reasoning paths to ensure robust predictions and reduce hallucinations during the CD process. This mechanism applies iterative corrections and multistep inference to refine detected change coordinates, leading to enhanced accuracy and reliability. Our method demonstrates substantial improvements in prediction precision, advancing the potential for autonomous and accurate land management applications.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"19 ","pages":"6313-6327"},"PeriodicalIF":5.3,"publicationDate":"2026-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11359709","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146223594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dual-Domain Masked Representation Learning for Semantic Segmentation of Remote Sensing Images 基于双域掩码表示学习的遥感图像语义分割
IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-20 DOI: 10.1109/JSTARS.2026.3655583
Yujia Fu;Mingyang Wang;Danfeng Hong;Gemine Vivone
Self-supervisedlearning (SSL) has emerged as a promising paradigm for remote sensing semantic segmentation, enabling the exploitation of large-scale unlabeled data to learn meaningful representations. However, most existing methods focus solely on the spatial domain, overlooking rich frequency information that is particularly critical in remote sensing images, where fine-grained textures and repetitive structural patterns are prevalent. To address this limitation, we propose a novel dual-domain masked representation (DDMR) learning framework. Specifically, the spatial masking branch simulates partial occlusions and encourages spatial context reasoning by randomly masking regions in the spatial domain. Meanwhile, randomized frequency masking increases input diversity during training and improves generalization. In addition, feature representations are further decoupled into amplitude and phase components in the frequency branch, and an amplitude-phase loss is introduced to encourage fine-grained, frequency-aware learning. By jointly leveraging spatial and frequency masked representation learning, DDMR enhances the robustness and discriminative power of learned features. Extensive experiments on two remote sensing datasets demonstrate that our method consistently outperforms state-of-the-art self-supervised approaches, validating its effectiveness for self-supervised semantic segmentation in complex remote sensing scenarios.
自监督学习(SSL)已经成为遥感语义分割的一个很有前途的范例,可以利用大规模未标记数据来学习有意义的表示。然而,大多数现有方法仅关注空间域,忽略了在遥感图像中特别重要的丰富频率信息,其中细粒度纹理和重复结构模式普遍存在。为了解决这一限制,我们提出了一种新的双域掩码表示(DDMR)学习框架。具体来说,空间掩蔽分支模拟部分遮挡,并通过在空间域中随机掩蔽区域来鼓励空间上下文推理。同时,随机频率掩蔽增加了训练过程中的输入分集,提高了泛化能力。此外,特征表示进一步解耦为频率分支中的幅度和相位分量,并引入幅度相位损失以鼓励细粒度的频率感知学习。通过联合利用空间和频率掩蔽表征学习,DDMR增强了学习特征的鲁棒性和判别能力。在两个遥感数据集上进行的大量实验表明,我们的方法始终优于最先进的自监督方法,验证了其在复杂遥感场景下自监督语义分割的有效性。
{"title":"Dual-Domain Masked Representation Learning for Semantic Segmentation of Remote Sensing Images","authors":"Yujia Fu;Mingyang Wang;Danfeng Hong;Gemine Vivone","doi":"10.1109/JSTARS.2026.3655583","DOIUrl":"https://doi.org/10.1109/JSTARS.2026.3655583","url":null,"abstract":"Self-supervisedlearning (SSL) has emerged as a promising paradigm for remote sensing semantic segmentation, enabling the exploitation of large-scale unlabeled data to learn meaningful representations. However, most existing methods focus solely on the spatial domain, overlooking rich frequency information that is particularly critical in remote sensing images, where fine-grained textures and repetitive structural patterns are prevalent. To address this limitation, we propose a novel dual-domain masked representation (DDMR) learning framework. Specifically, the spatial masking branch simulates partial occlusions and encourages spatial context reasoning by randomly masking regions in the spatial domain. Meanwhile, randomized frequency masking increases input diversity during training and improves generalization. In addition, feature representations are further decoupled into amplitude and phase components in the frequency branch, and an amplitude-phase loss is introduced to encourage fine-grained, frequency-aware learning. By jointly leveraging spatial and frequency masked representation learning, DDMR enhances the robustness and discriminative power of learned features. Extensive experiments on two remote sensing datasets demonstrate that our method consistently outperforms state-of-the-art self-supervised approaches, validating its effectiveness for self-supervised semantic segmentation in complex remote sensing scenarios.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"19 ","pages":"5507-5519"},"PeriodicalIF":5.3,"publicationDate":"2026-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11359001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175771","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AMPUNet: Hierarchical Attention Map Pyramid for Semantic Segmentation of Remote Sensing Images AMPUNet:用于遥感图像语义分割的分层注意图金字塔
IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-20 DOI: 10.1109/JSTARS.2026.3656191
Yang Yang;Wei Ao;Shunyi Zheng;Zhao Liu;Yunni Wu
Accurate semantic segmentation of high-resolution remote sensing imagery is essential for applications ranging from urban planning to environmental monitoring. However, this task remains fundamentally challenging due to the complex spatial patterns, extreme scale variations, and fine-grained details inherent in geographical scenes. While attention mechanisms, particularly global and sparse attention, have shown promise in capturing long-range dependencies, existing approaches often suffer from three interconnected limitations: prohibitive computational complexity, misalignment when integrating multiscale representations, and loss of semantic information during the decoder’s upsampling stages. This article introduces AMPUNet, a novel framework designed to overcome these limitations through the construction of a hierarchical, coarse-to-fine attention map pyramid. Our core innovation lies in explicitly propagating and refining attention maps across network layers rather than operating solely on feature maps. Specifically, we design: first, a hybrid sparse attention framework combining a block attention module and a column attention module to model global context efficiently, second, a dimension correspondence module to achieve tensor-level granularity alignment for multiscale attention maps, and third, an attention map merging module with a CAW strategy, which directly transfers high-level semantic information from deep to shallow layers, mitigating information degradation. Extensive experiments on the ISPRS Vaihingen, Potsdam, and LoveDA benchmarks demonstrate that AMPUNet achieves superior performance, with mean intersection of union scores of 75.43% on Vaihingen, 78.03% on Potsdam, and 50.94% on LoveDA, while maintaining competitive inference efficiency. Our findings confirm that structuring attention into a learnable pyramid is a highly effective paradigm for remote sensing semantic segmentation, successfully balancing precise detail preservation with robust global understanding.
高分辨率遥感图像的准确语义分割对于从城市规划到环境监测的各种应用至关重要。然而,由于复杂的空间格局、极端的尺度变化和地理场景固有的细粒度细节,这项任务仍然具有根本性的挑战性。虽然注意机制,特别是全局和稀疏注意,在捕获远程依赖关系方面表现出了希望,但现有的方法经常受到三个相互关联的限制:令人望而却步的计算复杂性,集成多尺度表示时的不一致,以及解码器上采样阶段语义信息的丢失。本文介绍了AMPUNet,这是一个新的框架,旨在通过构建一个分层的、从粗到精的注意力地图金字塔来克服这些限制。我们的核心创新在于跨网络层明确地传播和精炼注意力图,而不是仅仅在特征图上操作。具体而言,我们设计了一种结合块注意模块和列注意模块的混合稀疏注意框架,以有效地建模全局上下文;第二,设计了一个维度对应模块,以实现多尺度注意图的张量级粒度对齐;第三,设计了一个具有CAW策略的注意图合并模块,该模块直接将高级语义信息从深层传递到浅层,减轻了信息退化。在ISPRS Vaihingen、Potsdam和LoveDA基准上进行的大量实验表明,AMPUNet在保持竞争性推理效率的同时,在Vaihingen、Potsdam和LoveDA基准上取得了更优的性能,union分数的平均交集在Vaihingen、Potsdam和LoveDA上分别为75.43%、78.03%和50.94%。我们的研究结果证实,将注意力结构成一个可学习的金字塔是遥感语义分割的一个非常有效的范例,成功地平衡了精确的细节保存和强大的全局理解。
{"title":"AMPUNet: Hierarchical Attention Map Pyramid for Semantic Segmentation of Remote Sensing Images","authors":"Yang Yang;Wei Ao;Shunyi Zheng;Zhao Liu;Yunni Wu","doi":"10.1109/JSTARS.2026.3656191","DOIUrl":"https://doi.org/10.1109/JSTARS.2026.3656191","url":null,"abstract":"Accurate semantic segmentation of high-resolution remote sensing imagery is essential for applications ranging from urban planning to environmental monitoring. However, this task remains fundamentally challenging due to the complex spatial patterns, extreme scale variations, and fine-grained details inherent in geographical scenes. While attention mechanisms, particularly global and sparse attention, have shown promise in capturing long-range dependencies, existing approaches often suffer from three interconnected limitations: prohibitive computational complexity, misalignment when integrating multiscale representations, and loss of semantic information during the decoder’s upsampling stages. This article introduces AMPUNet, a novel framework designed to overcome these limitations through the construction of a hierarchical, coarse-to-fine attention map pyramid. Our core innovation lies in explicitly propagating and refining attention maps across network layers rather than operating solely on feature maps. Specifically, we design: first, a hybrid sparse attention framework combining a block attention module and a column attention module to model global context efficiently, second, a dimension correspondence module to achieve tensor-level granularity alignment for multiscale attention maps, and third, an attention map merging module with a CAW strategy, which directly transfers high-level semantic information from deep to shallow layers, mitigating information degradation. Extensive experiments on the ISPRS Vaihingen, Potsdam, and LoveDA benchmarks demonstrate that AMPUNet achieves superior performance, with mean intersection of union scores of 75.43% on Vaihingen, 78.03% on Potsdam, and 50.94% on LoveDA, while maintaining competitive inference efficiency. Our findings confirm that structuring attention into a learnable pyramid is a highly effective paradigm for remote sensing semantic segmentation, successfully balancing precise detail preservation with robust global understanding.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"19 ","pages":"6328-6340"},"PeriodicalIF":5.3,"publicationDate":"2026-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11359465","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146223625","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1