首页 > 最新文献

International Journal of Applied Earth Observation and Geoinformation最新文献

英文 中文
RPDNet: Street-level road pavement damage detection with a real-time anchor-free network RPDNet:使用实时无锚网络进行街道级路面损伤检测
IF 7.5 1区 地球科学 Q1 Earth and Planetary Sciences Pub Date : 2026-01-23 DOI: 10.1016/j.jag.2025.105070
Jian Kang, Haiyan Guan, Dedong Zhang, Lingfei Ma, Lanying Wang, Yongtao Yu, Linlin Xu, Jonathan Li
Accurately and timely detecting road pavement damage helps monitor road deterioration extent, thereby guiding maintenance projects and ensuring traffic safety. Nevertheless, due to textural similarity and nested distribution between neighboring pavement damages, as well as the damages with the diversity sizes, irregular shapes, multiple categories, current methods have the limitation in the high-quality detection from road street-level images. To tackle these challenges, this paper develops a novel real-time anchor-free network with a one-stage processing architecture, named RPDNet, for precisely and accurately detecting pavement damages from streel-level road images. First, stacked with a layer-by-layer encoding structure boosted by a deformable fully-attentive module as the backbone extractor, the RPDNet can capture more fine-grained information and generate multiscale strong task-aware semantics, favoring significantly the discrimination noteworthy textural and geometric features. Then, by adopting a multi-level efficient aggregation neck, the RPDNet can promote informative spatial details and integrate the different-level damage encoding features, contributing to the light-weight and optimization of the whole architecture. Afterward, designed with a dual-large kernel module, embedded in a decoupled detection head with anchor-free guidance, the RPDNet can project the ranging dependency of salient and task-oriented pavement damage objects by adaptively aggregating information across large kernels in spatial-domain. Qualitative and quantitative evaluations confirmed that the RPDNet provided a promiseful solution for detecting pavement damages in industrial applications under complex street-level road conditions. Furthermore, comparative analysis with the latest anchor-based and anchor-free alternatives also proved the superiority and generalization of the RPDNet in pavement damage detection tasks. The assessment results displayed that the RPDNet obtained an average mAP@0.5, mAP@0.5:0.95, precision, and recall of 69.16%, 44.86%, 72.59%, and 60.41%, respectively, on two dataset. Additionally, we constructed a large-size multi-city road pavement damage image dataset to support urban road health monitoring.
准确、及时地检测道路路面损坏情况,有助于监测道路恶化程度,从而指导养护工程,保障交通安全。然而,由于相邻路面损伤之间的纹理相似性和嵌套分布,以及损伤大小多样、形状不规则、类别多的特点,现有方法在高质量的道路街道图像检测中存在一定的局限性。为了应对这些挑战,本文开发了一种新的实时无锚网络,该网络具有单阶段处理架构,名为RPDNet,用于精确地从街道级道路图像中检测路面损伤。首先,RPDNet采用可变形的全关注模块作为主干提取器,采用逐层编码结构叠加,可以捕获更细粒度的信息,生成多尺度强任务感知语义,显著有利于纹理和几何特征的识别。然后,RPDNet通过采用多级高效聚合颈,促进信息丰富的空间细节,整合不同级别的损伤编码特征,有利于整个体系结构的轻量化和优化。随后,RPDNet设计了双大核模块,嵌入无锚制导的解耦检测头中,通过在空间域上自适应地聚合大核信息,可以投影显著性和面向任务的路面损伤目标的距离依赖关系。定性和定量评估证实,RPDNet为在复杂街道道路条件下的工业应用中检测路面损伤提供了一个有希望的解决方案。此外,通过与最新的基于锚点和无锚点方案的对比分析,也证明了RPDNet在路面损伤检测任务中的优越性和通用性。评估结果表明,RPDNet在两个数据集上的平均准确率分别为mAP@0.5、mAP@0.5:0.95,准确率为69.16%、44.86%、72.59%、召回率为60.41%。此外,我们还构建了大型多城市道路路面损伤图像数据集,以支持城市道路健康监测。
{"title":"RPDNet: Street-level road pavement damage detection with a real-time anchor-free network","authors":"Jian Kang, Haiyan Guan, Dedong Zhang, Lingfei Ma, Lanying Wang, Yongtao Yu, Linlin Xu, Jonathan Li","doi":"10.1016/j.jag.2025.105070","DOIUrl":"https://doi.org/10.1016/j.jag.2025.105070","url":null,"abstract":"Accurately and timely detecting road pavement damage helps monitor road deterioration extent, thereby guiding maintenance projects and ensuring traffic safety. Nevertheless, due to textural similarity and nested distribution between neighboring pavement damages, as well as the damages with the diversity sizes, irregular shapes, multiple categories, current methods have the limitation in the high-quality detection from road street-level images. To tackle these challenges, this paper develops a novel real-time anchor-free network with a one-stage processing architecture, named RPDNet, for precisely and accurately detecting pavement damages from streel-level road images. First, stacked with a layer-by-layer encoding structure boosted by a deformable fully-attentive module as the backbone extractor, the RPDNet can capture more fine-grained information and generate multiscale strong task-aware semantics, favoring significantly the discrimination noteworthy textural and geometric features. Then, by adopting a multi-level efficient aggregation neck, the RPDNet can promote informative spatial details and integrate the different-level damage encoding features, contributing to the light-weight and optimization of the whole architecture. Afterward, designed with a dual-large kernel module, embedded in a decoupled detection head with anchor-free guidance, the RPDNet can project the ranging dependency of salient and task-oriented pavement damage objects by adaptively aggregating information across large kernels in spatial-domain. Qualitative and quantitative evaluations confirmed that the RPDNet provided a promiseful solution for detecting pavement damages in industrial applications under complex street-level road conditions. Furthermore, comparative analysis with the latest anchor-based and anchor-free alternatives also proved the superiority and generalization of the RPDNet in pavement damage detection tasks. The assessment results displayed that the RPDNet obtained an average <ce:italic>mAP@0.5</ce:italic>, <ce:italic>mAP@0.5:0.95</ce:italic>, <ce:italic>precision</ce:italic>, and <ce:italic>recall</ce:italic> of 69.16%, 44.86%, 72.59%, and 60.41%, respectively, on two dataset. Additionally, we constructed a large-size multi-city road pavement damage image dataset to support urban road health monitoring.","PeriodicalId":50341,"journal":{"name":"International Journal of Applied Earth Observation and Geoinformation","volume":"288 1","pages":""},"PeriodicalIF":7.5,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146047912","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Recognition of salt-marsh fairy circles in conventional optical satellite imagery: A generalizable framework with multiple machine learning models and imbalanced Bayesian probability updating 传统光学卫星图像中盐沼仙女圈的识别:多机器学习模型和非平衡贝叶斯概率更新的可推广框架
IF 7.5 1区 地球科学 Q1 Earth and Planetary Sciences Pub Date : 2026-01-23 DOI: 10.1016/j.jag.2026.105101
Jianru Yang, Hao Zheng, Weiwei Sun, Yuekai Hu, Weiguo Zhang, Chunpeng Chen, Yunxuan Zhou, Heqin Cheng, Weiming Xie, Kai Tan
Salt-marsh Fairy circles (FC) are enigmatic, quasi-circular structures linked to interacting biogeophysical processes, yet they remain difficult to detect and quantify at scale from conventional RGB imagery. Limited labeled data, transient and variable FC appearance, and severe class-imbalance make single-model machine learning (ML) unreliable for quantitative monitoring. We propose a framework for automatic FC recognition and enumeration on 3-band imagery. A zero-shot foundation model (SAM) segments images into instance-level blocks. Novel distribution-pattern and geometric features, class-equalized losses, weighted resampling, and augmentation are applied within deep-learning (U-Net, Attention-U-Net, Swin-Unet) and ensemble-learning (Random Forest, XGBoost) models. The key innovation is an imbalance-aware Bayesian method that fuses pixel-wise probabilities across models; a counting algorithm then tallies FC instances. We evaluate eight pan-sharpened scenes covering four sites along China’s coast. No individual ML model or standard Bayesian fusion is fully satisfactory. The imbalance-aware Bayesian method improves over the best single model: tight scheme: κ rises from 0.69 to 0.76, F1-score from 70.9% to 75.8% (Class 1) and from 63.5% to 68.2% (Class 2), and AUC from 84.8% to 93.1% and from 78.5% to 84.8%; loose scheme: κ increases from 0.74 to 0.79, AUC from 85.1% to 90.3%, F1-score from 74.3% to 78.6%. The counting algorithm achieves RMSE 1.62 and MAPE 0.33% over 1,135 instances, outperforming DBSCAN. A 22-month case study on Chongming Island captures marsh expansion and dieback dynamics through shifts between FC classes. Our framework delivers reliable FC recognition and enumeration on a small dataset with severe class-imbalance, generalizing across salt-marsh types.
盐沼仙女圈(FC)是一种神秘的准圆形结构,与相互作用的生物地球物理过程有关,但它们仍然难以从传统的RGB图像中进行大规模检测和量化。有限的标记数据,瞬时和可变的FC外观,以及严重的类别不平衡使得单模型机器学习(ML)在定量监测中不可靠。提出了一种基于三波段图像的FC自动识别与枚举框架。零射击基础模型(SAM)将图像分割成实例级块。在深度学习(U-Net、Attention-U-Net、swan - unet)和集成学习(Random Forest、XGBoost)模型中应用了新的分布模式和几何特征、类均衡损失、加权重采样和增强。关键的创新是一种不平衡感知贝叶斯方法,它融合了模型之间逐像素的概率;然后计数算法计算FC实例。我们评估了覆盖中国沿海四个地点的八个泛锐化场景。没有一个单独的ML模型或标准贝叶斯融合是完全令人满意的。不平衡感知贝叶斯方法比最佳单一模型有所改进:紧方案:κ从0.69上升到0.76,f1评分从70.9%上升到75.8%(第1类)和63.5%上升到68.2%(第2类),AUC从84.8%上升到93.1%和78.5%上升到84.8%;宽松方案:κ从0.74增加到0.79,AUC从85.1%增加到90.3%,f1评分从74.3%增加到78.6%。计数算法在1135个实例中实现RMSE 1.62和MAPE 0.33%,优于DBSCAN。一项为期22个月的崇明岛案例研究捕捉到了沼泽扩展和枯死的动态变化。我们的框架在具有严重类不平衡的小数据集上提供可靠的FC识别和枚举,并在盐沼类型中进行推广。
{"title":"Recognition of salt-marsh fairy circles in conventional optical satellite imagery: A generalizable framework with multiple machine learning models and imbalanced Bayesian probability updating","authors":"Jianru Yang, Hao Zheng, Weiwei Sun, Yuekai Hu, Weiguo Zhang, Chunpeng Chen, Yunxuan Zhou, Heqin Cheng, Weiming Xie, Kai Tan","doi":"10.1016/j.jag.2026.105101","DOIUrl":"https://doi.org/10.1016/j.jag.2026.105101","url":null,"abstract":"Salt-marsh Fairy circles (FC) are enigmatic, quasi-circular structures linked to interacting biogeophysical processes, yet they remain difficult to detect and quantify at scale from conventional RGB imagery. Limited labeled data, transient and variable FC appearance, and severe class-imbalance make single-model machine learning (ML) unreliable for quantitative monitoring. We propose a framework for automatic FC recognition and enumeration on 3-band imagery. A zero-shot foundation model (SAM) segments images into instance-level blocks. Novel distribution-pattern and geometric features, class-equalized losses, weighted resampling, and augmentation are applied within deep-learning (U-Net, Attention-U-Net, Swin-Unet) and ensemble-learning (Random Forest, XGBoost) models. The key innovation is an imbalance-aware Bayesian method that fuses pixel-wise probabilities across models; a counting algorithm then tallies FC instances. We evaluate eight pan-sharpened scenes covering four sites along China’s coast. No individual ML model or standard Bayesian fusion is fully satisfactory. The imbalance-aware Bayesian method improves over the best single model: tight scheme: κ rises from 0.69 to 0.76, F1-score from 70.9% to 75.8% (Class 1) and from 63.5% to 68.2% (Class 2), and AUC from 84.8% to 93.1% and from 78.5% to 84.8%; loose scheme: κ increases from 0.74 to 0.79, AUC from 85.1% to 90.3%, F1-score from 74.3% to 78.6%. The counting algorithm achieves RMSE 1.62 and MAPE 0.33% over 1,135 instances, outperforming DBSCAN. A 22-month case study on Chongming Island captures marsh expansion and dieback dynamics through shifts between FC classes. Our framework delivers reliable FC recognition and enumeration on a small dataset with severe class-imbalance, generalizing across salt-marsh types.","PeriodicalId":50341,"journal":{"name":"International Journal of Applied Earth Observation and Geoinformation","volume":"14 1","pages":""},"PeriodicalIF":7.5,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146047827","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DGL-RSIS: Decoupling global spatial context and local class semantics for training-free remote sensing image segmentation DGL-RSIS:解耦全局空间上下文和局部类语义的无训练遥感图像分割
IF 7.5 1区 地球科学 Q1 Earth and Planetary Sciences Pub Date : 2026-01-23 DOI: 10.1016/j.jag.2026.105113
Boyi Li, Ce Zhang, Richard M. Timmerman, Wenxuan Bao
The emergence of vision language models (VLMs) bridges the gap between vision and language, enabling multimodal understanding beyond traditional visual-only deep learning models. However, transferring VLMs from the natural image domain to remote sensing (RS) segmentation remains challenging due to the large domain gap and the diversity of RS inputs across tasks, particularly in open-vocabulary semantic segmentation (OVSS) and referring expression segmentation (RES). Here, we propose a training-free unified framework, termed DGL-RSIS, which decouples visual and textual representations and performs visual-language alignment at both local semantic and global contextual levels. Specifically, a Global–Local Decoupling (GLD) module decomposes textual inputs into local semantic tokens and global contextual tokens, while image inputs are partitioned into class-agnostic mask proposals. Then, a Local Visual–Textual Alignment (LVTA) module adaptively extracts context-aware visual features from the mask proposals and enriches textual features through knowledge-guided prompt engineering, achieving OVSS from a local perspective. Furthermore, a Global Visual–Textual Alignment (GVTA) module employs a global-enhanced Grad-CAM mechanism to capture contextual cues for referring expressions, followed by a mask selection module that integrates pixel-level activations into mask-level segmentation outputs, thereby achieving RES from a global perspective. Experiments on the iSAID (OVSS) and RRSIS-D (RES) benchmarks demonstrate that DGL-RSIS outperforms existing training-free approaches. Ablation studies further validate the effectiveness of each module. To the best of our knowledge, this is the first unified training-free framework for RS image segmentation, which effectively transfers the semantic capability of VLMs trained on natural images to the RS domain without additional training.
视觉语言模型(vlm)的出现弥合了视觉和语言之间的差距,使多模态理解超越了传统的仅视觉深度学习模型。然而,将vlm从自然图像域转移到遥感(RS)分割中仍然是一个挑战,因为领域差距大,而且不同任务之间RS输入的多样性,特别是在开放词汇语义分割(OVSS)和参考表达分割(RES)中。在这里,我们提出了一个无需训练的统一框架,称为DGL-RSIS,它将视觉和文本表示解耦,并在局部语义和全局上下文级别执行视觉语言对齐。具体来说,全局-局部解耦(GLD)模块将文本输入分解为局部语义令牌和全局上下文令牌,而图像输入则被划分为与类无关的掩码建议。然后,局部视觉文本对齐(LVTA)模块自适应地从掩模方案中提取上下文感知的视觉特征,并通过知识引导提示工程丰富文本特征,实现局部视角的OVSS。此外,全局视觉文本对齐(GVTA)模块采用全局增强的Grad-CAM机制来捕获上下文线索以引用表达式,随后是一个掩码选择模块,将像素级激活集成到掩码级分割输出中,从而从全局角度实现RES。在iSAID (OVSS)和RRSIS-D (RES)基准测试上的实验表明,DGL-RSIS优于现有的无训练方法。烧蚀研究进一步验证了各模块的有效性。据我们所知,这是第一个统一的RS图像分割框架,它有效地将在自然图像上训练的vlm的语义能力转移到RS域,而无需额外的训练。
{"title":"DGL-RSIS: Decoupling global spatial context and local class semantics for training-free remote sensing image segmentation","authors":"Boyi Li, Ce Zhang, Richard M. Timmerman, Wenxuan Bao","doi":"10.1016/j.jag.2026.105113","DOIUrl":"https://doi.org/10.1016/j.jag.2026.105113","url":null,"abstract":"The emergence of vision language models (VLMs) bridges the gap between vision and language, enabling multimodal understanding beyond traditional visual-only deep learning models. However, transferring VLMs from the natural image domain to remote sensing (RS) segmentation remains challenging due to the large domain gap and the diversity of RS inputs across tasks, particularly in open-vocabulary semantic segmentation (OVSS) and referring expression segmentation (RES). Here, we propose a training-free unified framework, termed DGL-RSIS, which decouples visual and textual representations and performs visual-language alignment at both local semantic and global contextual levels. Specifically, a Global–Local Decoupling (GLD) module decomposes textual inputs into local semantic tokens and global contextual tokens, while image inputs are partitioned into class-agnostic mask proposals. Then, a Local Visual–Textual Alignment (LVTA) module adaptively extracts context-aware visual features from the mask proposals and enriches textual features through knowledge-guided prompt engineering, achieving OVSS from a local perspective. Furthermore, a Global Visual–Textual Alignment (GVTA) module employs a global-enhanced Grad-CAM mechanism to capture contextual cues for referring expressions, followed by a mask selection module that integrates pixel-level activations into mask-level segmentation outputs, thereby achieving RES from a global perspective. Experiments on the iSAID (OVSS) and RRSIS-D (RES) benchmarks demonstrate that DGL-RSIS outperforms existing training-free approaches. Ablation studies further validate the effectiveness of each module. To the best of our knowledge, this is the first unified training-free framework for RS image segmentation, which effectively transfers the semantic capability of VLMs trained on natural images to the RS domain without additional training.","PeriodicalId":50341,"journal":{"name":"International Journal of Applied Earth Observation and Geoinformation","volume":"30 1","pages":""},"PeriodicalIF":7.5,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146047869","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Long-term plastic greenhouse mapping based on automatic sample generation and multi-temporal noise correction: A case study of Huang-Huai-Hai Plain 基于采样自动生成和多时相噪声校正的长期塑料大棚制图——以黄淮海平原为例
IF 7.5 1区 地球科学 Q1 Earth and Planetary Sciences Pub Date : 2026-01-22 DOI: 10.1016/j.jag.2026.105123
Xiaoping Zhang, Bo Cheng, Peng Huang, Chenbin Liang, Min Zhao, Guizhou Wang, Qinxue He, Yaocan Gan
Plastic greenhouses (PGs), as a typical form of facility agriculture, play a crucial role in stabilizing agricultural production and increasing crop yields, but their rapid expansion has raised environmental concerns. Accurate long-term PGs monitoring is therefore essential for scientific agricultural regulation and environmental sustainability. However, most existing studies have focused on local regions or single-year mapping, and long-term PGs mapping remains limited. Moreover, acquiring multi-year high-quality training samples and developing effective classification algorithms remain major challenges for reliable PGs extraction. To address these issues, we propose a novel PGs mapping framework that integrates automatic sample generation with multi-temporal noise correction (MTNC), and utilizes Landsat time-series images to efficiently and accurately map multi-year PGs distribution in the Huang-Huai-Hai Plain. Specifically, high-quality training samples were automatically generated from multi-source land use/land cover and PGs products through spatial rules and sample migration, followed by preliminary classification with Random Forest. The initial predictions were then refined through the MTNC strategy, and the optimized labels were subsequently employed to train a segmentation network for robust PGs extraction. Accuracy assessments on two independent validation datasets demonstrate that the final PGs maps achieve overall accuracies above 90% and Kappa coefficients greater than 0.8 across all years. And cross-comparisons with existing PGs products at multiple spatial resolutions show a high level of spatial consistency (R2 = 0.91 with PGs-10 and 0.74 with PGs-3), further confirming the reliability of the proposed framework and the high quality of the final products.
塑料大棚作为设施农业的一种典型形式,在稳定农业生产和提高作物产量方面发挥着至关重要的作用,但其迅速扩张也引起了人们对环境的担忧。因此,精确的长期pg监测对于科学的农业调控和环境可持续性至关重要。然而,大多数现有研究都集中在局部区域或一年的制图,长期的pg制图仍然有限。此外,获取多年的高质量训练样本和开发有效的分类算法仍然是可靠提取pg的主要挑战。为了解决这些问题,我们提出了一种新的PGs制图框架,该框架将自动样本生成与多时相噪声校正(MTNC)相结合,并利用Landsat时间序列图像高效准确地绘制了黄淮海平原多年PGs分布。具体而言,通过空间规则和样本迁移,从多源土地利用/土地覆盖和pg产品中自动生成高质量的训练样本,然后使用Random Forest进行初步分类。然后通过MTNC策略对初始预测进行改进,随后使用优化的标签来训练分割网络,以进行稳健的pg提取。对两个独立验证数据集的精度评估表明,最终的pg地图在所有年份的总体精度都在90%以上,Kappa系数大于0.8。与现有多空间分辨率pg产品的交叉比较显示出较高的空间一致性(PGs-10 R2 = 0.91, PGs-3 R2 = 0.74),进一步证实了所提出框架的可靠性和最终产品的高质量。
{"title":"Long-term plastic greenhouse mapping based on automatic sample generation and multi-temporal noise correction: A case study of Huang-Huai-Hai Plain","authors":"Xiaoping Zhang, Bo Cheng, Peng Huang, Chenbin Liang, Min Zhao, Guizhou Wang, Qinxue He, Yaocan Gan","doi":"10.1016/j.jag.2026.105123","DOIUrl":"https://doi.org/10.1016/j.jag.2026.105123","url":null,"abstract":"Plastic greenhouses (PGs), as a typical form of facility agriculture, play a crucial role in stabilizing agricultural production and increasing crop yields, but their rapid expansion has raised environmental concerns. Accurate long-term PGs monitoring is therefore essential for scientific agricultural regulation and environmental sustainability. However, most existing studies have focused on local regions or single-year mapping, and long-term PGs mapping remains limited. Moreover, acquiring multi-year high-quality training samples and developing effective classification algorithms remain major challenges for reliable PGs extraction. To address these issues, we propose a novel PGs mapping framework that integrates automatic sample generation with multi-temporal noise correction (MTNC), and utilizes Landsat time-series images to efficiently and accurately map multi-year PGs distribution in the Huang-Huai-Hai Plain. Specifically, high-quality training samples were automatically generated from multi-source land use/land cover and PGs products through spatial rules and sample migration, followed by preliminary classification with Random Forest. The initial predictions were then refined through the MTNC strategy, and the optimized labels were subsequently employed to train a segmentation network for robust PGs extraction. Accuracy assessments on two independent validation datasets demonstrate that the final PGs maps achieve overall accuracies above 90% and Kappa coefficients greater than 0.8 across all years. And cross-comparisons with existing PGs products at multiple spatial resolutions show a high level of spatial consistency (<mml:math altimg=\"si1.svg\" display=\"inline\"><mml:msup><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:math> = 0.91 with PGs-10 and 0.74 with PGs-3), further confirming the reliability of the proposed framework and the high quality of the final products.","PeriodicalId":50341,"journal":{"name":"International Journal of Applied Earth Observation and Geoinformation","volume":"67 1","pages":""},"PeriodicalIF":7.5,"publicationDate":"2026-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146047871","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Relative sea-level rise and inundation risks in the Bohai Rim: Dominant role of vertical land motion 环渤海地区相对海平面上升与淹没风险:垂直陆地运动的主导作用
IF 7.5 1区 地球科学 Q1 Earth and Planetary Sciences Pub Date : 2026-01-21 DOI: 10.1016/j.jag.2026.105115
Zhiqiang Gong, Jianzhong Wu, Jie Dong, Qianye Lan, Shangjing Lai, Jinxin Lin, Mingsheng Liao
Coastal regions worldwide are increasingly threatened by Relative Sea Level Rise (RSLR), which results from the combined effects of Sea Level Rise (SLR) and Vertical Land Motion (VLM). Satellite Interferometric Synthetic Aperture Radar (InSAR) can measure millimeter-scale VLM. However, the spatiotemporal variability of VLM remains poorly quantified in RSLR estimation and projection and flood inundation assessment. This study constructed an RSLR dataset for the Bohai Rim by integrating high-resolution InSAR-derived VLM with SLR. We develop a dynamic flood inundation model by incorporating hydrological connectivity and flow path attenuation factors to improve flood risk assessment. The results show significant spatial variability of VLM and concentration on muddy and sandy coastlines. VLM dominates the spatial patterns and magnitude of RSLR. The maximum inundation extent and depth reach 17,756 km2 and 5.4 m, respectively, under the RSLR-SSP5-8.5 scenario by 2100, which threatens 10.4 million residents. The uncertainties in inundation projection can be reduced by considering the drivers and nonlinear evolution of VLM. The flood protection infrastructures can reduce inundation largely. Our findings highlight the crucial importance of incorporating VLM into coastal risk assessments and provide insights for RSLR adaptation strategies.
相对海平面上升(RSLR)是海平面上升和陆地垂直运动共同作用的结果,对全球沿海地区的威胁日益严重。卫星干涉合成孔径雷达(InSAR)可以测量毫米尺度的VLM。然而,在RSLR估算和预测以及洪水淹没评估中,VLM的时空变异性仍然缺乏量化。将insar衍生的高分辨率VLM与SLR相结合,构建了环渤海地区的RSLR数据集。为了改进洪水风险评估,我们将水文连通性和流道衰减因素结合起来,建立了一个动态洪水淹没模型。结果表明,在泥泞和沙质海岸线上,VLM及其浓度具有显著的空间变异性。VLM主导着RSLR的空间格局和幅度。在RSLR-SSP5-8.5情景下,到2100年,最大淹没范围和深度分别达到17756 km2和5.4 m,威胁到1040万居民。考虑VLM的驱动因素和非线性演化,可以降低淹没预测的不确定性。防洪基础设施可以在很大程度上减少洪水泛滥。我们的研究结果强调了将VLM纳入沿海风险评估的重要性,并为RSLR适应策略提供了见解。
{"title":"Relative sea-level rise and inundation risks in the Bohai Rim: Dominant role of vertical land motion","authors":"Zhiqiang Gong, Jianzhong Wu, Jie Dong, Qianye Lan, Shangjing Lai, Jinxin Lin, Mingsheng Liao","doi":"10.1016/j.jag.2026.105115","DOIUrl":"https://doi.org/10.1016/j.jag.2026.105115","url":null,"abstract":"Coastal regions worldwide are increasingly threatened by Relative Sea Level Rise (RSLR), which results from the combined effects of Sea Level Rise (SLR) and Vertical Land Motion (VLM). Satellite Interferometric Synthetic Aperture Radar (InSAR) can measure millimeter-scale VLM. However, the spatiotemporal variability of VLM remains poorly quantified in RSLR estimation and projection and flood inundation assessment. This study constructed an RSLR dataset for the Bohai Rim by integrating high-resolution InSAR-derived VLM with SLR. We develop a dynamic flood inundation model by incorporating hydrological connectivity and flow path attenuation factors to improve flood risk assessment. The results show significant spatial variability of VLM and concentration on muddy and sandy coastlines. VLM dominates the spatial patterns and magnitude of RSLR. The maximum inundation extent and depth reach 17,756 km<ce:sup loc=\"post\">2</ce:sup> and 5.4 m, respectively, under the RSLR-SSP5-8.5 scenario by 2100, which threatens 10.4 million residents. The uncertainties in inundation projection can be reduced by considering the drivers and nonlinear evolution of VLM. The flood protection infrastructures can reduce inundation largely. Our findings highlight the crucial importance of incorporating VLM into coastal risk assessments and provide insights for RSLR adaptation strategies.","PeriodicalId":50341,"journal":{"name":"International Journal of Applied Earth Observation and Geoinformation","volume":"21 1","pages":""},"PeriodicalIF":7.5,"publicationDate":"2026-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146047876","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reverse degradation for remote sensing pan-sharpening 遥感泛锐化的反向退化
IF 7.5 1区 地球科学 Q1 Earth and Planetary Sciences Pub Date : 2026-01-21 DOI: 10.1016/j.jag.2026.105085
Jiang He, Xiao Xiang Zhu
Accurate pan-sharpening of multispectral images is essential for high-resolution remote sensing, yet supervised methods are limited by the need for paired training data and poor generalization. Existing unsupervised approaches often neglect the physical consistency between degradation and fusion and lack sufficient constraints, resulting in suboptimal performance in complex scenarios. We propose RevFus, a novel two-stage pan-sharpening framework. In the first stage, an invertible neural network models the degradation process and reverses it for fusion with cycle-consistency self-learning, ensuring a physically grounded mapping. In the second stage, structural detail compensation and spatial–spectral contrastive learning alleviate detail loss and enhance spectral–spatial fidelity. To further understand the network’s decision-making, we design a quantitative and systematic measure of model interpretability, the Interpretability Efficacy Coefficient (IEC). IEC integrates multiple statistics derived from SHapley Additive exPlanations (SHAP) values into a single unified score and try to evaluate how effectively a model balances spatial detail enhancement with spectral preservation. Experiments on three datasets demonstrate that RevFus outperforms state-of-the-art unsupervised and traditional methods, delivering superior spectral fidelity, enhanced spatial detail, and high model interpretability, thereby validating the effectiveness of the interpretable deep learning framework for robust, high-quality pan-sharpening.
多光谱图像的精确泛锐化对于高分辨率遥感至关重要,但监督方法受成对训练数据的需求和泛化能力的限制。现有的无监督方法往往忽略了退化和融合之间的物理一致性,缺乏足够的约束,导致在复杂场景下的性能不理想。我们提出RevFus,一个新的两阶段泛锐化框架。在第一阶段,一个可逆神经网络对退化过程进行建模,并将其与循环一致性自学习进行融合,以确保物理接地映射。在第二阶段,采用结构细节补偿和空间-光谱对比学习来减轻细节损失,提高频谱-空间保真度。为了进一步理解网络的决策,我们设计了一个定量的、系统的模型可解释性度量——可解释性效能系数(interpretability Efficacy Coefficient, IEC)。IEC将来自SHapley加性解释(SHAP)值的多个统计数据集成到一个统一的分数中,并试图评估模型如何有效地平衡空间细节增强与光谱保存。在三个数据集上的实验表明,RevFus优于最先进的无监督和传统方法,提供了卓越的光谱保真度、增强的空间细节和高模型可解释性,从而验证了可解释深度学习框架在鲁棒性、高质量泛锐化方面的有效性。
{"title":"Reverse degradation for remote sensing pan-sharpening","authors":"Jiang He, Xiao Xiang Zhu","doi":"10.1016/j.jag.2026.105085","DOIUrl":"https://doi.org/10.1016/j.jag.2026.105085","url":null,"abstract":"Accurate pan-sharpening of multispectral images is essential for high-resolution remote sensing, yet supervised methods are limited by the need for paired training data and poor generalization. Existing unsupervised approaches often neglect the physical consistency between degradation and fusion and lack sufficient constraints, resulting in suboptimal performance in complex scenarios. We propose RevFus, a novel two-stage pan-sharpening framework. In the first stage, an invertible neural network models the degradation process and reverses it for fusion with cycle-consistency self-learning, ensuring a physically grounded mapping. In the second stage, structural detail compensation and spatial–spectral contrastive learning alleviate detail loss and enhance spectral–spatial fidelity. To further understand the network’s decision-making, we design a quantitative and systematic measure of model interpretability, the Interpretability Efficacy Coefficient (IEC). IEC integrates multiple statistics derived from SHapley Additive exPlanations (SHAP) values into a single unified score and try to evaluate how effectively a model balances spatial detail enhancement with spectral preservation. Experiments on three datasets demonstrate that RevFus outperforms state-of-the-art unsupervised and traditional methods, delivering superior spectral fidelity, enhanced spatial detail, and high model interpretability, thereby validating the effectiveness of the interpretable deep learning framework for robust, high-quality pan-sharpening.","PeriodicalId":50341,"journal":{"name":"International Journal of Applied Earth Observation and Geoinformation","volume":"395 1","pages":""},"PeriodicalIF":7.5,"publicationDate":"2026-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146047875","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing Sentinel-2 landslide change detection by integrating multispectral, deformation, and topographic information 通过整合多光谱、变形和地形信息,增强Sentinel-2滑坡变化检测
IF 7.5 1区 地球科学 Q1 Earth and Planetary Sciences Pub Date : 2026-01-21 DOI: 10.1016/j.jag.2026.105116
Bo Liu, Deren Li, Xiongwu Xiao, Zhenfeng Shao, Yingbing Li, Haobin Zhang, Yunong Chen, Zhenbei Zhang, Siyuan Wang, Boshen Chang
Landslides are among the most frequent natural hazards worldwide, posing severe risks to human lives and assets. Sentinel-2 imagery, a widely available remote sensing resource, has become an essential tool for detecting landslide changes. Nevertheless, its moderate spatial resolution often causes the loss of subtle surface information, which limits the precision of landslide recognition. To overcome this limitation, this study develops a Dynamic Multi-branch Landslide Change Detection Network (DMLCDNet) that integrates three types of multimodal data: multispectral, deformation, and topographic information. The proposed model achieves efficient landslide extraction through four key stages: preliminary change detection, dynamic feature extraction, feature fusion optimization, and spatial-semantic restoration. Experiments were conducted on three representative landslide events—Jiuzhaigou, Luding, and Longyan. The experimental findings indicate that DMLCDNet achieves improvements of 3.61% in F1-score and 6.91% in IoU over all baseline models, while sustaining efficient inference performance. In addition, evaluations on two other landslide cases confirm the model’s robust generalization ability. The study further emphasizes that the optimal selection of input data should follow three guiding principles: rich information, high discriminability, and strong relevance. Overall, the proposed method provides an effective paradigm for multisource feature fusion in Sentinel-2–based landslide change detection, offering substantial theoretical significance and practical value. The source code and dataset are publicly available at https://github.com/Trifurs/DMLCDNet.
山体滑坡是世界上最常见的自然灾害之一,对人类生命和财产构成严重威胁。Sentinel-2遥感影像作为一种广泛应用的遥感资源,已成为滑坡变化探测的重要工具。然而,由于其空间分辨率不高,往往会导致细微的地表信息丢失,从而限制了滑坡识别的精度。为了克服这一限制,本研究开发了一个动态多分支滑坡变化检测网络(DMLCDNet),该网络集成了三种类型的多模态数据:多光谱、变形和地形信息。该模型通过初步变化检测、动态特征提取、特征融合优化和空间语义恢复四个关键阶段实现了高效的滑坡提取。对九寨沟、泸定、龙岩三个具有代表性的滑坡事件进行了试验研究。实验结果表明,与所有基线模型相比,DMLCDNet在保持高效推理性能的同时,f1得分提高了3.61%,IoU提高了6.91%。此外,对另外两个滑坡实例的评价也证实了该模型的鲁棒泛化能力。本研究进一步强调了输入数据的最佳选择应遵循三个指导原则:信息丰富、高判别性和强相关性。综上所述,该方法为基于sentinel -2的滑坡变化检测中多源特征融合提供了一种有效的范式,具有重要的理论意义和实用价值。源代码和数据集可在https://github.com/Trifurs/DMLCDNet上公开获取。
{"title":"Enhancing Sentinel-2 landslide change detection by integrating multispectral, deformation, and topographic information","authors":"Bo Liu, Deren Li, Xiongwu Xiao, Zhenfeng Shao, Yingbing Li, Haobin Zhang, Yunong Chen, Zhenbei Zhang, Siyuan Wang, Boshen Chang","doi":"10.1016/j.jag.2026.105116","DOIUrl":"https://doi.org/10.1016/j.jag.2026.105116","url":null,"abstract":"Landslides are among the most frequent natural hazards worldwide, posing severe risks to human lives and assets. Sentinel-2 imagery, a widely available remote sensing resource, has become an essential tool for detecting landslide changes. Nevertheless, its moderate spatial resolution often causes the loss of subtle surface information, which limits the precision of landslide recognition. To overcome this limitation, this study develops a Dynamic Multi-branch Landslide Change Detection Network (DMLCDNet) that integrates three types of multimodal data: multispectral, deformation, and topographic information. The proposed model achieves efficient landslide extraction through four key stages: preliminary change detection, dynamic feature extraction, feature fusion optimization, and spatial-semantic restoration. Experiments were conducted on three representative landslide events—Jiuzhaigou, Luding, and Longyan. The experimental findings indicate that DMLCDNet achieves improvements of 3.61% in F1-score and 6.91% in IoU over all baseline models, while sustaining efficient inference performance. In addition, evaluations on two other landslide cases confirm the model’s robust generalization ability. The study further emphasizes that the optimal selection of input data should follow three guiding principles: rich information, high discriminability, and strong relevance. Overall, the proposed method provides an effective paradigm for multisource feature fusion in Sentinel-2–based landslide change detection, offering substantial theoretical significance and practical value. The source code and dataset are publicly available at https://github.com/Trifurs/DMLCDNet.","PeriodicalId":50341,"journal":{"name":"International Journal of Applied Earth Observation and Geoinformation","volume":"7 1","pages":""},"PeriodicalIF":7.5,"publicationDate":"2026-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146047872","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semantic segmentation of single SAR imagery leveraging historical Sentinel-1 and Sentinel-2 data 利用Sentinel-1和Sentinel-2历史数据对单个SAR图像进行语义分割
IF 7.5 1区 地球科学 Q1 Earth and Planetary Sciences Pub Date : 2026-01-21 DOI: 10.1016/j.jag.2026.105114
Xiaoyan Lu, Qihao Weng
Optical and Synthetic Aperture Radar (SAR) images are widely used in land cover mapping, environmental monitoring, and disaster management. While optical sensors are often hindered by adverse weather conditions such as clouds and rain, SAR provides all-weather capability but suffers from lower interpretability. To overcome this limitation, historical optical data can be leveraged to improve SAR interpretation and enhance mapping accuracy. This study proposed a degradable multimodal fusion (DEMFuse) framework that leverages historical optical data to enhance the interpretation of single SAR imagery and implemented it via semantic segmentation. The DEMFuse is composed of a SAR-to-optical image generator and a degradable fusion model. The former was built upon an ImageNet-pretrained transformer that employed a non-local generative modeling for translating SAR images into optical images to recover optical visual structural information. The latter was proposed to achieve progressively enhanced fusion performance by continuously assimilating useful information from synthesized optical data based on SAR data, thus reducing the impact of artifacts in the synthesized data, and ensuring the effective fusion of the optical and SAR data. To demonstrate the effectiveness of the proposed DEMFuse framework, a globally distributed SAR-to-optical image translation dataset and a land-cover semantic segmentation dataset were constructed by using 18,071 Sentinel-1 SAR and Sentinel-2 optical images, together referred to as “IT-SS-18 K” (image translation-segmentation-18 K). Additionally, a SAR-based flood rapid mapping dataset was employed to validate the effectiveness of the proposed framework in disaster scenarios. Experiments on IT-SS-18 K demonstrated that the proposed DEMFuse framework significantly reduced the relative accuracy gap by over 30 % between SAR and optical data for the segmentation of water, built areas, and trees. In the flood disaster scenario, the proposed framework yielded a 1.76 % improvement compared to the single SAR segmentation. This finding suggests that DEMFuse can be employed to enhance SAR interpretation in scenarios where optical data is missing, and improve the efficiency and accuracy of disaster management and environmental monitoring. The datasets will be made available at https://github.com/RCAIG/DEMFuse.
光学和合成孔径雷达(SAR)图像广泛应用于土地覆盖制图、环境监测和灾害管理等领域。虽然光学传感器经常受到恶劣天气条件(如云和雨)的阻碍,但SAR提供全天候能力,但可解释性较低。为了克服这一限制,可以利用历史光学数据来改进SAR解释并提高制图精度。本研究提出了一种可降解的多模态融合(DEMFuse)框架,该框架利用历史光学数据增强对单个SAR图像的解释,并通过语义分割实现。DEMFuse由SAR-to-optical图像发生器和可降解融合模型组成。前者建立在imagenet预训练的转换器上,该转换器采用非局部生成建模将SAR图像转换为光学图像,以恢复光学视觉结构信息。基于SAR数据不断吸收合成光学数据中的有用信息,从而减少合成数据中伪影的影响,保证光学数据与SAR数据的有效融合,从而逐步增强融合性能。为了验证所提出的DEMFuse框架的有效性,利用18071张Sentinel-1 SAR和Sentinel-2光学图像构建了全球分布式SAR到光学图像转换数据集和土地覆盖语义分割数据集,统称为“it - ss - 18k”(图像平移-分割- 18k)。此外,利用基于sar的洪水快速制图数据集验证了该框架在灾害场景中的有效性。在it - ss - 18k上的实验表明,所提出的DEMFuse框架显著降低了SAR和光学数据在分割水体、建筑区域和树木方面的相对精度差距,降幅超过30%。在洪水灾害场景中,与单一SAR分割相比,所提出的框架产生了1.76%的改进。这一发现表明,DEMFuse可以在光学数据缺失的情况下增强SAR解释,提高灾害管理和环境监测的效率和准确性。这些数据集将在https://github.com/RCAIG/DEMFuse上提供。
{"title":"Semantic segmentation of single SAR imagery leveraging historical Sentinel-1 and Sentinel-2 data","authors":"Xiaoyan Lu, Qihao Weng","doi":"10.1016/j.jag.2026.105114","DOIUrl":"https://doi.org/10.1016/j.jag.2026.105114","url":null,"abstract":"Optical and Synthetic Aperture Radar (SAR) images are widely used in land cover mapping, environmental monitoring, and disaster management. While optical sensors are often hindered by adverse weather conditions such as clouds and rain, SAR provides all-weather capability but suffers from lower interpretability. To overcome this limitation, historical optical data can be leveraged to improve SAR interpretation and enhance mapping accuracy. This study proposed a degradable multimodal fusion (DEMFuse) framework that leverages historical optical data to enhance the interpretation of single SAR imagery and implemented it via semantic segmentation. The DEMFuse is composed of a SAR-to-optical image generator and a degradable fusion model. The former was built upon an ImageNet-pretrained transformer that employed a non-local generative modeling for translating SAR images into optical images to recover optical visual structural information. The latter was proposed to achieve progressively enhanced fusion performance by continuously assimilating useful information from synthesized optical data based on SAR data, thus reducing the impact of artifacts in the synthesized data, and ensuring the effective fusion of the optical and SAR data. To demonstrate the effectiveness of the proposed DEMFuse framework, a globally distributed SAR-to-optical image translation dataset and a land-cover semantic segmentation dataset were constructed by using 18,071 Sentinel-1 SAR and Sentinel-2 optical images, together referred to as “IT-SS-18 K” (image translation-segmentation-18 K). Additionally, a SAR-based flood rapid mapping dataset was employed to validate the effectiveness of the proposed framework in disaster scenarios. Experiments on IT-SS-18 K demonstrated that the proposed DEMFuse framework significantly reduced the relative accuracy gap by over 30 % between SAR and optical data for the segmentation of water, built areas, and trees. In the flood disaster scenario, the proposed framework yielded a 1.76 % improvement compared to the single SAR segmentation. This finding suggests that DEMFuse can be employed to enhance SAR interpretation in scenarios where optical data is missing, and improve the efficiency and accuracy of disaster management and environmental monitoring. The datasets will be made available at <ce:inter-ref xlink:href=\"https://github.com/RCAIG/DEMFuse\" xlink:type=\"simple\">https://github.com/RCAIG/DEMFuse</ce:inter-ref>.","PeriodicalId":50341,"journal":{"name":"International Journal of Applied Earth Observation and Geoinformation","volume":"40 1","pages":""},"PeriodicalIF":7.5,"publicationDate":"2026-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146047874","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A review of satellite remote sensing for pollution control and carbon reduction in China 中国污染控制与碳减排卫星遥感研究综述
IF 7.5 1区 地球科学 Q1 Earth and Planetary Sciences Pub Date : 2026-01-20 DOI: 10.1016/j.jag.2026.105109
Shaohua Zhao, Yipeng Yang, Lin Ma, Zhijie Bai, Youcan Feng, Quanhai Liu, Qiao Wang, Yunhan Chen, Fei Wang, Jiahua Teng, Linbo Zhao, Yuhao Xie, Yazhen Dai, Le Yu, Yanqin Zhou, Yujiu Xiong
Mitigating pollutants and greenhouse gas (GHG) emissions is a core strategic imperative for China, necessitating integrated technological solutions. Satellite remote sensing (SRS) has emerged as a pivotal tool for environmental monitoring, enhancing our understanding of pollutants and GHG emissions from space and supports policy-making. Here we summarize the recent research advances regarding the role of SRS in monitoring pollutants and GHG emissions in China, focusing on three key applications: 1) pollutant emission tracking, 2) carbon emission quantification, and 3) synergistic monitoring of pollution-carbon interactions. Current analysis identifies substantial progress achieved over the past decades, but critical limitations remain. From a scientific perspective, both ground observation networks and professional sensors are lacking for improving retrieval accuracy. Moreover, artificial intelligence (AI)-based data-mining theory is inadequate. From a practical perspective, high-resolution satellite provides insufficient coverage for nationwide carbon monitoring and data deficiencies hinders intelligent monitoring systems. To address these challenges, we provide the following strategic priorities: 1) establishing integrated Space-Air-Ground observation systems, 2) accelerating the deployment of next-generation monitoring satellites and developing related retrieval algorithms, and 3) developing AI-based monitoring systems through multisource data integration. This review enhances the understanding of SRS applications and provides directions for future research aimed at mitigating pollutant and GHG emissions, thereby supporting China’s dual carbon goals and sustainable development worldwide.
减少污染物和温室气体(GHG)排放是中国的核心战略任务,需要综合技术解决方案。卫星遥感(SRS)已成为环境监测的关键工具,增强了我们对空间污染物和温室气体排放的了解,并支持决策。本文总结了SRS在中国污染物和温室气体排放监测中的研究进展,重点介绍了SRS在污染物排放跟踪、碳排放量化和污染-碳协同监测三个方面的应用。目前的分析表明,过去几十年取得了实质性进展,但仍然存在严重的局限性。从科学的角度来看,无论是地面观测网络还是专业的传感器,都缺乏提高检索精度的能力。此外,基于人工智能(AI)的数据挖掘理论是不充分的。从实际应用的角度看,高分辨率卫星对全国碳监测的覆盖范围不够,数据的不足阻碍了智能监测系统的发展。为了应对这些挑战,我们提出了以下战略重点:1)建立综合空间-空-地观测系统;2)加快下一代监测卫星的部署并开发相关检索算法;3)通过多源数据集成开发基于人工智能的监测系统。本文的综述增进了对SRS应用的理解,并为未来旨在减少污染物和温室气体排放的研究提供了方向,从而支持中国的双碳目标和全球的可持续发展。
{"title":"A review of satellite remote sensing for pollution control and carbon reduction in China","authors":"Shaohua Zhao, Yipeng Yang, Lin Ma, Zhijie Bai, Youcan Feng, Quanhai Liu, Qiao Wang, Yunhan Chen, Fei Wang, Jiahua Teng, Linbo Zhao, Yuhao Xie, Yazhen Dai, Le Yu, Yanqin Zhou, Yujiu Xiong","doi":"10.1016/j.jag.2026.105109","DOIUrl":"https://doi.org/10.1016/j.jag.2026.105109","url":null,"abstract":"Mitigating pollutants and greenhouse gas (GHG) emissions is a core strategic imperative for China, necessitating integrated technological solutions. Satellite remote sensing (SRS) has emerged as a pivotal tool for environmental monitoring, enhancing our understanding of pollutants and GHG emissions from space and supports policy-making. Here we summarize the recent research advances regarding the role of SRS in monitoring pollutants and GHG emissions in China, focusing on three key applications: 1) pollutant emission tracking, 2) carbon emission quantification, and 3) synergistic monitoring of pollution-carbon interactions. Current analysis identifies substantial progress achieved over the past decades, but critical limitations remain. From a scientific perspective, both ground observation networks and professional sensors are lacking for improving retrieval accuracy. Moreover, artificial intelligence (AI)-based data-mining theory is inadequate. From a practical perspective, high-resolution satellite provides insufficient coverage for nationwide carbon monitoring and data deficiencies hinders intelligent monitoring systems. To address these challenges, we provide the following strategic priorities: 1) establishing integrated Space-Air-Ground observation systems, 2) accelerating the deployment of next-generation monitoring satellites and developing related retrieval algorithms, and 3) developing AI-based monitoring systems through multisource data integration. This review enhances the understanding of SRS applications and provides directions for future research aimed at mitigating pollutant and GHG emissions, thereby supporting China’s dual carbon goals and sustainable development worldwide.","PeriodicalId":50341,"journal":{"name":"International Journal of Applied Earth Observation and Geoinformation","volume":"18 1","pages":""},"PeriodicalIF":7.5,"publicationDate":"2026-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146047914","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A new optimization framework for the geometric positioning accuracy of optical images 一种光学图像几何定位精度优化框架
IF 7.5 1区 地球科学 Q1 Earth and Planetary Sciences Pub Date : 2026-01-20 DOI: 10.1016/j.jag.2026.105111
Ming Li, Dazhao Fan, Yang Dong, Song Ji, Dongzi Li, Jiaqi Yang, Aosheng Wang, Baosheng Zhang
Obtaining accurate geometric positioning information for images is crucial for the application of remote sensing images in stereo mapping, map production, and target positioning. However, for optical remote sensing satellite images and archived historical images, the geometric positioning methods vary, and different control data formats are needed. This limits the timeliness of applying optical remote sensing images and has become a bottleneck for the unified and standardized processing of such images. In response, this paper presents a framework for optimizing the geometric positioning accuracy of optical remote sensing images. A lightweight vector control library (LVCL) is constructed to serve as the control data for the geometric positions of the optical images, and a rough positioning method based on a mutual feedback constraint-matching strategy and multiple associated images is proposed to obtain approximate positions of the target images quickly. Moreover, optimal iteration and segmented connection based vector matching methods are proposed to accurately match the target image to images in the LVCL, thereby providing the target image’s geographical information based on the control data. Experiments conducted across multiple regions show that this framework can effectively match remote sensing images with the geographic coordinates in the LVCL: the data storage volume is only 1/10 to 1/3893 of that of traditional and existing lightweight methods, demonstrating prominent lightweight advantages; among the images from 13 cities, the correct rough positioning matching results for 12 cities rank among the top 10, with 7 achieving the first place; the number of vector construction nodes in 62 regions is better than that of the comparison strategy; the vector matching results are correct and the number of matching points is the largest; for the 13 regions, the average positioning deviation decreases from 156.590 m to 4.098 m, and the maximum deviation decreases from 582.142 m to 10.588 m, with systematic errors significantly corrected. Thus, the proposed framework is expected to become a new paradigm for obtaining the geometric positions of optical remote sensing images.
获取准确的影像几何定位信息是遥感影像在立体制图、地图制作、目标定位等领域应用的关键。然而,对于光学遥感卫星图像和存档历史图像,几何定位方法不同,需要不同的控制数据格式。这限制了光学遥感影像应用的时效性,成为光学遥感影像统一、规范处理的瓶颈。为此,本文提出了一种优化光学遥感影像几何定位精度的框架。构建轻量级矢量控制库(LVCL)作为光学图像几何位置的控制数据,提出一种基于互反馈约束匹配策略和多幅关联图像的粗糙定位方法,快速获得目标图像的近似位置。提出了基于最优迭代和分段连接的矢量匹配方法,将目标图像与LVCL中的图像进行精确匹配,从而在控制数据的基础上提供目标图像的地理信息。跨多个区域的实验表明,该框架能够有效地将遥感图像与LVCL中的地理坐标进行匹配,数据存储量仅为传统和现有轻量化方法的1/10 ~ 1/3893,轻量化优势突出;13个城市的图像中,12个城市的正确粗略定位匹配结果进入前10名,其中7个城市获得第一名;62个区域的向量构建节点数优于对比策略;向量匹配结果正确,匹配点数量最大;13个区域的平均定位偏差从156.590 m减小到4.098 m,最大偏差从582.142 m减小到10.588 m,系统误差得到了显著的修正。因此,该框架有望成为光学遥感图像几何位置获取的新范式。
{"title":"A new optimization framework for the geometric positioning accuracy of optical images","authors":"Ming Li, Dazhao Fan, Yang Dong, Song Ji, Dongzi Li, Jiaqi Yang, Aosheng Wang, Baosheng Zhang","doi":"10.1016/j.jag.2026.105111","DOIUrl":"https://doi.org/10.1016/j.jag.2026.105111","url":null,"abstract":"Obtaining accurate geometric positioning information for images is crucial for the application of remote sensing images in stereo mapping, map production, and target positioning. However, for optical remote sensing satellite images and archived historical images, the geometric positioning methods vary, and different control data formats are needed. This limits the timeliness of applying optical remote sensing images and has become a bottleneck for the unified and standardized processing of such images. In response, this paper presents a framework for optimizing the geometric positioning accuracy of optical remote sensing images. A lightweight vector control library (LVCL) is constructed to serve as the control data for the geometric positions of the optical images, and a rough positioning method based on a mutual feedback constraint-matching strategy and multiple associated images is proposed to obtain approximate positions of the target images quickly. Moreover, optimal iteration and segmented connection based vector matching methods are proposed to accurately match the target image to images in the LVCL, thereby providing the target image’s geographical information based on the control data. Experiments conducted across multiple regions show that this framework can effectively match remote sensing images with the geographic coordinates in the LVCL: the data storage volume is only 1/10 to 1/3893 of that of traditional and existing lightweight methods, demonstrating prominent lightweight advantages; among the images from 13 cities, the correct rough positioning matching results for 12 cities rank among the top 10, with 7 achieving the first place; the number of vector construction nodes in 62 regions is better than that of the comparison strategy; the vector matching results are correct and the number of matching points is the largest; for the 13 regions, the average positioning deviation decreases from 156.590 m to 4.098 m, and the maximum deviation decreases from 582.142 m to 10.588 m, with systematic errors significantly corrected. Thus, the proposed framework is expected to become a new paradigm for obtaining the geometric positions of optical remote sensing images.","PeriodicalId":50341,"journal":{"name":"International Journal of Applied Earth Observation and Geoinformation","volume":"100 1","pages":""},"PeriodicalIF":7.5,"publicationDate":"2026-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146047877","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International Journal of Applied Earth Observation and Geoinformation
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1