首页 > 最新文献

IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing最新文献

英文 中文
MCDF-Net: Dynamic Adaptive Network Based on Modal Competition and Dual Encoder Feature Fusion for Remote Sensing Image Target Detection MCDF-Net:基于模态竞争和双编码器特征融合的遥感图像目标检测动态自适应网络
IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-29 DOI: 10.1109/JSTARS.2026.3659193
Yuanjie Zhi;Yushuo Qi;Zhi Yang;Wenkui Hao;Mingyang Ma;Shaohui Mei
Remote sensing image object detection, leveraging the complementary characteristics of infrared and RGB imaging, represents an effective approach for achieving all-weather detection. However, in complex environments, the quality of information provided by different modalities can undergo dynamic variations, necessitating dynamic adjustment of the weights assigned to each modality. Therefore, a dynamic adaptive network based on modal competition and dual-encoder feature fusion is proposed to implement precise modeling of dynamic complementary relationships and adaptive extraction of discriminative features through hierarchical feature dynamic interaction and an adaptive salient modal competition mechanism. Specifically, a hierarchical feature attention fusion module (HFAM) is designed under dual parallel feature encoding branches to enable the fusion of global context and local details, in which the cross-channel attention module is adopted to enhance channel responses through reconstruction via channel feature correlation matrices, and the difference fusion attention module (DFAM) concurrently calibrates spatial biases through pixel-level difference modeling. Moreover, an information entropy-guided adaptive modal competition mechanism is proposed to filter high-confidence queries by quantifying feature point uncertainty, thereby providing useful prior information for the decoder and adaptively determining the salient modality for targets to balance modal contributions. Experimental results over two benchmark datasets, i.e., DroneVehicle and VEDAI datasets, demonstrate that the proposed method clearly outperform state-of-the-art algorithms by effectively handling highly dynamic feature variations.
遥感图像目标检测利用红外和RGB成像的互补特性,是实现全天候探测的有效途径。然而,在复杂环境中,不同模态提供的信息质量会发生动态变化,因此需要动态调整分配给每种模态的权重。为此,提出了一种基于模态竞争和双编码器特征融合的动态自适应网络,通过层次化特征动态交互和自适应显著模态竞争机制实现动态互补关系的精确建模和判别特征的自适应提取。具体而言,在双并行特征编码分支下设计了分层特征注意融合模块(HFAM),实现了全局上下文和局部细节的融合,其中跨通道注意模块通过通道特征相关矩阵重构增强通道响应,差分融合注意模块通过像素级差分建模并发校准空间偏差。此外,提出了一种信息熵导向的自适应模态竞争机制,通过量化特征点不确定性来过滤高置信度查询,从而为解码器提供有用的先验信息,并自适应确定目标的显著模态以平衡模态贡献。在两个基准数据集(即DroneVehicle和VEDAI数据集)上的实验结果表明,该方法通过有效地处理高度动态的特征变化,明显优于最先进的算法。
{"title":"MCDF-Net: Dynamic Adaptive Network Based on Modal Competition and Dual Encoder Feature Fusion for Remote Sensing Image Target Detection","authors":"Yuanjie Zhi;Yushuo Qi;Zhi Yang;Wenkui Hao;Mingyang Ma;Shaohui Mei","doi":"10.1109/JSTARS.2026.3659193","DOIUrl":"https://doi.org/10.1109/JSTARS.2026.3659193","url":null,"abstract":"Remote sensing image object detection, leveraging the complementary characteristics of infrared and RGB imaging, represents an effective approach for achieving all-weather detection. However, in complex environments, the quality of information provided by different modalities can undergo dynamic variations, necessitating dynamic adjustment of the weights assigned to each modality. Therefore, a dynamic adaptive network based on modal competition and dual-encoder feature fusion is proposed to implement precise modeling of dynamic complementary relationships and adaptive extraction of discriminative features through hierarchical feature dynamic interaction and an adaptive salient modal competition mechanism. Specifically, a hierarchical feature attention fusion module (HFAM) is designed under dual parallel feature encoding branches to enable the fusion of global context and local details, in which the cross-channel attention module is adopted to enhance channel responses through reconstruction via channel feature correlation matrices, and the difference fusion attention module (DFAM) concurrently calibrates spatial biases through pixel-level difference modeling. Moreover, an information entropy-guided adaptive modal competition mechanism is proposed to filter high-confidence queries by quantifying feature point uncertainty, thereby providing useful prior information for the decoder and adaptively determining the salient modality for targets to balance modal contributions. Experimental results over two benchmark datasets, i.e., DroneVehicle and VEDAI datasets, demonstrate that the proposed method clearly outperform state-of-the-art algorithms by effectively handling highly dynamic feature variations.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"19 ","pages":"6581-6596"},"PeriodicalIF":5.3,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11367775","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146223540","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assimilation of SWOT Altimetry Data for Riverine Flood Reanalysis: From Synthetic to Real Data 河流洪水再分析中SWOT高程数据的同化:从综合数据到真实数据
IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-29 DOI: 10.1109/JSTARS.2026.3659808
Quentin Bonassies;Thanh Huy Nguyen;Ludovic Cassan;Andrea Piacentini;Sophie Ricci;Charlotte Emery;Christophe Fatras;Santiago Peña Luque;Raquel Rodriguez Suquet
Floods are one of the most common and devastating natural disasters worldwide. The contribution of remote sensing is important for reducing the impact of flooding both during the event itself and for improving hydrodynamic models by reducing their associated uncertainties. This article presents the innovative capabilities of the Surface Water and Ocean Topography (SWOT) mission, especially its river node products, to enhance the accuracy of riverine flood reanalysis, performed on a 50-km stretch of the Garonne River. The challenge addressed here is quantifying how SWOT river observations, alone and in combination with in-situ gauges, can improve hydraulic parameter estimation and river water level prediction in flood reanalysis. The experiments incorporate various data assimilation strategies, based on the ensemble Kalman filter, which allows for sequential updates of model parameters based on available observations. The experimental results show that while SWOT data alone offers some improvements, combining it with in-situ water level measurements provides the most accurate representation of flood dynamics, both at gauge stations and along the river. The study also investigates the impact of different SWOT revisit frequencies on the model’s performance, revealing that assimilating more frequent SWOT observations leads to more reliable flood reanalyses. In the real event, it was demonstrated that the assimilation of SWOT and in-situ data accurately reproduces the water level dynamics, offering promising prospects for future flood monitoring systems. Results show that in the OSSE framework, assimilation reduced water level errors by an order of magnitude, while in the real 2024 event the errors were reduced to below 17 cm, demonstrating the reliability of the approach. This study underscores the complementary role of Earth Observation data in enhancing flood dynamics representation in the riverbed and the floodplains.
洪水是世界范围内最常见和最具破坏性的自然灾害之一。遥感的贡献对于减少洪水发生期间的影响以及通过减少与之相关的不确定性来改进水动力模式都是重要的。本文介绍了地表水和海洋地形(SWOT)任务的创新能力,特别是其河流节点产品,以提高在加龙河50公里长的河段上进行的河流洪水再分析的准确性。这里要解决的挑战是如何量化SWOT河流观测,单独和结合现场测量,可以改善洪水再分析中的水力参数估计和河流水位预测。实验结合了基于集合卡尔曼滤波的各种数据同化策略,该策略允许基于可用观测值的模型参数的顺序更新。实验结果表明,虽然SWOT数据单独提供了一些改进,但将其与原位水位测量相结合,可以最准确地表示水位站和沿河的洪水动态。研究还考察了不同SWOT重访频率对模型性能的影响,揭示了吸收更频繁的SWOT观察结果会导致更可靠的洪水再分析。在实际事件中,SWOT和现场数据的同化能够准确再现水位动态,为未来的洪水监测系统提供了良好的前景。结果表明,在OSSE框架下,同化将水位误差降低了一个数量级,而在2024实际事件中,误差降至17 cm以下,证明了该方法的可靠性。本研究强调了地球观测数据在增强河床和洪泛平原洪水动态表征方面的补充作用。
{"title":"Assimilation of SWOT Altimetry Data for Riverine Flood Reanalysis: From Synthetic to Real Data","authors":"Quentin Bonassies;Thanh Huy Nguyen;Ludovic Cassan;Andrea Piacentini;Sophie Ricci;Charlotte Emery;Christophe Fatras;Santiago Peña Luque;Raquel Rodriguez Suquet","doi":"10.1109/JSTARS.2026.3659808","DOIUrl":"https://doi.org/10.1109/JSTARS.2026.3659808","url":null,"abstract":"Floods are one of the most common and devastating natural disasters worldwide. The contribution of remote sensing is important for reducing the impact of flooding both during the event itself and for improving hydrodynamic models by reducing their associated uncertainties. This article presents the innovative capabilities of the Surface Water and Ocean Topography (SWOT) mission, especially its river node products, to enhance the accuracy of riverine flood reanalysis, performed on a 50-km stretch of the Garonne River. The challenge addressed here is quantifying how SWOT river observations, alone and in combination with in-situ gauges, can improve hydraulic parameter estimation and river water level prediction in flood reanalysis. The experiments incorporate various data assimilation strategies, based on the ensemble Kalman filter, which allows for sequential updates of model parameters based on available observations. The experimental results show that while SWOT data alone offers some improvements, combining it with in-situ water level measurements provides the most accurate representation of flood dynamics, both at gauge stations and along the river. The study also investigates the impact of different SWOT revisit frequencies on the model’s performance, revealing that assimilating more frequent SWOT observations leads to more reliable flood reanalyses. In the real event, it was demonstrated that the assimilation of SWOT and in-situ data accurately reproduces the water level dynamics, offering promising prospects for future flood monitoring systems. Results show that in the OSSE framework, assimilation reduced water level errors by an order of magnitude, while in the real 2024 event the errors were reduced to below 17 cm, demonstrating the reliability of the approach. This study underscores the complementary role of Earth Observation data in enhancing flood dynamics representation in the riverbed and the floodplains.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"19 ","pages":"7024-7041"},"PeriodicalIF":5.3,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11368722","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146223551","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Real-Time MPSoC-Based Back Projection Accelerator for High-Accuracy Large-Size SAR Imaging Using Truncated Sinc Reconstruction and Mixed Precision Design 基于截断正弦重建和混合精度设计的实时mpsoc高精度大尺寸SAR反投影加速器
IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-29 DOI: 10.1109/JSTARS.2026.3658956
Xinyu Hu;Yinshen Wang;Jiabao Guo;Yao Cheng;Qiancheng Yan;Jiangyu Yao;Xiaolan Qiu
The back projection (BP) algorithm has become an important method for achieving high-resolution synthetic aperture radar imaging due to its model-free assumptions, high imaging accuracy, and strong trajectory adaptability. However, its high computational complexity severely limits real-time performance and system scalability. To address the challenge of large-size, high-accuracy, real-time imaging on resource-constrained system-on-chip, this article proposes an efficient BP acceleration architecture based on truncated sinc interpolation reconstruction that effectively eliminates the limitation of off-chip memory bandwidth on system performance, and significantly reduces on-chip memory and logic resource consumption. A mixed precision strategy is proposed, reducing the lookup table consumption by 46.84% compared to the traditional floating-point implementation, while maintaining nearly the same imaging accuracy. The proposed system achieves real-time SAR imaging on both uncrewed and manned aerial vehicles: on a low-speed uncrewed vehicle, it completes a $text{4096} times text{3840}$ image in 2.5968 s, and on a high-speed manned vehicle, it completes an $text{8192} times text{4096}$ image in 15.3557 s, which meets real-time processing requirements. The peak signal-to-noise ratio of the imaging result improves nearly seven times compared to most existing FPGA implementations with lower resource consumption, while achieving faster processing speeds. Experimental results demonstrate that the proposed scheme, by significantly improving resource utilization efficiency and imaging speed, achieves real-time processing capabilities for high-accuracy and large-size SAR imaging tasks, thereby exhibiting excellent practicality and scalability.
BP算法具有无模型假设、成像精度高、弹道适应性强等优点,已成为实现高分辨率合成孔径雷达成像的重要方法。然而,它的高计算复杂度严重限制了系统的实时性和可扩展性。为了解决在资源受限的片上系统上实现大尺寸、高精度、实时成像的挑战,本文提出了一种基于截断自插值重构的高效BP加速架构,有效消除了片外存储器带宽对系统性能的限制,显著降低了片上存储器和逻辑资源的消耗。提出了一种混合精度策略,与传统浮点实现相比,查找表消耗减少46.84%,同时保持几乎相同的成像精度。该系统实现了无人飞行器和有人飞行器的实时SAR成像,在低速无人飞行器上,在2.5968秒内完成一张$text{4096} times text{3840}$图像,在高速有人飞行器上,在15.3557秒内完成一张$text{8192} times text{4096}$图像,满足实时性处理要求。与大多数现有FPGA实现相比,成像结果的峰值信噪比提高了近7倍,资源消耗更低,同时实现了更快的处理速度。实验结果表明,该方案显著提高了资源利用效率和成像速度,实现了高精度、大尺寸SAR成像任务的实时处理能力,具有良好的实用性和可扩展性。
{"title":"A Real-Time MPSoC-Based Back Projection Accelerator for High-Accuracy Large-Size SAR Imaging Using Truncated Sinc Reconstruction and Mixed Precision Design","authors":"Xinyu Hu;Yinshen Wang;Jiabao Guo;Yao Cheng;Qiancheng Yan;Jiangyu Yao;Xiaolan Qiu","doi":"10.1109/JSTARS.2026.3658956","DOIUrl":"https://doi.org/10.1109/JSTARS.2026.3658956","url":null,"abstract":"The back projection (BP) algorithm has become an important method for achieving high-resolution synthetic aperture radar imaging due to its model-free assumptions, high imaging accuracy, and strong trajectory adaptability. However, its high computational complexity severely limits real-time performance and system scalability. To address the challenge of large-size, high-accuracy, real-time imaging on resource-constrained system-on-chip, this article proposes an efficient BP acceleration architecture based on truncated sinc interpolation reconstruction that effectively eliminates the limitation of off-chip memory bandwidth on system performance, and significantly reduces on-chip memory and logic resource consumption. A mixed precision strategy is proposed, reducing the lookup table consumption by 46.84% compared to the traditional floating-point implementation, while maintaining nearly the same imaging accuracy. The proposed system achieves real-time SAR imaging on both uncrewed and manned aerial vehicles: on a low-speed uncrewed vehicle, it completes a <inline-formula><tex-math>$text{4096} times text{3840}$</tex-math></inline-formula> image in 2.5968 s, and on a high-speed manned vehicle, it completes an <inline-formula><tex-math>$text{8192} times text{4096}$</tex-math></inline-formula> image in 15.3557 s, which meets real-time processing requirements. The peak signal-to-noise ratio of the imaging result improves nearly seven times compared to most existing FPGA implementations with lower resource consumption, while achieving faster processing speeds. Experimental results demonstrate that the proposed scheme, by significantly improving resource utilization efficiency and imaging speed, achieves real-time processing capabilities for high-accuracy and large-size SAR imaging tasks, thereby exhibiting excellent practicality and scalability.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"19 ","pages":"6597-6613"},"PeriodicalIF":5.3,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11367768","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146223553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ReSaP: Reasoning-Enhanced and Scale-Aware Prompting for Referring Remote Sensing Image Segmentation 基于推理增强和尺度感知的参考遥感图像分割提示
IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-29 DOI: 10.1109/JSTARS.2026.3659080
Ning Lv;Jisheng Dang;Teng Wang;Bimei Wang;Yichu Liu;Hong Peng;Haowen Yan;Bin Hu
Recent research has actively explored diverse mechanisms to unlock pixel-level segmentation capabilities in multimodal large language models (MLLMs), aiming to bridge the gap between high-level semantic reasoning and fine-grained visual perception. However, directly transferring these general-domain frameworks to referring remote sensing image segmentation (RRSIS) faces significant hurdles. These challenges primarily stem from the weak pixel-level discrimination capability of MLLMs in complex geospatial scenes and the severe granularity mismatch caused by drastic scale variations in remote sensing targets. To overcome these limitations, this article proposes ReSaP, a reasoning-enhanced and scale-aware prompting framework. ReSaP incorporates two core components to effectively adapt MLLMs for pixel-wise tasks. First, we introduce a pixel-aware group relative policy optimization (GRPO) training scheme. By utilizing a reinforcement learning framework with a hybrid reward mechanism that integrates bipartite matching for localization and classification accuracy for verification, this scheme explicitly enhances the MLLM’s fine-grained pixel discrimination and localization precision. Second, we propose the scale-aware prompting strategy for inference. This mechanism employs a density-adaptive grid sampling approach to dynamically adjust the prompt configuration based on target dimensions, effectively harmonizing prompt granularity with object scale. Extensive experiments on the RRSIS-D and RIS-LAD benchmarks demonstrate that ReSaP significantly outperforms existing state-of-the-art methods, validating its superior performance and robustness across both satellite and unmanned aerial vehicle observation perspectives.
最近的研究积极探索多种机制来解锁多模态大语言模型(mllm)中的像素级分割能力,旨在弥合高级语义推理和细粒度视觉感知之间的差距。然而,将这些通用领域框架直接转化为参考遥感图像分割(RRSIS)面临着很大的障碍。这些挑战主要源于mllm在复杂地理空间场景下的像元级识别能力较弱,以及遥感目标尺度变化大导致粒度失配严重。为了克服这些限制,本文提出了ReSaP,这是一个推理增强和规模感知的提示框架。ReSaP包含两个核心组件,可以有效地使mlm适应像素级任务。首先,我们引入了一种像素感知的群体相对策略优化(GRPO)训练方案。该方案利用混合奖励机制的强化学习框架,结合二部匹配定位和分类精度验证,明显提高了mlm的细粒度像素识别和定位精度。其次,提出了推理的尺度感知提示策略。该机制采用密度自适应网格采样方法,根据目标尺寸动态调整提示配置,有效协调提示粒度与对象尺度。在rssis - d和RIS-LAD基准测试上进行的大量实验表明,ReSaP显著优于现有的最先进的方法,验证了其在卫星和无人机观测视角下的优越性能和鲁棒性。
{"title":"ReSaP: Reasoning-Enhanced and Scale-Aware Prompting for Referring Remote Sensing Image Segmentation","authors":"Ning Lv;Jisheng Dang;Teng Wang;Bimei Wang;Yichu Liu;Hong Peng;Haowen Yan;Bin Hu","doi":"10.1109/JSTARS.2026.3659080","DOIUrl":"https://doi.org/10.1109/JSTARS.2026.3659080","url":null,"abstract":"Recent research has actively explored diverse mechanisms to unlock pixel-level segmentation capabilities in multimodal large language models (MLLMs), aiming to bridge the gap between high-level semantic reasoning and fine-grained visual perception. However, directly transferring these general-domain frameworks to referring remote sensing image segmentation (RRSIS) faces significant hurdles. These challenges primarily stem from the weak pixel-level discrimination capability of MLLMs in complex geospatial scenes and the severe granularity mismatch caused by drastic scale variations in remote sensing targets. To overcome these limitations, this article proposes ReSaP, a reasoning-enhanced and scale-aware prompting framework. ReSaP incorporates two core components to effectively adapt MLLMs for pixel-wise tasks. First, we introduce a pixel-aware group relative policy optimization (GRPO) training scheme. By utilizing a reinforcement learning framework with a hybrid reward mechanism that integrates bipartite matching for localization and classification accuracy for verification, this scheme explicitly enhances the MLLM’s fine-grained pixel discrimination and localization precision. Second, we propose the scale-aware prompting strategy for inference. This mechanism employs a density-adaptive grid sampling approach to dynamically adjust the prompt configuration based on target dimensions, effectively harmonizing prompt granularity with object scale. Extensive experiments on the RRSIS-D and RIS-LAD benchmarks demonstrate that ReSaP significantly outperforms existing state-of-the-art methods, validating its superior performance and robustness across both satellite and unmanned aerial vehicle observation perspectives.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"19 ","pages":"8372-8383"},"PeriodicalIF":5.3,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11367764","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147440493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Top-Down Coarse-to-Fine Cascade Network for High-Precision Cluster Infrared Small Target Detection 高精度聚类红外小目标检测的自顶向下粗精级联网络
IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-29 DOI: 10.1109/JSTARS.2026.3659652
Tuntun Wang;Jincheng Zhou;Lang Wu;Shuai Yuan;Yuxin Jing
Infrared small-target detection (IRSTD) holds a critical role in low-visibility and long-distance imaging scenarios, such as UAV tracking and maritime surveillance. However, cluster-IRSTD (CIRSTD) faces more prominent challenges: adjacent targets are prone to feature coupling, dim targets are easily submerged by background clutter, and cluster shapes vary dynamically. Owing to the constraint of independent single-target modeling, current deep-learning methods struggle to effectively handle dense cluster scenarios. Inspired by the human top-down visual attention mechanism, this paper proposes a coarse-to-fine cascaded detection network. First, an adaptive regional attention mechanism is tailored specifically for clusters, and a coarse cluster extraction module is further designed to extract the overall features of clusters. Subsequently, the Inner Fine Distinction module seamlessly integrates the Gaussian and Scharr filters from model-driven approaches into the deep-learning framework, aiming to amplify the saliency of dim targets. It effectively solves the problems of dim target missed detection and adjacent target coupling in clusters. By synergistically integrating holistic cluster information and enhancing target saliency, the proposed Coarse-to-Fine Cascade IRSTD (C2IRSTD) significantly mitigates missed detections within clusters and reduces false alarms outside clusters. The experiments conducted on the DenseSIRST dataset have strongly demonstrated the superior performance of C2IRSTD in highly challenging dense-cluster scenarios. Meanwhile, its leading performance on the SIRST3 dataset in sparse scenarios fully highlights its excellent generalization ability.
红外小目标探测(IRSTD)在低能见度和远距离成像场景中发挥着关键作用,例如无人机跟踪和海上监视。然而,簇- irstd (CIRSTD)算法面临着邻近目标容易发生特征耦合、弱小目标容易被背景杂波淹没以及簇形状动态变化等突出的挑战。由于独立的单目标建模的约束,目前的深度学习方法难以有效地处理密集的集群场景。受人类自上而下的视觉注意机制的启发,本文提出了一种由粗到细的级联检测网络。首先,针对聚类定制自适应区域注意机制,并进一步设计粗聚类提取模块,提取聚类的整体特征;随后,内部精细区分模块将模型驱动方法中的高斯滤波器和沙尔滤波器无缝集成到深度学习框架中,旨在放大模糊目标的显著性。它有效地解决了集群中弱小目标漏检和相邻目标耦合问题。通过协同整合整体集群信息和增强目标显著性,本文提出的粗-细级联IRSTD (C2IRSTD)显著减轻了集群内的漏检,减少了集群外的误报。在DenseSIRST数据集上进行的实验有力地证明了C2IRSTD在高挑战性的密集集群场景下的优越性能。同时,在稀疏场景下,其在SIRST3数据集上的领先性能充分体现了其出色的泛化能力。
{"title":"Top-Down Coarse-to-Fine Cascade Network for High-Precision Cluster Infrared Small Target Detection","authors":"Tuntun Wang;Jincheng Zhou;Lang Wu;Shuai Yuan;Yuxin Jing","doi":"10.1109/JSTARS.2026.3659652","DOIUrl":"https://doi.org/10.1109/JSTARS.2026.3659652","url":null,"abstract":"Infrared small-target detection (IRSTD) holds a critical role in low-visibility and long-distance imaging scenarios, such as UAV tracking and maritime surveillance. However, cluster-IRSTD (CIRSTD) faces more prominent challenges: adjacent targets are prone to feature coupling, dim targets are easily submerged by background clutter, and cluster shapes vary dynamically. Owing to the constraint of independent single-target modeling, current deep-learning methods struggle to effectively handle dense cluster scenarios. Inspired by the human top-down visual attention mechanism, this paper proposes a coarse-to-fine cascaded detection network. First, an adaptive regional attention mechanism is tailored specifically for clusters, and a coarse cluster extraction module is further designed to extract the overall features of clusters. Subsequently, the Inner Fine Distinction module seamlessly integrates the Gaussian and Scharr filters from model-driven approaches into the deep-learning framework, aiming to amplify the saliency of dim targets. It effectively solves the problems of dim target missed detection and adjacent target coupling in clusters. By synergistically integrating holistic cluster information and enhancing target saliency, the proposed Coarse-to-Fine Cascade IRSTD (C2IRSTD) significantly mitigates missed detections within clusters and reduces false alarms outside clusters. The experiments conducted on the DenseSIRST dataset have strongly demonstrated the superior performance of C2IRSTD in highly challenging dense-cluster scenarios. Meanwhile, its leading performance on the SIRST3 dataset in sparse scenarios fully highlights its excellent generalization ability.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"19 ","pages":"6566-6580"},"PeriodicalIF":5.3,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11368750","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146223539","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Field Parcel Scale Algorithm for Mapping Potato Distribution Using Multitemporal Sentinel-2 Images 基于Sentinel-2多时相影像的马铃薯分布图地包比例尺算法
IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-28 DOI: 10.1109/JSTARS.2026.3654208
Hasituya;Feng Quan;Chen Zhongxin;Battsetseg Tuvdendorj;Altantuya Dorjsuren;Yan Zhiyuan
Potato is an important staple crop both in China and worldwide, playing a critical role in ensuring global food security. Accurate mapping of the potato distribution is essential for detecting planting areas, estimating crop yields, and optimizing planting structures, thereby supporting sustainable agricultural development. However, remote sensing techniques for mapping potato distribution are still in their infancy, as most attention has been focused on the three major crops—maize, wheat, and rice. Consequently, this article proposed a cropland field-parcel-scale methodology for mapping potato distribution in Siziwang Banner, Inner Mongolia Autonomous Region, China. This methodology integrates edge detection, image segmentation, and machine-learning algorithm, leveraging multitemporal Sentinel-2 imagery to achieve accurately and effectively map the potato distribution. The results of detected edge from the four 10-m resolution Sentinel-2 bands (blue, green, red, and near infrared band) revealed that Canny edge detection can provide more sufficient information for edge extraction than Sobel edge detection. The extracted edges of the Canny edge detection algorithm are more closed and complete than the others, which is extremely important for accurate image segmentation. A comprehensive and robust edge map was generated by applying a weighted aggregation method to the edges initially extracted from each of the four spectral bands. Subsequently, the watershed segmentation algorithm was applied to these aggregated edges to delineate field parcels and index thresholds used to differentiate the cultivated field parcels and noncultivated field parcels. The methodology achieved an overall accuracy of 85% and an intersection-over-union ratio of 0.82. Finally, a random forest classifier was employed to map potato distribution by integrating spectral and index features at the field-parcel scale, achieving an overall mapping accuracy of 80% . The producer’s accuracy and user’s accuracy for potato mapping were 93.3% and 81.6%, respectively. As such, this article delivers a significant methodological support for mapping the fourth staple crop. The framework established here serves as a critical baseline for advancing crop type mapping, precise parcel extraction, and yield estimation, thereby supporting more strategic decision-making in potato cultivation and harvest.
马铃薯是中国乃至世界重要的主粮作物,在保障全球粮食安全方面发挥着至关重要的作用。准确的马铃薯分布图对于确定种植面积、估算作物产量、优化种植结构,从而支持农业可持续发展至关重要。然而,测绘马铃薯分布的遥感技术仍处于初级阶段,因为大多数注意力集中在三种主要作物——玉米、小麦和水稻上。为此,本文提出了一种农田-地块-比例尺的方法来绘制内蒙古四子王旗马铃薯分布图。该方法集成了边缘检测、图像分割和机器学习算法,利用多时相Sentinel-2图像实现准确有效的马铃薯分布图。对Sentinel-2 10 m分辨率4个波段(蓝、绿、红和近红外波段)的边缘检测结果表明,Canny边缘检测比Sobel边缘检测能够提供更充分的边缘提取信息。Canny边缘检测算法提取的边缘比其他算法提取的边缘更加封闭和完整,这对准确分割图像至关重要。利用加权聚集方法对4个光谱波段中提取的边缘进行加权聚集,生成全面、鲁棒的边缘图。然后,对这些聚集的边缘应用分水岭分割算法来划分田块,并使用指数阈值来区分耕地和非耕地地块。该方法获得了85%的总体精度和0.82的相交-超合并率。最后,利用随机森林分类器在地包尺度上整合光谱特征和指数特征来绘制马铃薯分布图,总体制图精度达到80%。马铃薯制图生产者和使用者的准确率分别为93.3%和81.6%。因此,本文为绘制第四种主要作物的地图提供了重要的方法支持。本文建立的框架可作为推进作物类型制图、精确包裹提取和产量估算的关键基线,从而支持马铃薯种植和收获方面的更多战略决策。
{"title":"A Field Parcel Scale Algorithm for Mapping Potato Distribution Using Multitemporal Sentinel-2 Images","authors":"Hasituya;Feng Quan;Chen Zhongxin;Battsetseg Tuvdendorj;Altantuya Dorjsuren;Yan Zhiyuan","doi":"10.1109/JSTARS.2026.3654208","DOIUrl":"https://doi.org/10.1109/JSTARS.2026.3654208","url":null,"abstract":"Potato is an important staple crop both in China and worldwide, playing a critical role in ensuring global food security. Accurate mapping of the potato distribution is essential for detecting planting areas, estimating crop yields, and optimizing planting structures, thereby supporting sustainable agricultural development. However, remote sensing techniques for mapping potato distribution are still in their infancy, as most attention has been focused on the three major crops—maize, wheat, and rice. Consequently, this article proposed a cropland field-parcel-scale methodology for mapping potato distribution in Siziwang Banner, Inner Mongolia Autonomous Region, China. This methodology integrates edge detection, image segmentation, and machine-learning algorithm, leveraging multitemporal Sentinel-2 imagery to achieve accurately and effectively map the potato distribution. The results of detected edge from the four 10-m resolution Sentinel-2 bands (blue, green, red, and near infrared band) revealed that Canny edge detection can provide more sufficient information for edge extraction than Sobel edge detection. The extracted edges of the Canny edge detection algorithm are more closed and complete than the others, which is extremely important for accurate image segmentation. A comprehensive and robust edge map was generated by applying a weighted aggregation method to the edges initially extracted from each of the four spectral bands. Subsequently, the watershed segmentation algorithm was applied to these aggregated edges to delineate field parcels and index thresholds used to differentiate the cultivated field parcels and noncultivated field parcels. The methodology achieved an overall accuracy of 85% and an intersection-over-union ratio of 0.82. Finally, a random forest classifier was employed to map potato distribution by integrating spectral and index features at the field-parcel scale, achieving an overall mapping accuracy of 80% . The producer’s accuracy and user’s accuracy for potato mapping were 93.3% and 81.6%, respectively. As such, this article delivers a significant methodological support for mapping the fourth staple crop. The framework established here serves as a critical baseline for advancing crop type mapping, precise parcel extraction, and yield estimation, thereby supporting more strategic decision-making in potato cultivation and harvest.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"19 ","pages":"6534-6545"},"PeriodicalIF":5.3,"publicationDate":"2026-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11364529","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146223552","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MD2F-Mamba: Multidirectional Depthwise Convolution and Dual-Branch Mamba Feature Fusion Networks for Hyperspectral Image Classification md2f -曼巴:用于高光谱图像分类的多向深度卷积和双分支曼巴特征融合网络
IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-28 DOI: 10.1109/JSTARS.2026.3657648
Xiaoqing Wan;Dongtao Mo;Yupeng He;Feng Chen;Zhize Li
Hyperspectral image (HSI) classification necessitates adept modeling of both intricate local variations and long-range spectral–spatial dependencies, while maintaining computational efficiency. Conventional methods frequently prioritize on either local or global features, neglecting directional information, or employ simplistic fusion techniques, which leads to inadequate feature representations and reduced discriminative ability. To address these challenges, this article presents MD2F-Mamba, a novel dual-branch architecture that integrates a multidirectional depthwise convolution module to capture spatial features from multiple orientations—namely, square, horizontal, and vertical convolutions, enriching local representations. The architecture comprises a local branch, featuring a multiscale local feature enhancement module with positional encoding, which effectively captures diverse spatial–spectral patterns. Simultaneously, the global branch utilizes a hierarchical state-space Mamba for global feature extraction that models multiscale long-range dependencies with linear complexity. A cosine similarity feature fusion module adaptively merges local and global features, optimizing discriminability by reducing redundancy. Experimental results on the Pavia University, Houston2013, WHU-Hi-LongKou, and WHU-Hi-HanChuan datasets demonstrate that MD2F-Mamba achieves competitive classification results while maintaining a minimal parameter count compared with several state-of-the-art methods, underscoring its innovative efficiency and robustness in HSI classification.
高光谱图像(HSI)分类需要在保持计算效率的同时,对复杂的局部变化和远程光谱空间依赖关系进行熟练的建模。传统的方法经常优先考虑局部或全局特征,忽略了方向信息,或者采用简单的融合技术,导致特征表示不足,降低了判别能力。为了应对这些挑战,本文介绍了MD2F-Mamba,这是一种新颖的双分支架构,它集成了一个多向深度卷积模块,可以从多个方向(即正方形、水平和垂直卷积)捕获空间特征,丰富了局部表示。该体系结构包括一个局部分支,该分支采用位置编码的多尺度局部特征增强模块,可有效捕获不同的空间光谱模式。同时,全局分支利用分层状态空间曼巴进行全局特征提取,以线性复杂性建模多尺度远程依赖关系。余弦相似特征融合模块自适应地融合局部和全局特征,通过减少冗余优化可判别性。在Pavia University, Houston2013, WHU-Hi-LongKou和WHU-Hi-HanChuan数据集上的实验结果表明,与几种最先进的方法相比,MD2F-Mamba在保持最小参数计数的同时获得了具有竞争力的分类结果,强调了其在HSI分类中的创新效率和鲁棒性。
{"title":"MD2F-Mamba: Multidirectional Depthwise Convolution and Dual-Branch Mamba Feature Fusion Networks for Hyperspectral Image Classification","authors":"Xiaoqing Wan;Dongtao Mo;Yupeng He;Feng Chen;Zhize Li","doi":"10.1109/JSTARS.2026.3657648","DOIUrl":"https://doi.org/10.1109/JSTARS.2026.3657648","url":null,"abstract":"Hyperspectral image (HSI) classification necessitates adept modeling of both intricate local variations and long-range spectral–spatial dependencies, while maintaining computational efficiency. Conventional methods frequently prioritize on either local or global features, neglecting directional information, or employ simplistic fusion techniques, which leads to inadequate feature representations and reduced discriminative ability. To address these challenges, this article presents MD<sup>2</sup>F-Mamba, a novel dual-branch architecture that integrates a multidirectional depthwise convolution module to capture spatial features from multiple orientations—namely, square, horizontal, and vertical convolutions, enriching local representations. The architecture comprises a local branch, featuring a multiscale local feature enhancement module with positional encoding, which effectively captures diverse spatial–spectral patterns. Simultaneously, the global branch utilizes a hierarchical state-space Mamba for global feature extraction that models multiscale long-range dependencies with linear complexity. A cosine similarity feature fusion module adaptively merges local and global features, optimizing discriminability by reducing redundancy. Experimental results on the Pavia University, Houston2013, WHU-Hi-LongKou, and WHU-Hi-HanChuan datasets demonstrate that MD<sup>2</sup>F-Mamba achieves competitive classification results while maintaining a minimal parameter count compared with several state-of-the-art methods, underscoring its innovative efficiency and robustness in HSI classification.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"19 ","pages":"6214-6238"},"PeriodicalIF":5.3,"publicationDate":"2026-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11363424","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Feature Tracking and Trajectory Selection Based Rotation Axis Estimation Method for Small Bodies Using Optical Remote Sensing Images From the Approach Phase 基于特征跟踪和轨迹选择的光学遥感图像小物体旋转轴估计方法
IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-28 DOI: 10.1109/JSTARS.2026.3658924
Yifan Wang;Huan Xie;Xiongfeng Yan;Jie Chen;Yaqiong Wang;Taoze Ying;Ming Yang;Xiaohua Tong
Determining the rotation axis of small bodies during the approach phase is essential for both mission operations and scientific investigations. Estimating the axis from the motion trajectories of image features has proven effective, but challenges remain due to limited image availability, weak surface textures, and uncertain observation geometries. In particular, tracking errors, unreliable trajectories, and dependence on accurately known rotation periods reduce the robustness and efficiency of existing methods. To address these challenges, this study proposes a rotation-axis estimation method for small bodies during the approach phase, based on image feature tracking and trajectory selection. The method employs sparse optical flow to extract feature trajectories and removes unstable tracks using image masks and bidirectional flow. An adaptive trajectory selection and shape classification are then performed based on the statistical distribution of fitted parameters using the histogram. Finally, a geometry-based optimization model identifies the correct rotation axis solution via a genetic algorithm, without requiring prior knowledge of the rotation period. The proposed algorithm was tested on over 400 simulated cases considering varying sun phase angles, approach angles, image numbers per rotation period, and small body shapes. The results demonstrate that the proposed method significantly outperforms the existing algorithms. The proposed algorithm achieved estimation errors below 3° in 89% of the cases and below 5° in 92% of the cases, and the running time of all the cases was less than 3 min. Validation using in-orbit data from the OSIRIS-REx mission confirmed that the proposed algorithm can estimate the rotation axis of asteroid Bennu with an error of only 2.69°. The results validate the proposed algorithm's effectiveness and efficiency, proving its potential for small body exploration missions.
在接近阶段确定小天体的旋转轴对于任务操作和科学调查都是至关重要的。从图像特征的运动轨迹估计轴已被证明是有效的,但由于图像可用性有限、表面纹理弱和观测几何形状不确定,挑战仍然存在。特别是,跟踪误差、不可靠的轨迹以及对精确已知旋转周期的依赖降低了现有方法的鲁棒性和效率。为了解决这些问题,本研究提出了一种基于图像特征跟踪和轨迹选择的小物体接近阶段旋转轴估计方法。该方法采用稀疏光流提取特征轨迹,利用图像蒙版和双向流去除不稳定轨迹。然后根据直方图拟合参数的统计分布进行自适应轨迹选择和形状分类。最后,基于几何的优化模型通过遗传算法识别正确的旋转轴解,而不需要事先知道旋转周期。该算法在400多个模拟案例中进行了测试,考虑了不同的太阳相位角、接近角、每个旋转周期的图像数量和小体型。结果表明,该方法明显优于现有算法。该算法在89%的情况下实现了小于3°的估计误差,92%的情况下实现了小于5°的估计误差,并且所有情况的运行时间都小于3 min。OSIRIS-REx任务的在轨数据验证证实,该算法可以估计小行星Bennu的旋转轴,误差仅为2.69°。实验结果验证了该算法的有效性和高效性,证明了该算法在小体探测任务中的应用潜力。
{"title":"A Feature Tracking and Trajectory Selection Based Rotation Axis Estimation Method for Small Bodies Using Optical Remote Sensing Images From the Approach Phase","authors":"Yifan Wang;Huan Xie;Xiongfeng Yan;Jie Chen;Yaqiong Wang;Taoze Ying;Ming Yang;Xiaohua Tong","doi":"10.1109/JSTARS.2026.3658924","DOIUrl":"https://doi.org/10.1109/JSTARS.2026.3658924","url":null,"abstract":"Determining the rotation axis of small bodies during the approach phase is essential for both mission operations and scientific investigations. Estimating the axis from the motion trajectories of image features has proven effective, but challenges remain due to limited image availability, weak surface textures, and uncertain observation geometries. In particular, tracking errors, unreliable trajectories, and dependence on accurately known rotation periods reduce the robustness and efficiency of existing methods. To address these challenges, this study proposes a rotation-axis estimation method for small bodies during the approach phase, based on image feature tracking and trajectory selection. The method employs sparse optical flow to extract feature trajectories and removes unstable tracks using image masks and bidirectional flow. An adaptive trajectory selection and shape classification are then performed based on the statistical distribution of fitted parameters using the histogram. Finally, a geometry-based optimization model identifies the correct rotation axis solution via a genetic algorithm, without requiring prior knowledge of the rotation period. The proposed algorithm was tested on over 400 simulated cases considering varying sun phase angles, approach angles, image numbers per rotation period, and small body shapes. The results demonstrate that the proposed method significantly outperforms the existing algorithms. The proposed algorithm achieved estimation errors below 3° in 89% of the cases and below 5° in 92% of the cases, and the running time of all the cases was less than 3 min. Validation using in-orbit data from the OSIRIS-REx mission confirmed that the proposed algorithm can estimate the rotation axis of asteroid Bennu with an error of only 2.69°. The results validate the proposed algorithm's effectiveness and efficiency, proving its potential for small body exploration missions.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"19 ","pages":"6477-6496"},"PeriodicalIF":5.3,"publicationDate":"2026-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11367265","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146223559","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quantifying Surface Downward Shortwave Radiation and Its Direct and Diffuse Components Using Fengyun-4A AGRI Observations 利用风云- 4a AGRI观测资料定量地表向下短波辐射及其直接和漫射分量
IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-28 DOI: 10.1109/JSTARS.2026.3658505
Xinpei Han;Xiaotong Zhang;Lingfeng Lu;Lingchen Bu;Run Jia;Bo Jiang;Yunjun Yao
Surface downward shortwave radiation (Rs) is fundamental for modeling surface energy budgets and biogeochemical cycles. Although much effort on Rs estimation has been conducted, retrievals of its direct and diffuse components remain limited. This study developed a framework integrating machine learning with physical modeling to retrieve Rs, and its direct (Rdirect) and diffuse (Rdiffuse) components at 4-km spatial resolution over China using satellite observations from the Fengyun-4A AGRI. The proposed model derives the instantaneous estimates of Rs and its direct and diffuse components using traditional physical models. These initial estimates, along with cloud, water, and ERA5 Rs data, served as a feature set to obtain accurate radiation estimates based on the random forest model. The model-estimated daily mean Rs was validated against ground measurements from Climate Data Center of the Chinese Meteorological Administration (CDC/CMA), yielding an R of 0.98, a mean bias error (MBE) of –0.01 W/m2, and an root mean square error (RMSE) of 17.13 W/m2. For the daily mean Rdirect (Rdiffuse), validation against National Ecosystem Science Data Center stations yielded an R of 0.98 (0.98), an MBE of 12.59 (–37.01) W/m2, and an RMSE of 24.11 (42.84) W/m2, respectively. The generated Rs and its direct and diffuse components were also compared with existing products. The spatial distribution of the derived estimates is consistent with other products, but with relatively higher spatial resolution and precision at the selected sites. The proposed method has the advantage of using new-generation geostationary satellites by combining the strengths of physical models and machine learning to advance radiation estimation research.
地表向下短波辐射(Rs)是模拟地表能量收支和生物地球化学循环的基础。虽然在Rs估计方面已经做了很多工作,但其直接和扩散分量的检索仍然有限。本研究开发了一个将机器学习与物理建模相结合的框架,利用“云运- 4a”AGRI的卫星观测数据,在中国4公里空间分辨率下检索Rs及其直接(Rdirect)和漫射(Rdiffuse)分量。该模型利用传统的物理模型推导出Rs及其直接和扩散分量的瞬时估计。这些初始估计值与云、水和era5rs数据一起作为一个特征集,根据随机森林模型获得准确的辐射估计值。模型估计的日平均Rs与中国气象局气候数据中心(CDC/CMA)的地面测量数据进行了验证,R为0.98,平均偏置误差(MBE)为-0.01 W/m2,均方根误差(RMSE)为17.13 W/m2。对于日平均Rdirect (Rdiffuse),基于国家生态系统科学数据中心站点验证的R为0.98 (0.98),MBE为12.59 (-37.01)W/m2, RMSE为24.11 (42.84)W/m2。生成的Rs及其直接和扩散组分也与现有产品进行了比较。所得估算值的空间分布与其他产品基本一致,但所选站点的空间分辨率和精度相对较高。该方法利用新一代地球静止卫星,结合物理模型和机器学习的优势,推进辐射估计研究。
{"title":"Quantifying Surface Downward Shortwave Radiation and Its Direct and Diffuse Components Using Fengyun-4A AGRI Observations","authors":"Xinpei Han;Xiaotong Zhang;Lingfeng Lu;Lingchen Bu;Run Jia;Bo Jiang;Yunjun Yao","doi":"10.1109/JSTARS.2026.3658505","DOIUrl":"https://doi.org/10.1109/JSTARS.2026.3658505","url":null,"abstract":"Surface downward shortwave radiation (<italic>R</i><sub>s</sub>) is fundamental for modeling surface energy budgets and biogeochemical cycles. Although much effort on <italic>R</i><sub>s</sub> estimation has been conducted, retrievals of its direct and diffuse components remain limited. This study developed a framework integrating machine learning with physical modeling to retrieve <italic>R</i><sub>s</sub>, and its direct (<italic>R</i><sub>direct</sub>) and diffuse (<italic>R</i><sub>diffuse</sub>) components at 4-km spatial resolution over China using satellite observations from the Fengyun-4A AGRI. The proposed model derives the instantaneous estimates of <italic>R</i><sub>s</sub> and its direct and diffuse components using traditional physical models. These initial estimates, along with cloud, water, and ERA5 <italic>R</i><sub>s</sub> data, served as a feature set to obtain accurate radiation estimates based on the random forest model. The model-estimated daily mean <italic>R</i><sub>s</sub> was validated against ground measurements from Climate Data Center of the Chinese Meteorological Administration (CDC/CMA), yielding an <italic>R</i> of 0.98, a mean bias error (MBE) of –0.01 W/m<sup>2</sup>, and an root mean square error (RMSE) of 17.13 W/m<sup>2</sup>. For the daily mean <italic>R</i><sub>direct</sub> (<italic>R</i><sub>diffuse</sub>), validation against National Ecosystem Science Data Center stations yielded an <italic>R</i> of 0.98 (0.98), an MBE of 12.59 (–37.01) W/m<sup>2</sup>, and an RMSE of 24.11 (42.84) W/m<sup>2</sup>, respectively. The generated <italic>R</i><sub>s</sub> and its direct and diffuse components were also compared with existing products. The spatial distribution of the derived estimates is consistent with other products, but with relatively higher spatial resolution and precision at the selected sites. The proposed method has the advantage of using new-generation geostationary satellites by combining the strengths of physical models and machine learning to advance radiation estimation research.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"19 ","pages":"6359-6374"},"PeriodicalIF":5.3,"publicationDate":"2026-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11367295","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146223715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MovSeg: Efficient Adaptation of Vision–Language Models for Multispectral Open- Vocabulary Segmentation 面向多光谱开放词汇分割的高效自适应视觉语言模型
IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-27 DOI: 10.1109/JSTARS.2026.3658442
Yingrui Ji;Chenhao Wang;Jiansheng Chen;Jingbo Chen;Anzhi Yue;Yu Meng;Chenhong Sui
Open-vocabulary segmentation (OVS) allows models to segment any categories based on text prompts, overcoming the limitations of traditional closed-set methods. Open-vocabulary segmentation (OVS) in remote sensing is limited by its reliance on standard red, green, and blue (RGB) images. This prevents models from using valuable information from other spectral bands, such as near-infrared (NIR). This article proposes MovSeg, a framework for multispectral open-vocabulary segmentation. Our method efficiently integrates four-band (RGB+NIR) data into a pretrained vision-language model. MovSeg introduces two components: a multispectral input adaptation (MIA) module and a spatial–channel adaptive tuning (SCAT) strategy. The MIA module adapts the input layer to process four-band data. It uses a weight-copying strategy for initialization and a window efficient channel attention mechanism to reweight spectral channels. The SCAT strategy is a parameter-efficient strategy that uses a hybrid of low-rank adaptation and adapters to fine-tune deep features with low computational cost. Experiments on three multispectral datasets show that MovSeg outperforms existing general-domain and remote sensing OVS methods. The model achieves significant gains on NIR-sensitive classes, confirming its ability to exploit the extra spectral data. The codes will be coming soon.
开放词汇分词(OVS)允许模型根据文本提示对任何类别进行分词,克服了传统闭集方法的局限性。遥感中的开放词汇分割(OVS)受其依赖标准红、绿、蓝(RGB)图像的限制。这阻止了模型使用来自其他光谱波段的有价值的信息,例如近红外(NIR)。本文提出了一个多谱开放词汇分词框架MovSeg。我们的方法有效地将四波段(RGB+NIR)数据集成到预训练的视觉语言模型中。MovSeg引入了两个组件:一个多频谱输入自适应(MIA)模块和一个空间信道自适应调谐(SCAT)策略。MIA模块采用输入层处理四波段数据。它使用权重复制策略进行初始化,并使用窗口有效信道关注机制对频谱信道进行重加权。SCAT策略是一种参数高效的策略,它混合使用低秩自适应和适配器来微调深度特征,并且计算成本低。在3个多光谱数据集上的实验表明,MovSeg方法优于现有的通用域和遥感OVS方法。该模型在nir敏感类别上取得了显著的进步,证实了其利用额外光谱数据的能力。代码将很快发布。
{"title":"MovSeg: Efficient Adaptation of Vision–Language Models for Multispectral Open- Vocabulary Segmentation","authors":"Yingrui Ji;Chenhao Wang;Jiansheng Chen;Jingbo Chen;Anzhi Yue;Yu Meng;Chenhong Sui","doi":"10.1109/JSTARS.2026.3658442","DOIUrl":"https://doi.org/10.1109/JSTARS.2026.3658442","url":null,"abstract":"Open-vocabulary segmentation (OVS) allows models to segment any categories based on text prompts, overcoming the limitations of traditional closed-set methods. Open-vocabulary segmentation (OVS) in remote sensing is limited by its reliance on standard red, green, and blue (RGB) images. This prevents models from using valuable information from other spectral bands, such as near-infrared (NIR). This article proposes MovSeg, a framework for multispectral open-vocabulary segmentation. Our method efficiently integrates four-band (RGB+NIR) data into a pretrained vision-language model. MovSeg introduces two components: a multispectral input adaptation (MIA) module and a spatial–channel adaptive tuning (SCAT) strategy. The MIA module adapts the input layer to process four-band data. It uses a weight-copying strategy for initialization and a window efficient channel attention mechanism to reweight spectral channels. The SCAT strategy is a parameter-efficient strategy that uses a hybrid of low-rank adaptation and adapters to fine-tune deep features with low computational cost. Experiments on three multispectral datasets show that MovSeg outperforms existing general-domain and remote sensing OVS methods. The model achieves significant gains on NIR-sensitive classes, confirming its ability to exploit the extra spectral data. The codes will be coming soon.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"19 ","pages":"8044-8055"},"PeriodicalIF":5.3,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11365526","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147362265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1