首页 > 最新文献

IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing最新文献

英文 中文
Local Pixel-Contrast and Global Gaussian Multiprototype Bidirectional Alignment for Unsupervised Domain Adaptation of Semantic Segmentation in Remote Sensing Imagery 基于局部像素对比和全局高斯多原型双向对齐的遥感图像语义分割无监督域自适应
IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-23 DOI: 10.1109/JSTARS.2025.3647100
Yongkang Hu;Yupei Wang;Liang Chen
In unsupervised domain adaptation for semantic segmentation of remote sensing imagery, identical land-cover classes across different domains often exhibit substantial variations in appearance, scale, and class distribution, which seriously hinder cross-domain generalization. Moreover, even within a single domain, land-cover classes present highly complex and diverse intraclass distributions that cannot be effectively captured by a single class representation, further increasing the challenge of generalization. To this end, we propose a collaborative framework integrating local pixel-level contrast and global Gaussian multiprototype bidirectional alignment. At the local level, we introduce probability-masked contrastive learning, which adaptively increases the sampling probability of minority classes to mitigate the class imbalance issue. Meanwhile, pixel contrastive learning is incorporated to enhance the robustness to cross-domain variations in appearance and texture. At the global level, we employ a Gaussian mixture model to represent each source-domain class with multiple Gaussian prototypes rather than a single one, thereby yielding richer and more fine-grained class representations. Building on this, a bidirectional alignment strategy is proposed. Concretely, the forward alignment serves multiprototypes as semantic anchors that progressively guide target-domain features to align with the source-domain class distributions, reducing the intraclass variance. Meanwhile, the reverse alignment dynamically refines the prototypes to increase anchor accuracy, further enhancing the stability and discriminability of cross-domain alignment. Experimental results on the widely used ISPRS and LoveDA datasets demonstrate the superiority of our proposed method over state-of-the-art approaches.
在遥感图像语义分割的无监督域自适应中,相同的土地覆盖类在不同域的外观、规模和分布往往存在较大差异,严重阻碍了跨域泛化。此外,即使在单一领域内,土地覆盖类别也呈现出高度复杂和多样化的类别内分布,无法通过单一类别表示有效地捕获,这进一步增加了泛化的挑战。为此,我们提出了一种集成局部像素级对比度和全局高斯多原型双向对齐的协作框架。在局部层面,我们引入了概率掩蔽对比学习,自适应地增加少数类的抽样概率,以缓解类不平衡问题。同时,结合像素对比学习增强了对外观和纹理跨域变化的鲁棒性。在全局层面上,我们使用高斯混合模型用多个高斯原型而不是单个原型来表示每个源域类,从而产生更丰富和更细粒度的类表示。在此基础上,提出了一种双向对齐策略。具体而言,前向对齐将多原型作为语义锚点,逐步引导目标域特征与源域类分布对齐,从而减少类内方差。同时,反向对准对原型进行动态细化,提高锚定精度,进一步增强了跨域对准的稳定性和可分辨性。在广泛使用的ISPRS和LoveDA数据集上的实验结果表明,我们提出的方法优于目前最先进的方法。
{"title":"Local Pixel-Contrast and Global Gaussian Multiprototype Bidirectional Alignment for Unsupervised Domain Adaptation of Semantic Segmentation in Remote Sensing Imagery","authors":"Yongkang Hu;Yupei Wang;Liang Chen","doi":"10.1109/JSTARS.2025.3647100","DOIUrl":"https://doi.org/10.1109/JSTARS.2025.3647100","url":null,"abstract":"In unsupervised domain adaptation for semantic segmentation of remote sensing imagery, identical land-cover classes across different domains often exhibit substantial variations in appearance, scale, and class distribution, which seriously hinder cross-domain generalization. Moreover, even within a single domain, land-cover classes present highly complex and diverse intraclass distributions that cannot be effectively captured by a single class representation, further increasing the challenge of generalization. To this end, we propose a collaborative framework integrating local pixel-level contrast and global Gaussian multiprototype bidirectional alignment. At the local level, we introduce probability-masked contrastive learning, which adaptively increases the sampling probability of minority classes to mitigate the class imbalance issue. Meanwhile, pixel contrastive learning is incorporated to enhance the robustness to cross-domain variations in appearance and texture. At the global level, we employ a Gaussian mixture model to represent each source-domain class with multiple Gaussian prototypes rather than a single one, thereby yielding richer and more fine-grained class representations. Building on this, a bidirectional alignment strategy is proposed. Concretely, the forward alignment serves multiprototypes as semantic anchors that progressively guide target-domain features to align with the source-domain class distributions, reducing the intraclass variance. Meanwhile, the reverse alignment dynamically refines the prototypes to increase anchor accuracy, further enhancing the stability and discriminability of cross-domain alignment. Experimental results on the widely used ISPRS and LoveDA datasets demonstrate the superiority of our proposed method over state-of-the-art approaches.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"19 ","pages":"3573-3588"},"PeriodicalIF":5.3,"publicationDate":"2025-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11313335","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145982156","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SAM2-MDESD: An SAM2-Assisted Multilevel Dual Encoder–Single Decoder Method for Optical Remote Sensing Image Change Detection SAM2-MDESD:一种sam2辅助的多电平双编码器-单解码器光学遥感图像变化检测方法
IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-23 DOI: 10.1109/JSTARS.2025.3647047
Nan Mo;Gege Ma;Dengfeng Xie;Guangyun Zhang
Given the limitations of traditional feature coding in capturing multiscale information and precise segmentation, existing deep learning-based change detection (CD) methods often suffer from problems such as high missed detection ratio and poor boundary segmentation accuracy when dealing with changed regions with considerable scale differences and irregular geometric shapes. Recently, the emergence of vision foundation models has presented new opportunities for enhancing the feature representation and position encoding capabilities for remote sensing images. To address these challenges, this study proposes a novel segment anything model 2 (SAM2) assisted multilevel dual encoder–single decoder (SAM2-MDESD) CD method. First, an innovative dual-stream Hiera encoder is designed. This encoder leverages SAM2’s powerful feature encoding to extract multilevel features and positional information from bitemporal images effectively, thereby improving the performance for multiscale and irregular changes remarkably. Second, a grouped-channel self-attention enhancement module is designed to increase the saliency of changed regions in multilevel difference feature maps. This module contributes to the improved integrity of changed areas. Finally, a multilevel difference feature map joint prediction loss function is proposed, which leverages hierarchical constraints across features to suppress false detection caused by shallow features, thereby further improving the precision of changed regions. Experimental results on the CLCD and WHU-CD datasets demonstrate that the proposed method achieves F1-scores of 80.02% and 93.76%, respectively, outperforming existing state-of-the-art CD methods. Notably, SAM2-MDESD demonstrates remarkable advantages in reducing the missed detection ratio and improving segmentation accuracy for multiscale and irregularly shaped changes.
传统的特征编码在捕获多尺度信息和精确分割方面存在局限性,现有的基于深度学习的变化检测(CD)方法在处理尺度差异较大、几何形状不规则的变化区域时,往往存在检测漏检率高、边界分割精度差等问题。近年来,视觉基础模型的出现为增强遥感图像的特征表示和位置编码能力提供了新的契机。为了解决这些挑战,本研究提出了一种新的分段任何模型2 (SAM2)辅助多级双编码器-单解码器(SAM2- mdesd) CD方法。首先,设计了一种新颖的双流Hiera编码器。该编码器利用SAM2强大的特征编码功能,有效地从双时图像中提取多层次特征和位置信息,从而显著提高了对多尺度和不规则变化的处理性能。其次,设计分组通道自关注增强模块,提高多级差分特征图中变化区域的显著性;该模块有助于提高变化区域的完整性。最后,提出了多级差分特征映射联合预测损失函数,利用特征间的分层约束抑制浅层特征引起的误检,进一步提高了变化区域的精度。在CLCD和WHU-CD数据集上的实验结果表明,该方法的f1得分分别为80.02%和93.76%,优于现有的最先进的CD方法。值得注意的是,SAM2-MDESD在减少多尺度和不规则形状变化的漏检率和提高分割精度方面具有显著优势。
{"title":"SAM2-MDESD: An SAM2-Assisted Multilevel Dual Encoder–Single Decoder Method for Optical Remote Sensing Image Change Detection","authors":"Nan Mo;Gege Ma;Dengfeng Xie;Guangyun Zhang","doi":"10.1109/JSTARS.2025.3647047","DOIUrl":"https://doi.org/10.1109/JSTARS.2025.3647047","url":null,"abstract":"Given the limitations of traditional feature coding in capturing multiscale information and precise segmentation, existing deep learning-based change detection (CD) methods often suffer from problems such as high missed detection ratio and poor boundary segmentation accuracy when dealing with changed regions with considerable scale differences and irregular geometric shapes. Recently, the emergence of vision foundation models has presented new opportunities for enhancing the feature representation and position encoding capabilities for remote sensing images. To address these challenges, this study proposes a novel segment anything model 2 (SAM2) assisted multilevel dual encoder–single decoder (SAM2-MDESD) CD method. First, an innovative dual-stream Hiera encoder is designed. This encoder leverages SAM2’s powerful feature encoding to extract multilevel features and positional information from bitemporal images effectively, thereby improving the performance for multiscale and irregular changes remarkably. Second, a grouped-channel self-attention enhancement module is designed to increase the saliency of changed regions in multilevel difference feature maps. This module contributes to the improved integrity of changed areas. Finally, a multilevel difference feature map joint prediction loss function is proposed, which leverages hierarchical constraints across features to suppress false detection caused by shallow features, thereby further improving the precision of changed regions. Experimental results on the CLCD and WHU-CD datasets demonstrate that the proposed method achieves F1-scores of 80.02% and 93.76%, respectively, outperforming existing state-of-the-art CD methods. Notably, SAM2-MDESD demonstrates remarkable advantages in reducing the missed detection ratio and improving segmentation accuracy for multiscale and irregularly shaped changes.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"19 ","pages":"2588-2604"},"PeriodicalIF":5.3,"publicationDate":"2025-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11313499","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145929420","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Multiscale Attention Transformer for Martian Dust Devil Detection in Remote Sensing Imagery 遥感影像中火星尘暴探测的多尺度注意力转换器
IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-22 DOI: 10.1109/JSTARS.2025.3646776
Gan Liu;Linlin Shi;Jialong Lai;Feifei Cui;Xiaoping Zhang;Yi Xu
Detecting Martian dust devils remains a challenging task due to the scarcity of high-quality annotated data, significant variations in scale, blurred boundaries, and complex surface textures. To address these difficulties, we construct a cross-regional, manually annotated benchmark dataset named MDD-Human and propose a novel transformer-based detection network, Mars dust devil detection transformer (MDT). The model adopts FasterNet as its backbone to ensure a balance between computational efficiency and feature extraction capability. A key innovation lies in the multiscale attention fusion module, which incorporates hierarchical fusion strategies and hybrid attention mechanisms to effectively enhance the representation of dust devil features under diverse Martian terrains. In addition, we introduce a shape-aware localization loss function, shape-augmented minimum point distance IoU, which improves geometric sensitivity by integrating corner distance constraints and structural shape priors. Experimental results on the MDD-Human dataset demonstrate that MDT achieves 92.7% Precision, 90.8% Recall, 92.4% mAP@50, and 91.8% F1-score, outperforming several classical and state-of-the-art detectors. Further tests on unseen THEMIS and CRISM datasets confirm the model’s strong cross-source generalization, highlighting its robustness and applicability in diverse Martian imaging scenarios.
探测火星尘卷风仍然是一项具有挑战性的任务,因为缺乏高质量的注释数据,规模变化显著,边界模糊,表面纹理复杂。为了解决这些问题,我们构建了一个跨区域、人工标注的基准数据集MDD-Human,并提出了一种新的基于变压器的检测网络——火星尘暴检测变压器(MDT)。该模型采用FasterNet作为主干,保证了计算效率和特征提取能力之间的平衡。一个关键的创新在于多尺度注意力融合模块,该模块结合了分层融合策略和混合注意力机制,有效增强了不同火星地形下尘暴特征的表征。此外,我们引入了形状感知的定位损失函数——形状增强最小点距IoU,通过整合角距约束和结构形状先验来提高几何灵敏度。在MDD-Human数据集上的实验结果表明,MDT达到了92.7%的精度,90.8%的召回率,92.4%的mAP@50和91.8%的f1得分,优于几种经典和最先进的检测器。对未见过的THEMIS和CRISM数据集的进一步测试证实了该模型强大的跨源泛化能力,突出了其在各种火星成像场景中的鲁棒性和适用性。
{"title":"A Multiscale Attention Transformer for Martian Dust Devil Detection in Remote Sensing Imagery","authors":"Gan Liu;Linlin Shi;Jialong Lai;Feifei Cui;Xiaoping Zhang;Yi Xu","doi":"10.1109/JSTARS.2025.3646776","DOIUrl":"https://doi.org/10.1109/JSTARS.2025.3646776","url":null,"abstract":"Detecting Martian dust devils remains a challenging task due to the scarcity of high-quality annotated data, significant variations in scale, blurred boundaries, and complex surface textures. To address these difficulties, we construct a cross-regional, manually annotated benchmark dataset named MDD-Human and propose a novel transformer-based detection network, Mars dust devil detection transformer (MDT). The model adopts FasterNet as its backbone to ensure a balance between computational efficiency and feature extraction capability. A key innovation lies in the multiscale attention fusion module, which incorporates hierarchical fusion strategies and hybrid attention mechanisms to effectively enhance the representation of dust devil features under diverse Martian terrains. In addition, we introduce a shape-aware localization loss function, shape-augmented minimum point distance IoU, which improves geometric sensitivity by integrating corner distance constraints and structural shape priors. Experimental results on the MDD-Human dataset demonstrate that MDT achieves 92.7% Precision, 90.8% Recall, 92.4% mAP@50, and 91.8% F1-score, outperforming several classical and state-of-the-art detectors. Further tests on unseen THEMIS and CRISM datasets confirm the model’s strong cross-source generalization, highlighting its robustness and applicability in diverse Martian imaging scenarios.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"19 ","pages":"2696-2712"},"PeriodicalIF":5.3,"publicationDate":"2025-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11309725","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145929474","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prediction of Net Longwave Radiation and Turbulent Fluxes Using Remote-Sensing-Derived Net Shortwave Radiation for Different Land Cover Types 利用遥感提取的净短波辐射预测不同土地覆盖类型的净长波辐射和湍流通量
IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-22 DOI: 10.1109/JSTARS.2025.3646720
Jingwen Wang;Lei Lu;Xiaoming Zhou;Guanghui Huang;Zihan Chen
Accurate estimation of surface net longwave radiation (LW_net) and turbulent fluxes (Shle) is crucial for understanding the mechanisms of surface energy balance (SEB) and modeling SEB-based approach for estimating land surface temperature (LST) under cloudy sky. Parameterized or empirical regression methods based on remotely sensed data are effective ways to acquire LW_net and Shle. Current methods are always limited in global application due to the scarcity of observation sites and the absence of remote-sensing observations under cloudy conditions. To overcome these issues, this study developed multiple linear regression models (MLR) based on the data from 62 global sites covering 12 International Geosphere–Biosphere Program (IGBP) land cover types to estimate LW_net and Shle, in which net shortwave radiation (SW_net), normalized difference vegetation index (NDVI), normalized difference moisture index (NDMI), and digital elevation model (DEM) were used as variables. Model performance was evaluated with an independent dataset at both overall and seasonal scales. Then, the models were applied to remote-sensing products to estimate all-weather LW_net and Shle at a spatial resolution of 500 m, and the estimates were assessed against in situ data from five sites located in semiarid and arid regions in Northwest China. The results showed that the introduction of NDMI significantly improved prediction accuracy for most land cover types. The root-mean-square error (RMSE) of predicted LW_net ranged from 18.57 to 29.13 W/m2 with a mean RMSE of 25.65 W/m2, and the error of Shle ranged from 51.45 to 103.79 W/m2 with a mean RMSE of 65.16 W/m2. Application of the models to remote-sensing products, in which SW_net was provided by FY-4B surface shortwave radiation product, showed that the RMSE of estimated LW_net was 27.13 W/m2 in summer, 25.46 W/m2 in autumn, and 17.14 W/m2 in winter. For Shle, the average RMSE was 65.07 W/m2 in summer, 39.25 W/m2 in autumn, and 19.38 W/m2 in winter. Although the accuracy declined slightly over complex vegetation types and in the summer, the models exhibited robust applicability across different land cover types and seasons. This study provides an efficient, generalized method for estimating LW_net and Shle, which is promising to be used for studies on energy balance at regional and global scales and retrieval of LST under cloudy skies using SEB-based method.
准确估算地表净长波辐射(LW_net)和湍流通量(Shle)对于理解地表能量平衡(SEB)机制和模拟基于SEB的多云天气下地表温度估算方法至关重要。基于遥感数据的参数化或经验回归方法是获取LW_net和Shle的有效方法。目前的方法在全球范围内的应用受到限制,这主要是由于观测点稀缺和缺乏多云条件下的遥感观测。为了克服这些问题,本研究基于全球62个站点覆盖12种国际地圈-生物圈计划(IGBP)土地覆盖类型的数据,建立了以净短波辐射(SW_net)、归一化植被指数(NDVI)、归一化差异湿度指数(NDMI)和数字高程模型(DEM)为变量的LW_net和Shle多元线性回归模型(MLR)。在整体和季节尺度上使用独立数据集评估模型性能。然后,将该模型应用于遥感产品,估算了500 m空间分辨率下的全天候LW_net和Shle,并与西北半干旱和干旱区5个站点的现场数据进行了评估。结果表明,引入NDMI显著提高了大部分土地覆被类型的预测精度。预测LW_net的均方根误差(RMSE)范围为18.57 ~ 29.13 W/m2,平均RMSE为25.65 W/m2; Shle的误差范围为51.45 ~ 103.79 W/m2,平均RMSE为65.16 W/m2。将模型应用于FY-4B地面短波辐射产品提供SW_net的遥感产品,结果表明,估算的LW_net的RMSE在夏季为27.13 W/m2,在秋季为25.46 W/m2,在冬季为17.14 W/m2。夏季平均RMSE为65.07 W/m2,秋季为39.25 W/m2,冬季为19.38 W/m2。虽然在复杂植被类型和夏季,模型的精度略有下降,但在不同的土地覆盖类型和季节,模型具有较强的适用性。该研究提供了一种高效、通用的LW_net和Shle估算方法,有望用于区域和全球尺度的能量平衡研究以及基于seb方法的多云天气下地表温度反演。
{"title":"Prediction of Net Longwave Radiation and Turbulent Fluxes Using Remote-Sensing-Derived Net Shortwave Radiation for Different Land Cover Types","authors":"Jingwen Wang;Lei Lu;Xiaoming Zhou;Guanghui Huang;Zihan Chen","doi":"10.1109/JSTARS.2025.3646720","DOIUrl":"https://doi.org/10.1109/JSTARS.2025.3646720","url":null,"abstract":"Accurate estimation of surface net longwave radiation (LW_net) and turbulent fluxes (Shle) is crucial for understanding the mechanisms of surface energy balance (SEB) and modeling SEB-based approach for estimating land surface temperature (LST) under cloudy sky. Parameterized or empirical regression methods based on remotely sensed data are effective ways to acquire LW_net and Shle. Current methods are always limited in global application due to the scarcity of observation sites and the absence of remote-sensing observations under cloudy conditions. To overcome these issues, this study developed multiple linear regression models (MLR) based on the data from 62 global sites covering 12 International Geosphere–Biosphere Program (IGBP) land cover types to estimate LW_net and Shle, in which net shortwave radiation (SW_net), normalized difference vegetation index (NDVI), normalized difference moisture index (NDMI), and digital elevation model (DEM) were used as variables. Model performance was evaluated with an independent dataset at both overall and seasonal scales. Then, the models were applied to remote-sensing products to estimate all-weather LW_net and Shle at a spatial resolution of 500 m, and the estimates were assessed against in situ data from five sites located in semiarid and arid regions in Northwest China. The results showed that the introduction of NDMI significantly improved prediction accuracy for most land cover types. The root-mean-square error (RMSE) of predicted LW_net ranged from 18.57 to 29.13 W/m<sup>2</sup> with a mean RMSE of 25.65 W/m<sup>2</sup>, and the error of Shle ranged from 51.45 to 103.79 W/m<sup>2</sup> with a mean RMSE of 65.16 W/m<sup>2</sup>. Application of the models to remote-sensing products, in which SW_net was provided by FY-4B surface shortwave radiation product, showed that the RMSE of estimated LW_net was 27.13 W/m<sup>2</sup> in summer, 25.46 W/m<sup>2</sup> in autumn, and 17.14 W/m<sup>2</sup> in winter. For Shle, the average RMSE was 65.07 W/m<sup>2</sup> in summer, 39.25 W/m<sup>2</sup> in autumn, and 19.38 W/m<sup>2</sup> in winter. Although the accuracy declined slightly over complex vegetation types and in the summer, the models exhibited robust applicability across different land cover types and seasons. This study provides an efficient, generalized method for estimating LW_net and Shle, which is promising to be used for studies on energy balance at regional and global scales and retrieval of LST under cloudy skies using SEB-based method.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"19 ","pages":"2864-2878"},"PeriodicalIF":5.3,"publicationDate":"2025-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11309741","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145982145","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Modified Trajectory Error Modeling Method Integrating Moving Object Velocity and Trajectory Geometry 一种结合运动物体速度和轨迹几何的修正轨迹误差建模方法
IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-19 DOI: 10.1109/JSTARS.2025.3646151
Yanmin Jin;Zhixian Luo;Xinyi Zheng;Xiaohua Tong;Yongjiu Feng;Huan Xie;Xiong Xu
Trajectory error modeling plays a significant role in the effective application of trajectory data in various fields. Classical trajectory error modeling methods construct the trajectory error band based on the velocity of a moving object or the trajectory's geometric features. Since both the velocity and trajectory geometric features influence the trajectory uncertainty, a modified method is proposed that integrates the velocity of the moving object and trajectory geometric features to construct the uncertain region. This method is developed based on the previous geometry-based model, the broad adaptive error ellipse model. First, two groups of Minkowski coefficients were derived based on the global geometric feature and the local velocities of moving objects, respectively. Then, the optimal set of Minkowski coefficients was obtained by integrating the above two sets of Minkowski coefficients. Last, the trajectory uncertainty was modeled on the basis of the optimal Minkowski coefficients and the measurement error of the sampled points. The proposed modified method was verified by using three trajectory datasets. The results prove that the modified method can provide an error band that can enclose the real trajectory with a relatively small area in most cases.
弹道误差建模在各个领域有效应用弹道数据中起着重要作用。经典的轨迹误差建模方法是根据运动物体的速度或轨迹的几何特征来构造轨迹误差带。针对速度和轨迹几何特征对轨迹不确定性的影响,提出了一种结合运动物体速度和轨迹几何特征构建不确定区域的改进方法。该方法是在先前基于几何的广义自适应误差椭圆模型的基础上发展起来的。首先,根据运动物体的全局几何特征和局部速度分别导出两组闵可夫斯基系数;然后,对上述两组闵可夫斯基系数进行积分,得到最优闵可夫斯基系数集。最后,根据最优闵可夫斯基系数和采样点的测量误差对轨迹不确定性进行建模。利用三个轨迹数据集对改进方法进行了验证。结果表明,改进后的方法在大多数情况下都能提供一个较小面积的误差带,将真实轨迹包裹起来。
{"title":"A Modified Trajectory Error Modeling Method Integrating Moving Object Velocity and Trajectory Geometry","authors":"Yanmin Jin;Zhixian Luo;Xinyi Zheng;Xiaohua Tong;Yongjiu Feng;Huan Xie;Xiong Xu","doi":"10.1109/JSTARS.2025.3646151","DOIUrl":"https://doi.org/10.1109/JSTARS.2025.3646151","url":null,"abstract":"Trajectory error modeling plays a significant role in the effective application of trajectory data in various fields. Classical trajectory error modeling methods construct the trajectory error band based on the velocity of a moving object or the trajectory's geometric features. Since both the velocity and trajectory geometric features influence the trajectory uncertainty, a modified method is proposed that integrates the velocity of the moving object and trajectory geometric features to construct the uncertain region. This method is developed based on the previous geometry-based model, the broad adaptive error ellipse model. First, two groups of Minkowski coefficients were derived based on the global geometric feature and the local velocities of moving objects, respectively. Then, the optimal set of Minkowski coefficients was obtained by integrating the above two sets of Minkowski coefficients. Last, the trajectory uncertainty was modeled on the basis of the optimal Minkowski coefficients and the measurement error of the sampled points. The proposed modified method was verified by using three trajectory datasets. The results prove that the modified method can provide an error band that can enclose the real trajectory with a relatively small area in most cases.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"19 ","pages":"2621-2640"},"PeriodicalIF":5.3,"publicationDate":"2025-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11304715","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145929453","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Simplified Model for Simulating Complex Signals in GNSS-Reflectometry over Land 陆地gnss反射测量中复杂信号模拟的简化模型
IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-19 DOI: 10.1109/JSTARS.2025.3646137
Jilun Peng;Estel Cardellach;Weiqiang Li
The availability of complex signals in global navigation satellite system-reflectometry (GNSS-R) has gained growing attention across a range of applications, due to its capacity to preserve phase information and provide high along-track resolution. HydroGNSS will employ a high-rate complex signal mode, known as the “coherent channel,” to capture both the in-phase and quadrature components of signals reflected at the specular point. This work presents a simulation framework for analyzing the coherent channel in GNSS-R at high temporal resolution. Given the computational resources and limitations of fully detailed simulations, a simplified model is proposed, which incorporates a well established reflectivity model, and new empirical and theoretical functions. The surface reflectivity model estimates it from parameters, such as soil roughness, moisture, vegetation cover, and water fraction, while the innovative modeling block, here called the complex field model, derives the coherent and diffuse field amplitudes from reflectivity, and it assigns electromagnetic phases that behave, statistically, as in actual data. The validation is conducted at different levels, first using actual amplitude and reflectivity measurements as input, then starting from auxiliary surface information, yielding correlations with the coherence coefficient of 0.93, 0.74, and 0.61, respectively. This validation approach facilitates the differentiation of the errors introduced by each of the modules. The results support the feasibility of the proposed framework as a practical and quick tool to investigate complex signals under varying reflected surface conditions. Higher accuracy will require a tighter integration of the surface reflectivity model and the complex field mode.
全球导航卫星系统反射测量(GNSS-R)中复杂信号的可用性由于其保持相位信息和提供高沿轨分辨率的能力,在一系列应用中得到了越来越多的关注。HydroGNSS将采用一种被称为“相干信道”的高速率复杂信号模式,以捕获反射在镜面点上的信号的同相分量和正交分量。本文提出了一个高时间分辨率GNSS-R中相干信道分析的仿真框架。考虑到计算资源和完全详细模拟的局限性,提出了一个简化模型,该模型包含了一个完善的反射率模型,以及新的经验和理论函数。地表反射率模型根据土壤粗糙度、湿度、植被覆盖度和含水率等参数估算地表反射率,而创新的建模模块(这里称为复杂场模型)从反射率中导出相干和漫射场振幅,并分配与实际数据统计一致的电磁相位。在不同的水平上进行验证,首先使用实际振幅和反射率测量作为输入,然后从辅助表面信息开始,得到相干系数分别为0.93,0.74和0.61的相关性。这种验证方法有助于区分每个模块引入的错误。结果支持了该框架作为研究不同反射表面条件下复杂信号的实用和快速工具的可行性。更高的精度要求将地表反射率模型和复杂场模式更紧密地结合起来。
{"title":"A Simplified Model for Simulating Complex Signals in GNSS-Reflectometry over Land","authors":"Jilun Peng;Estel Cardellach;Weiqiang Li","doi":"10.1109/JSTARS.2025.3646137","DOIUrl":"https://doi.org/10.1109/JSTARS.2025.3646137","url":null,"abstract":"The availability of complex signals in global navigation satellite system-reflectometry (GNSS-R) has gained growing attention across a range of applications, due to its capacity to preserve phase information and provide high along-track resolution. HydroGNSS will employ a high-rate complex signal mode, known as the “coherent channel,” to capture both the in-phase and quadrature components of signals reflected at the specular point. This work presents a simulation framework for analyzing the coherent channel in GNSS-R at high temporal resolution. Given the computational resources and limitations of fully detailed simulations, a simplified model is proposed, which incorporates a well established reflectivity model, and new empirical and theoretical functions. The surface reflectivity model estimates it from parameters, such as soil roughness, moisture, vegetation cover, and water fraction, while the innovative modeling block, here called the complex field model, derives the coherent and diffuse field amplitudes from reflectivity, and it assigns electromagnetic phases that behave, statistically, as in actual data. The validation is conducted at different levels, first using actual amplitude and reflectivity measurements as input, then starting from auxiliary surface information, yielding correlations with the coherence coefficient of 0.93, 0.74, and 0.61, respectively. This validation approach facilitates the differentiation of the errors introduced by each of the modules. The results support the feasibility of the proposed framework as a practical and quick tool to investigate complex signals under varying reflected surface conditions. Higher accuracy will require a tighter integration of the surface reflectivity model and the complex field mode.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"19 ","pages":"2474-2484"},"PeriodicalIF":5.3,"publicationDate":"2025-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11304548","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145929390","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A High Spatiotemporal Resolution Soil Moisture Retrieval Approach Leveraging Deep Regression Networks and Multisource Remote Sensing Data 基于深度回归网络和多源遥感数据的高时空分辨率土壤水分反演方法
IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-18 DOI: 10.1109/JSTARS.2025.3646044
Xiaofei Kuang;Liping Wan;Shiyu Xiang;Pengliang Wei;Jiao Guo;Hanwen Yu
High spatiotemporal resolution soil moisture (SM) products can provide high-frequency SM information at the farmland scale for agricultural management processes. Existing SM retrieval methods for large-scale agricultural regions struggle to achieve both high temporal and spatial resolution simultaneously. In order to generate high spatiotemporal resolution SM in large-scale agricultural areas, this article has developed an SM retrieval model by taking advantage of multisource heterogeneous remote sensing data including active and passive microwaves and advanced data regression methods. First, multisource heterogeneous data with different spatiotemporal resolutions are fused to construct driving variables. Then, the multisource heterogeneous driving variables are coupled with the proposed transformer regression network to drive the SM retrieval task and generate high spatiotemporal resolution SM. We analyzed the advantages of the combination of active and passive microwaves. Compared with the SM derived from single microwave data, the SM derived from the combination of active and passive microwaves can reflect more detailed spatial and temporal information. The analysis of the retrieval performance of different retrieval methods shows that SM retrieved using multisource heterogeneous data and transformer regression has better accuracy, with a coefficient of determination of 0.9220. Moreover, the retrieved SM is not only highly consistent with the situation reflected by the U.S. Drought Monitor data, but also has a higher spatial accuracy when compared with the existing typical data products with a time resolution of one day. This article provides a feasible option for generating large-scale SM data with high spatiotemporal resolution for better agricultural water resource management.
高时空分辨率土壤湿度产品可为农业经营过程提供农田尺度的高频土壤湿度信息。现有的大尺度农区SM检索方法难以同时实现高时空分辨率。为了在大尺度农区生成高时空分辨率的遥感信息,利用主被动微波等多源异构遥感数据,结合先进的数据回归方法,建立了一种遥感信息检索模型。首先,融合不同时空分辨率的多源异构数据构建驱动变量;然后,将多源异构驱动变量与所提出的变压器回归网络相结合,驱动SM检索任务,生成高时空分辨率的SM。分析了主动式微波与被动式微波相结合的优点。与单一微波数据得到的SM相比,主动式微波和被动式微波组合得到的SM能反映更详细的时空信息。对不同检索方法的检索性能分析表明,使用多源异构数据和变压器回归检索的SM具有更好的准确性,其决定系数为0.9220。此外,反演的SM不仅与美国干旱监测数据反映的情况高度一致,而且与现有典型的1天时间分辨率数据产品相比,具有更高的空间精度。本文为农业水资源管理提供了一种高时空分辨率的大规模SM数据生成方法。
{"title":"A High Spatiotemporal Resolution Soil Moisture Retrieval Approach Leveraging Deep Regression Networks and Multisource Remote Sensing Data","authors":"Xiaofei Kuang;Liping Wan;Shiyu Xiang;Pengliang Wei;Jiao Guo;Hanwen Yu","doi":"10.1109/JSTARS.2025.3646044","DOIUrl":"https://doi.org/10.1109/JSTARS.2025.3646044","url":null,"abstract":"High spatiotemporal resolution soil moisture (SM) products can provide high-frequency SM information at the farmland scale for agricultural management processes. Existing SM retrieval methods for large-scale agricultural regions struggle to achieve both high temporal and spatial resolution simultaneously. In order to generate high spatiotemporal resolution SM in large-scale agricultural areas, this article has developed an SM retrieval model by taking advantage of multisource heterogeneous remote sensing data including active and passive microwaves and advanced data regression methods. First, multisource heterogeneous data with different spatiotemporal resolutions are fused to construct driving variables. Then, the multisource heterogeneous driving variables are coupled with the proposed transformer regression network to drive the SM retrieval task and generate high spatiotemporal resolution SM. We analyzed the advantages of the combination of active and passive microwaves. Compared with the SM derived from single microwave data, the SM derived from the combination of active and passive microwaves can reflect more detailed spatial and temporal information. The analysis of the retrieval performance of different retrieval methods shows that SM retrieved using multisource heterogeneous data and transformer regression has better accuracy, with a coefficient of determination of 0.9220. Moreover, the retrieved SM is not only highly consistent with the situation reflected by the U.S. Drought Monitor data, but also has a higher spatial accuracy when compared with the existing typical data products with a time resolution of one day. This article provides a feasible option for generating large-scale SM data with high spatiotemporal resolution for better agricultural water resource management.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"19 ","pages":"2557-2574"},"PeriodicalIF":5.3,"publicationDate":"2025-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11303755","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145929392","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Uncertainty Characterization of ICESat-2, Passive Microwave, and Reanalysis Snow Depth Datasets Using Site Data in the Northern Hemisphere 使用北半球站点数据的ICESat-2、被动微波和再分析雪深数据集的不确定性表征
IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-18 DOI: 10.1109/JSTARS.2025.3645917
Dongdong Feng;Tao Che;Liyun Dai;Liang Gao;Fei Wu;Yanxing Hu;Yang Zhang;Guigang Wang;Yueling Shi
Significant uncertainties persist in current snow depth (SD) datasets due to variations in sensor characteristics and retrieval algorithms. This study systematically evaluated five SD datasets derived from ice, cloud, and land elevation satellite-2 (ICESat-2), passive microwave products (AMSR2 and GlobSnow), and reanalysis products (ERA5 and modern-era retrospective analysis for research and applications, version 2) across 13 representative snow-covered regions in the Northern Hemisphere. A novel dynamic upscaling approach was developed by integrating simple averaging with regression kriging for ICESat-2 SD data. Spatial matching of multisource datasets was then conducted using 3342 SD observation sites to comprehensively evaluate the uncertainties among the five datasets. The results indicate that the retrieval errors of each dataset are positively correlated with the mean SD across different regions. During snow accumulation and melt periods, ICESat-2 demonstrates significant advantages in nonforested areas with SDs ranging from 5 to 45 cm. Both passive microwave and reanalysis SD products demonstrate reliable performance during stable snow periods. However, products often miss snow during melt seasons despite ground confirmation. The causes of this phenomenon differ between the two datasets: passive microwave retrievals are primarily dominated by the physical properties of liquid water, whereas reanalysis products face limitations due to model structure and insufficient input data. In conclusion, the integration of station and ICESat-2 SD in nonforested regions may provide new possibilities for validating products.
由于传感器特性和检索算法的差异,当前雪深(SD)数据集存在显著的不确定性。本研究系统评估了来自北半球13个代表性积雪覆盖地区的5个SD数据集,这些数据集来自冰、云和陆地高程卫星2号(ICESat-2)、无源微波产品(AMSR2和GlobSnow)和再分析产品(ERA5和现代研究与应用回顾性分析,版本2)。将ICESat-2 SD数据的简单平均与回归克里格相结合,提出了一种新的动态升级方法。然后利用3342个SD观测点对多源数据集进行空间匹配,综合评价5个数据集之间的不确定性。结果表明,各数据集的检索误差与不同区域的平均SD呈正相关。在积雪和融化期间,ICESat-2在非森林地区显示出显著的优势,SDs范围为5至45 cm。无源微波和再分析SD产品在稳定雪期表现出可靠的性能。然而,产品经常错过融雪季节,尽管地面确认。造成这种现象的原因在两个数据集之间有所不同:被动微波检索主要受液态水的物理性质支配,而再分析产品由于模型结构和输入数据不足而面临限制。综上所述,在非森林地区整合站点和ICESat-2 SD可能为验证产品提供新的可能性。
{"title":"Uncertainty Characterization of ICESat-2, Passive Microwave, and Reanalysis Snow Depth Datasets Using Site Data in the Northern Hemisphere","authors":"Dongdong Feng;Tao Che;Liyun Dai;Liang Gao;Fei Wu;Yanxing Hu;Yang Zhang;Guigang Wang;Yueling Shi","doi":"10.1109/JSTARS.2025.3645917","DOIUrl":"https://doi.org/10.1109/JSTARS.2025.3645917","url":null,"abstract":"Significant uncertainties persist in current snow depth (SD) datasets due to variations in sensor characteristics and retrieval algorithms. This study systematically evaluated five SD datasets derived from ice, cloud, and land elevation satellite-2 (ICESat-2), passive microwave products (AMSR2 and GlobSnow), and reanalysis products (ERA5 and modern-era retrospective analysis for research and applications, version 2) across 13 representative snow-covered regions in the Northern Hemisphere. A novel dynamic upscaling approach was developed by integrating simple averaging with regression kriging for ICESat-2 SD data. Spatial matching of multisource datasets was then conducted using 3342 SD observation sites to comprehensively evaluate the uncertainties among the five datasets. The results indicate that the retrieval errors of each dataset are positively correlated with the mean SD across different regions. During snow accumulation and melt periods, ICESat-2 demonstrates significant advantages in nonforested areas with SDs ranging from 5 to 45 cm. Both passive microwave and reanalysis SD products demonstrate reliable performance during stable snow periods. However, products often miss snow during melt seasons despite ground confirmation. The causes of this phenomenon differ between the two datasets: passive microwave retrievals are primarily dominated by the physical properties of liquid water, whereas reanalysis products face limitations due to model structure and insufficient input data. In conclusion, the integration of station and ICESat-2 SD in nonforested regions may provide new possibilities for validating products.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"19 ","pages":"2525-2542"},"PeriodicalIF":5.3,"publicationDate":"2025-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11303762","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145929391","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
UAV-DETR: Few-Parameter DETR for Small Object Detection in High-Altitude UAV Images UAV-DETR:用于高空无人机图像小目标检测的少参数DETR
IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-18 DOI: 10.1109/JSTARS.2025.3645731
Ningsheng Liao;Yuning Zhang;Zhongliang Yu;Jiangshuai Huang;Mi Zhu;Bo Peng
DEtection TRansformer (DETR) has received significant recognition for its ability to streamline the design process of object detectors through the concept of set prediction. However, its exceptional performance comes at the cost of a high parameter count and significant computational requirements. Moreover, its ability to detect small objects is compromised, making it less suitable for analyzing high-altitude UncrewedAerial Vehicle (UAV) images. This article proposes UAV-DETR, a DETR architecture specifically designed for detecting UAV images captured at high altitudes, balancing parameter count and precision. UAV-DETR is built in two steps: First, inverted residual structures are used to preserve low-dimensional image features, followed by a carefully designed cascaded linear attention mechanism to mitigate parameter redundancy. Through observation and analysis of the attention diffusion issue in the encoder, a cross-channel dynamic sampling mechanism is proposed, which effectively expands the model’s receptive field while maintaining accuracy. In addition, the loss function is redesigned by incorporating the Wasserstein distance, which is insensitive to bounding boxes, to accelerate model convergence. Extensive experimental results on two major benchmarks, i.e., VisDrone and UAVDT, validate the simplicity and efficiency of our model. Specifically, on the VisDrone2021 public test set, UAV-DETR exhibits superior performance with only 14 million parameters compared to YOLOv8$_{m}$, reducing the model’s parameter count and complexity by 44$%$ and 10$%$, respectively, while achieving a 16.6$%$ improvement in accuracy, without any data augmentation or postprocessing procedures.
探测变压器(DEtection TRansformer, DETR)因其通过集合预测的概念简化目标检测器设计过程的能力而受到广泛认可。然而,其卓越的性能是以高参数计数和大量计算需求为代价的。此外,它探测小物体的能力受到损害,使得它不太适合分析高空无人驾驶飞行器(UAV)图像。本文提出了UAV-DETR,这是一种专门用于检测高空捕获的无人机图像,平衡参数计数和精度的DETR架构。UAV-DETR的构建分两个步骤:首先,使用反向残差结构来保持低维图像特征,然后精心设计级联线性注意机制来减轻参数冗余。通过对编码器中注意扩散问题的观察和分析,提出了一种跨通道动态采样机制,在保持模型精度的同时有效扩展了模型的接受野。此外,通过引入对边界框不敏感的Wasserstein距离来重新设计损失函数,加快了模型的收敛速度。在两个主要基准(即VisDrone和UAVDT)上的大量实验结果验证了我们模型的简单性和效率。具体来说,在VisDrone2021公共测试集上,与YOLOv8$_{m}$相比,UAV-DETR仅具有1400万个参数,表现出优越的性能,分别将模型的参数计数和复杂性减少了44% $%$和10% $%$,同时在没有任何数据增强或后处理程序的情况下实现了16.6% $%$的精度提高。
{"title":"UAV-DETR: Few-Parameter DETR for Small Object Detection in High-Altitude UAV Images","authors":"Ningsheng Liao;Yuning Zhang;Zhongliang Yu;Jiangshuai Huang;Mi Zhu;Bo Peng","doi":"10.1109/JSTARS.2025.3645731","DOIUrl":"https://doi.org/10.1109/JSTARS.2025.3645731","url":null,"abstract":"<bold>DE</b>tection <bold>TR</b>ansformer (DETR) has received significant recognition for its ability to streamline the design process of object detectors through the concept of set prediction. However, its exceptional performance comes at the cost of a high parameter count and significant computational requirements. Moreover, its ability to detect small objects is compromised, making it less suitable for analyzing high-altitude <bold>U</b>ncrewed<bold>A</b>erial <bold>V</b>ehicle (UAV) images. This article proposes UAV-DETR, a DETR architecture specifically designed for detecting UAV images captured at high altitudes, balancing parameter count and precision. UAV-DETR is built in two steps: First, inverted residual structures are used to preserve low-dimensional image features, followed by a carefully designed cascaded linear attention mechanism to mitigate parameter redundancy. Through observation and analysis of the attention diffusion issue in the encoder, a cross-channel dynamic sampling mechanism is proposed, which effectively expands the model’s receptive field while maintaining accuracy. In addition, the loss function is redesigned by incorporating the Wasserstein distance, which is insensitive to bounding boxes, to accelerate model convergence. Extensive experimental results on two major benchmarks, i.e., VisDrone and UAVDT, validate the simplicity and efficiency of our model. Specifically, on the VisDrone2021 public test set, UAV-DETR exhibits superior performance with only 14 million parameters compared to YOLOv8<inline-formula><tex-math>$_{m}$</tex-math></inline-formula>, reducing the model’s parameter count and complexity by 44<inline-formula><tex-math>$%$</tex-math></inline-formula> and 10<inline-formula><tex-math>$%$</tex-math></inline-formula>, respectively, while achieving a 16.6<inline-formula><tex-math>$%$</tex-math></inline-formula> improvement in accuracy, without any data augmentation or postprocessing procedures.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"19 ","pages":"2575-2587"},"PeriodicalIF":5.3,"publicationDate":"2025-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11303730","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145929340","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Ultralightweight Multidomain Feature Extraction Network With Cross Spatial–Spectral Attention for Hyperspectral Image Classification 一种跨空间光谱关注的超轻量多域特征提取网络用于高光谱图像分类
IF 5.3 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-18 DOI: 10.1109/JSTARS.2025.3646025
Qinggang Wu;Chao Ma;Mengkun He;Zedong Wu;Qinge Wu
The combination of convolutional neural networks and attention mechanisms effectively enhances feature representation capabilities in hyperspectral image (HSI) classification. However, most existing methods face great challenges in terms of parameter numbers and computational overhead, which hinders their applications when computing and storage resources are limited. To address these issues, we propose an ultralightweight multidomain feature extraction network with cross spatial–spectral attention (ULMN-CS2A) for HSI classification, which primarily consists of three modules, i.e., the collaborative frequency-spatial–spectral (CFSS) feature extraction module, Gaussian neighboring pixel ReLU (GNReLU) activation, and cross spatial–spectral attention (CSSA). First, the ultralightweight CFSS module is designed to replace traditional lightweight convolutional layers by independently extracting features from the frequency, spatial, and spectral domains. Second, the GNReLU module enhances the network's nonlinear fitting ability and improves interlayer information transmission by aggregating neighboring pixels with Gaussian weights. Third, the lightweight CSSA module captures the paired pixel-level spatial–spectral relationships and enhances the global context representation ability by simultaneously learning their interactions. Extensive experiments demonstrate that the proposed ULMN-CS2A method shows strong competitiveness compared to state-of-the-art lightweight methods in terms of model parameters, FLOPs, and classification performance under small sampling rates. Meanwhile, ULMN-CS2A-MSP achieves an excellent classification result of 82.31% in terms of open-overall accuracy on Salinas Valley dataset for open-set HSI classification task.
卷积神经网络与注意机制的结合有效地提高了高光谱图像分类中的特征表示能力。然而,现有的大多数方法在参数数量和计算开销方面都面临着巨大的挑战,这阻碍了它们在计算和存储资源有限的情况下的应用。为了解决这些问题,我们提出了一种用于HSI分类的超轻量化多域特征提取网络(ULMN-CS2A),该网络主要由三个模块组成,即协同频率-空间-频谱(CFSS)特征提取模块、高斯相邻像素ReLU (GNReLU)激活模块和交叉空间-频谱注意(CSSA)模块。首先,超轻量化CFSS模块通过从频率、空间和频谱域独立提取特征来取代传统的轻量级卷积层。其次,GNReLU模块通过高斯权值对相邻像素进行聚合,增强了网络的非线性拟合能力,改善了层间信息传输;第三,轻量级CSSA模块捕获配对的像素级空间-光谱关系,并通过同时学习它们之间的相互作用来增强全局上下文表示能力。大量实验表明,与最先进的轻量级方法相比,所提出的ULMN-CS2A方法在模型参数、FLOPs和小采样率下的分类性能方面具有很强的竞争力。同时,ULMN-CS2A-MSP在Salinas Valley数据集上对开放集HSI分类任务的开放总体准确率达到了82.31%的优异分类结果。
{"title":"An Ultralightweight Multidomain Feature Extraction Network With Cross Spatial–Spectral Attention for Hyperspectral Image Classification","authors":"Qinggang Wu;Chao Ma;Mengkun He;Zedong Wu;Qinge Wu","doi":"10.1109/JSTARS.2025.3646025","DOIUrl":"https://doi.org/10.1109/JSTARS.2025.3646025","url":null,"abstract":"The combination of convolutional neural networks and attention mechanisms effectively enhances feature representation capabilities in hyperspectral image (HSI) classification. However, most existing methods face great challenges in terms of parameter numbers and computational overhead, which hinders their applications when computing and storage resources are limited. To address these issues, we propose an ultralightweight multidomain feature extraction network with cross spatial–spectral attention (ULMN-CS2A) for HSI classification, which primarily consists of three modules, i.e., the collaborative frequency-spatial–spectral (CFSS) feature extraction module, Gaussian neighboring pixel ReLU (GNReLU) activation, and cross spatial–spectral attention (CSSA). First, the ultralightweight CFSS module is designed to replace traditional lightweight convolutional layers by independently extracting features from the frequency, spatial, and spectral domains. Second, the GNReLU module enhances the network's nonlinear fitting ability and improves interlayer information transmission by aggregating neighboring pixels with Gaussian weights. Third, the lightweight CSSA module captures the paired pixel-level spatial–spectral relationships and enhances the global context representation ability by simultaneously learning their interactions. Extensive experiments demonstrate that the proposed ULMN-CS2A method shows strong competitiveness compared to state-of-the-art lightweight methods in terms of model parameters, FLOPs, and classification performance under small sampling rates. Meanwhile, ULMN-CS2A-MSP achieves an excellent classification result of 82.31% in terms of open-overall accuracy on Salinas Valley dataset for open-set HSI classification task.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"19 ","pages":"2829-2849"},"PeriodicalIF":5.3,"publicationDate":"2025-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11303540","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145982197","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1