首页 > 最新文献

IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society最新文献

英文 中文
S3LBI: Spectral–Spatial Segmentation-Based Local Bicubic Interpolation for Single Hyperspectral Image Super-Resolution 基于光谱-空间分割的单幅高光谱图像超分辨率局部双三次插值
Yubo Ma;Wei He;Siyu Cai;Qingke Zou
Single hyperspectral image (HSI) super-resolution (SR), which is limited by the lack of exterior information, has always been a challenging task. A lot of effort has gone into fully mining spectral information or adopting pretrained models to enhance spatial resolution. However, few SR approaches take into account structural features from the perspective of multidimensional segmentation of the image. Therefore, a novel spectral–spatial segmentation-based local bicubic interpolation (S3LBI) is proposed to implement segmented and blocked interpolation according to the characteristics of HSI. Specifically, the bands of an HSI are clustered into several spectral segments. Then, super-pixel segmentation is carried out in each spectral segment. After that, the bicubic interpolations are separately conducted on different spectral–spatial segments. Experiments demonstrate the superiority of our S3LBI over the compared HSI SR approaches.
单幅高光谱图像(HSI)的超分辨率一直是一项具有挑战性的任务,但受外部信息缺乏的限制。在充分挖掘光谱信息或采用预训练模型来提高空间分辨率方面已经付出了大量的努力。然而,很少有SR方法从图像的多维分割角度考虑结构特征。因此,根据HSI的特点,提出了一种基于频谱空间分割的局部双三次插值方法(S3LBI)来实现分割和块插值。具体地说,HSI的波段被聚集成几个光谱段。然后,对每个光谱段进行超像素分割。然后分别对不同的光谱空间段进行双三次插值。实验证明了我们的S3LBI优于比较的HSI SR方法。
{"title":"S3LBI: Spectral–Spatial Segmentation-Based Local Bicubic Interpolation for Single Hyperspectral Image Super-Resolution","authors":"Yubo Ma;Wei He;Siyu Cai;Qingke Zou","doi":"10.1109/LGRS.2025.3601230","DOIUrl":"https://doi.org/10.1109/LGRS.2025.3601230","url":null,"abstract":"Single hyperspectral image (HSI) super-resolution (SR), which is limited by the lack of exterior information, has always been a challenging task. A lot of effort has gone into fully mining spectral information or adopting pretrained models to enhance spatial resolution. However, few SR approaches take into account structural features from the perspective of multidimensional segmentation of the image. Therefore, a novel spectral–spatial segmentation-based local bicubic interpolation (S3LBI) is proposed to implement segmented and blocked interpolation according to the characteristics of HSI. Specifically, the bands of an HSI are clustered into several spectral segments. Then, super-pixel segmentation is carried out in each spectral segment. After that, the bicubic interpolations are separately conducted on different spectral–spatial segments. Experiments demonstrate the superiority of our S3LBI over the compared HSI SR approaches.","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":4.4,"publicationDate":"2025-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144914174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multimodal-Guided Transformer Architecture for Remote Sensing Salient Object Detection 遥感显著目标检测的多模态引导变压器结构
Bei Cheng;Zao Liu;Huxiao Tang;Qingwang Wang;Wenhao Chen;Tao Chen;Tao Shen
The latest remote sensing image saliency detectors primarily rely on RGB information alone. However, spatial and geometric information embedded in depth images is robust to variations in lighting and color. Integrating depth information with RGB images can enhance the spatial structure of objects. In light of this, we innovatively propose a remote sensing image saliency detection model that fuses RGB and depth information, named the multimodal-guided transformer architecture (MGTA). Specifically, we first introduce the strongly correlated complementary fusion (SCCF) module to explore cross-modal consistency and similarity, maintaining consistency across different modalities while uncovering multidimensional common information. In addition, the global–local context information interaction (GLCII) module is designed to extract global semantic information and local detail information, effectively utilizing contextual information while reducing the number of parameters. Finally, a cascaded feature-guided decoder (CFGD) is employed to gradually fuse hierarchical decoding features, effectively integrating multilevel data and accurately locating target positions. Extensive experiments demonstrate that our proposed model outperforms 14 state-of-the-art methods. The code and results of our method are available at https://github.com/Zackisliuzao/MGTANet
最新的遥感图像显著性检测器主要依赖于RGB信息。然而,嵌入在深度图像中的空间和几何信息对光照和颜色的变化具有鲁棒性。将深度信息与RGB图像相结合可以增强物体的空间结构。鉴于此,我们创新性地提出了一种融合RGB和深度信息的遥感图像显著性检测模型,命名为多模态引导变压器架构(multimodal-guided transformer architecture, MGTA)。具体来说,我们首先引入了强相关互补融合(SCCF)模块来探索跨模态一致性和相似性,在发现多维公共信息的同时保持不同模态之间的一致性。此外,设计了全局-局部上下文信息交互(GLCII)模块,提取全局语义信息和局部细节信息,在减少参数数量的同时有效利用上下文信息。最后,采用级联特征引导解码器(CFGD)逐步融合分层解码特征,有效整合多层数据,准确定位目标位置。大量的实验表明,我们提出的模型优于14种最先进的方法。我们的方法的代码和结果可在https://github.com/Zackisliuzao/MGTANet上获得
{"title":"Multimodal-Guided Transformer Architecture for Remote Sensing Salient Object Detection","authors":"Bei Cheng;Zao Liu;Huxiao Tang;Qingwang Wang;Wenhao Chen;Tao Chen;Tao Shen","doi":"10.1109/LGRS.2025.3601083","DOIUrl":"https://doi.org/10.1109/LGRS.2025.3601083","url":null,"abstract":"The latest remote sensing image saliency detectors primarily rely on RGB information alone. However, spatial and geometric information embedded in depth images is robust to variations in lighting and color. Integrating depth information with RGB images can enhance the spatial structure of objects. In light of this, we innovatively propose a remote sensing image saliency detection model that fuses RGB and depth information, named the multimodal-guided transformer architecture (MGTA). Specifically, we first introduce the strongly correlated complementary fusion (SCCF) module to explore cross-modal consistency and similarity, maintaining consistency across different modalities while uncovering multidimensional common information. In addition, the global–local context information interaction (GLCII) module is designed to extract global semantic information and local detail information, effectively utilizing contextual information while reducing the number of parameters. Finally, a cascaded feature-guided decoder (CFGD) is employed to gradually fuse hierarchical decoding features, effectively integrating multilevel data and accurately locating target positions. Extensive experiments demonstrate that our proposed model outperforms 14 state-of-the-art methods. The code and results of our method are available at <uri>https://github.com/Zackisliuzao/MGTANet</uri>","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":4.4,"publicationDate":"2025-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144998172","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Channel Characterization Based on 3-D TransUnet-CBAM With Multiloss Function 基于多损耗函数的三维TransUnet-CBAM通道表征
Binpeng Yan;Jiaqi Zhao;Mutian Li;Rui Pan
The channel system is intimately linked to the formation of oil and gas reservoirs. In petroliferous basins, channel deposits frequently serve as both storage spaces and fluid conduits. Consequently, the accurate identification of channels in 3-D seismic data is, therefore, critical for reservoir prediction. Traditional seismic attribute-based methods can outline channel boundaries, but noise and stratigraphic complexity introduce discontinuities that reduce accuracy and require extensive manual correction. Deep learning-based methods outperform conventional methods in terms of efficiency and precision. However, the similar seismic signatures of channels and continuous karst caves in seismic profiles can still mislead the existing models. To address this challenge, we proposed an improved variant of the 3-D TransUnet model for 3-D seismic data recognition. The model incorporates channel and spatial attention mechanisms into the skip connections of the TransUnet architecture, effectively enhancing its feature representation capability and recognition accuracy. In addition, a multiloss function is introduced to improve the delineation and continuity of the channel while increasing the model’s robustness against nonchannel interference features. Experiments on synthetic and field seismic data confirm superior boundary delineation, continuity, and noise resistance compared with baseline methods.
河道系统与油气藏的形成有着密切的联系。在含油气盆地中,河道沉积往往既是储集空间又是流体通道。因此,在三维地震资料中准确识别通道对储层预测至关重要。传统的基于地震属性的方法可以勾勒出通道边界,但噪声和地层复杂性会引入不连续面,从而降低精度,需要大量的人工校正。基于深度学习的方法在效率和精度方面优于传统方法。然而,地震剖面中通道和连续溶洞的相似地震特征仍然会对现有模型产生误导。为了解决这一挑战,我们提出了一种改进的3-D TransUnet模型,用于3-D地震数据识别。该模型将通道和空间注意机制融入到TransUnet架构的跳跃连接中,有效提高了TransUnet架构的特征表示能力和识别精度。此外,引入多损失函数来改善信道的描绘和连续性,同时提高模型对非信道干扰特征的鲁棒性。合成和现场地震数据实验证实,与基线方法相比,该方法具有更好的边界圈定、连续性和抗噪性。
{"title":"Channel Characterization Based on 3-D TransUnet-CBAM With Multiloss Function","authors":"Binpeng Yan;Jiaqi Zhao;Mutian Li;Rui Pan","doi":"10.1109/LGRS.2025.3601200","DOIUrl":"https://doi.org/10.1109/LGRS.2025.3601200","url":null,"abstract":"The channel system is intimately linked to the formation of oil and gas reservoirs. In petroliferous basins, channel deposits frequently serve as both storage spaces and fluid conduits. Consequently, the accurate identification of channels in 3-D seismic data is, therefore, critical for reservoir prediction. Traditional seismic attribute-based methods can outline channel boundaries, but noise and stratigraphic complexity introduce discontinuities that reduce accuracy and require extensive manual correction. Deep learning-based methods outperform conventional methods in terms of efficiency and precision. However, the similar seismic signatures of channels and continuous karst caves in seismic profiles can still mislead the existing models. To address this challenge, we proposed an improved variant of the 3-D TransUnet model for 3-D seismic data recognition. The model incorporates channel and spatial attention mechanisms into the skip connections of the TransUnet architecture, effectively enhancing its feature representation capability and recognition accuracy. In addition, a multiloss function is introduced to improve the delineation and continuity of the channel while increasing the model’s robustness against nonchannel interference features. Experiments on synthetic and field seismic data confirm superior boundary delineation, continuity, and noise resistance compared with baseline methods.","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":4.4,"publicationDate":"2025-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144914218","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Removal of the Feedback Loop in CCSDS 123.0-B-2 During Hardware Implementation CCSDS 123.0-B-2硬件实现过程中反馈回路的去除
Liang Jia;Qi Wang;Lei Zhang;Chengpeng Song;Peng Zhang
The Consultative Committee for Space Data Systems (CCSDS) proposed the CCSDS 123.0-B-2 standard for compressing large volumes of data acquired by multispectral and hyperspectral sensors. However, data dependencies in the CCSDS 123.0-B-2 predictor lead to feedback loops during the weight update process. This poses challenges for fully pipelined hardware implementation of the predictor and severely limits the achievable data throughput. Therefore, it is critical to improve throughput while keeping the degradation in compression performance within an acceptable range. This work demonstrates that by appropriately reducing the frequency of weight updates, the data dependencies in the predictor can be mitigated, thus shortening the critical path in hardware implementation and eliminating feedback loops. Experimental results show that under the band interleaved by line (BIL) data format, the proposed method achieves a throughput of 348.4 MSamples/s using only 3995 look-up tables (LUTs).
空间数据系统协商委员会(CCSDS)提出了CCSDS 123.0-B-2标准,用于压缩多光谱和高光谱传感器获取的大量数据。然而,CCSDS 123.0-B-2预测器中的数据依赖性导致权重更新过程中的反馈循环。这对预测器的完全流水线硬件实现提出了挑战,并严重限制了可实现的数据吞吐量。因此,在提高吞吐量的同时将压缩性能的下降保持在可接受的范围内是至关重要的。这项工作表明,通过适当减少权重更新的频率,可以减轻预测器中的数据依赖性,从而缩短硬件实现中的关键路径并消除反馈回路。实验结果表明,在行交错带数据格式下,该方法仅使用3995个查找表(lut)即可实现348.4 MSamples/s的吞吐量。
{"title":"Removal of the Feedback Loop in CCSDS 123.0-B-2 During Hardware Implementation","authors":"Liang Jia;Qi Wang;Lei Zhang;Chengpeng Song;Peng Zhang","doi":"10.1109/LGRS.2025.3601194","DOIUrl":"https://doi.org/10.1109/LGRS.2025.3601194","url":null,"abstract":"The Consultative Committee for Space Data Systems (CCSDS) proposed the CCSDS 123.0-B-2 standard for compressing large volumes of data acquired by multispectral and hyperspectral sensors. However, data dependencies in the CCSDS 123.0-B-2 predictor lead to feedback loops during the weight update process. This poses challenges for fully pipelined hardware implementation of the predictor and severely limits the achievable data throughput. Therefore, it is critical to improve throughput while keeping the degradation in compression performance within an acceptable range. This work demonstrates that by appropriately reducing the frequency of weight updates, the data dependencies in the predictor can be mitigated, thus shortening the critical path in hardware implementation and eliminating feedback loops. Experimental results show that under the band interleaved by line (BIL) data format, the proposed method achieves a throughput of 348.4 MSamples/s using only 3995 look-up tables (LUTs).","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":4.4,"publicationDate":"2025-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144934415","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Polynomial Fitting Emitter Localization Method Based on Multisubaperture Phase Stitching 基于多子孔径相位拼接的多项式拟合发射器定位方法
Jiayu Sun;Hao Huan;Ran Tao;Yue Wang
In passive localization, the synthetic aperture positioning (SAP) method enables high-precision positioning under low signal-to-noise ratio (SNR) conditions. However, higher order phase errors induced by platform self-localization errors degrade image focusing and reduce localization accuracy. In this letter, a polynomial fitting approach based on designing optimal prewhitening filters using autoregressive (AR) models and employing iteratively reweighted least squares (IRLS) is applied to the unwrapped phase to eliminate higher order error components. In addition, a multiple subaperture phase stitching method is proposed to mitigate phase susceptibility to noise interference and error accumulation during phase unwrapping. The effectiveness of the proposed method is validated through both simulations and UAV experiments. Results demonstrate that meter-level localization accuracy can be achieved for the emitter target.
在被动定位中,合成孔径定位(SAP)方法能够在低信噪比条件下实现高精度定位。然而,由平台自定位误差引起的高阶相位误差降低了图像聚焦,降低了定位精度。在这篇文章中,基于自回归(AR)模型设计最优预白化滤波器并采用迭代加权最小二乘(IRLS)的多项式拟合方法应用于解包裹阶段以消除高阶误差分量。此外,提出了一种多子孔径相位拼接方法,以减轻相位对噪声干扰的敏感性和相位展开过程中的误差积累。仿真和无人机实验验证了该方法的有效性。结果表明,该方法可实现对目标的米级定位精度。
{"title":"Polynomial Fitting Emitter Localization Method Based on Multisubaperture Phase Stitching","authors":"Jiayu Sun;Hao Huan;Ran Tao;Yue Wang","doi":"10.1109/LGRS.2025.3601138","DOIUrl":"https://doi.org/10.1109/LGRS.2025.3601138","url":null,"abstract":"In passive localization, the synthetic aperture positioning (SAP) method enables high-precision positioning under low signal-to-noise ratio (SNR) conditions. However, higher order phase errors induced by platform self-localization errors degrade image focusing and reduce localization accuracy. In this letter, a polynomial fitting approach based on designing optimal prewhitening filters using autoregressive (AR) models and employing iteratively reweighted least squares (IRLS) is applied to the unwrapped phase to eliminate higher order error components. In addition, a multiple subaperture phase stitching method is proposed to mitigate phase susceptibility to noise interference and error accumulation during phase unwrapping. The effectiveness of the proposed method is validated through both simulations and UAV experiments. Results demonstrate that meter-level localization accuracy can be achieved for the emitter target.","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":4.4,"publicationDate":"2025-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145021344","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhanced Slum Mapping Through U-Net CNN and Multimodal Remote Sensing Data: A Case Study of Makassar City 利用U-Net CNN和多模态遥感数据增强贫民窟制图:以望加锡市为例
Yohanes Fridolin Hestrio;Eduard Thomas Prakoso;Kiki Winda Veronica;Ika Siwi Supriyani;Destri Yanti Hutapea;Siti Desty Wahyuningsih;Nico Cendiana;Steward Augusto;Krisna Malik Sukarno;Olivia Maftukhaturrizqoh;Rubini Jusuf;Orbita Roswintiarti;Wisnu Jatmiko
Urban slums present critical challenges for sustainable development, particularly in rapidly urbanizing cities like Makassar, Indonesia. This study develops an automated slum mapping approach that integrates high-resolution SPOT-6/7 satellite imagery (1.5-m spatial resolution) with multimodal geospatial data using a U-Net convolutional neural network. Our methodology combines spectral and textural features from satellite imagery with nighttime light emissions, infrastructure proximity analysis, land use classifications, and socioeconomic indicators. The integrated approach achieves an overall accuracy of 97.1%–98.3% across both the datasets. However, slum-specific classification remains challenging with producer’s accuracy of 55.8%–59.1% and user’s accuracy of 22.9%–35.7%, yielding F1-scores of 0.33–0.43 for slum detection. Despite these limitations, the approach demonstrates significant enhancements over traditional census-based methods through automated processing, improved spatial resolution (1.5 m versus administrative units), and increased temporal frequency (annual versus decadal updates). The framework provides actionable insights for urban planning and social assistance targeting while establishing a foundation for automated slum monitoring system iterative improvement.
城市贫民窟对可持续发展构成严峻挑战,特别是在印度尼西亚望加锡等快速城市化的城市。本研究开发了一种自动化贫民窟测绘方法,该方法使用U-Net卷积神经网络将高分辨率SPOT-6/7卫星图像(1.5米空间分辨率)与多模态地理空间数据集成在一起。我们的方法将卫星图像的光谱和纹理特征与夜间光发射、基础设施接近性分析、土地利用分类和社会经济指标相结合。综合方法在两个数据集上的总体准确率为97.1%-98.3%。然而,针对贫民窟的分类仍然具有挑战性,生产者的准确率为55.8%-59.1%,用户的准确率为22.9%-35.7%,贫民窟检测的f1得分为0.33-0.43。尽管存在这些局限性,但该方法通过自动化处理、提高空间分辨率(相对于行政单位1.5米)和增加时间频率(年度更新相对于十年更新),比传统的基于人口普查的方法有了显著的增强。该框架为城市规划和社会援助目标提供了可操作的见解,同时为贫民窟自动监测系统的迭代改进奠定了基础。
{"title":"Enhanced Slum Mapping Through U-Net CNN and Multimodal Remote Sensing Data: A Case Study of Makassar City","authors":"Yohanes Fridolin Hestrio;Eduard Thomas Prakoso;Kiki Winda Veronica;Ika Siwi Supriyani;Destri Yanti Hutapea;Siti Desty Wahyuningsih;Nico Cendiana;Steward Augusto;Krisna Malik Sukarno;Olivia Maftukhaturrizqoh;Rubini Jusuf;Orbita Roswintiarti;Wisnu Jatmiko","doi":"10.1109/LGRS.2025.3601167","DOIUrl":"https://doi.org/10.1109/LGRS.2025.3601167","url":null,"abstract":"Urban slums present critical challenges for sustainable development, particularly in rapidly urbanizing cities like Makassar, Indonesia. This study develops an automated slum mapping approach that integrates high-resolution SPOT-6/7 satellite imagery (1.5-m spatial resolution) with multimodal geospatial data using a U-Net convolutional neural network. Our methodology combines spectral and textural features from satellite imagery with nighttime light emissions, infrastructure proximity analysis, land use classifications, and socioeconomic indicators. The integrated approach achieves an overall accuracy of 97.1%–98.3% across both the datasets. However, slum-specific classification remains challenging with producer’s accuracy of 55.8%–59.1% and user’s accuracy of 22.9%–35.7%, yielding F1-scores of 0.33–0.43 for slum detection. Despite these limitations, the approach demonstrates significant enhancements over traditional census-based methods through automated processing, improved spatial resolution (1.5 m versus administrative units), and increased temporal frequency (annual versus decadal updates). The framework provides actionable insights for urban planning and social assistance targeting while establishing a foundation for automated slum monitoring system iterative improvement.","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":4.4,"publicationDate":"2025-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144990019","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SpADANet: A Spatially Aware Domain Adaptation Network for Hurricane Damage Assessment SpADANet:用于飓风损害评估的空间感知域自适应网络
Pratyush V. Talreja;Surya S. Durbha
Hurricanes cause significant damage to communities, necessitating rapid and accurate damage assessment to support timely disaster response. However, image-based deep learning models for hurricane-induced damage assessment face substantial challenges due to domain shifts across different hurricane events, and the restricted availability of labeled data for each disaster further complicates this task. In this study, we propose a novel domain-adaptive deep learning framework that mitigates the domain gap while requiring minimal labeled samples from the target domain. Our approach integrates a self-supervised learning (SSL) pretext task to enhance feature robustness and leverages a novel bilateral local Moran’s I (BLMI) module to improve spatial feature aggregation for damage localization. We evaluate our method using aerial datasets from Hurricanes Harvey, Matthew, and Michael. The experimental results demonstrate that our model achieves more than 5% improvement in damage classification accuracy over baseline methods. These findings highlight the potential of our approach for scalable and efficient hurricane damage assessment in real-world disaster scenarios.
飓风对社区造成重大破坏,需要快速准确的损害评估,以支持及时的灾害应对。然而,基于图像的飓风损伤评估深度学习模型面临着巨大的挑战,因为不同飓风事件之间的域转移,并且每个灾难标记数据的有限可用性进一步使这项任务复杂化。在本研究中,我们提出了一种新的领域自适应深度学习框架,该框架可以减轻领域差距,同时需要来自目标领域的最小标记样本。我们的方法集成了一个自监督学习(SSL)借口任务来增强特征鲁棒性,并利用一个新的双边局部Moran 's I (BLMI)模块来改进用于损伤定位的空间特征聚合。我们使用哈维、马修和迈克尔飓风的航空数据集来评估我们的方法。实验结果表明,该模型的损伤分类精度比基线方法提高了5%以上。这些发现突出了我们的方法在现实世界灾害情景中可扩展和有效的飓风损害评估的潜力。
{"title":"SpADANet: A Spatially Aware Domain Adaptation Network for Hurricane Damage Assessment","authors":"Pratyush V. Talreja;Surya S. Durbha","doi":"10.1109/LGRS.2025.3601507","DOIUrl":"https://doi.org/10.1109/LGRS.2025.3601507","url":null,"abstract":"Hurricanes cause significant damage to communities, necessitating rapid and accurate damage assessment to support timely disaster response. However, image-based deep learning models for hurricane-induced damage assessment face substantial challenges due to domain shifts across different hurricane events, and the restricted availability of labeled data for each disaster further complicates this task. In this study, we propose a novel domain-adaptive deep learning framework that mitigates the domain gap while requiring minimal labeled samples from the target domain. Our approach integrates a self-supervised learning (SSL) pretext task to enhance feature robustness and leverages a novel bilateral local Moran’s I (BLMI) module to improve spatial feature aggregation for damage localization. We evaluate our method using aerial datasets from Hurricanes Harvey, Matthew, and Michael. The experimental results demonstrate that our model achieves more than 5% improvement in damage classification accuracy over baseline methods. These findings highlight the potential of our approach for scalable and efficient hurricane damage assessment in real-world disaster scenarios.","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":4.4,"publicationDate":"2025-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144914209","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CSA-RSIC: Cross-Modal Semantic Alignment for Remote Sensing Image Captioning 遥感图像标注的跨模态语义对齐
Kangda Cheng;Jinlong Liu;Rui Mao;Zhilu Wu;Erik Cambria
Remote sensing image captioning (RSIC) is an important task in environmental monitoring and disaster assessment. However, existing methods are constrained by redundant feature interference, insufficient multiscale feature integration, and cross-modal semantic gaps, leading to limited performance in scenarios requiring fine-grained descriptions and semantic integrity, such as disaster assessment and emergency response. In this letter, we propose a cross-modal semantic alignment model for RSIC (CSA-RSIC), addressing these challenges with three innovations. First, we designed an adaptive feature selection module (AFSM) that generates channel weights through dual pooling. The AFSM dynamically weights the most informative features at each scale to improve caption accuracy. Second, we propose a cross-scale feature aggregation module (CFAM) that constructs a hierarchical feature pyramid by aligning multiscale resolutions and performs attention-guided fusion with enhanced weighting via AFSM, ensuring the effective integration of fine-grained and global semantic information. Finally, a novel loss function that combines contrastive learning and consistency loss is proposed to enhance the semantic alignment between visual and textual features. Experiments on three datasets show the advancement of CSA-RSIC over strong baselines, indicating its effectiveness in enhancing both semantic completeness and accuracy.
遥感图像字幕处理是环境监测和灾害评估中的一项重要任务。然而,现有方法受到冗余特征干扰、多尺度特征集成不足和跨模态语义缺口的限制,导致在需要细粒度描述和语义完整性的场景下,如灾害评估和应急响应,性能有限。在这封信中,我们提出了一个RSIC的跨模态语义对齐模型(CSA-RSIC),通过三个创新来解决这些挑战。首先,我们设计了一个自适应特征选择模块(AFSM),该模块通过双池化生成信道权重。AFSM动态地对每个尺度上信息量最大的特征进行加权,以提高标题的准确性。其次,我们提出了一种跨尺度特征聚合模块(CFAM),该模块通过对齐多尺度分辨率构建分层特征金字塔,并通过AFSM进行注意引导融合和增强加权,确保了细粒度和全局语义信息的有效集成。最后,提出了一种结合对比学习和一致性损失的损失函数,以增强视觉特征和文本特征之间的语义一致性。在三个数据集上的实验表明,CSA-RSIC在强基线上的进步表明其在提高语义完整性和准确性方面都是有效的。
{"title":"CSA-RSIC: Cross-Modal Semantic Alignment for Remote Sensing Image Captioning","authors":"Kangda Cheng;Jinlong Liu;Rui Mao;Zhilu Wu;Erik Cambria","doi":"10.1109/LGRS.2025.3601114","DOIUrl":"https://doi.org/10.1109/LGRS.2025.3601114","url":null,"abstract":"Remote sensing image captioning (RSIC) is an important task in environmental monitoring and disaster assessment. However, existing methods are constrained by redundant feature interference, insufficient multiscale feature integration, and cross-modal semantic gaps, leading to limited performance in scenarios requiring fine-grained descriptions and semantic integrity, such as disaster assessment and emergency response. In this letter, we propose a cross-modal semantic alignment model for RSIC (CSA-RSIC), addressing these challenges with three innovations. First, we designed an adaptive feature selection module (AFSM) that generates channel weights through dual pooling. The AFSM dynamically weights the most informative features at each scale to improve caption accuracy. Second, we propose a cross-scale feature aggregation module (CFAM) that constructs a hierarchical feature pyramid by aligning multiscale resolutions and performs attention-guided fusion with enhanced weighting via AFSM, ensuring the effective integration of fine-grained and global semantic information. Finally, a novel loss function that combines contrastive learning and consistency loss is proposed to enhance the semantic alignment between visual and textual features. Experiments on three datasets show the advancement of CSA-RSIC over strong baselines, indicating its effectiveness in enhancing both semantic completeness and accuracy.","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":4.4,"publicationDate":"2025-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144998080","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sea-Undistort: A Dataset for Through-Water Image Restoration in High-Resolution Airborne Bathymetric Mapping sea - undistortion:高分辨率航空测深制图中通过水图像恢复的数据集
Maximilian Kromer;Panagiotis Agrafiotis;Begüm Demir
Accurate image-based bathymetric mapping in shallow waters remains challenging due to the complex optical distortions, such as wave-induced patterns, scattering, and sunglint, introduced by the dynamic water surface, the water column properties, and solar illumination. In this work, we introduce Sea-Undistort, a comprehensive synthetic dataset of 1200 paired $512times 512$ through-water scenes rendered in Blender. Each pair comprises a distortion-free and a distorted view, featuring realistic water effects, such as sun glint, waves, and scattering over diverse seabeds. Accompanied by per-image metadata, such as camera parameters, sun position, and average depth, Sea-Undistort enables supervised training that is otherwise infeasible in real environments. We use Sea-Undistort to benchmark two state-of-the-art image restoration methods alongside an enhanced lightweight diffusion-based framework with an early fusion sun-glint mask. When applied to real aerial data, the enhanced diffusion model delivers more complete digital surface models (DSMs) of the seabed, especially in deeper areas, reduces bathymetric errors, suppresses glint and scattering, and crisply restores fine seabed details. Dataset, weights, and code are publicly available at https://www.magicbathy.eu/Sea-Undistort.html.
由于动态水面、水柱特性和太阳光照等因素导致的复杂光学畸变,如波浪诱导模式、散射和太阳晖射等,在浅水区进行精确的基于图像的水深测绘仍然具有挑战性。在这项工作中,我们介绍了sea - undistortion,这是一个综合的合成数据集,由1200对$512乘以512$通过Blender渲染的水场景组成。每一对都包含一个无扭曲和扭曲的视图,具有逼真的水效果,如阳光闪烁,波浪和散射在不同的海床上。伴随着每个图像的元数据,如相机参数、太阳位置和平均深度,sea - undistortion使监督训练成为可能,否则在真实环境中是不可行的。我们使用sea - undistortion对两种最先进的图像恢复方法进行基准测试,以及增强的轻量级扩散框架和早期融合太阳闪烁面罩。当应用于实际航空数据时,增强的扩散模型提供了更完整的海底数字表面模型(DSMs),特别是在较深的区域,减少了水深误差,抑制了闪烁和散射,并清晰地恢复了海底的精细细节。数据集、权重和代码可在https://www.magicbathy.eu/Sea-Undistort.html上公开获取。
{"title":"Sea-Undistort: A Dataset for Through-Water Image Restoration in High-Resolution Airborne Bathymetric Mapping","authors":"Maximilian Kromer;Panagiotis Agrafiotis;Begüm Demir","doi":"10.1109/LGRS.2025.3601239","DOIUrl":"https://doi.org/10.1109/LGRS.2025.3601239","url":null,"abstract":"Accurate image-based bathymetric mapping in shallow waters remains challenging due to the complex optical distortions, such as wave-induced patterns, scattering, and sunglint, introduced by the dynamic water surface, the water column properties, and solar illumination. In this work, we introduce Sea-Undistort, a comprehensive synthetic dataset of 1200 paired <inline-formula> <tex-math>$512times 512$ </tex-math></inline-formula> through-water scenes rendered in Blender. Each pair comprises a distortion-free and a distorted view, featuring realistic water effects, such as sun glint, waves, and scattering over diverse seabeds. Accompanied by per-image metadata, such as camera parameters, sun position, and average depth, Sea-Undistort enables supervised training that is otherwise infeasible in real environments. We use Sea-Undistort to benchmark two state-of-the-art image restoration methods alongside an enhanced lightweight diffusion-based framework with an early fusion sun-glint mask. When applied to real aerial data, the enhanced diffusion model delivers more complete digital surface models (DSMs) of the seabed, especially in deeper areas, reduces bathymetric errors, suppresses glint and scattering, and crisply restores fine seabed details. Dataset, weights, and code are publicly available at <uri>https://www.magicbathy.eu/Sea-Undistort.html</uri>.","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":4.4,"publicationDate":"2025-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11132387","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144914212","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Land Surface Emissivity Retrieval From Landsat 9 Data in Combination With Land Cover Data and Spectral Library 结合土地覆盖数据和光谱库的Landsat 9数据地表发射率反演
Qi Zhang;Yonggang Qian;Kun Li;Qiyao Li;Jianmin Wang;Dacheng Li
Land surface emissivity (LSE) is crucial for retrieving land surface temperature (LST) from Landsat 9 TIRS-2 thermal infrared (TIR) data. However, the single-band LSE product (band 10) provided officially is insufficient for the split-window (SW) algorithm requiring dual-band emissivity inputs. This letter proposes a land cover and channel transformed-LSE (LCCT-LSE) method to estimate band 11 LSE and enables LST retrieval using the SW algorithm on Google Earth Engine. Cross-validation with MOD21 LSE products showed that the LCCT-LSE method achieved a mean absolute error (MAE) of 0.004 and a root mean square error (RMSE) of 0.005, outperforming the classification-based method, NDVI threshold method, and vegetation cover vegetation cover-based method (VCM) methods. In situ validation showed SW-retrieved LST attains MAE/RMSE of 1.27/2.13 K, with consistent accuracy across diverse land covers (water: 0.86 K, soil: 1.58 K, desert: 1.71 K, sand: 1.80 K, and vegetation: 0.87 K). A comparison with the official Landsat 9 LST product indicated that the bias of retrieved LST is within 1 K for all land cover classes (cropland, forest, grassland, shrubland, water, barren, and impervious) in Beijing. These results demonstrated that the LCCT-LSE method is capable of estimating the LSE in Landsat 9 band 11 with a reliable and accurate result. This study provides a new insight for LST retrieval from Landsat 9 data.
地表发射率(LSE)是Landsat 9 -2热红外数据反演地表温度(LST)的关键参数。然而,官方提供的单波段LSE产品(波段10)不足以满足需要双波段发射率输入的分窗(SW)算法。本文提出了一种土地覆盖和信道变换LSE (LCCT-LSE)方法来估计11波段LSE,并在谷歌Earth Engine上使用SW算法实现LST检索。与MOD21 LSE产品的交叉验证表明,LCCT-LSE方法的平均绝对误差(MAE)为0.004,均方根误差(RMSE)为0.005,优于基于分类的方法、NDVI阈值方法和植被覆盖度基于植被覆盖度的方法(VCM)。原位验证表明,sw反演的LST的MAE/RMSE为1.27/2.13 K,在不同的土地覆盖范围(水:0.86 K,土壤:1.58 K,沙漠:1.71 K,沙子:1.80 K,植被:0.87 K)具有一致的精度。与官方Landsat 9地表温度产品的比较表明,北京所有土地覆盖类型(农田、森林、草地、灌丛、水、贫瘠和不透水)的反演地表温度偏差在1 K以内。这些结果表明,LCCT-LSE方法能够估计Landsat 9波段11的LSE,结果可靠、准确。该研究为Landsat 9数据的地表温度反演提供了新的思路。
{"title":"Land Surface Emissivity Retrieval From Landsat 9 Data in Combination With Land Cover Data and Spectral Library","authors":"Qi Zhang;Yonggang Qian;Kun Li;Qiyao Li;Jianmin Wang;Dacheng Li","doi":"10.1109/LGRS.2025.3601391","DOIUrl":"https://doi.org/10.1109/LGRS.2025.3601391","url":null,"abstract":"Land surface emissivity (LSE) is crucial for retrieving land surface temperature (LST) from Landsat 9 TIRS-2 thermal infrared (TIR) data. However, the single-band LSE product (band 10) provided officially is insufficient for the split-window (SW) algorithm requiring dual-band emissivity inputs. This letter proposes a land cover and channel transformed-LSE (LCCT-LSE) method to estimate band 11 LSE and enables LST retrieval using the SW algorithm on Google Earth Engine. Cross-validation with MOD21 LSE products showed that the LCCT-LSE method achieved a mean absolute error (MAE) of 0.004 and a root mean square error (RMSE) of 0.005, outperforming the classification-based method, NDVI threshold method, and vegetation cover vegetation cover-based method (VCM) methods. In situ validation showed SW-retrieved LST attains MAE/RMSE of 1.27/2.13 K, with consistent accuracy across diverse land covers (water: 0.86 K, soil: 1.58 K, desert: 1.71 K, sand: 1.80 K, and vegetation: 0.87 K). A comparison with the official Landsat 9 LST product indicated that the bias of retrieved LST is within 1 K for all land cover classes (cropland, forest, grassland, shrubland, water, barren, and impervious) in Beijing. These results demonstrated that the LCCT-LSE method is capable of estimating the LSE in Landsat 9 band 11 with a reliable and accurate result. This study provides a new insight for LST retrieval from Landsat 9 data.","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":4.4,"publicationDate":"2025-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145011321","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1