首页 > 最新文献

ISPRS Journal of Photogrammetry and Remote Sensing最新文献

英文 中文
A novel transformer-based CO2 retrieval framework incorporating prior constraint and hierarchical features injection: assessment of transferability for Tansat-2 结合先验约束和分层特征注入的基于变压器的CO2检索框架:Tansat-2可转移性评估
IF 12.7 1区 地球科学 Q1 GEOGRAPHY, PHYSICAL Pub Date : 2026-02-02 DOI: 10.1016/j.isprsjprs.2026.01.039
Lingfeng Zhang, Lu Zhang, Xingying Zhang, Tiantao Cheng, Xifeng Cao, Tongwen Li, Dongdong Liu, Yang Zhang, Yuhan Jiang, Ruohua Hu, Haiyang Dou, Lin Chen
{"title":"A novel transformer-based CO2 retrieval framework incorporating prior constraint and hierarchical features injection: assessment of transferability for Tansat-2","authors":"Lingfeng Zhang, Lu Zhang, Xingying Zhang, Tiantao Cheng, Xifeng Cao, Tongwen Li, Dongdong Liu, Yang Zhang, Yuhan Jiang, Ruohua Hu, Haiyang Dou, Lin Chen","doi":"10.1016/j.isprsjprs.2026.01.039","DOIUrl":"https://doi.org/10.1016/j.isprsjprs.2026.01.039","url":null,"abstract":"","PeriodicalId":50269,"journal":{"name":"ISPRS Journal of Photogrammetry and Remote Sensing","volume":"23 1","pages":""},"PeriodicalIF":12.7,"publicationDate":"2026-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146110789","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparative assessment of AI-based and classical DSAS approaches in multi-temporal shoreline prediction: A case study of Ras El-Bar coast, Egypt 基于人工智能和经典DSAS方法在多时间线预测中的比较评估:以埃及Ras El-Bar海岸为例
IF 12.7 1区 地球科学 Q1 GEOGRAPHY, PHYSICAL Pub Date : 2026-02-01 DOI: 10.1016/j.isprsjprs.2026.01.040
Hesham M. El-Asmar, Mahmoud Sh. Felfla
{"title":"Comparative assessment of AI-based and classical DSAS approaches in multi-temporal shoreline prediction: A case study of Ras El-Bar coast, Egypt","authors":"Hesham M. El-Asmar, Mahmoud Sh. Felfla","doi":"10.1016/j.isprsjprs.2026.01.040","DOIUrl":"https://doi.org/10.1016/j.isprsjprs.2026.01.040","url":null,"abstract":"","PeriodicalId":50269,"journal":{"name":"ISPRS Journal of Photogrammetry and Remote Sensing","volume":"38 1","pages":""},"PeriodicalIF":12.7,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146095922","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-object tracking of vehicles and anomalous states in remote sensing videos: Joint learning of historical trajectory guidance and ID prediction 遥感视频中车辆多目标跟踪与异常状态:历史轨迹制导与ID预测的联合学习
IF 12.2 1区 地球科学 Q1 GEOGRAPHY, PHYSICAL Pub Date : 2026-01-31 DOI: 10.1016/j.isprsjprs.2026.01.038
Bin Wang , Yuan Zhou , Haigang Sui , Guorui Ma , Peng Cheng , Di Wang
Research on multi-object tracking (MOT) of vehicles based on remote sensing video data has achieved breakthrough progress. However, MOT of vehicles in complex scenarios and their anomalous states after being subjected to strong deformation interference remains a huge challenge. This is of great significance for military defense, traffic flow management, vehicle damage assessment, etc. To address this problem, this study proposes an end-to-end MOT method that integrates a joint learning paradigm of historical trajectory guidance and identity (ID) prediction, aiming to bridge the gap between vehicle detection and continuous tracking after anomalous states occurrence. The proposed network framework primarily consists of a Frame Feature Aggregation Module (FFAM) that enhances spatial consistency of objects across consecutive video frames, a Historical Tracklets Flow Encoder (HTFE) that employs Mamba blocks to guide object embedding within potential motion flows based on historical frames, and a Semantic-Consistent Clustering Module (SCM) constructed via sparse attention computation to capture global semantic information. The discriminative features extracted by these modules are fused by a Dual-branch Modulation Fusion Unit (DMFU) to maximize the performance of the model. This study also constructs a new dataset for MOT of vehicles and anomalous states in videos, termed the VAS-MOT dataset. Extensive validation experiments conducted on this dataset demonstrate that the method achieves the highest level of performance, with HOTA and MOTA reaching 68.2% and 71.5%, respectively. Additional validation on the open-source dataset IRTS-AG confirms the strong robustness of the proposed method, showing excellent performance in long-term tracking of small vehicles in infrared videos under complex scenarios, where HOTA and MOTA reached 70.9% and 91.6%, respectively. The proposed method provides valuable insights for capturing moving objects and their anomalous states, laying a foundation for further damage assessment.
基于遥感视频数据的车辆多目标跟踪(MOT)研究取得了突破性进展。然而,复杂场景下车辆的MOT及其在受到强变形干扰后的异常状态仍然是一个巨大的挑战。这对军事防御、交通流管理、车辆损伤评估等具有重要意义。为了解决这一问题,本研究提出了一种端到端的MOT方法,该方法集成了历史轨迹引导和身份(ID)预测的联合学习范式,旨在弥合异常状态发生后车辆检测与持续跟踪之间的差距。所提出的网络框架主要由帧特征聚合模块(FFAM)组成,该模块增强了对象在连续视频帧中的空间一致性;历史轨迹流编码器(HTFE)采用曼巴块来指导基于历史帧的潜在运动流中的对象嵌入;以及通过稀疏注意力计算构建的语义一致聚类模块(SCM)来捕获全局语义信息。通过双支路调制融合单元(Dual-branch Modulation Fusion Unit, DMFU)对这些模块提取的判别特征进行融合,使模型的性能最大化。本研究还构建了一个新的车辆MOT和视频异常状态数据集,称为VAS-MOT数据集。在该数据集上进行的大量验证实验表明,该方法达到了最高的性能水平,HOTA和MOTA分别达到了68.2%和71.5%。在开源数据集IRTS-AG上的进一步验证证实了该方法具有较强的鲁棒性,在复杂场景下红外视频中对小型车辆的长期跟踪表现优异,HOTA和MOTA分别达到70.9%和91.6%。该方法为捕获运动物体及其异常状态提供了有价值的见解,为进一步的损伤评估奠定了基础。
{"title":"Multi-object tracking of vehicles and anomalous states in remote sensing videos: Joint learning of historical trajectory guidance and ID prediction","authors":"Bin Wang ,&nbsp;Yuan Zhou ,&nbsp;Haigang Sui ,&nbsp;Guorui Ma ,&nbsp;Peng Cheng ,&nbsp;Di Wang","doi":"10.1016/j.isprsjprs.2026.01.038","DOIUrl":"10.1016/j.isprsjprs.2026.01.038","url":null,"abstract":"<div><div>Research on multi-object tracking (MOT) of vehicles based on remote sensing video data has achieved breakthrough progress. However, MOT of vehicles in complex scenarios and their anomalous states after being subjected to strong deformation interference remains a huge challenge. This is of great significance for military defense, traffic flow management, vehicle damage assessment, etc. To address this problem, this study proposes an end-to-end MOT method that integrates a joint learning paradigm of historical trajectory guidance and identity (ID) prediction, aiming to bridge the gap between vehicle detection and continuous tracking after anomalous states occurrence. The proposed network framework primarily consists of a Frame Feature Aggregation Module (FFAM) that enhances spatial consistency of objects across consecutive video frames, a Historical Tracklets Flow Encoder (HTFE) that employs Mamba blocks to guide object embedding within potential motion flows based on historical frames, and a Semantic-Consistent Clustering Module (SCM) constructed via sparse attention computation to capture global semantic information. The discriminative features extracted by these modules are fused by a Dual-branch Modulation Fusion Unit (DMFU) to maximize the performance of the model. This study also constructs a new dataset for MOT of vehicles and anomalous states in videos, termed the VAS-MOT dataset. Extensive validation experiments conducted on this dataset demonstrate that the method achieves the highest level of performance, with HOTA and MOTA reaching 68.2% and 71.5%, respectively. Additional validation on the open-source dataset IRTS-AG confirms the strong robustness of the proposed method, showing excellent performance in long-term tracking of small vehicles in infrared videos under complex scenarios, where HOTA and MOTA reached 70.9% and 91.6%, respectively. The proposed method provides valuable insights for capturing moving objects and their anomalous states, laying a foundation for further damage assessment.</div></div>","PeriodicalId":50269,"journal":{"name":"ISPRS Journal of Photogrammetry and Remote Sensing","volume":"233 ","pages":"Pages 383-406"},"PeriodicalIF":12.2,"publicationDate":"2026-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146079956","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A geometric Cross-Propagation-Calibration method for SAR constellation based on the graph theory 基于图论的SAR星座几何交叉传播定标方法
IF 12.2 1区 地球科学 Q1 GEOGRAPHY, PHYSICAL Pub Date : 2026-01-31 DOI: 10.1016/j.isprsjprs.2026.01.007
Yitong Luo , Xiaolan Qiu , Bei Lin , Zekun Jiao , Wei Wang , Chibiao Ding
The networking capability of SAR constellations can effectively reduce the average revisit period, which has become a new trend in SAR Earth observation. However, the system electronic delay of several or even dozens of SAR satellites in a constellation must be calibrated and monitored for a long time to ensure high geometric accuracy of the product. In this paper, a geometric cross-propagation-calibration method for SAR constellations is proposed, which can calibrate the slant ranges of the SAR satellites in a constellation without any calibrators. The proposed method constructs a graph from all reference and uncalibrated SAR images involved in a cross-calibration task. For each uncalibrated image, the cumulative calibration error along paths originating from the reference images is estimated, enabling the identification of a path that minimizes this error. Cross-calibration is then performed sequentially along this optimal path. A closed-form expression is derived to estimate the cumulative calibration error along any path, which also reveals the underlying mechanism of error propagation in cross-calibration. Experiments based on real data show that the proposed method enables two China’s microsatellites, Qilu-1 and Xingrui-9, to achieve geometric accuracy of less than 5 m after calibration.
SAR星座的组网能力可以有效地缩短平均重访周期,成为SAR对地观测的新趋势。然而,一个星座内的几颗甚至几十颗SAR卫星的系统电子延迟必须经过长时间的校准和监测,才能保证产品的高几何精度。本文提出了一种SAR星座几何交叉传播定标方法,该方法可以在不使用任何定标器的情况下对星座内SAR卫星的倾斜距离进行定标。该方法将交叉校准任务中涉及的所有参考和未校准SAR图像构建成一个图形。对于每个未校准的图像,沿着源自参考图像的路径估计累积校准误差,从而能够识别最小化该误差的路径。然后沿着这条最优路径依次进行交叉校准。推导出沿任意路径估计累计校准误差的封闭表达式,揭示了交叉校准误差传播的潜在机制。基于实际数据的实验表明,所提出的方法使中国两颗微卫星“齐鲁一号”和“星瑞九号”标定后的几何精度达到小于5 m。
{"title":"A geometric Cross-Propagation-Calibration method for SAR constellation based on the graph theory","authors":"Yitong Luo ,&nbsp;Xiaolan Qiu ,&nbsp;Bei Lin ,&nbsp;Zekun Jiao ,&nbsp;Wei Wang ,&nbsp;Chibiao Ding","doi":"10.1016/j.isprsjprs.2026.01.007","DOIUrl":"10.1016/j.isprsjprs.2026.01.007","url":null,"abstract":"<div><div>The networking capability of SAR constellations can effectively reduce the average revisit period, which has become a new trend in SAR Earth observation. However, the system electronic delay of several or even dozens of SAR satellites in a constellation must be calibrated and monitored for a long time to ensure high geometric accuracy of the product. In this paper, a geometric cross-propagation-calibration method for SAR constellations is proposed, which can calibrate the slant ranges of the SAR satellites in a constellation without any calibrators. The proposed method constructs a graph from all reference and uncalibrated SAR images involved in a cross-calibration task. For each uncalibrated image, the cumulative calibration error along paths originating from the reference images is estimated, enabling the identification of a path that minimizes this error. Cross-calibration is then performed sequentially along this optimal path. A closed-form expression is derived to estimate the cumulative calibration error along any path, which also reveals the underlying mechanism of error propagation in cross-calibration. Experiments based on real data show that the proposed method enables two China’s microsatellites, Qilu-1 and Xingrui-9, to achieve geometric accuracy of less than 5 m after calibration.</div></div>","PeriodicalId":50269,"journal":{"name":"ISPRS Journal of Photogrammetry and Remote Sensing","volume":"233 ","pages":"Pages 346-359"},"PeriodicalIF":12.2,"publicationDate":"2026-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146079954","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Monitoring snow cover dynamics at 30-m resolution in higher latitude regions using Harmonized Landsat Sentinel-2 利用Harmonized Landsat Sentinel-2监测高纬度地区30米分辨率积雪动态
IF 12.2 1区 地球科学 Q1 GEOGRAPHY, PHYSICAL Pub Date : 2026-01-31 DOI: 10.1016/j.isprsjprs.2026.01.032
Mitchell T. Bonney, Yu Zhang
Snow is an essential climate variable that is important for hydrology, climate, soil temperature and permafrost, vegetation, animal habitat, and socioeconomics. Wide-area snow cover dynamics (SCD), including the start and end of snow cover, are generally monitored by satellites with coarse spatial resolutions (250–1000 m) and high temporal (daily) resolutions. Higher spatial resolution (HSR) monitoring (10–30 m) has been limited to small areas because of computational constraints and infrequent cloud-free observations. Here, we develop a new method to map wide-area HSR SCD (snow start date, end date, length, periods, status) by leveraging the recently released Harmonized Landsat Sentinel-2 (HLS) v2.0, which has a 2–3-day revisit at 30-m resolution. The method is built around SpatialTemporal Asset Catalogs (STACs) and open-source Python tools. We utilize tiled datacubes, snow classification, and a model involving implausibility checking, cleaning, and finding peaks in data with gaps due to orbit frequencies and clouds. We demonstrate SCD mapping and validation across Canada’s Hudson Bay Lowlands (HBL) and an area in northern Alaska for each snow-year from 2018 to 2019 to 2023–2024 and multi-year composites (2018–2024). We also provide timing uncertainties and a quality metric for all pixels. Performance is best for snow end date, having strong relationships with both visually interpreted SCD from primarily very high-resolution imagery and measured local-scale snow depth. The combination of lower cloud cover and lower solar zenith angles during melt periods leads to lower uncertainties for snow end date compared to start date and length. Performance is better for all metrics at higher latitudes (e.g., northern Alaska), where satellite observations are more frequent due to increased orbit overlap. Although we have only completed validation for the HBL, Canada-wide products using this methodology are available publicly as STACs on the CCMEO Data Cube and will continue to be updated. Addition validation across Canada and methodology improvements are ongoing.
雪是一个重要的气候变量,对水文、气候、土壤温度和永久冻土、植被、动物栖息地和社会经济都很重要。广域积雪动态(SCD),包括积雪的开始和结束,通常由卫星监测,具有粗空间分辨率(250-1000 m)和高时间分辨率(日)。由于计算限制和不频繁的无云观测,高空间分辨率(HSR)监测(10-30米)仅限于小区域。在这里,我们利用最近发布的Harmonized Landsat Sentinel-2 (HLS) v2.0开发了一种新的方法来绘制广域高铁SCD(雪开始日期,结束日期,长度,周期,状态),该v2.0具有2 - 3天的30米分辨率重访。该方法是围绕时空资产目录(STACs)和开源Python工具构建的。我们使用了平铺数据库、雪分类和一个模型,该模型涉及不可信检查、清理和查找由于轨道频率和云而存在间隙的数据中的峰值。我们展示了加拿大哈德逊湾低地(HBL)和阿拉斯加北部地区的SCD制图和验证,包括2018年至2019年、2023年至2024年的每个雪年和多年合成(2018年至2024年)。我们还为所有像素提供了时间不确定性和质量度量。性能最好的是降雪结束日期,与主要是非常高分辨率图像的视觉解释SCD和测量的局地尺度雪深有很强的关系。融雪期较低的云量和较低的太阳天顶角相结合,导致雪结束日期的不确定性低于开始日期和长度。在高纬度地区(例如阿拉斯加北部),由于轨道重叠增加,卫星观测更加频繁,因此所有指标的性能都更好。虽然我们只完成了HBL的验证,但使用这种方法的加拿大范围内的产品在CCMEO数据立方上作为stac公开可用,并将继续更新。加拿大各地的附加验证和方法改进正在进行中。
{"title":"Monitoring snow cover dynamics at 30-m resolution in higher latitude regions using Harmonized Landsat Sentinel-2","authors":"Mitchell T. Bonney,&nbsp;Yu Zhang","doi":"10.1016/j.isprsjprs.2026.01.032","DOIUrl":"10.1016/j.isprsjprs.2026.01.032","url":null,"abstract":"<div><div>Snow is an essential climate variable that is important for hydrology, climate, soil temperature and permafrost, vegetation, animal habitat, and socioeconomics. Wide-area snow cover dynamics (SCD), including the start and end of snow cover, are generally monitored by satellites with coarse spatial resolutions (250–1000 m) and high temporal (daily) resolutions. Higher spatial resolution (HSR) monitoring (10–30 m) has been limited to small areas because of computational constraints and infrequent cloud-free observations. Here, we develop a new method to map wide-area HSR SCD (snow start date, end date, length, periods, status) by leveraging the recently released Harmonized Landsat Sentinel-2 (HLS) v2.0, which has a 2–3-day revisit at 30-m resolution. The method is built around SpatialTemporal Asset Catalogs (STACs) and open-source Python tools. We utilize tiled datacubes, snow classification, and a model involving implausibility checking, cleaning, and finding peaks in data with gaps due to orbit frequencies and clouds. We demonstrate SCD mapping and validation across Canada’s Hudson Bay Lowlands (HBL) and an area in northern Alaska for each snow-year from 2018 to 2019 to 2023–2024 and multi-year composites (2018–2024). We also provide timing uncertainties and a quality metric for all pixels. Performance is best for snow end date, having strong relationships with both visually interpreted SCD from primarily very high-resolution imagery and measured local-scale snow depth. The combination of lower cloud cover and lower solar zenith angles during melt periods leads to lower uncertainties for snow end date compared to start date and length. Performance is better for all metrics at higher latitudes (e.g., northern Alaska), where satellite observations are more frequent due to increased orbit overlap. Although we have only completed validation for the HBL, Canada-wide products using this methodology are available publicly as STACs on the CCMEO Data Cube and will continue to be updated. Addition validation across Canada and methodology improvements are ongoing.</div></div>","PeriodicalId":50269,"journal":{"name":"ISPRS Journal of Photogrammetry and Remote Sensing","volume":"233 ","pages":"Pages 360-382"},"PeriodicalIF":12.2,"publicationDate":"2026-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146079876","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Set-CVGL: A new perspective on cross-view geo-localization with unordered ground-view image sets Set-CVGL:一种基于无序地视图像集的跨视点地理定位新视角
IF 12.2 1区 地球科学 Q1 GEOGRAPHY, PHYSICAL Pub Date : 2026-01-30 DOI: 10.1016/j.isprsjprs.2026.01.037
Qiong Wu , Panwang Xia , Lei Yu , Yi Liu , Mingtao Xiong , Liheng Zhong , Jingdong Chen , Ming Yang , Yongjun Zhang , Yi Wan
Cross-view geo-localization (CVGL) has been widely applied in fields such as robotic navigation and geographic information coupling. Existing approaches primarily use single images or fixed-view image sequences as queries, which limits perspective diversity. In contrast, when humans determine their location visually, they typically move around to gather multiple perspectives. This behavior suggests that integrating diverse visual cues can improve geo-localization reliability. Therefore, we propose a novel task: Cross-View Image Set Geo-Localization (Set-CVGL), which gathers multiple images with diverse perspectives as a query set for localization. To support this task, we introduce SetVL-480K, a benchmark comprising 480,000 ground images captured worldwide and their corresponding satellite images, with each satellite image corresponds to an average of 40 ground images from varied perspectives and locations. Furthermore, we propose FlexGeo, a flexible method designed for Set-CVGL that can also adapt to single-image and image-sequence inputs. FlexGeo includes two key modules: the Similarity-guided Feature Fuser (SFF), which adaptively fuses image features without prior content dependency, and the Individual-level Attributes Learner (IAL), leveraging geo-attributes of each image for comprehensive scene perception. FlexGeo consistently outperforms existing methods on SetVL-480K and four public datasets (VIGOR, University-1652, SeqGeo, and KITTI-CVL), achieving a 2.34× improvement in localization accuracy on SetVL-480K. The codes and dataset will be available at https://github.com/Mabel0403/Set-CVGL.
交叉视角地理定位技术在机器人导航、地理信息耦合等领域有着广泛的应用。现有的方法主要使用单个图像或固定视图图像序列作为查询,这限制了视角的多样性。相比之下,当人类在视觉上确定自己的位置时,他们通常会四处走动,以收集多个视角。这种行为表明,整合不同的视觉线索可以提高地理定位的可靠性。因此,我们提出了一种新的任务:交叉视图图像集地理定位(Set- cvgl),该任务将具有不同视角的多幅图像作为定位的查询集。为了支持这项任务,我们引入了SetVL-480K,这是一个基准,包括48万张全球地面图像及其相应的卫星图像,每张卫星图像平均对应40张不同角度和位置的地面图像。此外,我们提出了FlexGeo,这是一种为Set-CVGL设计的灵活方法,也可以适应单图像和图像序列输入。FlexGeo包括两个关键模块:相似性引导的特征融合器(SFF),它自适应地融合图像特征,而不依赖于先前的内容;以及个人层面的属性学习器(IAL),利用每张图像的地理属性进行全面的场景感知。FlexGeo在SetVL-480K和四个公共数据集(VIGOR、University-1652、SeqGeo和KITTI-CVL)上的定位精度持续优于现有方法,在SetVL-480K上的定位精度提高了2.34倍。代码和数据集可在https://github.com/Mabel0403/Set-CVGL上获得。
{"title":"Set-CVGL: A new perspective on cross-view geo-localization with unordered ground-view image sets","authors":"Qiong Wu ,&nbsp;Panwang Xia ,&nbsp;Lei Yu ,&nbsp;Yi Liu ,&nbsp;Mingtao Xiong ,&nbsp;Liheng Zhong ,&nbsp;Jingdong Chen ,&nbsp;Ming Yang ,&nbsp;Yongjun Zhang ,&nbsp;Yi Wan","doi":"10.1016/j.isprsjprs.2026.01.037","DOIUrl":"10.1016/j.isprsjprs.2026.01.037","url":null,"abstract":"<div><div>Cross-view geo-localization (CVGL) has been widely applied in fields such as robotic navigation and geographic information coupling. Existing approaches primarily use single images or fixed-view image sequences as queries, which limits perspective diversity. In contrast, when humans determine their location visually, they typically move around to gather multiple perspectives. This behavior suggests that integrating diverse visual cues can improve geo-localization reliability. Therefore, we propose a novel task: Cross-View Image Set Geo-Localization (Set-CVGL), which gathers multiple images with diverse perspectives as <strong>a query set</strong> for localization. To support this task, we introduce SetVL-480K, a benchmark comprising 480,000 ground images captured worldwide and their corresponding satellite images, with each satellite image corresponds to an average of 40 ground images from varied perspectives and locations. Furthermore, we propose FlexGeo, a flexible method designed for Set-CVGL that can also adapt to single-image and image-sequence inputs. FlexGeo includes two key modules: the Similarity-guided Feature Fuser (SFF), which adaptively fuses image features without prior content dependency, and the Individual-level Attributes Learner (IAL), leveraging geo-attributes of each image for comprehensive scene perception. FlexGeo consistently outperforms existing methods on SetVL-480K and four public datasets (VIGOR, University-1652, SeqGeo, and KITTI-CVL), achieving a 2.34<span><math><mo>×</mo></math></span> improvement in localization accuracy on SetVL-480K. The codes and dataset will be available at <span><span>https://github.com/Mabel0403/Set-CVGL</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50269,"journal":{"name":"ISPRS Journal of Photogrammetry and Remote Sensing","volume":"233 ","pages":"Pages 328-345"},"PeriodicalIF":12.2,"publicationDate":"2026-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146079952","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An advanced decoupled polarimetric calibration method for the LuTan-1 hybrid- and quadrature-polarimetric modes LuTan-1混合偏振模式和正交偏振模式的一种高级解耦偏振定标方法
IF 12.2 1区 地球科学 Q1 GEOGRAPHY, PHYSICAL Pub Date : 2026-01-30 DOI: 10.1016/j.isprsjprs.2026.01.035
Lizhi Liu, Lijie Huang, Yiding Wang, Pingping Lu, Bo Li, Liang Li, Robert Wang, Yirong Wu
During solar maximum, low-frequency spaceborne Polarimetric Synthetic Aperture Radar (PolSAR) systems suffer ionosphere-induced distortions that couple with system-induced polarimetric distortions. High-precision decoupled polarimetric calibration is therefore essential for obtaining high-fidelity PolSAR data. Existing point-target calibration methods lack a general approach for unbiased estimation of polarimetric distortion across multiple polarimetric modes and calibrator combinations, particularly under spatiotemporally varying ionospheric conditions. To address this, we derive the necessary conditions for unbiased estimation and propose a General Polarimetric Calibration Method (GPCM) applicable to various configurations. In addition, Enhanced Multi-Look Autofocus (EMLA), a modified STEC inversion method, is introduced for precise inversion of Slant Total Electron Content (STEC), enabling estimation of the spatiotemporally varying Faraday rotation angle for system distortion decoupling and PolSAR data compensation. GPCM applied to LuTan-1 HP and QP data results in HH/VV amplitude and phase imbalances of 0.0433  dB (STD: 0.017) and − 0.60° (STD: 1.02°), respectively, measured on trihedral corner reflectors. Calibration results also indicate that QP mode isolation exceeds 39 dB, while estimated axial ratios for HP mode are lower than 0.115 dB. Under comparable conditions, the results of GPCM are consistent with the Freeman analytical method. Furthermore, EMLA outperforms existing STEC inversion methods (COA, MLA, and GIM-based mapping), achieving a mean absolute difference of 1.95 TECU compared with in-situ measurements while demonstrating applicability to general scenes. Overall, the effectiveness of GPCM and EMLA in the LuTan-1 calibration mission is confirmed, indicating their potential for future PolSAR calibration tasks. The primary calibrated experimental dataset is publicly available at https://radars.ac.cn/web/data/getData?dataType=HPSAREADEN&pageType=en.
在太阳活动极大期,低频星载极化合成孔径雷达(PolSAR)系统遭受电离层诱导的畸变,这种畸变与系统诱导的极化畸变耦合。因此,高精度解耦极化校准对于获得高保真的PolSAR数据至关重要。现有的点目标校准方法缺乏一种通用的方法来无偏估计跨多个极化模式和校准器组合的极化失真,特别是在时空变化的电离层条件下。为了解决这个问题,我们推导了无偏估计的必要条件,并提出了一种适用于各种配置的通用偏振校准方法(GPCM)。此外,提出了一种改进的STEC反演方法——Enhanced Multi-Look Autofocus (EMLA),用于精确反演倾斜总电子含量(STEC),从而估算出法拉第旋转角的时空变化,从而实现系统畸变解耦和PolSAR数据补偿。采用GPCM对鲁坦1号的HP和QP数据进行处理,在三面角反射镜上测得的HH/VV振幅和相位不平衡分别为0.0433 dB (STD: 0.017)和- 0.60°(STD: 1.02°)。校准结果还表明,QP模式隔离度超过39 dB,而HP模式的估计轴向比低于0.115 dB。在可比条件下,GPCM的计算结果与Freeman分析方法一致。此外,EMLA优于现有的STEC反演方法(基于COA、MLA和基于gimm的制图),与原位测量相比,平均绝对差为1.95 TECU,同时证明了对一般场景的适用性。总体而言,GPCM和EMLA在LuTan-1校准任务中的有效性得到了证实,表明它们在未来PolSAR校准任务中的潜力。主要校准的实验数据集可在https://radars.ac.cn/web/data/getData?dataType=HPSAREADEN&pageType=en上公开获取。
{"title":"An advanced decoupled polarimetric calibration method for the LuTan-1 hybrid- and quadrature-polarimetric modes","authors":"Lizhi Liu,&nbsp;Lijie Huang,&nbsp;Yiding Wang,&nbsp;Pingping Lu,&nbsp;Bo Li,&nbsp;Liang Li,&nbsp;Robert Wang,&nbsp;Yirong Wu","doi":"10.1016/j.isprsjprs.2026.01.035","DOIUrl":"10.1016/j.isprsjprs.2026.01.035","url":null,"abstract":"<div><div>During solar maximum, low-frequency spaceborne Polarimetric Synthetic Aperture Radar (PolSAR) systems suffer ionosphere-induced distortions that couple with system-induced polarimetric distortions. High-precision decoupled polarimetric calibration is therefore essential for obtaining high-fidelity PolSAR data. Existing point-target calibration methods lack a general approach for unbiased estimation of polarimetric distortion across multiple polarimetric modes and calibrator combinations, particularly under spatiotemporally varying ionospheric conditions. To address this, we derive the necessary conditions for unbiased estimation and propose a General Polarimetric Calibration Method (GPCM) applicable to various configurations. In addition, Enhanced Multi-Look Autofocus (EMLA), a modified STEC inversion method, is introduced for precise inversion of Slant Total Electron Content (STEC), enabling estimation of the spatiotemporally varying Faraday rotation angle for system distortion decoupling and PolSAR data compensation. GPCM applied to LuTan-1 HP and QP data results in HH/VV amplitude and phase imbalances of 0.0433  dB (STD: 0.017) and − 0.60° (STD: 1.02°), respectively, measured on trihedral corner reflectors. Calibration results also indicate that QP mode isolation exceeds 39 dB, while estimated axial ratios for HP mode are lower than 0.115 dB. Under comparable conditions, the results of GPCM are consistent with the Freeman analytical method. Furthermore, EMLA outperforms existing STEC inversion methods (COA, MLA, and GIM-based mapping), achieving a mean absolute difference of 1.95 TECU compared with in-situ measurements while demonstrating applicability to general scenes. Overall, the effectiveness of GPCM and EMLA in the LuTan-1 calibration mission is confirmed, indicating their potential for future PolSAR calibration tasks. The primary calibrated experimental dataset is publicly available at <span><span>https://radars.ac.cn/web/data/getData?dataType=HPSAREADEN&amp;pageType=en</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50269,"journal":{"name":"ISPRS Journal of Photogrammetry and Remote Sensing","volume":"233 ","pages":"Pages 310-327"},"PeriodicalIF":12.2,"publicationDate":"2026-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146079953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RegScorer: Learning to select the best transformation of point cloud registration RegScorer:学习选择点云注册的最佳转换
IF 12.2 1区 地球科学 Q1 GEOGRAPHY, PHYSICAL Pub Date : 2026-01-27 DOI: 10.1016/j.isprsjprs.2026.01.034
Xiaochen Yang , Haiping Wang , Yuan Liu , Bisheng Yang , Zhen Dong
We propose RegScorer, a model learning to identify the optimal transformation to register unaligned point clouds. Existing registration advancements can generate a set of candidate transformations, which are then evaluated using conventional metrics such as Inlier Ratio (IR), Mean Squared Error (MSE) or Chamfer Distance (CD). The candidate achieving the best score is selected as the final result. However, we argue that these metrics often fail to select the correct transformation, especially in challenging scenarios involving symmetric objects, repetitive structures, or low-overlap regions. This leads to significant degradation in registration performance, a problem that has long been overlooked. The core issue lies in their limited focus on local geometric consistency and inability to capture two key conflict cases of misalignment: (1) point pairs that are spatially close after alignment but have conflicting features, and (2) point pairs with high feature similarity but large spatial distances after alignment. To address this, we propose RegScorer, which models both the spatial and feature relationships of all point pairs. This allows RegScorer to learn to capture the above conflict cases and provides a more reliable score for transformation quality. On the 3DLoMatch and ScanNet datasets, RegScorer demonstrate 19.3% and 14.1% improvements in registration recall, leading to 4.7% and 5.1% accuracy gains in multiview registration. Moreover, when generalized to symmetric and low-texture outdoor scenes, RegScorer achieves a 25% increase in transformation recall over IR metric, highlighting its robustness and generalizability. The pre-trained model and the complete code repository can be accessed at https://github.com/WHU-USI3DV/RegScorer.
我们提出了RegScorer,一个模型学习来识别最优的转换,以配准不对齐的点云。现有的配准进展可以生成一组候选变换,然后使用传统的指标(如Inlier Ratio (IR)、均方误差(MSE)或倒角距离(CD))对其进行评估。成绩最好的候选人被选为最终成绩。然而,我们认为这些指标经常不能选择正确的转换,特别是在涉及对称对象、重复结构或低重叠区域的具有挑战性的场景中。这将导致注册性能的显著下降,这是一个长期被忽视的问题。核心问题在于它们对局部几何一致性的关注有限,无法捕捉到两种关键的不对齐冲突情况:(1)对齐后空间接近但特征冲突的点对;(2)对齐后特征相似度高但空间距离大的点对。为了解决这个问题,我们提出了RegScorer,它对所有点对的空间和特征关系进行建模。这允许RegScorer学习捕获上述冲突案例,并为转换质量提供更可靠的评分。在3DLoMatch和ScanNet数据集上,RegScorer的注册召回率分别提高了19.3%和14.1%,导致多视图注册的准确率分别提高了4.7%和5.1%。此外,当推广到对称和低纹理户外场景时,RegScorer的变换召回率比IR指标提高了25%,突出了其鲁棒性和泛化性。预训练的模型和完整的代码存储库可以在https://github.com/WHU-USI3DV/RegScorer上访问。
{"title":"RegScorer: Learning to select the best transformation of point cloud registration","authors":"Xiaochen Yang ,&nbsp;Haiping Wang ,&nbsp;Yuan Liu ,&nbsp;Bisheng Yang ,&nbsp;Zhen Dong","doi":"10.1016/j.isprsjprs.2026.01.034","DOIUrl":"10.1016/j.isprsjprs.2026.01.034","url":null,"abstract":"<div><div>We propose RegScorer, a model learning to identify the optimal transformation to register unaligned point clouds. Existing registration advancements can generate a set of candidate transformations, which are then evaluated using conventional metrics such as Inlier Ratio (IR), Mean Squared Error (MSE) or Chamfer Distance (CD). The candidate achieving the best score is selected as the final result. However, we argue that these metrics often fail to select the correct transformation, especially in challenging scenarios involving symmetric objects, repetitive structures, or low-overlap regions. This leads to significant degradation in registration performance, a problem that has long been overlooked. The core issue lies in their limited focus on local geometric consistency and inability to capture two key conflict cases of misalignment: (1) point pairs that are spatially close after alignment but have conflicting features, and (2) point pairs with high feature similarity but large spatial distances after alignment. To address this, we propose RegScorer, which models both the spatial and feature relationships of all point pairs. This allows RegScorer to learn to capture the above conflict cases and provides a more reliable score for transformation quality. On the 3DLoMatch and ScanNet datasets, RegScorer demonstrate <strong>19.3</strong>% and <strong>14.1</strong>% improvements in registration recall, leading to <strong>4.7</strong>% and <strong>5.1</strong>% accuracy gains in multiview registration. Moreover, when generalized to symmetric and low-texture outdoor scenes, RegScorer achieves a <strong>25</strong>% increase in transformation recall over IR metric, highlighting its robustness and generalizability. The pre-trained model and the complete code repository can be accessed at <span><span>https://github.com/WHU-USI3DV/RegScorer</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50269,"journal":{"name":"ISPRS Journal of Photogrammetry and Remote Sensing","volume":"233 ","pages":"Pages 266-277"},"PeriodicalIF":12.2,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146071855","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multispectral airborne laser scanning for tree species classification: A benchmark of machine learning and deep learning algorithms 树种分类的多光谱机载激光扫描:机器学习和深度学习算法的基准
IF 12.2 1区 地球科学 Q1 GEOGRAPHY, PHYSICAL Pub Date : 2026-01-27 DOI: 10.1016/j.isprsjprs.2026.01.031
Josef Taher , Eric Hyyppä , Matti Hyyppä , Klaara Salolahti , Xiaowei Yu , Leena Matikainen , Antero Kukko , Matti Lehtomäki , Harri Kaartinen , Sopitta Thurachen , Paula Litkey , Ville Luoma , Markus Holopainen , Gefei Kong , Hongchao Fan , Petri Rönnholm , Matti Vaaja , Antti Polvivaara , Samuli Junttila , Mikko Vastaranta , Juha Hyyppä
<div><div>Climate-smart and biodiversity-preserving forestry demands precise information on forest resources, extending to the individual tree level. Multispectral airborne laser scanning (ALS) has shown promise in automated point cloud processing, but challenges remain in leveraging deep learning techniques and identifying rare tree species in class-imbalanced datasets. This study addresses these gaps by conducting a comprehensive benchmark of deep learning and traditional shallow machine learning methods for tree species classification. For the study, we collected high-density multispectral ALS data (<span><math><mrow><mo>></mo><mn>1000</mn></mrow></math></span> <span><math><mrow><mi>pts</mi><mo>/</mo><msup><mrow><mi>m</mi></mrow><mrow><mn>2</mn></mrow></msup></mrow></math></span>) at three wavelengths using the FGI-developed HeliALS system, complemented by existing Optech Titan data (35 <span><math><mrow><mi>pts</mi><mo>/</mo><msup><mrow><mi>m</mi></mrow><mrow><mn>2</mn></mrow></msup></mrow></math></span>), to evaluate the species classification accuracy of various algorithms in a peri-urban study area located in southern Finland. We established a field reference dataset of 6326 segments across nine species using a newly developed browser-based crowdsourcing tool, which facilitated efficient data annotation. The ALS data, including a training dataset of 1065 segments, was shared with the scientific community to foster collaborative research and diverse algorithmic contributions. Based on 5261 test segments, our findings demonstrate that point-based deep learning methods, particularly a point transformer model, outperformed traditional machine learning and image-based deep learning approaches on high-density multispectral point clouds. For the high-density ALS dataset, a point transformer model provided the best performance reaching an overall (macro-average) accuracy of 87.9% (74.5%) with a training set of 1065 segments and 92.0% (85.1%) with a larger training set of 5000 segments. With 1065 training segments, the best image-based deep learning method, DetailView, reached an overall (macro-average) accuracy of 84.3% (63.9%), whereas a shallow random forest (RF) classifier achieved an overall (macro-average) accuracy of 83.2% (61.3%). For the sparser ALS dataset, an RF model topped the list with an overall (macro-average) accuracy of 79.9% (57.6%), closely followed by the point transformer at 79.6% (56.0%). Importantly, the overall classification accuracy of the point transformer model on the HeliALS data increased from 73.0% with no spectral information to 84.7% with single-channel reflectance, and to 87.9% with spectral information of all the three channels. Furthermore, we studied the scaling of the classification accuracy as a function of point density and training set size using 5-fold cross-validation of our dataset. Based on our findings, multispectral information is especially beneficial for sparse point clouds with 1–50 <span><math>
气候智慧型和生物多样性保护林业需要森林资源的精确信息,并延伸到单个树木的水平。多光谱机载激光扫描(ALS)在自动化点云处理中显示出前景,但在利用深度学习技术和识别类别不平衡数据集中的稀有树种方面仍然存在挑战。本研究通过对树种分类的深度学习和传统浅机器学习方法进行全面的基准测试,解决了这些差距。在这项研究中,我们使用fgis开发的helals系统在三个波长下收集高密度多光谱ALS数据(>1000 pts/m2),并辅以现有的Optech Titan数据(35 pts/m2),以评估芬兰南部城郊研究区内各种算法的物种分类精度。利用新开发的基于浏览器的众包工具,建立了9个物种6326个片段的野外参考数据集,方便了数据标注。ALS数据,包括1065个片段的训练数据集,与科学界共享,以促进合作研究和多样化的算法贡献。基于5261个测试片段,我们的研究结果表明,在高密度多光谱点云上,基于点的深度学习方法,特别是点转换器模型,优于传统的机器学习和基于图像的深度学习方法。对于高密度ALS数据集,点转换模型提供了最好的性能,在1065个片段的训练集上达到了87.9%(74.5%)的总体(宏观平均)准确率,在5000个片段的更大的训练集上达到了92.0%(85.1%)。在1065个训练片段中,基于图像的最佳深度学习方法DetailView的总体(宏观平均)准确率为84.3%(63.9%),而浅随机森林(RF)分类器的总体(宏观平均)准确率为83.2%(61.3%)。对于稀疏的ALS数据集,RF模型以79.9%(57.6%)的总体(宏观平均)准确率位居榜首,紧随其后的是点变压器,准确率为79.6%(56.0%)。重要的是,在没有光谱信息的情况下,点变压器模型在helals数据上的总体分类精度从73.0%提高到单通道反射率下的84.7%,在三个通道都有光谱信息的情况下提高到87.9%。此外,我们使用数据集的5倍交叉验证研究了分类精度作为点密度和训练集大小的函数的缩放。基于我们的研究结果,多光谱信息对1-50 pts/m2的稀疏点云特别有利。此外,我们观察到分类误差与训练集大小m呈幂律关系,并且随着训练集大小的增加,点变压器的分类误差降低的速度明显快于RF。
{"title":"Multispectral airborne laser scanning for tree species classification: A benchmark of machine learning and deep learning algorithms","authors":"Josef Taher ,&nbsp;Eric Hyyppä ,&nbsp;Matti Hyyppä ,&nbsp;Klaara Salolahti ,&nbsp;Xiaowei Yu ,&nbsp;Leena Matikainen ,&nbsp;Antero Kukko ,&nbsp;Matti Lehtomäki ,&nbsp;Harri Kaartinen ,&nbsp;Sopitta Thurachen ,&nbsp;Paula Litkey ,&nbsp;Ville Luoma ,&nbsp;Markus Holopainen ,&nbsp;Gefei Kong ,&nbsp;Hongchao Fan ,&nbsp;Petri Rönnholm ,&nbsp;Matti Vaaja ,&nbsp;Antti Polvivaara ,&nbsp;Samuli Junttila ,&nbsp;Mikko Vastaranta ,&nbsp;Juha Hyyppä","doi":"10.1016/j.isprsjprs.2026.01.031","DOIUrl":"10.1016/j.isprsjprs.2026.01.031","url":null,"abstract":"&lt;div&gt;&lt;div&gt;Climate-smart and biodiversity-preserving forestry demands precise information on forest resources, extending to the individual tree level. Multispectral airborne laser scanning (ALS) has shown promise in automated point cloud processing, but challenges remain in leveraging deep learning techniques and identifying rare tree species in class-imbalanced datasets. This study addresses these gaps by conducting a comprehensive benchmark of deep learning and traditional shallow machine learning methods for tree species classification. For the study, we collected high-density multispectral ALS data (&lt;span&gt;&lt;math&gt;&lt;mrow&gt;&lt;mo&gt;&gt;&lt;/mo&gt;&lt;mn&gt;1000&lt;/mn&gt;&lt;/mrow&gt;&lt;/math&gt;&lt;/span&gt; &lt;span&gt;&lt;math&gt;&lt;mrow&gt;&lt;mi&gt;pts&lt;/mi&gt;&lt;mo&gt;/&lt;/mo&gt;&lt;msup&gt;&lt;mrow&gt;&lt;mi&gt;m&lt;/mi&gt;&lt;/mrow&gt;&lt;mrow&gt;&lt;mn&gt;2&lt;/mn&gt;&lt;/mrow&gt;&lt;/msup&gt;&lt;/mrow&gt;&lt;/math&gt;&lt;/span&gt;) at three wavelengths using the FGI-developed HeliALS system, complemented by existing Optech Titan data (35 &lt;span&gt;&lt;math&gt;&lt;mrow&gt;&lt;mi&gt;pts&lt;/mi&gt;&lt;mo&gt;/&lt;/mo&gt;&lt;msup&gt;&lt;mrow&gt;&lt;mi&gt;m&lt;/mi&gt;&lt;/mrow&gt;&lt;mrow&gt;&lt;mn&gt;2&lt;/mn&gt;&lt;/mrow&gt;&lt;/msup&gt;&lt;/mrow&gt;&lt;/math&gt;&lt;/span&gt;), to evaluate the species classification accuracy of various algorithms in a peri-urban study area located in southern Finland. We established a field reference dataset of 6326 segments across nine species using a newly developed browser-based crowdsourcing tool, which facilitated efficient data annotation. The ALS data, including a training dataset of 1065 segments, was shared with the scientific community to foster collaborative research and diverse algorithmic contributions. Based on 5261 test segments, our findings demonstrate that point-based deep learning methods, particularly a point transformer model, outperformed traditional machine learning and image-based deep learning approaches on high-density multispectral point clouds. For the high-density ALS dataset, a point transformer model provided the best performance reaching an overall (macro-average) accuracy of 87.9% (74.5%) with a training set of 1065 segments and 92.0% (85.1%) with a larger training set of 5000 segments. With 1065 training segments, the best image-based deep learning method, DetailView, reached an overall (macro-average) accuracy of 84.3% (63.9%), whereas a shallow random forest (RF) classifier achieved an overall (macro-average) accuracy of 83.2% (61.3%). For the sparser ALS dataset, an RF model topped the list with an overall (macro-average) accuracy of 79.9% (57.6%), closely followed by the point transformer at 79.6% (56.0%). Importantly, the overall classification accuracy of the point transformer model on the HeliALS data increased from 73.0% with no spectral information to 84.7% with single-channel reflectance, and to 87.9% with spectral information of all the three channels. Furthermore, we studied the scaling of the classification accuracy as a function of point density and training set size using 5-fold cross-validation of our dataset. Based on our findings, multispectral information is especially beneficial for sparse point clouds with 1–50 &lt;span&gt;&lt;math&gt;","PeriodicalId":50269,"journal":{"name":"ISPRS Journal of Photogrammetry and Remote Sensing","volume":"233 ","pages":"Pages 278-309"},"PeriodicalIF":12.2,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146072730","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Satellite-based heat Index estimatioN modEl (SHINE): An integrated machine learning approach for the conterminous United States 基于卫星的热指数估算模型(SHINE):美国周边地区的综合机器学习方法
IF 12.2 1区 地球科学 Q1 GEOGRAPHY, PHYSICAL Pub Date : 2026-01-23 DOI: 10.1016/j.isprsjprs.2026.01.018
Seyed Babak Haji Seyed Asadollah, Giorgos Mountrakis, Stephen B. Shaw
The accelerating frequency, duration and intensity of extreme heat events demand accurate, spatially complete heat exposure metrics. Here, a modeling approach is presented for estimating the daily-maximum Heat Index (HI) at 1 km spatial resolution. Our study area covered the conterminous United States (CONUS) during the warm season (May to September) between 2003 and 2023. More than 4.6 million observations from approximately 2000 weather stations were paired with weather-related, geographical, land cover and historical climatic factors to develop the proposed Satellite-based Heat Index estimatioN modEl (SHINE). Selected explanatory variables at daily temporal intervals included reanalysis products from Modern-Era Retrospective analysis for Research and Applications (MERRA) and direct satellite products from the Moderate Resolution Imaging Spectroradiometer (MODIS) sensor.
The most influential variables for HI estimation were the MERRA surface layer height and specific humidity products and the dual-pass MODIS daily land surface temperatures. These were followed by land cover products capturing water and forest presence, historical norms of wind speed and maximum temperature, elevation information and the corresponding day of year. An Extreme Gradient Boosting (XGBoost) regressor trained with spatial cross-validation explained 93 % of the variance (R2 = 0.93) and attained a Root Mean Square Error (RMSE) of 1.9°C and a Mean Absolute Error (MAE) of 1.4°C. Comparison of alternative configurations showed that while a MERRA-only model provided slightly higher accuracy (RMSE of 1.8°C), its coarse resolution failed to capture fine-scale heat variations. Conversely, a MODIS-only model offered kilometer-scale spatial resolution but with higher estimation errors (RMSE of 2.9°C). Integrating both MERRA and MODIS sources enabled SHINE to maintain spatial detail and preserved accuracy, underscoring the complementary strengths of reanalysis and satellite products. SHINE also demonstrated resistance to missing MODIS LST observations due to clouds as the additional RMSE error was approximately 0.5°C in the worst case of missing both morning and afternoon MODIS land surface temperature observations. Spatial error analysis revealed <1.7°C RMSE in arid and Mediterranean zones but larger, more heterogeneous errors in the humid Midwest and High Plains. From the policy perspective and considering the HI operational range for public-health heat effects, the proposed SHINE approach outperformed typically used proxies, such as land surface and air temperature. The resulting 1 km daily HI estimations can potentially be used as the foundation of the first wall-to-wall, multi-decadal, high resolution heat dataset for CONUS and offer actionable information for public-health heat studies, energy-demand forecasting and environmental-justice implications.
极端高温事件的频率、持续时间和强度不断加快,需要精确、空间完整的热暴露指标。本文提出了在1 km空间分辨率下估算日最大热指数(HI)的建模方法。我们的研究区域覆盖了2003年至2023年暖季(5月至9月)的美国(CONUS)。来自大约2000个气象站的460多万份观测资料与天气、地理、土地覆盖和历史气候因素相结合,开发了拟议的基于卫星的热指数估算模型(SHINE)。选取的每日时间间隔解释变量包括来自现代研究与应用回顾性分析(MERRA)的再分析产品和来自中分辨率成像光谱仪(MODIS)传感器的直接卫星产品。对HI估算影响最大的变量是MERRA地表高度和比湿产品以及MODIS双通道地表日温度。其次是土地覆盖产品,包括水和森林的存在、风速和最高温度的历史标准、海拔信息和相应的年份。使用空间交叉验证训练的极端梯度增强(XGBoost)回归器解释了93% %的方差(R2 = 0.93),获得了1.9°C的均方根误差(RMSE)和1.4°C的平均绝对误差(MAE)。不同配置的对比表明,仅merra模式的精度略高(RMSE为1.8°C),但其粗分辨率无法捕获精细尺度的热量变化。相反,仅使用modis的模式提供千米尺度的空间分辨率,但估计误差较高(RMSE为2.9°C)。整合MERRA和MODIS源使SHINE能够保持空间细节和保持精度,强调再分析和卫星产品的互补优势。SHINE还显示出对由于云层而丢失的MODIS LST观测值的抵抗能力,因为在最坏的情况下,在丢失上午和下午MODIS陆地表面温度观测值的情况下,额外的RMSE误差约为0.5°C。空间误差分析显示,干旱和地中海地区的RMSE为 <;1.7°C,而湿润的中西部和高平原地区的误差更大,异质性更强。从政策角度来看,并考虑到公共卫生热效应的HI操作范围,拟议的SHINE方法优于通常使用的替代指标,如陆地表面和空气温度。由此产生的每天1 公里的HI估计可能被用作CONUS的第一个墙到墙、多年代际、高分辨率热量数据集的基础,并为公共卫生热量研究、能源需求预测和环境正义影响提供可操作的信息。
{"title":"Satellite-based heat Index estimatioN modEl (SHINE): An integrated machine learning approach for the conterminous United States","authors":"Seyed Babak Haji Seyed Asadollah,&nbsp;Giorgos Mountrakis,&nbsp;Stephen B. Shaw","doi":"10.1016/j.isprsjprs.2026.01.018","DOIUrl":"10.1016/j.isprsjprs.2026.01.018","url":null,"abstract":"<div><div>The accelerating frequency, duration and intensity of extreme heat events demand accurate, spatially complete heat exposure metrics. Here, a modeling approach is presented for estimating the daily-maximum Heat Index (HI) at 1 km spatial resolution. Our study area covered the conterminous United States (CONUS) during the warm season (May to September) between 2003 and 2023. More than 4.6 million observations from approximately 2000 weather stations were paired with weather-related, geographical, land cover and historical climatic factors to develop the proposed Satellite-based Heat Index estimatioN modEl (SHINE). Selected explanatory variables at daily temporal intervals included reanalysis products from Modern-Era Retrospective analysis for Research and Applications (MERRA) and direct satellite products from the Moderate Resolution Imaging Spectroradiometer (MODIS) sensor.</div><div>The most influential variables for HI estimation were the MERRA surface layer height and specific humidity products and the dual-pass MODIS daily land surface temperatures. These were followed by land cover products capturing water and forest presence, historical norms of wind speed and maximum temperature, elevation information and the corresponding day of year. An Extreme Gradient Boosting (XGBoost) regressor trained with spatial cross-validation explained 93 % of the variance (R<sup>2</sup> = 0.93) and attained a Root Mean Square Error (RMSE) of 1.9°C and a Mean Absolute Error (MAE) of 1.4°C. Comparison of alternative configurations showed that while a MERRA-only model provided slightly higher accuracy (RMSE of 1.8°C), its coarse resolution failed to capture fine-scale heat variations. Conversely, a MODIS-only model offered kilometer-scale spatial resolution but with higher estimation errors (RMSE of 2.9°C). Integrating both MERRA and MODIS sources enabled SHINE to maintain spatial detail and preserved accuracy, underscoring the complementary strengths of reanalysis and satellite products. SHINE also demonstrated resistance to missing MODIS LST observations due to clouds as the additional RMSE error was approximately 0.5°C in the worst case of missing both morning and afternoon MODIS land surface temperature observations. Spatial error analysis revealed &lt;1.7°C RMSE in arid and Mediterranean zones but larger, more heterogeneous errors in the humid Midwest and High Plains. From the policy perspective and considering the HI operational range for public-health heat effects, the proposed SHINE approach outperformed typically used proxies, such as land surface and air temperature. The resulting 1 km daily HI estimations can potentially be used as the foundation of the first wall-to-wall, multi-decadal, high resolution heat dataset for CONUS and offer actionable information for public-health heat studies, energy-demand forecasting and environmental-justice implications.</div></div>","PeriodicalId":50269,"journal":{"name":"ISPRS Journal of Photogrammetry and Remote Sensing","volume":"233 ","pages":"Pages 209-230"},"PeriodicalIF":12.2,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146039556","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
ISPRS Journal of Photogrammetry and Remote Sensing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1