Pub Date : 2025-12-10DOI: 10.1016/j.jag.2025.104989
Anjar Dimara Sakti , Muhammad Asa , Agung Budi Harto , Tania Septi Anggraini , Cokro Santoso , Albertus Deliar , Riantini Virtriana , Akhmad Riqqi , Budhy Soeksmantono , Dudy Darmawan Wijaya , Can Trong Nguyen , Khairul Nizam Abdul Maulud , Maya Safira , Ketut Wikantika
Road infrastructure plays a vital role in national and regional development, particularly in Southeast Asia, where rapid economic growth is increasing pressure on transport systems. However, uneven investment, environmental stressors, and limited data-driven tools continue to hinder effective road maintenance planning. Previous studies have utilized remote sensing and statistical models for infrastructure analysis, but the integration of long-term environmental indicators with spatial prioritization methods remains limited. This study addresses this gap by developing a Road Maintenance Priority Index (RMPI) using ten parameters, including nighttime lights, population density, industrial zones, land surface temperature, precipitation, and wind speed. These variables were analyzed through machine learning regression and multi-criteria decision analysis to classify road segments into priority levels. Results show that 45.08 percent of roads fall into the low-priority category, followed by moderate (39.69 percent), high (9.06 percent), and very high (0.88 percent). Countries such as Singapore, Brunei, and Malaysia exhibited the highest RMPI scores, reflecting urgent maintenance needs, while Timor-Leste, Myanmar, and Laos scored lowest. The findings offer a transferable and scalable framework to support evidence-based infrastructure planning in economically and environmentally diverse regions.
{"title":"Assessing socioeconomic and climate driven road maintenance priorities in Southeast Asia using remote sensing approach","authors":"Anjar Dimara Sakti , Muhammad Asa , Agung Budi Harto , Tania Septi Anggraini , Cokro Santoso , Albertus Deliar , Riantini Virtriana , Akhmad Riqqi , Budhy Soeksmantono , Dudy Darmawan Wijaya , Can Trong Nguyen , Khairul Nizam Abdul Maulud , Maya Safira , Ketut Wikantika","doi":"10.1016/j.jag.2025.104989","DOIUrl":"10.1016/j.jag.2025.104989","url":null,"abstract":"<div><div>Road infrastructure plays a vital role in national and regional development, particularly in Southeast Asia, where rapid economic growth is increasing pressure on transport systems. However, uneven investment, environmental stressors, and limited data-driven tools continue to hinder effective road maintenance planning. Previous studies have utilized remote sensing and statistical models for infrastructure analysis, but the integration of long-term environmental indicators with spatial prioritization methods remains limited. This study addresses this gap by developing a Road Maintenance Priority Index (RMPI) using ten parameters, including nighttime lights, population density, industrial zones, land surface temperature, precipitation, and wind speed. These variables were analyzed through machine learning regression and multi-criteria decision analysis to classify road segments into priority levels. Results show that 45.08 percent of roads fall into the low-priority category, followed by moderate (39.69 percent), high (9.06 percent), and very high (0.88 percent). Countries such as Singapore, Brunei, and Malaysia exhibited the highest RMPI scores, reflecting urgent maintenance needs, while Timor-Leste, Myanmar, and Laos scored lowest. The findings offer a transferable and scalable framework to support evidence-based infrastructure planning in economically and environmentally diverse regions.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"146 ","pages":"Article 104989"},"PeriodicalIF":8.6,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145738279","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-10DOI: 10.1016/j.jag.2025.105010
Lijie Guo, Lei Deng
Gap probability (P) is a key indicator of vegetation canopy structure and can be effectively estimated using intensity data from airborne laser scanning (ALS) point clouds. However, point cloud intensity is highly susceptible to radiometric effects. Even for a specific natural target, its intensity can vary across three-dimensional space, which may reduce the accuracy of P estimation. To address this issue, we propose a novel method for estimating P that corrects the influence of radiometric effects on point cloud intensity (PRE_COR). The method consists of the following main steps: first, laser pulses are classified into vegetation-ground, pure-vegetation, and pure-ground pulses. Then, the intensity of vegetation-ground pulses is corrected using the inverse distance square law and the cosine law of incidence angle. Finally, the corrected intensity values are used to estimate the vegetation-ground reflectance ratio based on a linear relationship between their return energies. This ratio is then used to calculate the canopy P. The proposed method was evaluated using both simulated ALS point cloud data and (National Ecological Observatory Network) NEON ALS point cloud data. The results show that for the simulated data, under varying canopy cover, flight altitudes, and mean scan angles, the proposed method achieved relative root mean square errors (rRMSE) below 5.37%, 5.94%, and 21.51%, and mean absolute errors (MAE) below 0.025, 0.011, and 0.067, respectively. Compared with the traditional PFitted method, rRMSE was reduced by up to 1.90%, 1.94%, and 21.20%, and MAE decreased by up to 0.010, 0.003, and 0.096, respectively. For the NEON ALS data, when the scan angle exceeded 20°, the proposed method may improv accuracy by more than 5.39%, with possible MAE improvements exceeding 0.019. Overall, these results demonstrate that correcting radiometric effects on point cloud intensity can substantially enhance both the accuracy and stability of canopy P estimation, with particularly notable benefits under large scan angle conditions.
{"title":"An improved gap probability estimation method accounting for radiometric effects in airborne LiDAR intensity","authors":"Lijie Guo, Lei Deng","doi":"10.1016/j.jag.2025.105010","DOIUrl":"10.1016/j.jag.2025.105010","url":null,"abstract":"<div><div>Gap probability (P) is a key indicator of vegetation canopy structure and can be effectively estimated using intensity data from airborne laser scanning (ALS) point clouds. However, point cloud intensity is highly susceptible to radiometric effects. Even for a specific natural target, its intensity can vary across three-dimensional space, which may reduce the accuracy of P estimation. To address this issue, we propose a novel method for estimating P that corrects the influence of radiometric effects on point cloud intensity (P<sub>RE_COR</sub>). The method consists of the following main steps: first, laser pulses are classified into vegetation-ground, pure-vegetation, and pure-ground pulses. Then, the intensity of vegetation-ground pulses is corrected using the inverse distance square law and the cosine law of incidence angle. Finally, the corrected intensity values are used to estimate the vegetation-ground reflectance ratio based on a linear relationship between their return energies. This ratio is then used to calculate the canopy P. The proposed method was evaluated using both simulated ALS point cloud data and (National Ecological Observatory Network) NEON ALS point cloud data. The results show that for the simulated data, under varying canopy cover, flight altitudes, and mean scan angles, the proposed method achieved relative root mean square errors (rRMSE) below 5.37%, 5.94%, and 21.51%, and mean absolute errors (MAE) below 0.025, 0.011, and 0.067, respectively. Compared with the traditional P<sub>Fitted</sub> method, rRMSE was reduced by up to 1.90%, 1.94%, and 21.20%, and MAE decreased by up to 0.010, 0.003, and 0.096, respectively. For the NEON ALS data, when the scan angle exceeded 20°, the proposed method may improv accuracy by more than 5.39%, with possible MAE improvements exceeding 0.019. Overall, these results demonstrate that correcting radiometric effects on point cloud intensity can substantially enhance both the accuracy and stability of canopy P estimation, with particularly notable benefits under large scan angle conditions.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"146 ","pages":"Article 105010"},"PeriodicalIF":8.6,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145738265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The advancement of artificial intelligence (AI) has significantly improved the perception and decision-making abilities of autonomous vehicles (AVs), yet real-time and accurate road marking detection remains difficult under faded markings, nighttime scenes and adverse road–weather conditions. This paper presents RoadGPT, a vision–language pipeline that couples a CLIP-based detector (RoadCLIP) with a planner-facing, LLM-generated advisory layer. The model is trained and evaluated on 3,696 road marking images covering 14 UK Highway Code classes, comprising 2,576 real images (69.7 %) from Google Street View and 1,120 text-to-image synthetic images (30.3 %) that broaden rare appearances and degraded conditions. We use 2,968 images for training (2,072 real, 896 virtual) and 728 images for testing (504 real, 224 virtual), keeping the same 70:30 real–virtual ratio in both splits. On this test set, RoadCLIP attains 98.5 % precision, 97.9 % recall and 98.2 % F1 for non-lane markings, while lane-marking subclasses reach up to 89.8 % F1. The advisory layer transforms recognised markings into structured driving prompts and is assessed via semantic similarity using Sentence-BERT (all-mpnet-base-v2) cosine scores against Highway Code-based references, achieving 89.3 % similarity, alongside an external LLM-as-judge rating of 4.68/5 for accuracy, completeness, and concise effectiveness. The full camera-to-advisory path runs in real time at 135 FPS (batch size 1, 224 × 224) on an RTX 4070 under a unified timing protocol. A remaining limitation is that visually similar lane-marking classes and extreme low-light scenes still reduce discriminability compared with symbol-like, non-lane markings.
{"title":"Clip-based road-marking detection with LLM-guided driving prompts","authors":"Shaofan Sheng , Nicolette Formosa , Yuxiang Feng , Mohammed Quddus","doi":"10.1016/j.jag.2025.105012","DOIUrl":"10.1016/j.jag.2025.105012","url":null,"abstract":"<div><div>The advancement of artificial intelligence (AI) has significantly improved the perception and decision-making abilities of autonomous vehicles (AVs), yet real-time and accurate road marking detection remains difficult under faded markings, nighttime scenes and adverse road–weather conditions. This paper presents RoadGPT, a vision–language pipeline that couples a CLIP-based detector (RoadCLIP) with a planner-facing, LLM-generated advisory layer. The model is trained and evaluated on 3,696 road marking images covering 14 UK Highway Code classes, comprising 2,576 real images (69.7 %) from Google Street View and 1,120 text-to-image synthetic images (30.3 %) that broaden rare appearances and degraded conditions. We use 2,968 images for training (2,072 real, 896 virtual) and 728 images for testing (504 real, 224 virtual), keeping the same 70:30 real–virtual ratio in both splits. On this test set, RoadCLIP attains 98.5 % precision, 97.9 % recall and 98.2 % F1 for non-lane markings, while lane-marking subclasses reach up to 89.8 % F1. The advisory layer transforms recognised markings into structured driving prompts and is assessed via semantic similarity using Sentence-BERT (all-mpnet-base-v2) cosine scores against Highway Code-based references, achieving 89.3 % similarity, alongside an external LLM-as-judge rating of 4.68/5 for accuracy, completeness, and concise effectiveness. The full camera-to-advisory path runs in real time at 135 FPS (batch size 1, 224 × 224) on an RTX 4070 under a unified timing protocol. A remaining limitation is that visually similar lane-marking classes and extreme low-light scenes still reduce discriminability compared with symbol-like, non-lane markings.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"146 ","pages":"Article 105012"},"PeriodicalIF":8.6,"publicationDate":"2025-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145738409","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-07DOI: 10.1016/j.jag.2025.105007
Shanqing Gao , Xin Shen , Caiqin Shen , Lin Cao
Individual tree-level fine-scale (especially stem- and branch- scales) structural parameters constitute a critical foundation for tree structural trait assessments, biomass component estimations and tree physiological property evaluations. However, the Unmanned Aerial Vehicle (UAV) Laser Scanning (ULS) has limitations in sampling distance and penetrating capability in dense tree canopies, thus restricting its ability to extract detailed stem- and branch-scale structural parameters. The emergence of both advanced ULS and Backpack Laser Scanning (BLS) technologies have potential to precisely extract fine-scale structural parameters of individual trees. In this study, we proposed an advanced ALE-CS-NGMS approach for individual tree (Poplar (Populus spp.)) stem- and branch- scale structural parameters extraction by ULS and BLS point clouds. First, an Adaptive Least-squares Ellipse (ALE) fitting algorithm was developed to accurately derive the stem diameter of individual trees. Second, a Canopy-stem Separation (CS) model was built by identifying canopy point cloud through derivatives based on the vertical distribution profile of individual trees, while canopy volume was delineated by the AlphaShape as well as a voxel-based algorithm. Finally, a method integrating Neighborhood Graphs and Minimum Spanning (NGMS) was developed to extract individual tree stem, and stem taper curves were fitted to estimate individual-tree stem volume. The results demonstrated that the developed ALE approach yielded a root mean square error (RMSE) of 2.87 cm, representing an accuracy enhancement approximately 0.47 cm for DBH estimation. The NGMS approach produced RMSEs of 0.33 m3 and 0.40 m3 for stem volume estimation by using BLS and BLS + ULS data. The CS model achieved RMSEs of 6.48 m3 and 3.48 m3 for canopy volume estimation with the BLS and BLS + ULS data, respectively. Branch inclination angles exhibited an increase with stand age, generally ranging between 60° and 100°. The distribution of branch inclination across stands of varying ages revealed that in the 8-year-old and 12-year-old plots, branch angles fell within the 60°-90° interval.
{"title":"ALE-CS-NGMS: An advanced approach for individual tree stem- and branch- scale structural parameters extraction using ULS and BLS point clouds","authors":"Shanqing Gao , Xin Shen , Caiqin Shen , Lin Cao","doi":"10.1016/j.jag.2025.105007","DOIUrl":"10.1016/j.jag.2025.105007","url":null,"abstract":"<div><div>Individual tree-level fine-scale (especially stem- and branch- scales) structural parameters constitute a critical foundation for tree structural trait assessments, biomass component estimations and tree physiological property evaluations. However, the Unmanned Aerial Vehicle (UAV) Laser Scanning (ULS) has limitations in sampling distance and penetrating capability in dense tree canopies, thus restricting its ability to extract detailed stem- and branch-scale structural parameters. The emergence of both advanced ULS and Backpack Laser Scanning (BLS) technologies have potential to precisely extract fine-scale structural parameters of individual trees. In this study, we proposed an advanced ALE-CS-NGMS approach for individual tree (<em>Poplar</em> (<em>Populus</em> spp.)) stem- and branch- scale structural parameters extraction by ULS and BLS point clouds. First, an Adaptive Least-squares Ellipse (ALE) fitting algorithm was developed to accurately derive the stem diameter of individual trees. Second, a Canopy-stem Separation (CS) model was built by identifying canopy point cloud through derivatives based on the vertical distribution profile of individual trees, while canopy volume was delineated by the AlphaShape as well as a voxel-based algorithm. Finally, a method integrating Neighborhood Graphs and Minimum Spanning (NGMS) was developed to extract individual tree stem, and stem taper curves were fitted to estimate individual-tree stem volume. The results demonstrated that the developed ALE approach yielded a root mean square error (RMSE) of 2.87 cm, representing an accuracy enhancement approximately 0.47 cm for DBH estimation. The NGMS approach produced RMSEs of 0.33 m<sup>3</sup> and 0.40 m<sup>3</sup> for stem volume estimation by using BLS and BLS + ULS data. The CS model achieved RMSEs of 6.48 m<sup>3</sup> and 3.48 m<sup>3</sup> for canopy volume estimation with the BLS and BLS + ULS data, respectively. Branch inclination angles exhibited an increase with stand age, generally ranging between 60° and 100°. The distribution of branch inclination across stands of varying ages revealed that in the 8-year-old and 12-year-old plots, branch angles fell within the 60°-90° interval.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"146 ","pages":"Article 105007"},"PeriodicalIF":8.6,"publicationDate":"2025-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145697394","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-06DOI: 10.1016/j.jag.2025.104994
Xiaogan Yin , Weiguo Jiang , Zhe Yang , Ziyan Ling , Nandin-Erdene Tsendbazar , Peng Hou , Yue Deng , Xiaoya Wang , Zhijie Xiao , Xiao Li , Miaolong Lin
Global wetlands are experiencing severe degradation due to climate change and human activities. Under the Ramsar Convention, the Wetland City Accreditation promotes cities to protect and sustainably manage their urban wetlands. The accreditation system was launched in 2015. To date, 43 cities worldwide have obtained this certification, whose dynamic assessment depends on precise mapping of land use and wetlands. Existing global land cover datasets often show low accuracy in identifying wetlands and limited capacity to characterize wetland types within urban areas. we developed a hybrid Wetland City Map (WCM), by fusing three global 10 m-resolution products: Dynamic World, ESA WorldCover, and ESRI Land Cover. We applied a Weighted Voting and Knowledge-based Decision Rule method to achieve this fusion. This method overcomes the limitations of the input datasets by combining their complementary strengths to improve overall wetland classification and by applying expert-derived rules to enhance the delineation of wetland types within cities. The WCM achieves an average overall accuracy of 86.93 % and a kappa of 0.825. In all cities, its accuracy surpasses the three land cover products by 2 %-26 %. The visual comparison shows WCM performs better in wetland classification and spatial detail, with F1 scores of 90.33 % (water), 64.09 % (marsh), 71.67 % (tidal flat/flooded flat), and 92.17 % (mangrove). It more accurately reflects wetland coverage and changes. Wetland coverage varies across cities, with higher coverage in Asia and lower in Europe and Africa. Individual cities experienced a maximum increase of 6.5 % and decrease of 1.3 % from 2020 to 2021.The WCM supports wetland monitoring, city accreditation, and research aligned with the Ramsar Strategic Plan and Sustainable Development Goals.
由于气候变化和人类活动,全球湿地正在经历严重退化。根据《拉姆萨尔公约》,湿地城市认证旨在促进城市保护和可持续管理其城市湿地。认证制度于2015年启动。迄今为止,全世界有43个城市获得了这一认证,其动态评估依赖于精确的土地利用和湿地地图。现有的全球土地覆盖数据在识别湿地方面往往表现出较低的准确性,并且表征城市地区湿地类型的能力有限。通过融合三个全球10米分辨率的产品:Dynamic World、ESA WorldCover和ESRI Land Cover,我们开发了一个混合型湿地城市地图(WCM)。我们采用加权投票和基于知识的决策规则方法来实现这种融合。该方法通过结合输入数据集的互补优势来改进整体湿地分类,并通过应用专家导出的规则来增强城市内湿地类型的划分,从而克服了输入数据集的局限性。WCM的平均整体准确率为86.93%,kappa为0.825。在所有城市中,其精度比三种土地覆盖产品高出2% - 26%。视觉对比表明,WCM在湿地分类和空间细节方面表现较好,F1得分分别为90.33%(水)、64.09%(沼泽)、71.67%(潮滩/淹滩)和92.17%(红树林)。它更准确地反映了湿地的覆盖和变化。不同城市的湿地覆盖率各不相同,亚洲的覆盖率较高,欧洲和非洲的覆盖率较低。从2020年到2021年,单个城市的最高增长率为6.5%,下降1.3%。WCM支持湿地监测、城市认证和与拉姆萨尔战略规划和可持续发展目标相一致的研究。
{"title":"Hybrid wetland city map: Improved wetland characterization through the synergy of global land cover products","authors":"Xiaogan Yin , Weiguo Jiang , Zhe Yang , Ziyan Ling , Nandin-Erdene Tsendbazar , Peng Hou , Yue Deng , Xiaoya Wang , Zhijie Xiao , Xiao Li , Miaolong Lin","doi":"10.1016/j.jag.2025.104994","DOIUrl":"10.1016/j.jag.2025.104994","url":null,"abstract":"<div><div>Global wetlands are experiencing severe degradation due to climate change and human activities. Under the Ramsar Convention, the Wetland City Accreditation promotes cities to protect and sustainably manage their urban wetlands. The accreditation system was launched in 2015. To date, 43 cities worldwide have obtained this certification, whose dynamic assessment depends on precise mapping of land use and wetlands. Existing global land cover datasets often show low accuracy in identifying wetlands and limited capacity to characterize wetland types within urban areas. we developed a hybrid Wetland City Map (WCM), by fusing three global 10 m-resolution products: Dynamic World, ESA WorldCover, and ESRI Land Cover. We applied a Weighted Voting and Knowledge-based Decision Rule method to achieve this fusion. This method overcomes the limitations of the input datasets by combining their complementary strengths to improve overall wetland classification and by applying expert-derived rules to enhance the delineation of wetland types within cities. The WCM achieves an average overall accuracy of 86.93 % and a kappa of 0.825. In all cities, its accuracy surpasses the three land cover products by 2 %-26 %. The visual comparison shows WCM performs better in wetland classification and spatial detail, with F1 scores of 90.33 % (water), 64.09 % (marsh), 71.67 % (tidal flat/flooded flat), and 92.17 % (mangrove). It more accurately reflects wetland coverage and changes. Wetland coverage varies across cities, with higher coverage in Asia and lower in Europe and Africa. Individual cities experienced a maximum increase of 6.5 % and decrease of 1.3 % from 2020 to 2021.The WCM supports wetland monitoring, city accreditation, and research aligned with the Ramsar Strategic Plan and Sustainable Development Goals.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"146 ","pages":"Article 104994"},"PeriodicalIF":8.6,"publicationDate":"2025-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145685752","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Growing demands in sustainable development and resource management are driving increasing reliance on remote sensing-based Earth observation and image interpretation. In parallel, multimodal collaborative processing is attracting research attention. Synthetic aperture radar (SAR) and optical images offer complementary advantages but pose challenges for simultaneous use due to platform constraints and environmental conditions, often leaving only one modality available and impeding joint analysis. Generative models, particularly generative adversarial networks (GANs) and diffusion models (DMs), address this by learning cross-modal mappings. Translated images preserve structure and semantics while adopting target characteristics, thereby facilitating collaborative use. This review systematically categorizes translation frameworks spanning GANs, DMs, and other generative models. It then details downstream tasks supported by SAR–optical translation, including cloud removal, change detection, semantic segmentation, registration, and object detection, highlighting how translation bridges data gaps and enhances interpretation robustness. Furthermore, we provide open-source code and public datasets, discuss current challenges, and outline future research directions.
{"title":"Generative models for SAR–optical image translation: A systematic review","authors":"Zhao Wang , Zheng Zhang , Xiaojun Shan , Hong-an Wei , Ping Tang","doi":"10.1016/j.jag.2025.105009","DOIUrl":"10.1016/j.jag.2025.105009","url":null,"abstract":"<div><div>Growing demands in sustainable development and resource management are driving increasing reliance on remote sensing-based Earth observation and image interpretation. In parallel, multimodal collaborative processing is attracting research attention. Synthetic aperture radar (SAR) and optical images offer complementary advantages but pose challenges for simultaneous use due to platform constraints and environmental conditions, often leaving only one modality available and impeding joint analysis. Generative models, particularly generative adversarial networks (GANs) and diffusion models (DMs), address this by learning cross-modal mappings. Translated images preserve structure and semantics while adopting target characteristics, thereby facilitating collaborative use. This review systematically categorizes translation frameworks spanning GANs, DMs, and other generative models. It then details downstream tasks supported by SAR–optical translation, including cloud removal, change detection, semantic segmentation, registration, and object detection, highlighting how translation bridges data gaps and enhances interpretation robustness. Furthermore, we provide open-source code and public datasets, discuss current challenges, and outline future research directions.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"146 ","pages":"Article 105009"},"PeriodicalIF":8.6,"publicationDate":"2025-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145685754","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-04DOI: 10.1016/j.jag.2025.105003
Luca Fibbi , Marta Chiesi , Maurizio Pieri , Giorgio Bartolini , Daniele Grifoni , Bernardo Gozzini , Fabio Maselli
A semi-empirical method based on ancillary and remotely sensed data, NDVI-Cws, is currently applied over the Tuscany region to yield daily actual evapotranspiration (ETa) estimates at moderate spatial resolution (250 m) for a 20-year period (2005–2024). The outcome of this exercise is then statistically analysed in comparison with two similar ETa products provided by the MODIS and LSA SAF systems at 500 m and 5 km resolutions, respectively. The analysis relies on the triple collocation strategy and comprises the following two steps: i) examination and inter-comparison of the spatial and temporal ETa variations which occur in 12 areas of the region representative of different climatic conditions and biome types; ii) repetition of these operations at pixel level. The experimental results indicate the existence of clear spatial and temporal ETa variations over most of the region which are differently represented by the three ETa products. The MODIS ETa estimates are significantly higher and lower than those of the other two products for forest and non-forest areas, respectively; the ETa trends estimated by MODIS are poorly concordant with those of the other products, particularly for forests. Consequently, the MODIS ETa estimates reflect only marginally the ETa increases which are evidenced by the other two methods over most of the region. Out of these methods, NDVI-Cws allows a more spatially detailed prediction of the local ETa variability depending on the NDVI dataset used. The implications and consequences of these findings are finally discussed, together with the main future research prospects.
{"title":"Assessment of three remote sensing methods for estimating actual evapotranspiration in a Mediterranean region","authors":"Luca Fibbi , Marta Chiesi , Maurizio Pieri , Giorgio Bartolini , Daniele Grifoni , Bernardo Gozzini , Fabio Maselli","doi":"10.1016/j.jag.2025.105003","DOIUrl":"10.1016/j.jag.2025.105003","url":null,"abstract":"<div><div>A semi-empirical method based on ancillary and remotely sensed data, NDVI-Cws, is currently applied over the Tuscany region to yield daily actual evapotranspiration (ETa) estimates at moderate spatial resolution (250 m) for a 20-year period (2005–2024). The outcome of this exercise is then statistically analysed in comparison with two similar ETa products provided by the MODIS and LSA SAF systems at 500 m and 5 km resolutions, respectively. The analysis relies on the triple collocation strategy and comprises the following two steps: i) examination and inter-comparison of the spatial and temporal ETa variations which occur in 12 areas of the region representative of different climatic conditions and biome types; ii) repetition of these operations at pixel level. The experimental results indicate the existence of clear spatial and temporal ETa variations over most of the region which are differently represented by the three ETa products. The MODIS ETa estimates are significantly higher and lower than those of the other two products for forest and non-forest areas, respectively; the ETa trends estimated by MODIS are poorly concordant with those of the other products, particularly for forests. Consequently, the MODIS ETa estimates reflect only marginally the ETa increases which are evidenced by the other two methods over most of the region. Out of these methods, NDVI-Cws allows a more spatially detailed prediction of the local ETa variability depending on the NDVI dataset used. The implications and consequences of these findings are finally discussed, together with the main future research prospects.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"146 ","pages":"Article 105003"},"PeriodicalIF":8.6,"publicationDate":"2025-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145659160","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-04DOI: 10.1016/j.jag.2025.104968
Ramin Farhadiani , Sayyed Mohammad Javad Mirzadeh , Ehsan Roshani , Daniel Cusson , Saeid Homayouni
The precise prediction of dam deformation is essential for ensuring infrastructure safety and mitigating geohazards, particularly in regions characterized by limited monitoring studies. This research concentrates on the Oldman River Dam in Alberta, Canada, where Interferometric Synthetic Aperture Radar (InSAR)-based deformation monitoring and prediction remain inadequately explored. A novel framework that integrates a Graph Attention Network with Long Short-Term Memory (GAT-LSTM) has been developed to address the limitations of existing methods, which neglect spatial dependencies among InSAR-derived points and the increased model complexity stemming from point clustering or InSAR time series decomposition. Sentinel-1 data from three passes were processed utilizing a full-resolution InSAR technique, resulting in semi-vertical deformation velocities that demonstrated consistent subsidence along the dam crest, with rates fluctuating from 5.08 to 6.23 mm/yr. A robust correlation between deformation and reservoir water levels was noted, with accelerated crest deformation during the 2017–2019 drawdown period and a potential risk identified due to a significant decline in water levels projected for 2023–2024. The GAT-LSTM model, which captures both spatial and temporal dynamics, outperformed the standard LSTM, achieving 83.64% accurate points compared to 76.90% for the LSTM in short-term forecasting, exhibiting notable reliability along the crest. The peak performance was observed on September 9, 2021, with a Root Mean Square Error of 0.30 ± 0.013 mm and a Mean Absolute Error of 0.22 ± 0.012 mm. The proposed framework would enhance dam safety monitoring by providing actionable short-term predictions, demonstrating potential transferability to other slow-moving infrastructure.
{"title":"InSAR and GAT-LSTM integration for dam displacement prediction: Lessons from the Oldman River Dam, Canada","authors":"Ramin Farhadiani , Sayyed Mohammad Javad Mirzadeh , Ehsan Roshani , Daniel Cusson , Saeid Homayouni","doi":"10.1016/j.jag.2025.104968","DOIUrl":"10.1016/j.jag.2025.104968","url":null,"abstract":"<div><div>The precise prediction of dam deformation is essential for ensuring infrastructure safety and mitigating geohazards, particularly in regions characterized by limited monitoring studies. This research concentrates on the Oldman River Dam in Alberta, Canada, where Interferometric Synthetic Aperture Radar (InSAR)-based deformation monitoring and prediction remain inadequately explored. A novel framework that integrates a Graph Attention Network with Long Short-Term Memory (GAT-LSTM) has been developed to address the limitations of existing methods, which neglect spatial dependencies among InSAR-derived points and the increased model complexity stemming from point clustering or InSAR time series decomposition. Sentinel-1 data from three passes were processed utilizing a full-resolution InSAR technique, resulting in semi-vertical deformation velocities that demonstrated consistent subsidence along the dam crest, with rates fluctuating from 5.08 to 6.23 mm/yr. A robust correlation between deformation and reservoir water levels was noted, with accelerated crest deformation during the 2017–2019 drawdown period and a potential risk identified due to a significant decline in water levels projected for 2023–2024. The GAT-LSTM model, which captures both spatial and temporal dynamics, outperformed the standard LSTM, achieving 83.64% accurate points compared to 76.90% for the LSTM in short-term forecasting, exhibiting notable reliability along the crest. The peak performance was observed on September 9, 2021, with a Root Mean Square Error of 0.30 ± 0.013 mm and a Mean Absolute Error of 0.22 ± 0.012 mm. The proposed framework would enhance dam safety monitoring by providing actionable short-term predictions, demonstrating potential transferability to other slow-moving infrastructure.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"146 ","pages":"Article 104968"},"PeriodicalIF":8.6,"publicationDate":"2025-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145685753","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-03DOI: 10.1016/j.jag.2025.105000
Maximilian Eitel, Michael Schmitt
Satellite radar altimeters were originally designed for water applications, but their echoes over land surfaces remain less well understood. In this study we analyze how Sentinel-3 (S3) synthetic aperture radar (SAR) altimetry waveforms respond to different surface types and what physical characteristics are encoded in the signal. To probe this, we conduct classification experiments with a feature-enhanced one-dimensional convolutional neural network (1D-CNN) and analyze its performance. Since surface type information is relevant for climate, hydrology, and biodiversity applications, understanding these signal responses shows to what extent altimetric waveforms may provide consistent class-specific information despite their large elliptical footprint and heterogeneous landscapes. This study investigates the response of Sentinel-3 altimetry waveforms to different land cover types by employing a 1D-CNN to extract land cover information, complemented by a visual analysis of waveform patterns in relation to surface structures. Our results show that information about the underlying surface is embedded in the signals and can be extracted. They further reveal the sensitivity of Sentinel-3 altimetry to variations in land cover. By enhancing our 1D-CNN model with shape-based and contextual features, it effectively captures surface characteristics despite the large altimeter footprint. An ablation study highlights the complementary role of these features, as their removal negatively impacts performance. The best-performing 1D-CNN achieves a macro-averaged F1 (Macro-F1) score of 0.57 and an overall accuracy of 0.67, outperforming both a random forest and a dummy baseline. The classification includes six surface types: Tree, Shrub, Grass, Crop, Bare/Sparse Vegetation, and Water. Although some misclassification occurs, particularly in transition zones and among classes with similar vegetation structures and soil properties, the model provides valuable insights into systematic waveform behavior, highlighting the potential of SAR altimetry signals to capture broad surface characteristics.
{"title":"A global analysis of SAR altimetry signals over different landcover types","authors":"Maximilian Eitel, Michael Schmitt","doi":"10.1016/j.jag.2025.105000","DOIUrl":"10.1016/j.jag.2025.105000","url":null,"abstract":"<div><div>Satellite radar altimeters were originally designed for water applications, but their echoes over land surfaces remain less well understood. In this study we analyze how Sentinel-3 (S3) synthetic aperture radar (SAR) altimetry waveforms respond to different surface types and what physical characteristics are encoded in the signal. To probe this, we conduct classification experiments with a feature-enhanced one-dimensional convolutional neural network (1D-CNN) and analyze its performance. Since surface type information is relevant for climate, hydrology, and biodiversity applications, understanding these signal responses shows to what extent altimetric waveforms may provide consistent class-specific information despite their large elliptical footprint and heterogeneous landscapes. This study investigates the response of Sentinel-3 altimetry waveforms to different land cover types by employing a 1D-CNN to extract land cover information, complemented by a visual analysis of waveform patterns in relation to surface structures. Our results show that information about the underlying surface is embedded in the signals and can be extracted. They further reveal the sensitivity of Sentinel-3 altimetry to variations in land cover. By enhancing our 1D-CNN model with shape-based and contextual features, it effectively captures surface characteristics despite the large altimeter footprint. An ablation study highlights the complementary role of these features, as their removal negatively impacts performance. The best-performing 1D-CNN achieves a macro-averaged F1 (Macro-F1) score of 0.57 and an overall accuracy of 0.67, outperforming both a random forest and a dummy baseline. The classification includes six surface types: Tree, Shrub, Grass, Crop, Bare/Sparse Vegetation, and Water. Although some misclassification occurs, particularly in transition zones and among classes with similar vegetation structures and soil properties, the model provides valuable insights into systematic waveform behavior, highlighting the potential of SAR altimetry signals to capture broad surface characteristics.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"146 ","pages":"Article 105000"},"PeriodicalIF":8.6,"publicationDate":"2025-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145659109","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-03DOI: 10.1016/j.jag.2025.105008
Weisong Li , Yinwei Li , Yiming Zhu , Haipeng Wang
Multitemporal Synthetic Aperture Radar (SAR) image change detection (CD) represents a significant focus in remote sensing interpretation research. Recently, matrix low-rank decomposition theory has gained popularity in this field to exploit inherent structural information without requiring annotated data. However, existing approaches predominantly rely on an idealized assumption that defines changed regions as spatially localized and sparse. This assumption introduces critical theoretical limitations: its sensitivity to change scales results in sparsity constraints failing to characterize large-scale continuous changes and accumulating decomposition errors, while the neglect of low-rank coupling between changed regions and backgrounds further undermines theoretical completeness. To address these issues, we propose a Foreground and Background dual-path collaborative optimization CD framework, namely FBCD. Specifically, a foreground change saliency model is constructed under generalized low-rank constraints, integrating low-rank consistency and local correlation mechanisms to capture complex change patterns. In addition, a background stability model based on low-rank self-representation learning achieves precise background separation through multi-view consistency constraint. Once generating reconstructed difference map, a self-supervised graph-optimized label propagation algorithm is designed to transform binary classification into a graph partitioning optimization problem, which further improves the CD accuracy. Extensive experiments on seven bitemporal benchmark datasets validate the superiority of the proposed method: Compared to state-of-the-art approaches, it achieves average Kappa coefficient improvements of 2.50% for large-scale continuous changes and 4.48% for small-scale localized complex changes. Furthermore, the method also shows strong applicability in short-term time series datasets. The source code will be made available at https://github.com/95xiaoli/FBCD.
{"title":"Unsupervised multitemporal SAR image change detection via foreground-background collaborative optimization","authors":"Weisong Li , Yinwei Li , Yiming Zhu , Haipeng Wang","doi":"10.1016/j.jag.2025.105008","DOIUrl":"10.1016/j.jag.2025.105008","url":null,"abstract":"<div><div>Multitemporal Synthetic Aperture Radar (SAR) image change detection (CD) represents a significant focus in remote sensing interpretation research. Recently, matrix low-rank decomposition theory has gained popularity in this field to exploit inherent structural information without requiring annotated data. However, existing approaches predominantly rely on an idealized assumption that defines changed regions as spatially localized and sparse. This assumption introduces critical theoretical limitations: its sensitivity to change scales results in sparsity constraints failing to characterize large-scale continuous changes and accumulating decomposition errors, while the neglect of low-rank coupling between changed regions and backgrounds further undermines theoretical completeness. To address these issues, we propose a Foreground and Background dual-path collaborative optimization CD framework, namely FBCD. Specifically, a foreground change saliency model is constructed under generalized low-rank constraints, integrating low-rank consistency and local correlation mechanisms to capture complex change patterns. In addition, a background stability model based on low-rank self-representation learning achieves precise background separation through multi-view consistency constraint. Once generating reconstructed difference map, a self-supervised graph-optimized label propagation algorithm is designed to transform binary classification into a graph partitioning optimization problem, which further improves the CD accuracy. Extensive experiments on seven bitemporal benchmark datasets validate the superiority of the proposed method: Compared to state-of-the-art approaches, it achieves average Kappa coefficient improvements of 2.50% for large-scale continuous changes and 4.48% for small-scale localized complex changes. Furthermore, the method also shows strong applicability in short-term time series datasets. The source code will be made available at <span><span>https://github.com/95xiaoli/FBCD</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"146 ","pages":"Article 105008"},"PeriodicalIF":8.6,"publicationDate":"2025-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145659101","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}