Machine learning based classification of lake ice and open water from Sentinel-3 SAR altimetry waveforms

IF 11.1 1区 地球科学 Q1 ENVIRONMENTAL SCIENCES Remote Sensing of Environment Pub Date : 2023-11-04 DOI:10.1016/j.rse.2023.113891
Jaya Sree Mugunthan , Claude R. Duguay , Elena Zakharova
{"title":"Machine learning based classification of lake ice and open water from Sentinel-3 SAR altimetry waveforms","authors":"Jaya Sree Mugunthan ,&nbsp;Claude R. Duguay ,&nbsp;Elena Zakharova","doi":"10.1016/j.rse.2023.113891","DOIUrl":null,"url":null,"abstract":"<div><p>The aim of the study was to evaluate, for the first time, the capability of different machine learning (ML) algorithms in classifying along-track lake surface conditions (open water and ice types) across ice seasons (freeze-up, ice growth and break-up periods) from Sentinel-3 A/B synthetic aperture radar altimeter (SRAL) data. To achieve this goal, over 107,500 radar waveforms extracted from 11 large lakes across the Northern Hemisphere and three ice seasons (2018–2021) were manually labelled using complementary satellite data (Sentinel-1 imaging Synthetic Aperture Radar (SAR), Sentinel-2 Multispectral Instrument (MSI) Level 1C, and MODIS Aqua/Terra data) for the training and testing of the ML algorithms in discriminating between open water, young (thin) ice, growing ice and melting ice. The four ML algorithms tested include Random Forest (RF), Gradient Boosting Trees (GBT), K Nearest Neighbor (KNN) and Support Vector Machine (SVM). To characterize the waveforms, seven waveform parameters were derived: Leading Edge Width (LEW), Offset Center of Gravity (OCOG) Width, Pulse Peakiness (PP), backscatter coefficient (Sigma0), late tail to peak power (LTPP), early tail to peak power (ETPP) and the maximum value of the echo power (Max). Accuracies &gt;95% were achieved across all classifiers using a 4-parameter combination (Sigma0, PP, OCOG Width, and LEW). Among all waveform parameters, Sigma0, OCOG width and PP were found to be the most important parameters for discriminating between lake ice types and open water. Despite showing comparable classification performances in the overall classification, RF and KNN are found to be a better fit for global lake ice mapping as both are less sensitive to their internal hyperparameters. Additionally, consistent results (&gt;93.7% accuracy in all classifiers) achieved on the accuracy assessment carried out for each lake (out-of-sample testing) revealed the strength of the classifiers for spatial transferability. Implementation of RF and KNN could be valuable in a pre-or post-processing step for identifying lake surface conditions under which the retrieval of water level and ice thickness may be limited or not possible and, therefore, inform algorithms currently used for the generation of operational or research products. While the research focused on 11 of the largest lakes of the Northern Hemisphere, the classification approach presented herein has potential for application on smaller lakes too since data in SAR mode (∼300 m along-track resolution) are used.</p></div>","PeriodicalId":417,"journal":{"name":"Remote Sensing of Environment","volume":"299 ","pages":"Article 113891"},"PeriodicalIF":11.1000,"publicationDate":"2023-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S003442572300442X/pdfft?md5=ebf6304468b17b0856bca3e55010b8db&pid=1-s2.0-S003442572300442X-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Remote Sensing of Environment","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S003442572300442X","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

The aim of the study was to evaluate, for the first time, the capability of different machine learning (ML) algorithms in classifying along-track lake surface conditions (open water and ice types) across ice seasons (freeze-up, ice growth and break-up periods) from Sentinel-3 A/B synthetic aperture radar altimeter (SRAL) data. To achieve this goal, over 107,500 radar waveforms extracted from 11 large lakes across the Northern Hemisphere and three ice seasons (2018–2021) were manually labelled using complementary satellite data (Sentinel-1 imaging Synthetic Aperture Radar (SAR), Sentinel-2 Multispectral Instrument (MSI) Level 1C, and MODIS Aqua/Terra data) for the training and testing of the ML algorithms in discriminating between open water, young (thin) ice, growing ice and melting ice. The four ML algorithms tested include Random Forest (RF), Gradient Boosting Trees (GBT), K Nearest Neighbor (KNN) and Support Vector Machine (SVM). To characterize the waveforms, seven waveform parameters were derived: Leading Edge Width (LEW), Offset Center of Gravity (OCOG) Width, Pulse Peakiness (PP), backscatter coefficient (Sigma0), late tail to peak power (LTPP), early tail to peak power (ETPP) and the maximum value of the echo power (Max). Accuracies >95% were achieved across all classifiers using a 4-parameter combination (Sigma0, PP, OCOG Width, and LEW). Among all waveform parameters, Sigma0, OCOG width and PP were found to be the most important parameters for discriminating between lake ice types and open water. Despite showing comparable classification performances in the overall classification, RF and KNN are found to be a better fit for global lake ice mapping as both are less sensitive to their internal hyperparameters. Additionally, consistent results (>93.7% accuracy in all classifiers) achieved on the accuracy assessment carried out for each lake (out-of-sample testing) revealed the strength of the classifiers for spatial transferability. Implementation of RF and KNN could be valuable in a pre-or post-processing step for identifying lake surface conditions under which the retrieval of water level and ice thickness may be limited or not possible and, therefore, inform algorithms currently used for the generation of operational or research products. While the research focused on 11 of the largest lakes of the Northern Hemisphere, the classification approach presented herein has potential for application on smaller lakes too since data in SAR mode (∼300 m along-track resolution) are used.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于机器学习的Sentinel-3 SAR测高波形对湖泊冰和开放水域的分类
该研究的目的是首次评估不同机器学习(ML)算法从Sentinel-3 A/B合成孔径雷达高度计(SRAL)数据中对不同冰季(结冰、结冰和破裂期)的沿轨道湖面条件(开放水域和冰类型)进行分类的能力。为了实现这一目标,使用互补卫星数据(Sentinel-1成像合成孔径雷达(SAR)、Sentinel-2多光谱仪器(MSI)1C级、,和MODIS Aqua/Terra数据),用于训练和测试ML算法,以区分开放水域、年轻(薄)冰、生长中的冰和融化的冰。测试的四种ML算法包括随机森林(RF)、梯度提升树(GBT)、K近邻(KNN)和支持向量机(SVM)。为了表征波形,导出了七个波形参数:前沿宽度(LEW)、偏移重心(OCG)宽度、脉冲峰值(PP)、反向散射系数(Sigma0)、后期尾峰功率(LTPP)、早期尾峰功率和回波功率的最大值(Max)。指控>;使用4参数组合(Sigma0、PP、OCG宽度和LEW)在所有分类器中实现了95%。在所有波形参数中,Sigma0、奥组委宽度和PP是区分湖泊冰类型和开放水域的最重要参数。尽管在总体分类中显示出可比的分类性能,但RF和KNN被发现更适合全球湖泊冰图绘制,因为两者对其内部超参数不太敏感。此外,在对每个湖泊进行的准确度评估(样本外测试)中获得的一致结果(所有分类器的准确度>93.7%)揭示了分类器在空间可转移性方面的优势。在前或后处理步骤中,RF和KNN的实现对于识别湖面条件可能是有价值的,在这种情况下,水位和冰厚度的反演可能受到限制或不可能,因此,为目前用于生成作战或研究产品的算法提供信息。虽然研究的重点是北半球11个最大的湖泊,但由于使用了SAR模式(沿轨道分辨率~300m)的数据,本文提出的分类方法也有可能应用于较小的湖泊。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Remote Sensing of Environment
Remote Sensing of Environment 环境科学-成像科学与照相技术
CiteScore
25.10
自引率
8.90%
发文量
455
审稿时长
53 days
期刊介绍: Remote Sensing of Environment (RSE) serves the Earth observation community by disseminating results on the theory, science, applications, and technology that contribute to advancing the field of remote sensing. With a thoroughly interdisciplinary approach, RSE encompasses terrestrial, oceanic, and atmospheric sensing. The journal emphasizes biophysical and quantitative approaches to remote sensing at local to global scales, covering a diverse range of applications and techniques. RSE serves as a vital platform for the exchange of knowledge and advancements in the dynamic field of remote sensing.
期刊最新文献
Two-decade surface ozone (O3) pollution in China: Enhanced fine-scale estimations and environmental health implications Assessing lead fraction derived from passive microwave images and improving estimates at pixel-wise level Estimating anthropogenic CO2 emissions from China's Yangtze River Delta using OCO-2 observations and WRF-Chem simulations A dual-branch network for crop-type mapping of scattered small agricultural fields in time series remote sensing images From theory to hydrological practice: Leveraging CYGNSS data over seven years for advanced soil moisture monitoring
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1