Comparative analysis of k-nearest neighbors distance metrics for retrieving coastal water quality based on concurrent in situ and satellite observations

IF 5.3 3区 环境科学与生态学 Q1 ENVIRONMENTAL SCIENCES Marine pollution bulletin Pub Date : 2025-03-13 DOI:10.1016/j.marpolbul.2025.117816
Bonyad Ahmadi , Mehdi Gholamalifard , Seyed Mahmoud Ghasempouri , Tiit Kutser
{"title":"Comparative analysis of k-nearest neighbors distance metrics for retrieving coastal water quality based on concurrent in situ and satellite observations","authors":"Bonyad Ahmadi ,&nbsp;Mehdi Gholamalifard ,&nbsp;Seyed Mahmoud Ghasempouri ,&nbsp;Tiit Kutser","doi":"10.1016/j.marpolbul.2025.117816","DOIUrl":null,"url":null,"abstract":"<div><div>It is time consuming and expensive to monitor extensive areas of coastal waters with sufficient frequency using in situ (ship based) methods. Satellite remote sensing is much more cost effective. Satellites can detect Optically Active Constituents (OACs) in water. Therefore, it is crucial to know the concentrations of OACs in the study area in order to develop and validate remote sensing methods suitable for assessing water quality in this region. The Pars Special Economic Energy Zone (PSEEZ), a major hub of natural gas extraction in the Persian Gulf, has undergone rapid industrial expansion since 1998, intensifying environmental pressures and necessitating high-resolution, frequent water quality assessments. However, a structured, long-term monitoring framework is absent despite the significance of this region. In order to develop satellite-based remote sensing methods for this region we carried out measurements of different OACs (chlorophyll-a, coloured dissolved organic matter (CDOM) and turbidity) and tested Landsat 8, Sentinel-2, and Sentinel-3 performance in retrieving the OACs. We tested the k-Nearest Neighbors machine learning algorithm. The selection of distance metrics demonstrated a significant influence on the accuracy of retrieving OACs. In turbidity retrieval, the Euclidean Distance (ED) enhanced the regression slope to 0.90 (a 55.17 % improvement over Fuzzy Mahalanobis Distance (FD)) and reduced the RMSLE to 0.51, corresponding to an approximate 160 % enhancement in precision. For CDOM, RMSLE values for ED and FD were 0.39 and 0.48, respectively, indicating an 18.75 % improvement favoring ED. Furthermore, bias analysis revealed deviations of 1–6 % compared to reference data, with the lowest values observed for Mahalanobis Distance (MD) with MSI and FD with OLCI. In chlorophyll-a retrieval, the choice of distance metric directly impacted the accuracy of the OLCI sensor, inducing bidirectional bias, comprising both overestimation and underestimation, which varied depending on the selected metric. These results underscore the critical importance of optimizing distance metric selection to enhance prediction accuracy and mitigate systematic errors in remote sensing applications. Furthermore, the results revealed that the implementation of this algorithm exhibited substantially superior performance compared to other evaluated algorithms within the study area, achieving significantly higher accuracy metrics. Thereby establishing k-NN as the optimal framework for satellite-based water quality monitoring in environmentally sensitive regions like PSEEZ.</div></div>","PeriodicalId":18215,"journal":{"name":"Marine pollution bulletin","volume":"214 ","pages":"Article 117816"},"PeriodicalIF":5.3000,"publicationDate":"2025-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Marine pollution bulletin","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0025326X25002917","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

It is time consuming and expensive to monitor extensive areas of coastal waters with sufficient frequency using in situ (ship based) methods. Satellite remote sensing is much more cost effective. Satellites can detect Optically Active Constituents (OACs) in water. Therefore, it is crucial to know the concentrations of OACs in the study area in order to develop and validate remote sensing methods suitable for assessing water quality in this region. The Pars Special Economic Energy Zone (PSEEZ), a major hub of natural gas extraction in the Persian Gulf, has undergone rapid industrial expansion since 1998, intensifying environmental pressures and necessitating high-resolution, frequent water quality assessments. However, a structured, long-term monitoring framework is absent despite the significance of this region. In order to develop satellite-based remote sensing methods for this region we carried out measurements of different OACs (chlorophyll-a, coloured dissolved organic matter (CDOM) and turbidity) and tested Landsat 8, Sentinel-2, and Sentinel-3 performance in retrieving the OACs. We tested the k-Nearest Neighbors machine learning algorithm. The selection of distance metrics demonstrated a significant influence on the accuracy of retrieving OACs. In turbidity retrieval, the Euclidean Distance (ED) enhanced the regression slope to 0.90 (a 55.17 % improvement over Fuzzy Mahalanobis Distance (FD)) and reduced the RMSLE to 0.51, corresponding to an approximate 160 % enhancement in precision. For CDOM, RMSLE values for ED and FD were 0.39 and 0.48, respectively, indicating an 18.75 % improvement favoring ED. Furthermore, bias analysis revealed deviations of 1–6 % compared to reference data, with the lowest values observed for Mahalanobis Distance (MD) with MSI and FD with OLCI. In chlorophyll-a retrieval, the choice of distance metric directly impacted the accuracy of the OLCI sensor, inducing bidirectional bias, comprising both overestimation and underestimation, which varied depending on the selected metric. These results underscore the critical importance of optimizing distance metric selection to enhance prediction accuracy and mitigate systematic errors in remote sensing applications. Furthermore, the results revealed that the implementation of this algorithm exhibited substantially superior performance compared to other evaluated algorithms within the study area, achieving significantly higher accuracy metrics. Thereby establishing k-NN as the optimal framework for satellite-based water quality monitoring in environmentally sensitive regions like PSEEZ.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
求助全文
约1分钟内获得全文 去求助
来源期刊
Marine pollution bulletin
Marine pollution bulletin 环境科学-海洋与淡水生物学
CiteScore
10.20
自引率
15.50%
发文量
1077
审稿时长
68 days
期刊介绍: Marine Pollution Bulletin is concerned with the rational use of maritime and marine resources in estuaries, the seas and oceans, as well as with documenting marine pollution and introducing new forms of measurement and analysis. A wide range of topics are discussed as news, comment, reviews and research reports, not only on effluent disposal and pollution control, but also on the management, economic aspects and protection of the marine environment in general.
期刊最新文献
Comparative analysis of k-nearest neighbors distance metrics for retrieving coastal water quality based on concurrent in situ and satellite observations Unsealing behaviour: Variation in harbour seal (Phoca vitulina) responses to anthropogenic sound in relation to individual health Degradation and habitat-dependent colonization of plastics in Caribbean coastal waters and sediments by bacterial communities Spatial analysis of total mercury concentration in surface sediments in Sulaibikhat Bay, Kuwait Enhancing Lagrangian particle tracking using objective Eulerian coherent structures
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1