利用优化的光谱指数和融合地理空间信息的机器学习方法精确估算枣树叶片叶绿素含量

IF 8.5 2区 环境科学与生态学 Q1 ECOLOGY Ecological Informatics Pub Date : 2025-03-01 Epub Date: 2024-12-24 DOI:10.1016/j.ecoinf.2024.102980
Nigela Tuerxun , Sulei Naibi , Jianghua Zheng , Renjun Wang , Lei Wang , Binbin Lu , Danlin Yu
{"title":"利用优化的光谱指数和融合地理空间信息的机器学习方法精确估算枣树叶片叶绿素含量","authors":"Nigela Tuerxun ,&nbsp;Sulei Naibi ,&nbsp;Jianghua Zheng ,&nbsp;Renjun Wang ,&nbsp;Lei Wang ,&nbsp;Binbin Lu ,&nbsp;Danlin Yu","doi":"10.1016/j.ecoinf.2024.102980","DOIUrl":null,"url":null,"abstract":"<div><div>Leaf chlorophyll content (LCC) is vital for photosynthesis and ecosystem functioning; it influences carbon, water, and energy exchanges while serving as an indicator of photosynthetic activity and nitrogen levels in precision agriculture. Hyperspectral data enable precise LCC monitoring by extracting spectral indices through optimal band combination (OBC) and predicting LCC with machine learning. However, OBC faces dimensionality issues, and machine learning models often overlook geographical influences, potentially reducing prediction accuracy. This study hypothesizes that developing spectral indices from important wavelengths and integrating geospatial data into machine learning models can address these issues and increase prediction accuracy. To test this hypothesis, a framework was developed that first uses elastic net (EN) and the successive projection algorithm (SPA) for wavelength selection, followed by spectral index creation with OBC and ranking with random forest (RF). Support vector regression (SVR), random forest regression (RFR), and geographically weighted least squares support vector regression (GWLS-SVR) were then used to assess the prediction accuracy. Finally, the optimal variables and regression model were identified. The results revealed that the EN- and SPA-based indices had stronger correlations and importance than defined indices. The double-difference index (DDn) and the anti-reflectance index (ARI) are the most robust three-dimensional and two-dimensional spectral indices, respectively. GWLS-SVR requires fewer indices (1–4) to achieve optimal results, with EN-DDn (2<em>R</em><sub>519</sub>-<em>R</em><sub>775</sub>-<em>R</em><sub>936</sub>)-GWLS-SVR performing best (R<sup>2</sup> = 0.95, RMSE = 0.61, PBIAS = -0.02). This research presents a robust framework with strong adaptability for estimating LCC in a specific study area and region, demonstrating substantial potential for the precise estimation of agroforestry vegetation parameters.</div></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"85 ","pages":"Article 102980"},"PeriodicalIF":8.5000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Accurate estimation of Jujube leaf chlorophyll content using optimized spectral indices and machine learning methods integrating geospatial information\",\"authors\":\"Nigela Tuerxun ,&nbsp;Sulei Naibi ,&nbsp;Jianghua Zheng ,&nbsp;Renjun Wang ,&nbsp;Lei Wang ,&nbsp;Binbin Lu ,&nbsp;Danlin Yu\",\"doi\":\"10.1016/j.ecoinf.2024.102980\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Leaf chlorophyll content (LCC) is vital for photosynthesis and ecosystem functioning; it influences carbon, water, and energy exchanges while serving as an indicator of photosynthetic activity and nitrogen levels in precision agriculture. Hyperspectral data enable precise LCC monitoring by extracting spectral indices through optimal band combination (OBC) and predicting LCC with machine learning. However, OBC faces dimensionality issues, and machine learning models often overlook geographical influences, potentially reducing prediction accuracy. This study hypothesizes that developing spectral indices from important wavelengths and integrating geospatial data into machine learning models can address these issues and increase prediction accuracy. To test this hypothesis, a framework was developed that first uses elastic net (EN) and the successive projection algorithm (SPA) for wavelength selection, followed by spectral index creation with OBC and ranking with random forest (RF). Support vector regression (SVR), random forest regression (RFR), and geographically weighted least squares support vector regression (GWLS-SVR) were then used to assess the prediction accuracy. Finally, the optimal variables and regression model were identified. The results revealed that the EN- and SPA-based indices had stronger correlations and importance than defined indices. The double-difference index (DDn) and the anti-reflectance index (ARI) are the most robust three-dimensional and two-dimensional spectral indices, respectively. GWLS-SVR requires fewer indices (1–4) to achieve optimal results, with EN-DDn (2<em>R</em><sub>519</sub>-<em>R</em><sub>775</sub>-<em>R</em><sub>936</sub>)-GWLS-SVR performing best (R<sup>2</sup> = 0.95, RMSE = 0.61, PBIAS = -0.02). This research presents a robust framework with strong adaptability for estimating LCC in a specific study area and region, demonstrating substantial potential for the precise estimation of agroforestry vegetation parameters.</div></div>\",\"PeriodicalId\":51024,\"journal\":{\"name\":\"Ecological Informatics\",\"volume\":\"85 \",\"pages\":\"Article 102980\"},\"PeriodicalIF\":8.5000,\"publicationDate\":\"2025-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ecological Informatics\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1574954124005223\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/12/24 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"ECOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Informatics","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1574954124005223","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/24 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ECOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

叶片叶绿素含量(LCC)对光合作用和生态系统功能至关重要;它影响碳、水和能量交换,同时作为精准农业光合活性和氮水平的指标。高光谱数据通过最优波段组合(OBC)提取光谱指标,并通过机器学习预测LCC,从而实现精确的LCC监测。然而,OBC面临维度问题,机器学习模型经常忽略地理影响,可能会降低预测准确性。本研究假设从重要波长开发光谱指数并将地理空间数据集成到机器学习模型中可以解决这些问题并提高预测精度。为了验证这一假设,开发了一个框架,首先使用弹性网(EN)和连续投影算法(SPA)进行波长选择,然后使用OBC创建光谱指数,并使用随机森林(RF)进行排序。采用支持向量回归(SVR)、随机森林回归(RFR)和地理加权最小二乘支持向量回归(GWLS-SVR)评估预测精度。最后,确定了最优变量和回归模型。结果表明,基于EN和spa的指数比定义的指数具有更强的相关性和重要性。双差指数(DDn)和抗反射指数(ARI)分别是最稳健的三维和二维光谱指数。GWLS-SVR所需指标较少(1-4)即可达到最佳效果,其中EN-DDn (2R519-R775-R936)-GWLS-SVR表现最佳(R2 = 0.95, RMSE = 0.61, PBIAS = -0.02)。本研究提出了一个鲁棒框架,具有较强的适应性,可用于估算特定研究区域和区域的LCC,显示出农林植被参数精确估算的巨大潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Accurate estimation of Jujube leaf chlorophyll content using optimized spectral indices and machine learning methods integrating geospatial information
Leaf chlorophyll content (LCC) is vital for photosynthesis and ecosystem functioning; it influences carbon, water, and energy exchanges while serving as an indicator of photosynthetic activity and nitrogen levels in precision agriculture. Hyperspectral data enable precise LCC monitoring by extracting spectral indices through optimal band combination (OBC) and predicting LCC with machine learning. However, OBC faces dimensionality issues, and machine learning models often overlook geographical influences, potentially reducing prediction accuracy. This study hypothesizes that developing spectral indices from important wavelengths and integrating geospatial data into machine learning models can address these issues and increase prediction accuracy. To test this hypothesis, a framework was developed that first uses elastic net (EN) and the successive projection algorithm (SPA) for wavelength selection, followed by spectral index creation with OBC and ranking with random forest (RF). Support vector regression (SVR), random forest regression (RFR), and geographically weighted least squares support vector regression (GWLS-SVR) were then used to assess the prediction accuracy. Finally, the optimal variables and regression model were identified. The results revealed that the EN- and SPA-based indices had stronger correlations and importance than defined indices. The double-difference index (DDn) and the anti-reflectance index (ARI) are the most robust three-dimensional and two-dimensional spectral indices, respectively. GWLS-SVR requires fewer indices (1–4) to achieve optimal results, with EN-DDn (2R519-R775-R936)-GWLS-SVR performing best (R2 = 0.95, RMSE = 0.61, PBIAS = -0.02). This research presents a robust framework with strong adaptability for estimating LCC in a specific study area and region, demonstrating substantial potential for the precise estimation of agroforestry vegetation parameters.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Ecological Informatics
Ecological Informatics 环境科学-生态学
CiteScore
8.30
自引率
11.80%
发文量
346
审稿时长
46 days
期刊介绍: The journal Ecological Informatics is devoted to the publication of high quality, peer-reviewed articles on all aspects of computational ecology, data science and biogeography. The scope of the journal takes into account the data-intensive nature of ecology, the growing capacity of information technology to access, harness and leverage complex data as well as the critical need for informing sustainable management in view of global environmental and climate change. The nature of the journal is interdisciplinary at the crossover between ecology and informatics. It focuses on novel concepts and techniques for image- and genome-based monitoring and interpretation, sensor- and multimedia-based data acquisition, internet-based data archiving and sharing, data assimilation, modelling and prediction of ecological data.
期刊最新文献
A multi-model ecosystem risk assessment methodology for the Massaciuccoli Lake basin wetland CBM-Net: Automatic detection and separation of cicada chorus from bee wingbeat audios Construction of long-term (2003–2024) DIN dataset in the Chinese coastal ocean using interpretable ML Bayesian source apportionment of sedimentary organic carbon along a sluice-regulated river–estuary continuum: Coupling machine learning, spatial analytics, and multi-proxy geochemistry Sampling method influences zooplankton diversity estimates in humic lakes: A case study with implications for ecological assessment
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1