基于特征集优化的蒸散发分配机器学习模型构建框架

IF 2.6 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Applied Computing and Geosciences Pub Date : 2022-12-01 DOI:10.1016/j.acags.2022.100105
Adam Stapleton , Elke Eichelmann , Mark Roantree
{"title":"基于特征集优化的蒸散发分配机器学习模型构建框架","authors":"Adam Stapleton ,&nbsp;Elke Eichelmann ,&nbsp;Mark Roantree","doi":"10.1016/j.acags.2022.100105","DOIUrl":null,"url":null,"abstract":"<div><p>A deeper understanding of the drivers of evapotranspiration and the modelling of its constituent parts (evaporation and transpiration) may be of significant importance to the monitoring and management of water resources globally over the coming decades. In this work a framework was developed to identify the best performing machine learning algorithm from a candidate set, select optimal predictive features and rank features in terms of their importance to predictive accuracy. The experiments conducted in this work used 3 separate feature sets across 4 wetland sites as input into 8 candidate machine learning algorithms, providing 96 sets of experimental configurations. Given this high number of parameters, our results show strong evidence that there is no singularly optimal machine learning algorithm or feature set across all of the wetland sites studied despite their similarities. At each of the sites at least one model was identified that improved on the predictive performance of our baseline. A key finding discovered when examining feature importance is that methane flux, a feature whose relationship with evapotranspiration is not generally examined, may contribute to further biophysical process understanding. This work demonstrates the applicability of a machine learning framework for evapotranspiration partitioning that is independent of domain knowledge, producing improved models for partitioning and identifying new and useful predictive features.</p></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"16 ","pages":"Article 100105"},"PeriodicalIF":2.6000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2590197422000271/pdfft?md5=4bb0fbb0ea2eccd1e035569ab227461d&pid=1-s2.0-S2590197422000271-main.pdf","citationCount":"0","resultStr":"{\"title\":\"A framework for constructing machine learning models with feature set optimisation for evapotranspiration partitioning\",\"authors\":\"Adam Stapleton ,&nbsp;Elke Eichelmann ,&nbsp;Mark Roantree\",\"doi\":\"10.1016/j.acags.2022.100105\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>A deeper understanding of the drivers of evapotranspiration and the modelling of its constituent parts (evaporation and transpiration) may be of significant importance to the monitoring and management of water resources globally over the coming decades. In this work a framework was developed to identify the best performing machine learning algorithm from a candidate set, select optimal predictive features and rank features in terms of their importance to predictive accuracy. The experiments conducted in this work used 3 separate feature sets across 4 wetland sites as input into 8 candidate machine learning algorithms, providing 96 sets of experimental configurations. Given this high number of parameters, our results show strong evidence that there is no singularly optimal machine learning algorithm or feature set across all of the wetland sites studied despite their similarities. At each of the sites at least one model was identified that improved on the predictive performance of our baseline. A key finding discovered when examining feature importance is that methane flux, a feature whose relationship with evapotranspiration is not generally examined, may contribute to further biophysical process understanding. This work demonstrates the applicability of a machine learning framework for evapotranspiration partitioning that is independent of domain knowledge, producing improved models for partitioning and identifying new and useful predictive features.</p></div>\",\"PeriodicalId\":33804,\"journal\":{\"name\":\"Applied Computing and Geosciences\",\"volume\":\"16 \",\"pages\":\"Article 100105\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2022-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2590197422000271/pdfft?md5=4bb0fbb0ea2eccd1e035569ab227461d&pid=1-s2.0-S2590197422000271-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Computing and Geosciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2590197422000271\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Computing and Geosciences","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2590197422000271","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

摘要

更深入地了解蒸散的驱动因素及其组成部分(蒸发和蒸腾)的建模可能对未来几十年全球水资源的监测和管理具有重要意义。在这项工作中,开发了一个框架,用于从候选集中识别性能最佳的机器学习算法,选择最优预测特征并根据其对预测准确性的重要性对特征进行排名。在这项工作中进行的实验使用了4个湿地的3个独立特征集作为8个候选机器学习算法的输入,提供了96组实验配置。考虑到这么多的参数,我们的研究结果显示了强有力的证据,表明尽管有相似之处,但在所研究的所有湿地地点中,没有单一的最佳机器学习算法或特征集。在每个站点,至少有一个模型被确定为改进了我们基线的预测性能。在研究特征重要性时发现的一个关键发现是,甲烷通量可能有助于进一步了解生物物理过程,而这一特征与蒸散发的关系通常没有得到研究。这项工作证明了独立于领域知识的蒸散发划分的机器学习框架的适用性,产生了用于划分和识别新的和有用的预测特征的改进模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A framework for constructing machine learning models with feature set optimisation for evapotranspiration partitioning

A deeper understanding of the drivers of evapotranspiration and the modelling of its constituent parts (evaporation and transpiration) may be of significant importance to the monitoring and management of water resources globally over the coming decades. In this work a framework was developed to identify the best performing machine learning algorithm from a candidate set, select optimal predictive features and rank features in terms of their importance to predictive accuracy. The experiments conducted in this work used 3 separate feature sets across 4 wetland sites as input into 8 candidate machine learning algorithms, providing 96 sets of experimental configurations. Given this high number of parameters, our results show strong evidence that there is no singularly optimal machine learning algorithm or feature set across all of the wetland sites studied despite their similarities. At each of the sites at least one model was identified that improved on the predictive performance of our baseline. A key finding discovered when examining feature importance is that methane flux, a feature whose relationship with evapotranspiration is not generally examined, may contribute to further biophysical process understanding. This work demonstrates the applicability of a machine learning framework for evapotranspiration partitioning that is independent of domain knowledge, producing improved models for partitioning and identifying new and useful predictive features.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Applied Computing and Geosciences
Applied Computing and Geosciences Computer Science-General Computer Science
CiteScore
5.50
自引率
0.00%
发文量
23
审稿时长
5 weeks
期刊最新文献
A machine learning approach for mapping susceptibility to land subsidence caused by ground water extraction Revolutionizing the future of hydrological science: Impact of machine learning and deep learning amidst emerging explainable AI and transfer learning Generating land gravity anomalies from satellite gravity observations using PIX2PIX GAN image translation Reconstruction of reservoir rock using attention-based convolutional recurrent neural network Mapping landforms of a hilly landscape using machine learning and high-resolution LiDAR topographic data
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1