tidysdm: Leveraging the flexibility of tidymodels for species distribution modelling in R

IF 6.3 2区 环境科学与生态学 Q1 ECOLOGY Methods in Ecology and Evolution Pub Date : 2024-09-10 DOI:10.1111/2041-210x.14406
Michela Leonardi, Margherita Colucci, Andrea Vittorio Pozzi, Eleanor M. L. Scerri, Andrea Manica
{"title":"tidysdm: Leveraging the flexibility of tidymodels for species distribution modelling in R","authors":"Michela Leonardi, Margherita Colucci, Andrea Vittorio Pozzi, Eleanor M. L. Scerri, Andrea Manica","doi":"10.1111/2041-210x.14406","DOIUrl":null,"url":null,"abstract":"<jats:list> <jats:list-item>In species distribution modelling (SDM), it is common practice to explore multiple machine learning (ML) algorithms and combine their results into ensembles. In R, many implementations of different ML algorithms are available but, as they were mostly developed independently, they often use inconsistent syntax and data structures. For this reason, repeating an analysis with multiple algorithms and combining their results can be challenging.</jats:list-item> <jats:list-item>Specialised SDM packages solve this problem by providing a simpler, unified interface by wrapping the original functions to tackle each specific requirement. However, creating and maintaining such interfaces is time‐consuming, and with this approach, the user cannot easily integrate other methods that may become available.</jats:list-item> <jats:list-item>Here, we present <jats:italic>tidysdm</jats:italic>, an R package that solves this problem by taking advantage of the <jats:italic>tidymodels</jats:italic> universe. <jats:italic>tidymodels</jats:italic> provide standardised grammar, data structures and modelling interfaces, and a well‐documented infrastructure to integrate new algorithms and metrics. The wide adoption of <jats:italic>tidymodels</jats:italic> means that most ML algorithms and metrics are already integrated, and the user can add additional ones. Moreover, because of the broad adoption of <jats:italic>tidymodels</jats:italic>, new statistical approaches tend to be implemented quickly, making them easily integrated into existing pipelines and analyses.</jats:list-item> <jats:list-item><jats:italic>tidysdm</jats:italic> takes advantage of the <jats:italic>tidymodels</jats:italic> universe to provide a flexible and fully customisable pipeline to fit SDM. It includes SDM‐specific algorithms and metrics, and methods to facilitate the use of spatial data within <jats:italic>tidymodels</jats:italic>.</jats:list-item> <jats:list-item>Additionally, <jats:italic>tidysdm</jats:italic> is the first software that natively allows SDM to be performed using data from different periods, expanding the availability of SDM for scholars working in palaeontology, archaeology, palaeobiology, palaeoecology and other disciplines focussing on the past.</jats:list-item> </jats:list>","PeriodicalId":208,"journal":{"name":"Methods in Ecology and Evolution","volume":null,"pages":null},"PeriodicalIF":6.3000,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Methods in Ecology and Evolution","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1111/2041-210x.14406","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

In species distribution modelling (SDM), it is common practice to explore multiple machine learning (ML) algorithms and combine their results into ensembles. In R, many implementations of different ML algorithms are available but, as they were mostly developed independently, they often use inconsistent syntax and data structures. For this reason, repeating an analysis with multiple algorithms and combining their results can be challenging. Specialised SDM packages solve this problem by providing a simpler, unified interface by wrapping the original functions to tackle each specific requirement. However, creating and maintaining such interfaces is time‐consuming, and with this approach, the user cannot easily integrate other methods that may become available. Here, we present tidysdm, an R package that solves this problem by taking advantage of the tidymodels universe. tidymodels provide standardised grammar, data structures and modelling interfaces, and a well‐documented infrastructure to integrate new algorithms and metrics. The wide adoption of tidymodels means that most ML algorithms and metrics are already integrated, and the user can add additional ones. Moreover, because of the broad adoption of tidymodels, new statistical approaches tend to be implemented quickly, making them easily integrated into existing pipelines and analyses. tidysdm takes advantage of the tidymodels universe to provide a flexible and fully customisable pipeline to fit SDM. It includes SDM‐specific algorithms and metrics, and methods to facilitate the use of spatial data within tidymodels. Additionally, tidysdm is the first software that natively allows SDM to be performed using data from different periods, expanding the availability of SDM for scholars working in palaeontology, archaeology, palaeobiology, palaeoecology and other disciplines focussing on the past.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
tidysdm:利用潮汐模型的灵活性在 R 中建立物种分布模型
在物种分布建模(SDM)中,通常的做法是探索多种机器学习(ML)算法,并将其结果组合成集合。在 R 语言中,有许多不同 ML 算法的实现方法,但由于它们大多是独立开发的,因此经常使用不一致的语法和数据结构。因此,使用多种算法重复分析并将其结果组合起来是一项挑战。专门的 SDM 软件包可以解决这个问题,它通过封装原始函数来提供更简单、统一的界面,以满足各种特定要求。然而,创建和维护这样的界面非常耗时,而且采用这种方法,用户无法轻松集成可能出现的其他方法。tidymodels 提供了标准化的语法、数据结构和建模接口,以及文档齐全的基础设施,可用于集成新算法和度量标准。tidymodels 的广泛采用意味着大多数 ML 算法和度量标准已经集成,用户可以添加其他算法和度量标准。此外,由于 tidymodels 被广泛采用,新的统计方法往往能很快实施,从而很容易集成到现有的管道和分析中。tidysdm 利用 tidymodels 的优势,提供了一个灵活、完全可定制的管道,以适应 SDM。它包括 SDM 专用算法和指标,以及便于在 tidymodels 中使用空间数据的方法。此外,tidysdm 还是第一款允许使用不同时期数据进行 SDM 的软件,为古生物学、考古学、古生物学、古生态学和其他关注过去的学科的学者提供了 SDM 的更多可能性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
11.60
自引率
3.00%
发文量
236
审稿时长
4-8 weeks
期刊介绍: A British Ecological Society journal, Methods in Ecology and Evolution (MEE) promotes the development of new methods in ecology and evolution, and facilitates their dissemination and uptake by the research community. MEE brings together papers from previously disparate sub-disciplines to provide a single forum for tracking methodological developments in all areas. MEE publishes methodological papers in any area of ecology and evolution, including: -Phylogenetic analysis -Statistical methods -Conservation & management -Theoretical methods -Practical methods, including lab and field -This list is not exhaustive, and we welcome enquiries about possible submissions. Methods are defined in the widest terms and may be analytical, practical or conceptual. A primary aim of the journal is to maximise the uptake of techniques by the community. We recognise that a major stumbling block in the uptake and application of new methods is the accessibility of methods. For example, users may need computer code, example applications or demonstrations of methods.
期刊最新文献
sabinaNSDM: An R package for spatially nested hierarchical species distribution modelling Introducing a unique animal ID and digital life history museum for wildlife metadata tidysdm: Leveraging the flexibility of tidymodels for species distribution modelling in R Robust characterisation of forest structure from airborne laser scanning—A systematic assessment and sample workflow for ecologists Spatial confounding in joint species distribution models
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1