Data stochasticity and model parametrisation impact the performance of species distribution models: insights from a simulation study

Charlotte Lambert, Auriane Virgili
{"title":"Data stochasticity and model parametrisation impact the performance of species distribution models: insights from a simulation study","authors":"Charlotte Lambert, Auriane Virgili","doi":"10.24072/pcjournal.263","DOIUrl":null,"url":null,"abstract":"Species distribution models (SDM) are widely used to describe and explain how species relate to their environment and predict their spatial distributions. As such, they are the cornerstone of most of spatial planning efforts worldwide. SDM can be implemented with a wide array of data types (presence-only, presence-absence, count...), which can either be point- or areal-based, and use a wide array of environmental conditions as predictor variables. The choice of the sampling type as well as the resolution of environmental conditions to be used are recognized as of crucial importance, yet we lack any quantification of the effects these decisions may have on SDM reliability. In the present work, we fill this gap with an unprecedented simulation procedure. We simulated 100 possible distributions of two different virtual species in two different regions. Species distribution were modelled using either segment- or areal-based sampling and five different spatial resolutions of environmental conditions. The SDM performances were inspected by statistical metrics, model composition, shapes of relationships and prediction quality. We provided clear evidence of stochasticity in the modelling process (particularly in the shapes of relationships): two dataset from the same survey, species and region could yield different results. Sampling type had stronger effects than spatial resolution on the final model relevance. The effect of coarsening the resolution was directly related to the resistance of the spatial features to changes of scale: SDM failed to adequately identify spatial distributions when the spatial features targeted by the species were diluted by resolution coarsening. These results have important implications for the SDM community, backing up some commonly accepted choices, but also by highlighting some up-to-now unexpected features of SDM (stochasticity). As a whole, this work calls for carefully weighted decisions in implementing models, and for caution in interpreting results.","PeriodicalId":74413,"journal":{"name":"Peer community journal","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Peer community journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.24072/pcjournal.263","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Species distribution models (SDM) are widely used to describe and explain how species relate to their environment and predict their spatial distributions. As such, they are the cornerstone of most of spatial planning efforts worldwide. SDM can be implemented with a wide array of data types (presence-only, presence-absence, count...), which can either be point- or areal-based, and use a wide array of environmental conditions as predictor variables. The choice of the sampling type as well as the resolution of environmental conditions to be used are recognized as of crucial importance, yet we lack any quantification of the effects these decisions may have on SDM reliability. In the present work, we fill this gap with an unprecedented simulation procedure. We simulated 100 possible distributions of two different virtual species in two different regions. Species distribution were modelled using either segment- or areal-based sampling and five different spatial resolutions of environmental conditions. The SDM performances were inspected by statistical metrics, model composition, shapes of relationships and prediction quality. We provided clear evidence of stochasticity in the modelling process (particularly in the shapes of relationships): two dataset from the same survey, species and region could yield different results. Sampling type had stronger effects than spatial resolution on the final model relevance. The effect of coarsening the resolution was directly related to the resistance of the spatial features to changes of scale: SDM failed to adequately identify spatial distributions when the spatial features targeted by the species were diluted by resolution coarsening. These results have important implications for the SDM community, backing up some commonly accepted choices, but also by highlighting some up-to-now unexpected features of SDM (stochasticity). As a whole, this work calls for carefully weighted decisions in implementing models, and for caution in interpreting results.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
数据随机性和模型参数化影响物种分布模型的性能:来自模拟研究的见解
物种分布模型(SDM)被广泛用于描述和解释物种与环境的关系,并预测其空间分布。因此,它们是世界上大多数空间规划工作的基石。SDM可以使用多种数据类型(仅存在、不存在、计数等)来实现,这些数据类型可以是基于点的,也可以是基于区域的,并且可以使用多种环境条件作为预测变量。采样类型的选择以及要使用的环境条件的分辨率被认为是至关重要的,但我们缺乏对这些决策可能对SDM可靠性产生的影响的任何量化。在目前的工作中,我们用一种前所未有的模拟程序填补了这一空白。我们模拟了两种不同虚拟物种在两个不同地区的100种可能分布。物种分布采用分段或基于区域的采样和五种不同的空间分辨率的环境条件进行建模。通过统计指标、模型组成、关系形状和预测质量来检验SDM的性能。我们在建模过程中提供了明确的随机性证据(特别是在关系的形状上):来自同一调查的两个数据集,物种和地区可能会产生不同的结果。采样类型对最终模型相关性的影响强于空间分辨率。分辨率粗化的效果与空间特征对尺度变化的抵抗力直接相关:当分辨率粗化稀释了物种所瞄准的空间特征时,SDM无法充分识别空间分布。这些结果对SDM社区具有重要意义,支持了一些普遍接受的选择,但也强调了SDM的一些迄今为止意想不到的特征(随机性)。总的来说,这项工作要求在实现模型时仔细权衡决策,并在解释结果时谨慎。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
High quality genome assembly and annotation (v1) of the eukaryotic terrestrial microalga Coccomyxa viridis SAG 216-4 Ecotoxicity of lanthanides to Daphnia magna: insights from elemental behavior and speciation in a standardized test medium T7 DNA polymerase treatment improves quantitative sequencing of both double-stranded and single-stranded DNA viruses Differences in specificity, development time and virulence between two acanthocephalan parasites, infecting two cryptic species of Gammarus fossarum Multiproxy analysis exploring patterns of diet and disease in dental calculus and skeletal remains from a 19th century Dutch population
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1