Rebecca Brown Cooper, Bethany J Allen, Daniele Silvestro
{"title":"DeepDiveR--一款通过深度学习从化石地点估算古多样性的软件","authors":"Rebecca Brown Cooper, Bethany J Allen, Daniele Silvestro","doi":"10.1101/2024.09.03.610960","DOIUrl":null,"url":null,"abstract":"The incompleteness of the fossil record, in particular variation in preservation and sampling through space and time, presents a barrier to estimating changes in biodiversity which standard statistical methods struggle to account for. Here we present DeepDiveR, an R package for the DeepDive program enabling estimation of biodiversity from fossil occurrence data. The method uses a simulation-trained deep neural network to generate predictions of biodiversity change through time, while accounting for temporal, spatial and taxonomic heterogeneities in preservation. DeepDiveR can be readily used to explore the extinct biodiversity of different clades. We demonstrate the pipeline to build and customise analyses, including consideration of changes in biogeography. We also further develop the model to integrate information about modern diversity in the case of extant clades and introduce a function that automatically adjusts the parameterization of the simulations to generate training data that reflect the distribution of empirical datasets. To demonstrate the software, we analyse the fossil record of the order Carnivora through the Cenozoic, finding a peak in diversity in the Late Miocene and a 37% species loss since the Pleistocene. Our implementation includes the generation summary statistics and plots that allow for an evaluation of the model performance and diversity estimations and a configuration file that captures all parameters required to guarantee the full reproducibility of the results.","PeriodicalId":501477,"journal":{"name":"bioRxiv - Paleontology","volume":"14 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"DeepDiveR – A software for deep learning estimation of palaeodiversity from fossil occurrences\",\"authors\":\"Rebecca Brown Cooper, Bethany J Allen, Daniele Silvestro\",\"doi\":\"10.1101/2024.09.03.610960\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The incompleteness of the fossil record, in particular variation in preservation and sampling through space and time, presents a barrier to estimating changes in biodiversity which standard statistical methods struggle to account for. Here we present DeepDiveR, an R package for the DeepDive program enabling estimation of biodiversity from fossil occurrence data. The method uses a simulation-trained deep neural network to generate predictions of biodiversity change through time, while accounting for temporal, spatial and taxonomic heterogeneities in preservation. DeepDiveR can be readily used to explore the extinct biodiversity of different clades. We demonstrate the pipeline to build and customise analyses, including consideration of changes in biogeography. We also further develop the model to integrate information about modern diversity in the case of extant clades and introduce a function that automatically adjusts the parameterization of the simulations to generate training data that reflect the distribution of empirical datasets. To demonstrate the software, we analyse the fossil record of the order Carnivora through the Cenozoic, finding a peak in diversity in the Late Miocene and a 37% species loss since the Pleistocene. Our implementation includes the generation summary statistics and plots that allow for an evaluation of the model performance and diversity estimations and a configuration file that captures all parameters required to guarantee the full reproducibility of the results.\",\"PeriodicalId\":501477,\"journal\":{\"name\":\"bioRxiv - Paleontology\",\"volume\":\"14 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"bioRxiv - Paleontology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1101/2024.09.03.610960\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"bioRxiv - Paleontology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.09.03.610960","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
化石记录的不完整性,特别是保存和取样在空间和时间上的差异,给估算生物多样性的变化带来了障碍,而标准的统计方法难以对此做出解释。在此,我们介绍 DeepDiveR,这是一个用于 DeepDive 程序的 R 软件包,可根据化石出现数据估算生物多样性。该方法使用模拟训练的深度神经网络生成生物多样性随时间变化的预测,同时考虑到保存中的时间、空间和分类异质性。DeepDiveR 可用于探索不同支系的灭绝生物多样性。我们展示了建立和定制分析的管道,包括考虑生物地理学的变化。我们还进一步开发了模型,以整合现存支系的现代多样性信息,并引入了一个自动调整模拟参数的功能,以生成反映经验数据集分布的训练数据。为了演示该软件,我们分析了食肉目在新生代的化石记录,发现晚中新世出现了一个多样性高峰,自更新世以来物种减少了 37%。我们的实施包括生成汇总统计数据和图表,以便对模型性能和多样性估算进行评估,还包括一个配置文件,其中包含保证结果完全可重现所需的所有参数。
DeepDiveR – A software for deep learning estimation of palaeodiversity from fossil occurrences
The incompleteness of the fossil record, in particular variation in preservation and sampling through space and time, presents a barrier to estimating changes in biodiversity which standard statistical methods struggle to account for. Here we present DeepDiveR, an R package for the DeepDive program enabling estimation of biodiversity from fossil occurrence data. The method uses a simulation-trained deep neural network to generate predictions of biodiversity change through time, while accounting for temporal, spatial and taxonomic heterogeneities in preservation. DeepDiveR can be readily used to explore the extinct biodiversity of different clades. We demonstrate the pipeline to build and customise analyses, including consideration of changes in biogeography. We also further develop the model to integrate information about modern diversity in the case of extant clades and introduce a function that automatically adjusts the parameterization of the simulations to generate training data that reflect the distribution of empirical datasets. To demonstrate the software, we analyse the fossil record of the order Carnivora through the Cenozoic, finding a peak in diversity in the Late Miocene and a 37% species loss since the Pleistocene. Our implementation includes the generation summary statistics and plots that allow for an evaluation of the model performance and diversity estimations and a configuration file that captures all parameters required to guarantee the full reproducibility of the results.