B. Martín, J. González-Arias, J. A. Vicente-Virseda
{"title":"Machine learning as a successful approach for predicting complex spatio–temporal patterns in animal species abundance","authors":"B. Martín, J. González-Arias, J. A. Vicente-Virseda","doi":"10.32800/abc.2021.44.0289","DOIUrl":null,"url":null,"abstract":"Our aim was to identify an optimal analytical approach for accurately predicting complex spatio–temporal patterns in animal species distribution. We compared the performance of eight modelling techniques (generalized additive models, regression trees, bagged CART, k–nearest neighbors, stochastic gradient boosting, support vector machines, neural network, and random forest –enhanced form of bootstrap. We also performed extreme gradient boosting –an enhanced form of radiant boosting– to predict spatial patterns in abundance of migrating Balearic shearwaters based on data gathered within eBird. Derived from open–source datasets, proxies of frontal systems and ocean productivity domains that have been previously used to characterize the oceanographic habitats of seabirds were quantified, and then used as predictors in the models. The random\nforest model showed the best performance according to the parameters assessed (RMSE value and R2). The correlation between observed and predicted abundance with this model was also considerably high. This study shows that the combination of machine learning techniques and massive data provided by open data sources is a useful approach for identifying the long–term spatial–temporal distribution of species at regional spatial scales.","PeriodicalId":49107,"journal":{"name":"Animal Biodiversity and Conservation","volume":" ","pages":""},"PeriodicalIF":1.0000,"publicationDate":"2021-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Animal Biodiversity and Conservation","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.32800/abc.2021.44.0289","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIODIVERSITY CONSERVATION","Score":null,"Total":0}
引用次数: 4
Abstract
Our aim was to identify an optimal analytical approach for accurately predicting complex spatio–temporal patterns in animal species distribution. We compared the performance of eight modelling techniques (generalized additive models, regression trees, bagged CART, k–nearest neighbors, stochastic gradient boosting, support vector machines, neural network, and random forest –enhanced form of bootstrap. We also performed extreme gradient boosting –an enhanced form of radiant boosting– to predict spatial patterns in abundance of migrating Balearic shearwaters based on data gathered within eBird. Derived from open–source datasets, proxies of frontal systems and ocean productivity domains that have been previously used to characterize the oceanographic habitats of seabirds were quantified, and then used as predictors in the models. The random
forest model showed the best performance according to the parameters assessed (RMSE value and R2). The correlation between observed and predicted abundance with this model was also considerably high. This study shows that the combination of machine learning techniques and massive data provided by open data sources is a useful approach for identifying the long–term spatial–temporal distribution of species at regional spatial scales.
期刊介绍:
Animal Biodiversity and Conservation (antes Miscel·lània Zoològica) es una revista interdisciplinar, publicada desde 1958 por el Museu de Ciències Naturals de Barcelona. Incluye artículos de investigación empírica y teórica en todas las áreas de la zoología (sistemática, taxonomía, morfología, biogeografía, ecología, etología, fisiología y genética) procedentes de todas las regiones del mundo. La revista presta especial interés a los estudios que planteen un problema nuevo o introduzcan un tema nuevo, con hipòtesis y prediccions claras, y a los trabajos que de una manera u otra tengan relevancia en la biología de la conservación. No se publicaran artículos puramente descriptivos, o artículos faunísticos o corológicos en los que se describa la distribución en el espacio o en el tiempo de los organismes zoológicos.