Filip Dorm, Christian Lange, Scott Loarie, Oisin Mac Aodha
{"title":"Generating Binary Species Range Maps","authors":"Filip Dorm, Christian Lange, Scott Loarie, Oisin Mac Aodha","doi":"arxiv-2408.15956","DOIUrl":null,"url":null,"abstract":"Accurately predicting the geographic ranges of species is crucial for\nassisting conservation efforts. Traditionally, range maps were manually created\nby experts. However, species distribution models (SDMs) and, more recently,\ndeep learning-based variants offer a potential automated alternative. Deep\nlearning-based SDMs generate a continuous probability representing the\npredicted presence of a species at a given location, which must be binarized by\nsetting per-species thresholds to obtain binary range maps. However, selecting\nappropriate per-species thresholds to binarize these predictions is non-trivial\nas different species can require distinct thresholds. In this work, we evaluate\ndifferent approaches for automatically identifying the best thresholds for\nbinarizing range maps using presence-only data. This includes approaches that\nrequire the generation of additional pseudo-absence data, along with ones that\nonly require presence data. We also propose an extension of an existing\npresence-only technique that is more robust to outliers. We perform a detailed\nevaluation of different thresholding techniques on the tasks of binary range\nestimation and large-scale fine-grained visual classification, and we\ndemonstrate improved performance over existing pseudo-absence free approaches\nusing our method.","PeriodicalId":501266,"journal":{"name":"arXiv - QuanBio - Quantitative Methods","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Quantitative Methods","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.15956","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Accurately predicting the geographic ranges of species is crucial for
assisting conservation efforts. Traditionally, range maps were manually created
by experts. However, species distribution models (SDMs) and, more recently,
deep learning-based variants offer a potential automated alternative. Deep
learning-based SDMs generate a continuous probability representing the
predicted presence of a species at a given location, which must be binarized by
setting per-species thresholds to obtain binary range maps. However, selecting
appropriate per-species thresholds to binarize these predictions is non-trivial
as different species can require distinct thresholds. In this work, we evaluate
different approaches for automatically identifying the best thresholds for
binarizing range maps using presence-only data. This includes approaches that
require the generation of additional pseudo-absence data, along with ones that
only require presence data. We also propose an extension of an existing
presence-only technique that is more robust to outliers. We perform a detailed
evaluation of different thresholding techniques on the tasks of binary range
estimation and large-scale fine-grained visual classification, and we
demonstrate improved performance over existing pseudo-absence free approaches
using our method.