Software performance for the automated identification of bird vocalisations: the case of two closely related species

J. Marchal, François Fabianek, Y. Aubry
{"title":"Software performance for the automated identification of bird vocalisations: the case of two closely related species","authors":"J. Marchal, François Fabianek, Y. Aubry","doi":"10.1080/09524622.2021.1945952","DOIUrl":null,"url":null,"abstract":"ABSTRACT Autonomous recording units now facilitate the large collection of audio recordings. However, the analysis of large amounts of acoustic data remains a challenge. The time required for manually searching for bird vocalisations may be equivalent or greater to the duration of audio recordings. This major constraint can be significantly reduced through the use of software developed for automated identification of bird vocalisations in audio recordings. We have compared the performance of four software (CallSeeker, Kaleidoscope Pro, Raven Pro, and Song Scope) and a Convolutional Neural Network (CNN) using audio recordings containing calls of Bicknell’s Thrush and Gray-Cheeked Thrush, as well as the vocalisations of other bird species whose acoustic characteristics overlap with those of our target species. We evaluated all the software on the basis of two main criteria, their ability to detect calls and their ability to classify them correctly by species. Software performance ranged from 30 to 90% in terms of call detection (recall) and from 27 to 99% in terms of correct call classification (precision). CNNs offer a promising solution to the long-standing problem of detecting animal vocalisations in noisy soundscapes, while eliminating the tedious manual step of configuring the algorithms to maximise software performance.","PeriodicalId":55385,"journal":{"name":"Bioacoustics-The International Journal of Animal Sound and Its Recording","volume":"31 1","pages":"397 - 413"},"PeriodicalIF":2.1000,"publicationDate":"2021-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/09524622.2021.1945952","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioacoustics-The International Journal of Animal Sound and Its Recording","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1080/09524622.2021.1945952","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ZOOLOGY","Score":null,"Total":0}
引用次数: 11

Abstract

ABSTRACT Autonomous recording units now facilitate the large collection of audio recordings. However, the analysis of large amounts of acoustic data remains a challenge. The time required for manually searching for bird vocalisations may be equivalent or greater to the duration of audio recordings. This major constraint can be significantly reduced through the use of software developed for automated identification of bird vocalisations in audio recordings. We have compared the performance of four software (CallSeeker, Kaleidoscope Pro, Raven Pro, and Song Scope) and a Convolutional Neural Network (CNN) using audio recordings containing calls of Bicknell’s Thrush and Gray-Cheeked Thrush, as well as the vocalisations of other bird species whose acoustic characteristics overlap with those of our target species. We evaluated all the software on the basis of two main criteria, their ability to detect calls and their ability to classify them correctly by species. Software performance ranged from 30 to 90% in terms of call detection (recall) and from 27 to 99% in terms of correct call classification (precision). CNNs offer a promising solution to the long-standing problem of detecting animal vocalisations in noisy soundscapes, while eliminating the tedious manual step of configuring the algorithms to maximise software performance.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
自动识别鸟类鸣叫的软件性能:两个密切相关物种的案例
摘要:现在,自主录音单元可以方便地收集大量录音。然而,对大量声学数据的分析仍然是一个挑战。手动搜索鸟类发声所需的时间可能等于或大于音频记录的持续时间。通过使用为录音中鸟类发声的自动识别而开发的软件,可以显著减少这一主要限制。我们比较了四种软件(CallSeeker、Kaleidoscope Pro、Raven Pro和Song Scope)和卷积神经网络(CNN)的性能,使用的录音包括比克内尔画眉和灰颊画眉的叫声,以及其他鸟类的发声,这些鸟类的声学特征与我们的目标物种的声学特征重叠。我们根据两个主要标准对所有软件进行了评估,即它们检测叫声的能力和按物种正确分类的能力。软件性能在呼叫检测(召回)方面为30%至90%,在正确的呼叫分类(精度)方面为27%至99%。细胞神经网络为在嘈杂声景中检测动物发声这一长期存在的问题提供了一个很有前途的解决方案,同时消除了配置算法以最大限度地提高软件性能的繁琐手动步骤。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
4.50
自引率
0.00%
发文量
25
审稿时长
>12 weeks
期刊介绍: Bioacoustics primarily publishes high-quality original research papers and reviews on sound communication in birds, mammals, amphibians, reptiles, fish, insects and other invertebrates, including the following topics : -Communication and related behaviour- Sound production- Hearing- Ontogeny and learning- Bioacoustics in taxonomy and systematics- Impacts of noise- Bioacoustics in environmental monitoring- Identification techniques and applications- Recording and analysis- Equipment and techniques- Ultrasound and infrasound- Underwater sound- Bioacoustical sound structures, patterns, variation and repertoires
期刊最新文献
Bioacoustic fundamental frequency estimation: a cross-species dataset and deep learning baseline. Unsupervised discrimination of male Tawny owls ( Strix aluco ) individual calls using robust measurements of the acoustic signal The sound of hope: searching for critically endangered species using acoustic template matching Parselmouth for bioacoustics: automated acoustic analysis in Python Sound production in sea turtle nests and hatchlings ( Eretmochelys imbricata and Caretta caretta ) in Northeast Brazil
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1