Automated detection of Bornean white-bearded gibbon (Hylobates albibarbis) vocalizations using an open-source framework for deep learning.

IF 2.1 2区 物理与天体物理 Q2 ACOUSTICS Journal of the Acoustical Society of America Pub Date : 2024-09-01 DOI:10.1121/10.0028268
A F Owens, Kimberley J Hockings, Muhammed Ali Imron, Shyam Madhusudhana, Mariaty, Tatang Mitra Setia, Manmohan Sharma, Siti Maimunah, F J F Van Veen, Wendy M Erb
{"title":"Automated detection of Bornean white-bearded gibbon (Hylobates albibarbis) vocalizations using an open-source framework for deep learning.","authors":"A F Owens, Kimberley J Hockings, Muhammed Ali Imron, Shyam Madhusudhana, Mariaty, Tatang Mitra Setia, Manmohan Sharma, Siti Maimunah, F J F Van Veen, Wendy M Erb","doi":"10.1121/10.0028268","DOIUrl":null,"url":null,"abstract":"<p><p>Passive acoustic monitoring is a promising tool for monitoring at-risk populations of vocal species, yet, extracting relevant information from large acoustic datasets can be time-consuming, creating a bottleneck at the point of analysis. To address this, an open-source framework for deep learning in bioacoustics to automatically detect Bornean white-bearded gibbon (Hylobates albibarbis) \"great call\" vocalizations in a long-term acoustic dataset from a rainforest location in Borneo is adapted. The steps involved in developing this solution are described, including collecting audio recordings, developing training and testing datasets, training neural network models, and evaluating model performance. The best model performed at a satisfactory level (F score = 0.87), identifying 98% of the highest-quality calls from 90 h of manually annotated audio recordings and greatly reduced analysis times when compared to a human observer. No significant difference was found in the temporal distribution of great call detections between the manual annotations and the model's output. Future work should seek to apply this model to long-term acoustic datasets to understand spatiotemporal variations in H. albibarbis' calling activity. Overall, a roadmap is presented for applying deep learning to identify the vocalizations of species of interest, which can be adapted for monitoring other endangered vocalizing species.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":null,"pages":null},"PeriodicalIF":2.1000,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Acoustical Society of America","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1121/10.0028268","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0

Abstract

Passive acoustic monitoring is a promising tool for monitoring at-risk populations of vocal species, yet, extracting relevant information from large acoustic datasets can be time-consuming, creating a bottleneck at the point of analysis. To address this, an open-source framework for deep learning in bioacoustics to automatically detect Bornean white-bearded gibbon (Hylobates albibarbis) "great call" vocalizations in a long-term acoustic dataset from a rainforest location in Borneo is adapted. The steps involved in developing this solution are described, including collecting audio recordings, developing training and testing datasets, training neural network models, and evaluating model performance. The best model performed at a satisfactory level (F score = 0.87), identifying 98% of the highest-quality calls from 90 h of manually annotated audio recordings and greatly reduced analysis times when compared to a human observer. No significant difference was found in the temporal distribution of great call detections between the manual annotations and the model's output. Future work should seek to apply this model to long-term acoustic datasets to understand spatiotemporal variations in H. albibarbis' calling activity. Overall, a roadmap is presented for applying deep learning to identify the vocalizations of species of interest, which can be adapted for monitoring other endangered vocalizing species.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用深度学习开源框架自动检测婆罗洲白须长臂猿(Hylobates albibarbis)的发声。
被动声学监测是监测高危发声物种种群的一种前景广阔的工具,然而,从大型声学数据集中提取相关信息可能非常耗时,从而在分析时遇到瓶颈。为了解决这个问题,我们采用了生物声学深度学习的开源框架,以自动检测婆罗洲热带雨林长期声学数据集中的婆罗洲白须长臂猿(Hylobates albibarbis)"大叫 "发声。文中介绍了开发该解决方案的步骤,包括收集录音、开发训练和测试数据集、训练神经网络模型以及评估模型性能。最佳模型的表现令人满意(F 分数 = 0.87),能从 90 小时人工标注的音频记录中识别出 98% 的最高质量的叫声,与人类观察者相比大大缩短了分析时间。在人工标注和模型输出之间,未发现巨大呼叫检测的时间分布有明显差异。未来的工作应寻求将该模型应用于长期的声学数据集,以了解白喉姬蛙叫声活动的时空变化。总之,本文提出了应用深度学习识别相关物种发声的路线图,该路线图可用于监测其他濒危发声物种。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
4.60
自引率
16.70%
发文量
1433
审稿时长
4.7 months
期刊介绍: Since 1929 The Journal of the Acoustical Society of America has been the leading source of theoretical and experimental research results in the broad interdisciplinary study of sound. Subject coverage includes: linear and nonlinear acoustics; aeroacoustics, underwater sound and acoustical oceanography; ultrasonics and quantum acoustics; architectural and structural acoustics and vibration; speech, music and noise; psychology and physiology of hearing; engineering acoustics, transduction; bioacoustics, animal bioacoustics.
期刊最新文献
Influence of variable sound-absorbing devices on room acoustical parameters of reverberation and intelligibility in medium-to-large multipurpose halls. Integer multi-wavelength gradient phase metagrating for perfect refraction: Phase choice freedom in supercella). Measurement of ocean currents by seafloor distributed optical-fiber acoustic sensing. Neville Fletcher's vibrant valve voyage. Office soundscape assessment: A model of acoustic environment perception in open-plan officesa).
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1