Segmentation and Analysis of Bird Trill Vocalizations

Hagai Barmatz, Dana Klein, Y. Vortman, Sivan Toledo, Y. Lavner
{"title":"Segmentation and Analysis of Bird Trill Vocalizations","authors":"Hagai Barmatz, Dana Klein, Y. Vortman, Sivan Toledo, Y. Lavner","doi":"10.1109/ICSEE.2018.8646070","DOIUrl":null,"url":null,"abstract":"Animal communication and specifically acoustic communication is in the focus of ecological and biological research. With the advancement of monitoring technology, a vast amount of acoustic recordings of birds is continuously accumulated. As manual segmentation and annotation of this data is impractical, development of efficient algorithms for accurate detection, classification and segmentation of birdsong is therefore a prerequisite for further analysis. In this study we present an algorithm for automatic segmentation and parameters estimation of one type of bird vocalization, namely, the trill song. The algorithm is based on computing the short-time variance of the fundamental frequency derivative of bird acoustic signal for initial detection of syllables. The boundaries of each syllable are consequently obtained using a Gaussian smoothed short-time energy function and an adaptive threshold based on the energy envelope. The performance of the algorithm was evaluated using a comparison to human expert segmentation, as well as to ground-truth values of synthetic trills produced by the Harmonic + Noise model. A correct detection rate of more than 95% was yielded for SNR levels of -5 dB or higher for signals with additive colored noise, and for signals with additive white Gaussian noise more than 92% was obtained for SNR>-5dB. In addition, a high correlation between the automatic segmentation and that of a human expert was exemplified.","PeriodicalId":254455,"journal":{"name":"2018 IEEE International Conference on the Science of Electrical Engineering in Israel (ICSEE)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Conference on the Science of Electrical Engineering in Israel (ICSEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSEE.2018.8646070","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Animal communication and specifically acoustic communication is in the focus of ecological and biological research. With the advancement of monitoring technology, a vast amount of acoustic recordings of birds is continuously accumulated. As manual segmentation and annotation of this data is impractical, development of efficient algorithms for accurate detection, classification and segmentation of birdsong is therefore a prerequisite for further analysis. In this study we present an algorithm for automatic segmentation and parameters estimation of one type of bird vocalization, namely, the trill song. The algorithm is based on computing the short-time variance of the fundamental frequency derivative of bird acoustic signal for initial detection of syllables. The boundaries of each syllable are consequently obtained using a Gaussian smoothed short-time energy function and an adaptive threshold based on the energy envelope. The performance of the algorithm was evaluated using a comparison to human expert segmentation, as well as to ground-truth values of synthetic trills produced by the Harmonic + Noise model. A correct detection rate of more than 95% was yielded for SNR levels of -5 dB or higher for signals with additive colored noise, and for signals with additive white Gaussian noise more than 92% was obtained for SNR>-5dB. In addition, a high correlation between the automatic segmentation and that of a human expert was exemplified.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
鸟类颤音发声的分割与分析
动物交流,特别是声音交流是生态学和生物学研究的重点。随着监测技术的进步,大量鸟类的声音记录不断积累。由于对这些数据进行人工分割和标注是不切实际的,因此开发高效的算法来准确地检测、分类和分割鸟鸣是进一步分析的先决条件。在这项研究中,我们提出了一种算法来自动分割和参数估计一种鸟类发声,即颤音歌。该算法基于计算鸟声信号基频导数的短时方差进行音节的初始检测。然后利用高斯平滑的短时能量函数和基于能量包络的自适应阈值获得每个音节的边界。通过与人类专家分割的比较,以及谐波+噪声模型产生的合成颤音的真值,评估了该算法的性能。对于加性彩色噪声信号信噪比为-5dB及以上的信号,正确率达到95%以上;对于加性高斯白噪声信号,信噪比>-5dB的信号,正确率达到92%以上。此外,还举例说明了自动分割与人类专家分割之间的高度相关性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Robust Motion Compensation for Forensic Analysis of Egocentric Video using Joint Stabilization and Tracking DC low current Hall effect measurements Examining Change Detection Methods For Hyperspectral Data Effect of Reverberation in Speech-based Emotion Recognition Traveling-Wave Ring Oscillator – Simulations and Prototype Measurements for a New Architecture for a Transmission Line Based Oscillator
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1