鸟类发声的音高跟踪和使用YIN-bird的自动化过程

Colm O'Reilly, N. Harte
{"title":"鸟类发声的音高跟踪和使用YIN-bird的自动化过程","authors":"Colm O'Reilly, N. Harte","doi":"10.1080/23312025.2017.1322025","DOIUrl":null,"url":null,"abstract":"Pitch or fundamental frequency is an important feature of bird song, from which scientists can learn much about a population. To use pitch as a feature, researchers need confidence in their pitch extraction system. Pitch detection algorithms (PDAs) proven to work on human speech may not be suitable for all types of bird vocalizations. This paper discusses pitch estimation performance on a variety of common bird vocalizations. The presence of multiple partials or tones simultaneously, extended frequency sweeps through multiple octaves, and rapid pitch modulations are just some of the difficulties encountered when estimating the pitch of bird song. Carefully tuned parameters improve pitch tracking with YIN, but optimal parameters can change quickly even within one song. YIN is a PDA which estimates pitch of human speech very well. This paper presents YIN-bird, a modified version of YIN which exploits spectrogram properties to automatically set a minimum fundamental frequency parameter for YIN. Gross pitch errors on whistles and trills were reduced by up to 4% on a ground truth data-set of synthetic bird song with known pitch. This data-set was evaluated by expert listeners and described as “sounding like original & can hardly tell it is synthetic”. A qualitative analysis showing YIN-bird not to be suitable for more complex bird vocalizations, such as nasals, is also presented.","PeriodicalId":10412,"journal":{"name":"Cogent Biology","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2017-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/23312025.2017.1322025","citationCount":"0","resultStr":"{\"title\":\"Pitch tracking of bird vocalizations and an automated process using YIN-bird\",\"authors\":\"Colm O'Reilly, N. Harte\",\"doi\":\"10.1080/23312025.2017.1322025\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Pitch or fundamental frequency is an important feature of bird song, from which scientists can learn much about a population. To use pitch as a feature, researchers need confidence in their pitch extraction system. Pitch detection algorithms (PDAs) proven to work on human speech may not be suitable for all types of bird vocalizations. This paper discusses pitch estimation performance on a variety of common bird vocalizations. The presence of multiple partials or tones simultaneously, extended frequency sweeps through multiple octaves, and rapid pitch modulations are just some of the difficulties encountered when estimating the pitch of bird song. Carefully tuned parameters improve pitch tracking with YIN, but optimal parameters can change quickly even within one song. YIN is a PDA which estimates pitch of human speech very well. This paper presents YIN-bird, a modified version of YIN which exploits spectrogram properties to automatically set a minimum fundamental frequency parameter for YIN. Gross pitch errors on whistles and trills were reduced by up to 4% on a ground truth data-set of synthetic bird song with known pitch. This data-set was evaluated by expert listeners and described as “sounding like original & can hardly tell it is synthetic”. A qualitative analysis showing YIN-bird not to be suitable for more complex bird vocalizations, such as nasals, is also presented.\",\"PeriodicalId\":10412,\"journal\":{\"name\":\"Cogent Biology\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1080/23312025.2017.1322025\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cogent Biology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1080/23312025.2017.1322025\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cogent Biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/23312025.2017.1322025","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

音调或基本频率是鸟类鸣叫的一个重要特征,科学家可以从中了解一个种群的很多情况。为了使用音高作为特征,研究人员需要对他们的音高提取系统有信心。音调检测算法(pda)已被证明适用于人类语言,但可能并不适用于所有类型的鸟类发声。本文讨论了各种常见鸟类发声的音高估计性能。同时存在多个分音或音调,多个八度的扩展频率扫描,以及快速的音高调制只是估计鸟鸣音高时遇到的一些困难。精心调整的参数可以改善YIN的音高跟踪,但即使在一首歌中,最佳参数也会迅速改变。YIN是一种PDA,可以很好地估计人类说话的音高。本文提出了一种改进的YIN-bird,它利用谱图特性自动设置最小基频参数。在已知音高的合成鸟叫声的地面真实数据集上,口哨和颤音的总音高误差减少了4%。该数据集由专家听众评估,并被描述为“听起来像原创的,几乎看不出它是合成的”。定性分析表明阴鸟不适合更复杂的鸟类发声,如鼻音,也提出了。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Pitch tracking of bird vocalizations and an automated process using YIN-bird
Pitch or fundamental frequency is an important feature of bird song, from which scientists can learn much about a population. To use pitch as a feature, researchers need confidence in their pitch extraction system. Pitch detection algorithms (PDAs) proven to work on human speech may not be suitable for all types of bird vocalizations. This paper discusses pitch estimation performance on a variety of common bird vocalizations. The presence of multiple partials or tones simultaneously, extended frequency sweeps through multiple octaves, and rapid pitch modulations are just some of the difficulties encountered when estimating the pitch of bird song. Carefully tuned parameters improve pitch tracking with YIN, but optimal parameters can change quickly even within one song. YIN is a PDA which estimates pitch of human speech very well. This paper presents YIN-bird, a modified version of YIN which exploits spectrogram properties to automatically set a minimum fundamental frequency parameter for YIN. Gross pitch errors on whistles and trills were reduced by up to 4% on a ground truth data-set of synthetic bird song with known pitch. This data-set was evaluated by expert listeners and described as “sounding like original & can hardly tell it is synthetic”. A qualitative analysis showing YIN-bird not to be suitable for more complex bird vocalizations, such as nasals, is also presented.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Cogent Biology
Cogent Biology MULTIDISCIPLINARY SCIENCES-
自引率
0.00%
发文量
0
期刊最新文献
Eco-physiological and physiological characterization of cowpea nodulating native rhizobia isolated from major production areas of Ethiopia Primary and secondary substance use in the Western Cape Province of South Africa: A mathematical modelling approach Evaluation of abamectin induced hepatotoxicity in Oreochromis mossambicus Anti-inflammatory and antioxidant activities of extracts of Reissantia indica, Cissus cornifolia and Grosseria vignei Chromatographic, Mass and Cytotoxicity analysis of Isolates from Eichhornia crassipes’ Roots and Leaves against HepG2 and MCF7 cell lines
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1