Hongwei Wu, Yibiao Yu, Heming Zhao, Xueqin Chen, Chunjuan Wang
{"title":"Pitch estimation using mean shift algorithm on multitaper spectrum of noisy speech","authors":"Hongwei Wu, Yibiao Yu, Heming Zhao, Xueqin Chen, Chunjuan Wang","doi":"10.1109/ICIST.2014.6920512","DOIUrl":null,"url":null,"abstract":"In this paper, we apply the mean shift algorithm to estimate pitches of noisy speech from its multitaper spectrum. The noisy speech is first transformed into the multitaper spectrum, which can reduce the stationary noise. The pitch is extracted from the multitaper spectrum using the mean shift algorithm. After all estimates are collected, dynamic programming is used to obtain a smoothed pitch contour. We compare the performance of our method with two well-known algorithms on the Keele pitch database and demonstrate that it performs much well even at SNR as low as -15dB.","PeriodicalId":306383,"journal":{"name":"2014 4th IEEE International Conference on Information Science and Technology","volume":"11 2","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 4th IEEE International Conference on Information Science and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIST.2014.6920512","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, we apply the mean shift algorithm to estimate pitches of noisy speech from its multitaper spectrum. The noisy speech is first transformed into the multitaper spectrum, which can reduce the stationary noise. The pitch is extracted from the multitaper spectrum using the mean shift algorithm. After all estimates are collected, dynamic programming is used to obtain a smoothed pitch contour. We compare the performance of our method with two well-known algorithms on the Keele pitch database and demonstrate that it performs much well even at SNR as low as -15dB.