{"title":"一种新的频域基频提取方法","authors":"Hong Zhang, Taiyi Huang, Junshou Song","doi":"10.1109/ICOSP.1998.770305","DOIUrl":null,"url":null,"abstract":"This paper proposes a novel fundamental frequency (pitch) extraction method in the frequency domain. The procedure of the algorithm is as follows. The harmonic peaks of the short-time spectrum of the speech signal are extracted by a binary inhibition network. Then the pitch is determined by the average interval of the harmonics. The method is evaluated on the COSDIC speech database. For the purpose of comparison, the same results for the cepstrum- and autocorrelation-based pitch extraction methods are also presented. The results show that the new method is superior to the cepstrum-based and autocorrelation-based ones. Moreover, it is more robust to white noise than the other two methods.","PeriodicalId":145700,"journal":{"name":"ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1998-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A new method of fundamental frequency extraction in frequency domain\",\"authors\":\"Hong Zhang, Taiyi Huang, Junshou Song\",\"doi\":\"10.1109/ICOSP.1998.770305\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes a novel fundamental frequency (pitch) extraction method in the frequency domain. The procedure of the algorithm is as follows. The harmonic peaks of the short-time spectrum of the speech signal are extracted by a binary inhibition network. Then the pitch is determined by the average interval of the harmonics. The method is evaluated on the COSDIC speech database. For the purpose of comparison, the same results for the cepstrum- and autocorrelation-based pitch extraction methods are also presented. The results show that the new method is superior to the cepstrum-based and autocorrelation-based ones. Moreover, it is more robust to white noise than the other two methods.\",\"PeriodicalId\":145700,\"journal\":{\"name\":\"ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344)\",\"volume\":\"16 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1998-10-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICOSP.1998.770305\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOSP.1998.770305","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A new method of fundamental frequency extraction in frequency domain
This paper proposes a novel fundamental frequency (pitch) extraction method in the frequency domain. The procedure of the algorithm is as follows. The harmonic peaks of the short-time spectrum of the speech signal are extracted by a binary inhibition network. Then the pitch is determined by the average interval of the harmonics. The method is evaluated on the COSDIC speech database. For the purpose of comparison, the same results for the cepstrum- and autocorrelation-based pitch extraction methods are also presented. The results show that the new method is superior to the cepstrum-based and autocorrelation-based ones. Moreover, it is more robust to white noise than the other two methods.