{"title":"基于主激励衰减的自适应加权线性预测用于高频歌唱中形成峰频率估计","authors":"Eduardo Barrientos , Edson Cataldo","doi":"10.1016/j.specom.2023.103006","DOIUrl":null,"url":null,"abstract":"<div><p>This paper aims to show how to improve the accuracy of formant frequency estimation in the singing voice of a lyric soprano. Conventional methods of formant frequency estimation may not accurately capture the formant frequencies of the singing voice, particularly in the highest pitch range of a lyric soprano, where the lowest formants are biased by the pitch harmonics. To address this issue, the study proposes adapting the Weighted Linear Prediction with Attenuated Main Excitation (WLP-AME) method for formant frequency estimation. Specific methods for glottal closure instant estimation were required due to differences in glottal closure patterns between speech and singing. The study evaluates the accuracy of the proposed method by comparing its performance with the LPC method through different pitch series arranged in an ascending musical scale. The results indicated that the adapted WLP-AME method consistently outperformed the LPC method in estimating formant frequencies of vowels sung by a lyric soprano. In addition, by estimating the formant frequencies of a synthetic /i/ vowel sung by a soprano singer at the musical note E5, the study showed that the adapted WLP-AME method provided formant frequency values closer to the correct values than those estimated by the LPC method. In general, these results suggest parameter values of AME function that optimize its performance, which can have applications in fields such as singing and medicine.</p></div>","PeriodicalId":49485,"journal":{"name":"Speech Communication","volume":"156 ","pages":"Article 103006"},"PeriodicalIF":2.4000,"publicationDate":"2023-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0167639323001401/pdfft?md5=ae4d5be07478b88a7d7394a12ce3f36c&pid=1-s2.0-S0167639323001401-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Adapted Weighted Linear Prediction with Attenuated Main Excitation for formant frequency estimation in high-pitched singing\",\"authors\":\"Eduardo Barrientos , Edson Cataldo\",\"doi\":\"10.1016/j.specom.2023.103006\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>This paper aims to show how to improve the accuracy of formant frequency estimation in the singing voice of a lyric soprano. Conventional methods of formant frequency estimation may not accurately capture the formant frequencies of the singing voice, particularly in the highest pitch range of a lyric soprano, where the lowest formants are biased by the pitch harmonics. To address this issue, the study proposes adapting the Weighted Linear Prediction with Attenuated Main Excitation (WLP-AME) method for formant frequency estimation. Specific methods for glottal closure instant estimation were required due to differences in glottal closure patterns between speech and singing. The study evaluates the accuracy of the proposed method by comparing its performance with the LPC method through different pitch series arranged in an ascending musical scale. The results indicated that the adapted WLP-AME method consistently outperformed the LPC method in estimating formant frequencies of vowels sung by a lyric soprano. In addition, by estimating the formant frequencies of a synthetic /i/ vowel sung by a soprano singer at the musical note E5, the study showed that the adapted WLP-AME method provided formant frequency values closer to the correct values than those estimated by the LPC method. In general, these results suggest parameter values of AME function that optimize its performance, which can have applications in fields such as singing and medicine.</p></div>\",\"PeriodicalId\":49485,\"journal\":{\"name\":\"Speech Communication\",\"volume\":\"156 \",\"pages\":\"Article 103006\"},\"PeriodicalIF\":2.4000,\"publicationDate\":\"2023-11-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S0167639323001401/pdfft?md5=ae4d5be07478b88a7d7394a12ce3f36c&pid=1-s2.0-S0167639323001401-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Speech Communication\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0167639323001401\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ACOUSTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Speech Communication","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167639323001401","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ACOUSTICS","Score":null,"Total":0}
Adapted Weighted Linear Prediction with Attenuated Main Excitation for formant frequency estimation in high-pitched singing
This paper aims to show how to improve the accuracy of formant frequency estimation in the singing voice of a lyric soprano. Conventional methods of formant frequency estimation may not accurately capture the formant frequencies of the singing voice, particularly in the highest pitch range of a lyric soprano, where the lowest formants are biased by the pitch harmonics. To address this issue, the study proposes adapting the Weighted Linear Prediction with Attenuated Main Excitation (WLP-AME) method for formant frequency estimation. Specific methods for glottal closure instant estimation were required due to differences in glottal closure patterns between speech and singing. The study evaluates the accuracy of the proposed method by comparing its performance with the LPC method through different pitch series arranged in an ascending musical scale. The results indicated that the adapted WLP-AME method consistently outperformed the LPC method in estimating formant frequencies of vowels sung by a lyric soprano. In addition, by estimating the formant frequencies of a synthetic /i/ vowel sung by a soprano singer at the musical note E5, the study showed that the adapted WLP-AME method provided formant frequency values closer to the correct values than those estimated by the LPC method. In general, these results suggest parameter values of AME function that optimize its performance, which can have applications in fields such as singing and medicine.
期刊介绍:
Speech Communication is an interdisciplinary journal whose primary objective is to fulfil the need for the rapid dissemination and thorough discussion of basic and applied research results.
The journal''s primary objectives are:
• to present a forum for the advancement of human and human-machine speech communication science;
• to stimulate cross-fertilization between different fields of this domain;
• to contribute towards the rapid and wide diffusion of scientifically sound contributions in this domain.