{"title":"Perceptual Linear Predictive (PLP) Analysis-Resynthesis Technique","authors":"H. Hermansky, L. Cox","doi":"10.21437/Eurospeech.1991-88","DOIUrl":null,"url":null,"abstract":"A common wisdom in speech re-synthesis is that while the vocal tract excitation can be modified to represent the message prosody, the accurate preservation of the formants is needed in order to ensure that both the linguistic message and the speaker-dependent information is well represented in the synthesized speech. Formants are speaker-dependent. A further decomposition of the formant-based speech representation into its message-bearing and the speaker-dependent parts and the inverse problem of combining those two sources of speech information is of interest. The current paper addresses this issues.","PeriodicalId":146017,"journal":{"name":"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1991-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"37","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21437/Eurospeech.1991-88","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 37
Abstract
A common wisdom in speech re-synthesis is that while the vocal tract excitation can be modified to represent the message prosody, the accurate preservation of the formants is needed in order to ensure that both the linguistic message and the speaker-dependent information is well represented in the synthesized speech. Formants are speaker-dependent. A further decomposition of the formant-based speech representation into its message-bearing and the speaker-dependent parts and the inverse problem of combining those two sources of speech information is of interest. The current paper addresses this issues.