{"title":"感知线性预测(PLP)分析-再合成技术","authors":"H. Hermansky, L. Cox","doi":"10.21437/Eurospeech.1991-88","DOIUrl":null,"url":null,"abstract":"A common wisdom in speech re-synthesis is that while the vocal tract excitation can be modified to represent the message prosody, the accurate preservation of the formants is needed in order to ensure that both the linguistic message and the speaker-dependent information is well represented in the synthesized speech. Formants are speaker-dependent. A further decomposition of the formant-based speech representation into its message-bearing and the speaker-dependent parts and the inverse problem of combining those two sources of speech information is of interest. The current paper addresses this issues.","PeriodicalId":146017,"journal":{"name":"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1991-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"37","resultStr":"{\"title\":\"Perceptual Linear Predictive (PLP) Analysis-Resynthesis Technique\",\"authors\":\"H. Hermansky, L. Cox\",\"doi\":\"10.21437/Eurospeech.1991-88\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A common wisdom in speech re-synthesis is that while the vocal tract excitation can be modified to represent the message prosody, the accurate preservation of the formants is needed in order to ensure that both the linguistic message and the speaker-dependent information is well represented in the synthesized speech. Formants are speaker-dependent. A further decomposition of the formant-based speech representation into its message-bearing and the speaker-dependent parts and the inverse problem of combining those two sources of speech information is of interest. The current paper addresses this issues.\",\"PeriodicalId\":146017,\"journal\":{\"name\":\"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1991-09-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"37\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.21437/Eurospeech.1991-88\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21437/Eurospeech.1991-88","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Perceptual Linear Predictive (PLP) Analysis-Resynthesis Technique
A common wisdom in speech re-synthesis is that while the vocal tract excitation can be modified to represent the message prosody, the accurate preservation of the formants is needed in order to ensure that both the linguistic message and the speaker-dependent information is well represented in the synthesized speech. Formants are speaker-dependent. A further decomposition of the formant-based speech representation into its message-bearing and the speaker-dependent parts and the inverse problem of combining those two sources of speech information is of interest. The current paper addresses this issues.