{"title":"The generation of waveform fluctuation for the enhancement of the quality of sustained vowels","authors":"N. Aoki, T. Ifukube","doi":"10.1109/ICICS.1997.652130","DOIUrl":null,"url":null,"abstract":"It is indicated that the pitch-synchronous subtle waveform fluctuations of sustained vowels are the acoustic cue for the naturalness in case the sustained vowels do not include fundamental frequency fluctuations. From the viewpoint of speech synthesis, it is expected that the more naturally sounding sustained vowels can be synthesized by incorporating the appropriately modeled waveform fluctuations. In this study, the waveform fluctuations of the inverse-filtered normal sustained vowels were analyzed with the aim of high quality speech synthesis. The series of the analyses suggested that the waveform fluctuations could be modeled by the same rule for all speech samples obtained from ten male subjects. In addition, the psychoacoustic experiments confirmed the validity of the method which generates the completely artificial waveform fluctuations for the remarkable enhancement of the quality of sustained vowels.","PeriodicalId":71361,"journal":{"name":"信息通信技术","volume":"21 1","pages":"998-1002 vol.2"},"PeriodicalIF":0.0000,"publicationDate":"1997-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"信息通信技术","FirstCategoryId":"1093","ListUrlMain":"https://doi.org/10.1109/ICICS.1997.652130","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
It is indicated that the pitch-synchronous subtle waveform fluctuations of sustained vowels are the acoustic cue for the naturalness in case the sustained vowels do not include fundamental frequency fluctuations. From the viewpoint of speech synthesis, it is expected that the more naturally sounding sustained vowels can be synthesized by incorporating the appropriately modeled waveform fluctuations. In this study, the waveform fluctuations of the inverse-filtered normal sustained vowels were analyzed with the aim of high quality speech synthesis. The series of the analyses suggested that the waveform fluctuations could be modeled by the same rule for all speech samples obtained from ten male subjects. In addition, the psychoacoustic experiments confirmed the validity of the method which generates the completely artificial waveform fluctuations for the remarkable enhancement of the quality of sustained vowels.