{"title":"How prosodic cues could lead to information center in speech - An alternative to ASR","authors":"Chao-yu Su, Chiu-yu Tseng","doi":"10.1109/ICSDA.2017.8384443","DOIUrl":null,"url":null,"abstract":"It has been reported in ASR literature that prosody helps retrieve important textual information by word. We therefore believe that prosodic information in the speech signal could be used to facilitate speech processing more directly. The prosodic word, a perceptually identifiable unit which is usually slightly larger in size than lexical word, can be a possible alternative to help locate important information in speech. Acoustic analysis across labels of perceived prosodic highlighted part in prosodic words and semantic foci in words are compared. The results demonstrate that prosodic highlights occur before targeted key information and function as advanced prompts to outline upcoming sematic foci ahead of time. Semantic saliency of targeted words are thus enhanced beforehand while correct anticipation can be facilitated prior to detailed lexical processing. Further automatic identification approach of key content by prosodic features also shows the possibility to retrieve important information through prosodic words. We believe the results demonstrate that not all information is equally important in speech, locating information center is the key to speech communication, and the contribution of prosody is critical.","PeriodicalId":255147,"journal":{"name":"2017 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment (O-COCOSDA)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment (O-COCOSDA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSDA.2017.8384443","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
It has been reported in ASR literature that prosody helps retrieve important textual information by word. We therefore believe that prosodic information in the speech signal could be used to facilitate speech processing more directly. The prosodic word, a perceptually identifiable unit which is usually slightly larger in size than lexical word, can be a possible alternative to help locate important information in speech. Acoustic analysis across labels of perceived prosodic highlighted part in prosodic words and semantic foci in words are compared. The results demonstrate that prosodic highlights occur before targeted key information and function as advanced prompts to outline upcoming sematic foci ahead of time. Semantic saliency of targeted words are thus enhanced beforehand while correct anticipation can be facilitated prior to detailed lexical processing. Further automatic identification approach of key content by prosodic features also shows the possibility to retrieve important information through prosodic words. We believe the results demonstrate that not all information is equally important in speech, locating information center is the key to speech communication, and the contribution of prosody is critical.