{"title":"Spoken language processing: people versus machines","authors":"William S-Y. Wang","doi":"10.1109/CHINSL.2004.1409567","DOIUrl":null,"url":null,"abstract":"Summary form only given. A fundamental challenge we must meet for computers to eventually process spoken language as effectively as humans to capture the immensely rich fund of information we have in our heads that is not in the speech signal. This information is what gives us the ability to supply acoustic cues when these are degraded or missing, or to zero in on one speaker amid a chorus of other voices. While the powerful statistical methods currently used in speech recognition and synthesis have brought some success and useful applications, future progress will depend crucially on a deeper knowledge and greater use of this information. Some of this information is applicable to all languages, and some of it is specific to individual language types. In this discussion, special attention is given to the processing of spoken Chinese.","PeriodicalId":212562,"journal":{"name":"2004 International Symposium on Chinese Spoken Language Processing","volume":"61 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2004-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2004 International Symposium on Chinese Spoken Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CHINSL.2004.1409567","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Summary form only given. A fundamental challenge we must meet for computers to eventually process spoken language as effectively as humans to capture the immensely rich fund of information we have in our heads that is not in the speech signal. This information is what gives us the ability to supply acoustic cues when these are degraded or missing, or to zero in on one speaker amid a chorus of other voices. While the powerful statistical methods currently used in speech recognition and synthesis have brought some success and useful applications, future progress will depend crucially on a deeper knowledge and greater use of this information. Some of this information is applicable to all languages, and some of it is specific to individual language types. In this discussion, special attention is given to the processing of spoken Chinese.