Pub Date : 2018-10-03DOI: 10.1201/9781482276237-61
D. O'Shaughnessy
Auditory processing of speech is an important stage in the closed-loop human speech communication system. A computational auditory model for temporal processing of speech is described with details of numerical solution and of the temporal information extraction method given. The model is used to process fluent speech utterances and is applied to phonetic classification using both clean and noisy speech materials. The need for integrating auditory speech processing and phonetic modeling components in machine speech recognizer design is discussed within a proposed computational framework of speech recognition motivated by the closed-loop speech chain model for integrated human speech production and perception behaviors.
{"title":"Computational Models for Auditory Speech Processing","authors":"D. O'Shaughnessy","doi":"10.1201/9781482276237-61","DOIUrl":"https://doi.org/10.1201/9781482276237-61","url":null,"abstract":"Auditory processing of speech is an important stage in the closed-loop human speech communication system. A computational auditory model for temporal processing of speech is described with details of numerical solution and of the temporal information extraction method given. The model is used to process fluent speech utterances and is applied to phonetic classification using both clean and noisy speech materials. The need for integrating auditory speech processing and phonetic modeling components in machine speech recognizer design is discussed within a proposed computational framework of speech recognition motivated by the closed-loop speech chain model for integrated human speech production and perception behaviors.","PeriodicalId":90534,"journal":{"name":"International Conference on Auditory-Visual Speech Processing","volume":"59 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2018-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74121150","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2018-10-03DOI: 10.1201/9781482276237-25
Li Deng, D. O'Shaughnessy
{"title":"Optimization Methods and Estimation Theory","authors":"Li Deng, D. O'Shaughnessy","doi":"10.1201/9781482276237-25","DOIUrl":"https://doi.org/10.1201/9781482276237-25","url":null,"abstract":"","PeriodicalId":90534,"journal":{"name":"International Conference on Auditory-Visual Speech Processing","volume":"198 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2018-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87733732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2018-10-03DOI: 10.1201/9781482276237-34
K. JainAnil, P. W. DuinRobert, MaoJianchang
The primary goal of pattern recognition is supervised or unsupervised classification. Among the various frameworks in which pattern recognition has been traditionally formulated, the statistical ap...
模式识别的主要目标是有监督或无监督分类。在模式识别的各种传统框架中,统计方法是最重要的。
{"title":"Statistical Pattern Recognition","authors":"K. JainAnil, P. W. DuinRobert, MaoJianchang","doi":"10.1201/9781482276237-34","DOIUrl":"https://doi.org/10.1201/9781482276237-34","url":null,"abstract":"The primary goal of pattern recognition is supervised or unsupervised classification. Among the various frameworks in which pattern recognition has been traditionally formulated, the statistical ap...","PeriodicalId":90534,"journal":{"name":"International Conference on Auditory-Visual Speech Processing","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2018-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88243768","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2018-10-03DOI: 10.1201/9781482276237-12
Li Deng, D. O'Shaughnessy
{"title":"Analysis of Discrete-Time Speech Signals","authors":"Li Deng, D. O'Shaughnessy","doi":"10.1201/9781482276237-12","DOIUrl":"https://doi.org/10.1201/9781482276237-12","url":null,"abstract":"","PeriodicalId":90534,"journal":{"name":"International Conference on Auditory-Visual Speech Processing","volume":"3 7","pages":""},"PeriodicalIF":0.0,"publicationDate":"2018-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91422509","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2018-10-03DOI: 10.1201/9781482276237-57
{"title":"Q(Ole) = E E foogp(zgi.) + Epog p(xz fin ;:_ifin) +","authors":"","doi":"10.1201/9781482276237-57","DOIUrl":"https://doi.org/10.1201/9781482276237-57","url":null,"abstract":"","PeriodicalId":90534,"journal":{"name":"International Conference on Auditory-Visual Speech Processing","volume":"19 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2018-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76797518","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2018-10-03DOI: 10.1201/9781482276237-43
Liandizhai Deng, Douglas O'Shaughnessy
{"title":"n p ou z [+ con","authors":"Liandizhai Deng, Douglas O'Shaughnessy","doi":"10.1201/9781482276237-43","DOIUrl":"https://doi.org/10.1201/9781482276237-43","url":null,"abstract":"","PeriodicalId":90534,"journal":{"name":"International Conference on Auditory-Visual Speech Processing","volume":"24 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2018-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83944620","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2018-10-03DOI: 10.1201/9781482276237-17
Li Deng, D. O'Shaughnessy
{"title":"2 Conditioning, Total Probability Theorem, and Bayes' Rule","authors":"Li Deng, D. O'Shaughnessy","doi":"10.1201/9781482276237-17","DOIUrl":"https://doi.org/10.1201/9781482276237-17","url":null,"abstract":"","PeriodicalId":90534,"journal":{"name":"International Conference on Auditory-Visual Speech Processing","volume":"52 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2018-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83955131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}