{"title":"基于MFCC的基于k-NN和LDA的口吃语音重复和延长识别","authors":"L. Chee, Ooi Chia Ai, M. Hariharan, S. Yaacob","doi":"10.1109/SCORED.2009.5443210","DOIUrl":null,"url":null,"abstract":"Stuttering is a speech disorder in which the normal flow of speech is disrupted by occurrences of dysfluencies, such as repetitions, interjection and so on. There are a high proportion of repetitions and prolongations in stuttered speech, usually at the beginning of sentences. Consequently, acoustic analysis can be used to classify the stuttered events. This paper describes particular stuttering events to be located as repetitions and prolongations in stuttered speech with feature extraction algorithm. The well known Mel Frequency Cepstral Coefficient (MFCC) feature extraction is implemented to test its effectiveness in recognizing prolongations and repetitions in a stuttered speech. In this work, two classifiers such as Linear Discriminant Analysis based classifier (LDA) and ¿-nearest neighbors (Λ-1Ν1Ν) are employed and ¿-fold cross-validation was applied to measure classifiers performances. The result of this work shows that the MFCC and classifiers (LDA and Λ-1Ν1Ν) can be used for recognition of repetitions and prolongations in stuttered speech with the average accuracy of 90%.","PeriodicalId":443287,"journal":{"name":"2009 IEEE Student Conference on Research and Development (SCOReD)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"67","resultStr":"{\"title\":\"MFCC based recognition of repetitions and prolongations in stuttered speech using k-NN and LDA\",\"authors\":\"L. Chee, Ooi Chia Ai, M. Hariharan, S. Yaacob\",\"doi\":\"10.1109/SCORED.2009.5443210\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Stuttering is a speech disorder in which the normal flow of speech is disrupted by occurrences of dysfluencies, such as repetitions, interjection and so on. There are a high proportion of repetitions and prolongations in stuttered speech, usually at the beginning of sentences. Consequently, acoustic analysis can be used to classify the stuttered events. This paper describes particular stuttering events to be located as repetitions and prolongations in stuttered speech with feature extraction algorithm. The well known Mel Frequency Cepstral Coefficient (MFCC) feature extraction is implemented to test its effectiveness in recognizing prolongations and repetitions in a stuttered speech. In this work, two classifiers such as Linear Discriminant Analysis based classifier (LDA) and ¿-nearest neighbors (Λ-1Ν1Ν) are employed and ¿-fold cross-validation was applied to measure classifiers performances. The result of this work shows that the MFCC and classifiers (LDA and Λ-1Ν1Ν) can be used for recognition of repetitions and prolongations in stuttered speech with the average accuracy of 90%.\",\"PeriodicalId\":443287,\"journal\":{\"name\":\"2009 IEEE Student Conference on Research and Development (SCOReD)\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-11-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"67\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 IEEE Student Conference on Research and Development (SCOReD)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SCORED.2009.5443210\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 IEEE Student Conference on Research and Development (SCOReD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SCORED.2009.5443210","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
MFCC based recognition of repetitions and prolongations in stuttered speech using k-NN and LDA
Stuttering is a speech disorder in which the normal flow of speech is disrupted by occurrences of dysfluencies, such as repetitions, interjection and so on. There are a high proportion of repetitions and prolongations in stuttered speech, usually at the beginning of sentences. Consequently, acoustic analysis can be used to classify the stuttered events. This paper describes particular stuttering events to be located as repetitions and prolongations in stuttered speech with feature extraction algorithm. The well known Mel Frequency Cepstral Coefficient (MFCC) feature extraction is implemented to test its effectiveness in recognizing prolongations and repetitions in a stuttered speech. In this work, two classifiers such as Linear Discriminant Analysis based classifier (LDA) and ¿-nearest neighbors (Λ-1Ν1Ν) are employed and ¿-fold cross-validation was applied to measure classifiers performances. The result of this work shows that the MFCC and classifiers (LDA and Λ-1Ν1Ν) can be used for recognition of repetitions and prolongations in stuttered speech with the average accuracy of 90%.