{"title":"Fuzzy k-nearest neighbors applied to phoneme recognition","authors":"I. Fredj, K. Ouni","doi":"10.1109/SETIT.2016.7939907","DOIUrl":null,"url":null,"abstract":"In this work, the Fuzzy kNN (FkNN), an alternative of the standard kNN algorithm, is used for Timit phoneme recognition. Phoneme is the smallest unit that composes speech. For this reason, if phoneme recognition is performed, it can achieve a significant word and text recognition. Thus, the main idea consists on assigning phoneme membership to the data phonemes by measuring the distance to its kNN. FkNN compute the fuzzy distances between the data phonemes that define the cluster fuzziness. Mel Frequency Cepstral Coefficients (MFCC) associated with their first and second derivatives and energy coefficient were extracted from the speech signals as an input of the recognition system. A comparison of a crisp and fuzzy kNN was performed. Experiments show that FkNN algorithm not only can lead to significant recognition rates, but also may supersede in some ways Hidden Markov Model (HMM) the reference of speech recognition.","PeriodicalId":426951,"journal":{"name":"2016 7th International Conference on Sciences of Electronics, Technologies of Information and Telecommunications (SETIT)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 7th International Conference on Sciences of Electronics, Technologies of Information and Telecommunications (SETIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SETIT.2016.7939907","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
In this work, the Fuzzy kNN (FkNN), an alternative of the standard kNN algorithm, is used for Timit phoneme recognition. Phoneme is the smallest unit that composes speech. For this reason, if phoneme recognition is performed, it can achieve a significant word and text recognition. Thus, the main idea consists on assigning phoneme membership to the data phonemes by measuring the distance to its kNN. FkNN compute the fuzzy distances between the data phonemes that define the cluster fuzziness. Mel Frequency Cepstral Coefficients (MFCC) associated with their first and second derivatives and energy coefficient were extracted from the speech signals as an input of the recognition system. A comparison of a crisp and fuzzy kNN was performed. Experiments show that FkNN algorithm not only can lead to significant recognition rates, but also may supersede in some ways Hidden Markov Model (HMM) the reference of speech recognition.