{"title":"Speech recognition based on k-means clustering and neural network ensembles","authors":"Xin-guang Li, Min-feng Yao, Wen-Tao Huang","doi":"10.1109/ICNC.2011.6022159","DOIUrl":null,"url":null,"abstract":"Aiming at the disadvantages of the single BP neural network in speech recognition, a method of speech recognition based on k-means clustering and neural network ensembles is presented in this paper. At first, a number of individual neural networks are trained, and then the k-means clustering algorithm is used to select a part of the trained individuals' weights and thresholds for improving diversity. After that, the individuals of the nearest clustering center are selected to make up the membership's initial weights and thresholds of the ensemble learning. The method not only overcomes the shortcomings that single BP neural network model is easy to local convergence and is lack of stability, but also solves the problems that the traditional adaboost method in training time is too long and the diversity of individual network is not obvious. The final experiment results prove the effectiveness of this method when applied to speakers of independent speech recognition.","PeriodicalId":299503,"journal":{"name":"2011 Seventh International Conference on Natural Computation","volume":"113 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 Seventh International Conference on Natural Computation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICNC.2011.6022159","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11
Abstract
Aiming at the disadvantages of the single BP neural network in speech recognition, a method of speech recognition based on k-means clustering and neural network ensembles is presented in this paper. At first, a number of individual neural networks are trained, and then the k-means clustering algorithm is used to select a part of the trained individuals' weights and thresholds for improving diversity. After that, the individuals of the nearest clustering center are selected to make up the membership's initial weights and thresholds of the ensemble learning. The method not only overcomes the shortcomings that single BP neural network model is easy to local convergence and is lack of stability, but also solves the problems that the traditional adaboost method in training time is too long and the diversity of individual network is not obvious. The final experiment results prove the effectiveness of this method when applied to speakers of independent speech recognition.