Santiago Omar Caballero Morales, Yara Pérez Maldonado, F. Trujillo-Romero
{"title":"Improvement on Automatic Speech Recognition Using Micro-genetic Algorithm","authors":"Santiago Omar Caballero Morales, Yara Pérez Maldonado, F. Trujillo-Romero","doi":"10.1109/MICAI.2012.14","DOIUrl":null,"url":null,"abstract":"In this paper we extend on previous work about the application of Genetic Algorithms (GAs) to optimize the transition structure of phoneme Hidden Markov Models (HMMs) for Automatic Speech Recognition (ASR). We focus on the development of a micro-GA where, in contrast to other GA approaches, each individual in the initial population consists of an element of the transition matrix of an HMM. Each individual's fitness is measured at the phoneme recognition level, which makes the execution of the algorithm faster. Evaluation of performance was performed with test speech data from the Wall Street Journal (WSJ) database. When measuring the performance of the optimized HMMs at the word recognition level, statistically significant improvements were obtained when compared with the performance of a standard speaker adaptation technique.","PeriodicalId":348369,"journal":{"name":"2012 11th Mexican International Conference on Artificial Intelligence","volume":"76 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 11th Mexican International Conference on Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MICAI.2012.14","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
In this paper we extend on previous work about the application of Genetic Algorithms (GAs) to optimize the transition structure of phoneme Hidden Markov Models (HMMs) for Automatic Speech Recognition (ASR). We focus on the development of a micro-GA where, in contrast to other GA approaches, each individual in the initial population consists of an element of the transition matrix of an HMM. Each individual's fitness is measured at the phoneme recognition level, which makes the execution of the algorithm faster. Evaluation of performance was performed with test speech data from the Wall Street Journal (WSJ) database. When measuring the performance of the optimized HMMs at the word recognition level, statistically significant improvements were obtained when compared with the performance of a standard speaker adaptation technique.