Syed Abdul Rahman Al-Haddad, S. Samad, Aini Hussain, K. A. Ishak, Hamid Mirvaziri
{"title":"Decision Fusion for Isolated Malay Digit Recognition Using Dynamic Time Warping (DTW) and Hidden Markov Model (HMM)","authors":"Syed Abdul Rahman Al-Haddad, S. Samad, Aini Hussain, K. A. Ishak, Hamid Mirvaziri","doi":"10.1109/SCORED.2007.4451425","DOIUrl":null,"url":null,"abstract":"This paper is focused on Malay speech recognition with the intention to introduce a decision fusion technique for isolated Malay digit recognition using Dynamic Time Warping (DTW) and Hidden Markov Model (HMM). This study proposes an algorithm for decision fusion of the recognition models. The endpoint detection, framing, normalization, Mel Frequency Cepstral Coefficient (MFCC) and vector quantization techniques are used to process speech samples to accomplish the recognition. Decision fusion technique is then used to combine the results of DTW and HMM. The algorithm is tested on speech samples that is a part of a Malay corpus.","PeriodicalId":443652,"journal":{"name":"2007 5th Student Conference on Research and Development","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 5th Student Conference on Research and Development","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SCORED.2007.4451425","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11
Abstract
This paper is focused on Malay speech recognition with the intention to introduce a decision fusion technique for isolated Malay digit recognition using Dynamic Time Warping (DTW) and Hidden Markov Model (HMM). This study proposes an algorithm for decision fusion of the recognition models. The endpoint detection, framing, normalization, Mel Frequency Cepstral Coefficient (MFCC) and vector quantization techniques are used to process speech samples to accomplish the recognition. Decision fusion technique is then used to combine the results of DTW and HMM. The algorithm is tested on speech samples that is a part of a Malay corpus.