Fatima Barkani, Mohamed Hamidi, Ouissam Zealouk, H. Satori
{"title":"Speech Recognition Algorithms based Cough Recognition System","authors":"Fatima Barkani, Mohamed Hamidi, Ouissam Zealouk, H. Satori","doi":"10.3991/ijoe.v19i12.40471","DOIUrl":null,"url":null,"abstract":"This paper introduces an innovative technique for creating a cough detection system that relies on speech recognition algorithms. The strategy utilizes the Kaldi platform, which is open source and incorporates a hybrid system of Gaussian Mixture Model-based Hidden Markov Models (GMM-HMM) through a straightforward monophone training model. Additionally, the study examines the effectiveness of two different feature extraction approaches, Mel Frequency Cepstral Coefficient (MFCC) and Perceptual Linear Prediction (PLP). The proposed system can function as a collection tool for gathering natural and spontaneous cough data from conversations or continuous speech. The paper also compares the Kaldi and CMU Sphinx4 toolkits, concluding that Kaldi’s use of GMM-HMM outperforms CMU Sphinx4.","PeriodicalId":36900,"journal":{"name":"International Journal of Online and Biomedical Engineering","volume":" ","pages":""},"PeriodicalIF":1.7000,"publicationDate":"2023-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Online and Biomedical Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3991/ijoe.v19i12.40471","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
This paper introduces an innovative technique for creating a cough detection system that relies on speech recognition algorithms. The strategy utilizes the Kaldi platform, which is open source and incorporates a hybrid system of Gaussian Mixture Model-based Hidden Markov Models (GMM-HMM) through a straightforward monophone training model. Additionally, the study examines the effectiveness of two different feature extraction approaches, Mel Frequency Cepstral Coefficient (MFCC) and Perceptual Linear Prediction (PLP). The proposed system can function as a collection tool for gathering natural and spontaneous cough data from conversations or continuous speech. The paper also compares the Kaldi and CMU Sphinx4 toolkits, concluding that Kaldi’s use of GMM-HMM outperforms CMU Sphinx4.