Hala S. Abuelmakarem, S. Fawzi, A. Quriba, Ahmed Elbialy, A. Kandil
{"title":"使用自动语音识别技术的计算机辅助治疗语言发育迟缓儿童","authors":"Hala S. Abuelmakarem, S. Fawzi, A. Quriba, Ahmed Elbialy, A. Kandil","doi":"10.4015/s1016237222500235","DOIUrl":null,"url":null,"abstract":"Objectives: This study aims to develop a computer-aided therapy (CAT) application to help children who suffer from delayed language development (DLD) improve their language, especially during the COVID-19 pandemic. Methods: The implemented system teaches the children their body parts using the Egyptian dialect. Two datasets were collected from healthy children (2800 words) and unhealthy children (236 words) who have DLD at the clinic. The model is implemented using a speaker-independent isolated word recognizer based on a discrete-observation hidden Markov model (DHMM) classifier. After the speech signal preprocessing step, K-means algorithm generated a codebook to cluster the speech segments. This task was completed under the MATLAB environment. The graphical user interface was implemented successfully under the C# umbrella to complete the CAT application task. The system was tested on healthy and DLD children. Also, in a small clinical trial, five children who have DLD tested the program in an actual trial to monitor their pronunciation progress during therapeutic sessions. Results: The max recognition rate was 95.25% for the healthy children dataset, while 93.82% for the DLD dataset. Conclusion: DHMM was implemented successfully using nine and five states based on different codebook sizes (160, 200). The implemented system achieved a high recognition rate using both datasets. The children enjoyed using the application because it was interactive. Children who have DLD can use speech recognition applications.","PeriodicalId":8862,"journal":{"name":"Biomedical Engineering: Applications, Basis and Communications","volume":"397 1","pages":""},"PeriodicalIF":0.6000,"publicationDate":"2022-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"COMPUTER-AIDED THERAPY USING AUTOMATIC SPEECH RECOGNITION TECHNIQUE FOR DELAYED LANGUAGE DEVELOPMENT CHILDREN\",\"authors\":\"Hala S. Abuelmakarem, S. Fawzi, A. Quriba, Ahmed Elbialy, A. Kandil\",\"doi\":\"10.4015/s1016237222500235\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Objectives: This study aims to develop a computer-aided therapy (CAT) application to help children who suffer from delayed language development (DLD) improve their language, especially during the COVID-19 pandemic. Methods: The implemented system teaches the children their body parts using the Egyptian dialect. Two datasets were collected from healthy children (2800 words) and unhealthy children (236 words) who have DLD at the clinic. The model is implemented using a speaker-independent isolated word recognizer based on a discrete-observation hidden Markov model (DHMM) classifier. After the speech signal preprocessing step, K-means algorithm generated a codebook to cluster the speech segments. This task was completed under the MATLAB environment. The graphical user interface was implemented successfully under the C# umbrella to complete the CAT application task. The system was tested on healthy and DLD children. Also, in a small clinical trial, five children who have DLD tested the program in an actual trial to monitor their pronunciation progress during therapeutic sessions. Results: The max recognition rate was 95.25% for the healthy children dataset, while 93.82% for the DLD dataset. Conclusion: DHMM was implemented successfully using nine and five states based on different codebook sizes (160, 200). The implemented system achieved a high recognition rate using both datasets. The children enjoyed using the application because it was interactive. Children who have DLD can use speech recognition applications.\",\"PeriodicalId\":8862,\"journal\":{\"name\":\"Biomedical Engineering: Applications, Basis and Communications\",\"volume\":\"397 1\",\"pages\":\"\"},\"PeriodicalIF\":0.6000,\"publicationDate\":\"2022-03-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biomedical Engineering: Applications, Basis and Communications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4015/s1016237222500235\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Engineering: Applications, Basis and Communications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4015/s1016237222500235","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
COMPUTER-AIDED THERAPY USING AUTOMATIC SPEECH RECOGNITION TECHNIQUE FOR DELAYED LANGUAGE DEVELOPMENT CHILDREN
Objectives: This study aims to develop a computer-aided therapy (CAT) application to help children who suffer from delayed language development (DLD) improve their language, especially during the COVID-19 pandemic. Methods: The implemented system teaches the children their body parts using the Egyptian dialect. Two datasets were collected from healthy children (2800 words) and unhealthy children (236 words) who have DLD at the clinic. The model is implemented using a speaker-independent isolated word recognizer based on a discrete-observation hidden Markov model (DHMM) classifier. After the speech signal preprocessing step, K-means algorithm generated a codebook to cluster the speech segments. This task was completed under the MATLAB environment. The graphical user interface was implemented successfully under the C# umbrella to complete the CAT application task. The system was tested on healthy and DLD children. Also, in a small clinical trial, five children who have DLD tested the program in an actual trial to monitor their pronunciation progress during therapeutic sessions. Results: The max recognition rate was 95.25% for the healthy children dataset, while 93.82% for the DLD dataset. Conclusion: DHMM was implemented successfully using nine and five states based on different codebook sizes (160, 200). The implemented system achieved a high recognition rate using both datasets. The children enjoyed using the application because it was interactive. Children who have DLD can use speech recognition applications.
期刊介绍:
Biomedical Engineering: Applications, Basis and Communications is an international, interdisciplinary journal aiming at publishing up-to-date contributions on original clinical and basic research in the biomedical engineering. Research of biomedical engineering has grown tremendously in the past few decades. Meanwhile, several outstanding journals in the field have emerged, with different emphases and objectives. We hope this journal will serve as a new forum for both scientists and clinicians to share their ideas and the results of their studies.
Biomedical Engineering: Applications, Basis and Communications explores all facets of biomedical engineering, with emphasis on both the clinical and scientific aspects of the study. It covers the fields of bioelectronics, biomaterials, biomechanics, bioinformatics, nano-biological sciences and clinical engineering. The journal fulfils this aim by publishing regular research / clinical articles, short communications, technical notes and review papers. Papers from both basic research and clinical investigations will be considered.