H. E. Semary, Khamis A. AL-KARAWI, Mahmoud M. Abdelwahab
{"title":"Using Voice Technologies to Support Disabled People","authors":"H. E. Semary, Khamis A. AL-KARAWI, Mahmoud M. Abdelwahab","doi":"10.57197/jdr-2023-0063","DOIUrl":null,"url":null,"abstract":"In recent years, significant strides have been made in speech and speaker recognition systems, owing to the rapid evolution of data processing capabilities. Utilizing a speech recognition system facilitates straightforward and efficient interaction, especially for individuals with disabilities. This article introduces an automatic speech recognition (ASR) system designed for seamless adaptation across diverse platforms. The model is meticulously described, emphasizing clarity and detail to ensure reproducibility for researchers advancing in this field. The model’s architecture encompasses four stages: data acquisition, preprocessing, feature extraction, and pattern recognition. Comprehensive insights into the system’s functionality are provided in the Experiments and Results section. In this study, an ASR system is introduced as a valuable addition to the advancement of educational platforms, enhancing accessibility for individuals with visual disabilities. While the achieved recognition accuracy levels are promising, they may not match those of certain commercial systems. Nevertheless, the proposed model offers a cost-effective solution with low computational requirements. It seamlessly integrates with various platforms, facilitates straightforward modifications for developers, and can be tailored to the specific needs of individual users. Additionally, the system allows for the effortless inclusion of new words in its database through a single recording process.","PeriodicalId":516281,"journal":{"name":"Journal of Disability Research","volume":"36 2","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Disability Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.57197/jdr-2023-0063","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In recent years, significant strides have been made in speech and speaker recognition systems, owing to the rapid evolution of data processing capabilities. Utilizing a speech recognition system facilitates straightforward and efficient interaction, especially for individuals with disabilities. This article introduces an automatic speech recognition (ASR) system designed for seamless adaptation across diverse platforms. The model is meticulously described, emphasizing clarity and detail to ensure reproducibility for researchers advancing in this field. The model’s architecture encompasses four stages: data acquisition, preprocessing, feature extraction, and pattern recognition. Comprehensive insights into the system’s functionality are provided in the Experiments and Results section. In this study, an ASR system is introduced as a valuable addition to the advancement of educational platforms, enhancing accessibility for individuals with visual disabilities. While the achieved recognition accuracy levels are promising, they may not match those of certain commercial systems. Nevertheless, the proposed model offers a cost-effective solution with low computational requirements. It seamlessly integrates with various platforms, facilitates straightforward modifications for developers, and can be tailored to the specific needs of individual users. Additionally, the system allows for the effortless inclusion of new words in its database through a single recording process.
近年来,由于数据处理能力的快速发展,语音和扬声器识别系统取得了长足进步。使用语音识别系统有助于进行直接、高效的交互,尤其是对残障人士而言。本文介绍了一种自动语音识别(ASR)系统,其设计目的是在不同平台上实现无缝适应。文章对该模型进行了细致的描述,强调清晰度和细节,以确保该领域研究人员的可重复性。该模型的架构包括四个阶段:数据采集、预处理、特征提取和模式识别。实验和结果部分将全面介绍该系统的功能。在这项研究中,引入了 ASR 系统,作为教育平台进步的重要补充,提高了视障人士的无障碍环境。虽然所达到的识别准确率水平很有希望,但可能无法与某些商业系统相媲美。尽管如此,所提出的模型提供了一种低计算要求、经济高效的解决方案。它能与各种平台无缝集成,便于开发人员进行直接修改,并能根据个人用户的具体需求进行定制。此外,该系统只需一次记录过程,即可轻松将新词纳入数据库。