P. Cerna, Charisma S. Ututalum, R. S. Evangelista, Aldaruhz T. Darkis, Masnona Sabdani Asiri, Jehana A. Muallam-Darkis
{"title":"An IOT-based Language Recognition System for Indigenous Languages using Integrated CNN and RNN","authors":"P. Cerna, Charisma S. Ututalum, R. S. Evangelista, Aldaruhz T. Darkis, Masnona Sabdani Asiri, Jehana A. Muallam-Darkis","doi":"10.1109/ICSMDI57622.2023.00086","DOIUrl":null,"url":null,"abstract":"Automatic Speech Recognition (ASR) aims to establish communication between humans and computers in a more natural way. The main aim of this study is to build hardware-based automatic speech recognition for Indigenous People (IP)'s ancestral dialects, in particular for Manobo, Mandaya, and B'laan using Raspberry Pi. Jasper is an open source toolkit used for creating voice-activated, always-on applications. The researcher recording audio data from research participants, the study's participants will be located in Davao Occidental and Sarangani for B'laan, Agusan Del Sur for Manobo, and Davao Oriental for Mandaya. A functional microphone and raspberry pi boards serve as the experiment's hardware where audi o input is being fine-tuned from a raspberry pi-powered device that records audio in waveform format, which includes Mandaya, Manobo, and Malita words and phrases. The Tensorflow STFT technique will be used to analyze, generate, transform, and characterize audio signals. JiWER plugins for Similarity measures will also be used The WER output is 98.53%, an acceptable percentage for the number of datasets used","PeriodicalId":373017,"journal":{"name":"2023 3rd International Conference on Smart Data Intelligence (ICSMDI)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 3rd International Conference on Smart Data Intelligence (ICSMDI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSMDI57622.2023.00086","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Automatic Speech Recognition (ASR) aims to establish communication between humans and computers in a more natural way. The main aim of this study is to build hardware-based automatic speech recognition for Indigenous People (IP)'s ancestral dialects, in particular for Manobo, Mandaya, and B'laan using Raspberry Pi. Jasper is an open source toolkit used for creating voice-activated, always-on applications. The researcher recording audio data from research participants, the study's participants will be located in Davao Occidental and Sarangani for B'laan, Agusan Del Sur for Manobo, and Davao Oriental for Mandaya. A functional microphone and raspberry pi boards serve as the experiment's hardware where audi o input is being fine-tuned from a raspberry pi-powered device that records audio in waveform format, which includes Mandaya, Manobo, and Malita words and phrases. The Tensorflow STFT technique will be used to analyze, generate, transform, and characterize audio signals. JiWER plugins for Similarity measures will also be used The WER output is 98.53%, an acceptable percentage for the number of datasets used