基于ML和DL的印度口语分类综述

Riya Shah, Barkha M. Joshi, J. Shah, Milin M Patel, A. Rana, Ronak Roy
{"title":"基于ML和DL的印度口语分类综述","authors":"Riya Shah, Barkha M. Joshi, J. Shah, Milin M Patel, A. Rana, Ronak Roy","doi":"10.1109/ICECA55336.2022.10009380","DOIUrl":null,"url":null,"abstract":"Unlike in some other parts of the world, speech recognition technology is legal in the West. It's not to the same degree that this happens in East Asian countries. It's possible that linguistic barriers are a major cause of this chasm. In addition, countries with many languages, such as India, must be taken into account if voice-based language identification is ever going to be practical. The challenge is in finding a technique to clearly and effectively identify the features that may differentiate across languages. The model processes audio data, creating spectrogram images from them before extracting features. Then, the Deep Learning (DL) is employed to streamline the output identification process by emphasizing the most crucial characteristics and attributes. Realizing that a person's vocal signal may be understood or observed was a major inspiration for the concept. This research work employ spectrograms (for visual data) and deep learning techniques to categorize Indic languages inside the IIITH Indic voice database. Finally, a model-based comparative analysis has been conducted by analyzing the accuracy, precision, recall, and f1-score to show that the proposed approach is more robust than existing models.","PeriodicalId":356949,"journal":{"name":"2022 6th International Conference on Electronics, Communication and Aerospace Technology","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Summary of Spoken Indian Languages Classification Using ML and DL\",\"authors\":\"Riya Shah, Barkha M. Joshi, J. Shah, Milin M Patel, A. Rana, Ronak Roy\",\"doi\":\"10.1109/ICECA55336.2022.10009380\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Unlike in some other parts of the world, speech recognition technology is legal in the West. It's not to the same degree that this happens in East Asian countries. It's possible that linguistic barriers are a major cause of this chasm. In addition, countries with many languages, such as India, must be taken into account if voice-based language identification is ever going to be practical. The challenge is in finding a technique to clearly and effectively identify the features that may differentiate across languages. The model processes audio data, creating spectrogram images from them before extracting features. Then, the Deep Learning (DL) is employed to streamline the output identification process by emphasizing the most crucial characteristics and attributes. Realizing that a person's vocal signal may be understood or observed was a major inspiration for the concept. This research work employ spectrograms (for visual data) and deep learning techniques to categorize Indic languages inside the IIITH Indic voice database. Finally, a model-based comparative analysis has been conducted by analyzing the accuracy, precision, recall, and f1-score to show that the proposed approach is more robust than existing models.\",\"PeriodicalId\":356949,\"journal\":{\"name\":\"2022 6th International Conference on Electronics, Communication and Aerospace Technology\",\"volume\":\"12 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 6th International Conference on Electronics, Communication and Aerospace Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICECA55336.2022.10009380\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 6th International Conference on Electronics, Communication and Aerospace Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECA55336.2022.10009380","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

与世界上其他一些地区不同,语音识别技术在西方是合法的。这种情况在东亚国家发生的程度不同。语言障碍可能是造成这种鸿沟的主要原因。此外,如果基于语音的语言识别变得实用,必须考虑到拥有多种语言的国家,如印度。挑战在于找到一种技术来清晰有效地识别可能区分不同语言的特征。该模型处理音频数据,在提取特征之前从中创建频谱图图像。然后,通过强调最关键的特征和属性,采用深度学习(DL)来简化输出识别过程。意识到一个人的声音信号可以被理解或观察是这个概念的主要灵感。本研究工作采用频谱图(用于视觉数据)和深度学习技术对IIITH印度语音数据库中的印度语言进行分类。最后,通过对准确率、精密度、召回率和f1-score的分析,进行了基于模型的对比分析,表明本文方法比现有模型具有更强的鲁棒性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Summary of Spoken Indian Languages Classification Using ML and DL
Unlike in some other parts of the world, speech recognition technology is legal in the West. It's not to the same degree that this happens in East Asian countries. It's possible that linguistic barriers are a major cause of this chasm. In addition, countries with many languages, such as India, must be taken into account if voice-based language identification is ever going to be practical. The challenge is in finding a technique to clearly and effectively identify the features that may differentiate across languages. The model processes audio data, creating spectrogram images from them before extracting features. Then, the Deep Learning (DL) is employed to streamline the output identification process by emphasizing the most crucial characteristics and attributes. Realizing that a person's vocal signal may be understood or observed was a major inspiration for the concept. This research work employ spectrograms (for visual data) and deep learning techniques to categorize Indic languages inside the IIITH Indic voice database. Finally, a model-based comparative analysis has been conducted by analyzing the accuracy, precision, recall, and f1-score to show that the proposed approach is more robust than existing models.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Multi-Objective Artificial Flora Algorithm Based Optimal Handover Scheme for LTE-Advanced Networks Named Entity Recognition using CRF with Active Learning Algorithm in English Texts FPGA Implementation of Lattice-Wave Half-Order Digital Integrator using Radix-$2^{r}$ Digit Recoding Green Cloud Computing- Next Step Towards Eco-friendly Work Stations Diabetes Prediction using Support Vector Machine, Naive Bayes and Random Forest Machine Learning Models
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1