基于亲和力传播的预聚类特征选择在低资源语言语音分类中的应用

Parabattina Bhagath, Komal Bharti, Abhishek Kotiya, P. Das
{"title":"基于亲和力传播的预聚类特征选择在低资源语言语音分类中的应用","authors":"Parabattina Bhagath, Komal Bharti, Abhishek Kotiya, P. Das","doi":"10.1109/IICAIET51634.2021.9573696","DOIUrl":null,"url":null,"abstract":"Speech analysis is an active research field where different feature extraction techniques are studied for solving various issues. Such studies help to improve the time complexity of solutions by understanding necessary clues to select the features. Choosing essential features by removing irrelevant information is a significant step in feature engineering. Perceptual Linear Predictive (PLP) modeling concentrates on understanding the speech signals by focusing on the features perceived at the listener end. They have been used successfully in many speech processing applications. The selection of the order of PLP coefficients for efficient classification of spoken units plays a crucial role in the recognition task. A conventional speech processing system requires a huge training process to develop an Automatic Speech Recognition system. Such systems are efficient for the languages that have enough resources i.e. data. But, low-resource languages especially Asian languages haven't been developed to provide the data sufficient for such tasks. In this context, alternative methods and techniques are encouraged to enhance or optimize the development process with less amount of data. This paper proposes a pre-clustering technique to improve the classification rate with low resources.","PeriodicalId":234229,"journal":{"name":"2021 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Feature Selection using Pre-clustering via Affinity Propagation for Speech Classification in Low-resource Languages\",\"authors\":\"Parabattina Bhagath, Komal Bharti, Abhishek Kotiya, P. Das\",\"doi\":\"10.1109/IICAIET51634.2021.9573696\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Speech analysis is an active research field where different feature extraction techniques are studied for solving various issues. Such studies help to improve the time complexity of solutions by understanding necessary clues to select the features. Choosing essential features by removing irrelevant information is a significant step in feature engineering. Perceptual Linear Predictive (PLP) modeling concentrates on understanding the speech signals by focusing on the features perceived at the listener end. They have been used successfully in many speech processing applications. The selection of the order of PLP coefficients for efficient classification of spoken units plays a crucial role in the recognition task. A conventional speech processing system requires a huge training process to develop an Automatic Speech Recognition system. Such systems are efficient for the languages that have enough resources i.e. data. But, low-resource languages especially Asian languages haven't been developed to provide the data sufficient for such tasks. In this context, alternative methods and techniques are encouraged to enhance or optimize the development process with less amount of data. This paper proposes a pre-clustering technique to improve the classification rate with low resources.\",\"PeriodicalId\":234229,\"journal\":{\"name\":\"2021 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)\",\"volume\":\"53 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IICAIET51634.2021.9573696\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IICAIET51634.2021.9573696","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

语音分析是一个活跃的研究领域,人们研究了不同的特征提取技术来解决各种问题。这样的研究通过理解必要的线索来选择特征,有助于提高解决方案的时间复杂度。通过去除不相关信息来选择基本特征是特征工程中的一个重要步骤。感知线性预测(PLP)建模的重点是通过关注听者端感知到的特征来理解语音信号。它们已成功地应用于许多语音处理应用中。在语音识别任务中,有效分类语音单元的PLP系数顺序的选择是至关重要的。传统的语音处理系统需要大量的训练才能开发出自动语音识别系统。这样的系统对于拥有足够资源(即数据)的语言是有效的。但是,资源匮乏的语言,尤其是亚洲语言,还没有开发出能够为这些任务提供足够数据的语言。在这方面,鼓励采用其他方法和技术,以较少的数据量加强或优化开发过程。为了在资源较少的情况下提高分类率,提出了一种预聚类技术。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Feature Selection using Pre-clustering via Affinity Propagation for Speech Classification in Low-resource Languages
Speech analysis is an active research field where different feature extraction techniques are studied for solving various issues. Such studies help to improve the time complexity of solutions by understanding necessary clues to select the features. Choosing essential features by removing irrelevant information is a significant step in feature engineering. Perceptual Linear Predictive (PLP) modeling concentrates on understanding the speech signals by focusing on the features perceived at the listener end. They have been used successfully in many speech processing applications. The selection of the order of PLP coefficients for efficient classification of spoken units plays a crucial role in the recognition task. A conventional speech processing system requires a huge training process to develop an Automatic Speech Recognition system. Such systems are efficient for the languages that have enough resources i.e. data. But, low-resource languages especially Asian languages haven't been developed to provide the data sufficient for such tasks. In this context, alternative methods and techniques are encouraged to enhance or optimize the development process with less amount of data. This paper proposes a pre-clustering technique to improve the classification rate with low resources.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Text Analytics on Twitter Text-based Public Sentiment for Covid-19 Vaccine: A Machine Learning Approach Eye-Tank: Monitoring and Predicting Water and pH Level in Smart Farming Particle Swarm Optimization for Tuning Power System Stabilizer towards Transient Stability Improvement in Power System Network Multi-Scale Texture Analysis For Finger Vein Anti-Spoofing Utilization of Response Surface Methodology and Regression Model in Optimizing Bioretention Performance
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1