基于亲和力传播的预聚类特征选择在低资源语言语音分类中的应用

2021 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET) Pub Date : 2021-09-13 DOI:10.1109/IICAIET51634.2021.9573696

Parabattina Bhagath, Komal Bharti, Abhishek Kotiya, P. Das

{"title":"基于亲和力传播的预聚类特征选择在低资源语言语音分类中的应用","authors":"Parabattina Bhagath, Komal Bharti, Abhishek Kotiya, P. Das","doi":"10.1109/IICAIET51634.2021.9573696","DOIUrl":null,"url":null,"abstract":"Speech analysis is an active research field where different feature extraction techniques are studied for solving various issues. Such studies help to improve the time complexity of solutions by understanding necessary clues to select the features. Choosing essential features by removing irrelevant information is a significant step in feature engineering. Perceptual Linear Predictive (PLP) modeling concentrates on understanding the speech signals by focusing on the features perceived at the listener end. They have been used successfully in many speech processing applications. The selection of the order of PLP coefficients for efficient classification of spoken units plays a crucial role in the recognition task. A conventional speech processing system requires a huge training process to develop an Automatic Speech Recognition system. Such systems are efficient for the languages that have enough resources i.e. data. But, low-resource languages especially Asian languages haven't been developed to provide the data sufficient for such tasks. In this context, alternative methods and techniques are encouraged to enhance or optimize the development process with less amount of data. This paper proposes a pre-clustering technique to improve the classification rate with low resources.","PeriodicalId":234229,"journal":{"name":"2021 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Feature Selection using Pre-clustering via Affinity Propagation for Speech Classification in Low-resource Languages\",\"authors\":\"Parabattina Bhagath, Komal Bharti, Abhishek Kotiya, P. Das\",\"doi\":\"10.1109/IICAIET51634.2021.9573696\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Speech analysis is an active research field where different feature extraction techniques are studied for solving various issues. Such studies help to improve the time complexity of solutions by understanding necessary clues to select the features. Choosing essential features by removing irrelevant information is a significant step in feature engineering. Perceptual Linear Predictive (PLP) modeling concentrates on understanding the speech signals by focusing on the features perceived at the listener end. They have been used successfully in many speech processing applications. The selection of the order of PLP coefficients for efficient classification of spoken units plays a crucial role in the recognition task. A conventional speech processing system requires a huge training process to develop an Automatic Speech Recognition system. Such systems are efficient for the languages that have enough resources i.e. data. But, low-resource languages especially Asian languages haven't been developed to provide the data sufficient for such tasks. In this context, alternative methods and techniques are encouraged to enhance or optimize the development process with less amount of data. This paper proposes a pre-clustering technique to improve the classification rate with low resources.\",\"PeriodicalId\":234229,\"journal\":{\"name\":\"2021 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)\",\"volume\":\"53 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IICAIET51634.2021.9573696\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IICAIET51634.2021.9573696","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

语音分析是一个活跃的研究领域，人们研究了不同的特征提取技术来解决各种问题。这样的研究通过理解必要的线索来选择特征，有助于提高解决方案的时间复杂度。通过去除不相关信息来选择基本特征是特征工程中的一个重要步骤。感知线性预测(PLP)建模的重点是通过关注听者端感知到的特征来理解语音信号。它们已成功地应用于许多语音处理应用中。在语音识别任务中，有效分类语音单元的PLP系数顺序的选择是至关重要的。传统的语音处理系统需要大量的训练才能开发出自动语音识别系统。这样的系统对于拥有足够资源(即数据)的语言是有效的。但是，资源匮乏的语言，尤其是亚洲语言，还没有开发出能够为这些任务提供足够数据的语言。在这方面，鼓励采用其他方法和技术，以较少的数据量加强或优化开发过程。为了在资源较少的情况下提高分类率，提出了一种预聚类技术。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Feature Selection using Pre-clustering via Affinity Propagation for Speech Classification in Low-resource Languages

Speech analysis is an active research field where different feature extraction techniques are studied for solving various issues. Such studies help to improve the time complexity of solutions by understanding necessary clues to select the features. Choosing essential features by removing irrelevant information is a significant step in feature engineering. Perceptual Linear Predictive (PLP) modeling concentrates on understanding the speech signals by focusing on the features perceived at the listener end. They have been used successfully in many speech processing applications. The selection of the order of PLP coefficients for efficient classification of spoken units plays a crucial role in the recognition task. A conventional speech processing system requires a huge training process to develop an Automatic Speech Recognition system. Such systems are efficient for the languages that have enough resources i.e. data. But, low-resource languages especially Asian languages haven't been developed to provide the data sufficient for such tasks. In this context, alternative methods and techniques are encouraged to enhance or optimize the development process with less amount of data. This paper proposes a pre-clustering technique to improve the classification rate with low resources.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)

自引率

0.00%

发文量