基于Sincnet-CNN模型的原始语音孤立词识别研究

2022 3rd International Conference on Information Science, Parallel and Distributed Systems (ISPDS) Pub Date : 2022-07-22 DOI:10.1109/ISPDS56360.2022.9874177

Gao Hu, Qingwei Zeng, Chao Long, Dianyou Geng

{"title":"基于Sincnet-CNN模型的原始语音孤立词识别研究","authors":"Gao Hu, Qingwei Zeng, Chao Long, Dianyou Geng","doi":"10.1109/ISPDS56360.2022.9874177","DOIUrl":null,"url":null,"abstract":"In order to effectively speed up the model training time, reduce the model training parameters and improve the accuracy of raw speech isolated word recognition. An interpretable convolutional filter structure (sincnet) combined with convolutional neural network (CNN) is proposed for the task of raw speech isolated word recognition. On the premise of ensuring the speech recognition rate, the model structure becomes lightweight and the computational complexity is reduced. The experimental results show that compared with the traditional neural network model, the proposed model can effectively improve the performance of raw speech isolated word recognition.","PeriodicalId":280244,"journal":{"name":"2022 3rd International Conference on Information Science, Parallel and Distributed Systems (ISPDS)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Research on raw speech isolated word recognition based on Sincnet-CNN model\",\"authors\":\"Gao Hu, Qingwei Zeng, Chao Long, Dianyou Geng\",\"doi\":\"10.1109/ISPDS56360.2022.9874177\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In order to effectively speed up the model training time, reduce the model training parameters and improve the accuracy of raw speech isolated word recognition. An interpretable convolutional filter structure (sincnet) combined with convolutional neural network (CNN) is proposed for the task of raw speech isolated word recognition. On the premise of ensuring the speech recognition rate, the model structure becomes lightweight and the computational complexity is reduced. The experimental results show that compared with the traditional neural network model, the proposed model can effectively improve the performance of raw speech isolated word recognition.\",\"PeriodicalId\":280244,\"journal\":{\"name\":\"2022 3rd International Conference on Information Science, Parallel and Distributed Systems (ISPDS)\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 3rd International Conference on Information Science, Parallel and Distributed Systems (ISPDS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISPDS56360.2022.9874177\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 3rd International Conference on Information Science, Parallel and Distributed Systems (ISPDS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISPDS56360.2022.9874177","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

为了有效加快模型训练时间，减少模型训练参数，提高原始语音孤立词识别的准确率。针对原始语音孤立词识别问题，提出了一种结合卷积神经网络的可解释卷积滤波结构(sincnet)。在保证语音识别率的前提下，模型结构轻量化，降低了计算复杂度。实验结果表明，与传统的神经网络模型相比，该模型能有效提高原始语音孤立词识别的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Research on raw speech isolated word recognition based on Sincnet-CNN model

In order to effectively speed up the model training time, reduce the model training parameters and improve the accuracy of raw speech isolated word recognition. An interpretable convolutional filter structure (sincnet) combined with convolutional neural network (CNN) is proposed for the task of raw speech isolated word recognition. On the premise of ensuring the speech recognition rate, the model structure becomes lightweight and the computational complexity is reduced. The experimental results show that compared with the traditional neural network model, the proposed model can effectively improve the performance of raw speech isolated word recognition.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 3rd International Conference on Information Science, Parallel and Distributed Systems (ISPDS)

自引率

0.00%

发文量