基于深度学习的语音识别模型及其在语音质量评价系统中的应用

Proceedings of the 2019 International Conference on Data Mining and Machine Learning Pub Date : 2019-04-28 DOI:10.1145/3335656.3335657

Teng Haikun, W. Shiying, Liu Xinsheng, Xiaodong Yue

{"title":"基于深度学习的语音识别模型及其在语音质量评价系统中的应用","authors":"Teng Haikun, W. Shiying, Liu Xinsheng, Xiaodong Yue","doi":"10.1145/3335656.3335657","DOIUrl":null,"url":null,"abstract":"In order to improve the performance of the speech recognition and pronunciation quality evaluation system, the deep learning-based computer aided foreign language pronunciation evaluation and learning system has become a research focus of current artificial intelligence technology. Combining with the current advanced voice information technology theory, based on the previous research results, proposed to sparse since the encoder deep learning neural network is applied to speech recognition, using sparse automatic encoder based on MFCC features in-depth study, the imitation of the auditory nerve sparse touches the depth of feature extracting signal, beneficial to the improvement of the HMM model for speech recognition accuracy, meet the needs of the current computer assisted English teaching. The simulation results show that the recognition rate of the deep learning neural network is obviously superior to that of the traditional speech recognition algorithm, which realizes more accurate human-computer interaction and improves the reliability of the evaluation of the quality of foreign language pronunciation.","PeriodicalId":396772,"journal":{"name":"Proceedings of the 2019 International Conference on Data Mining and Machine Learning","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Speech Recognition Model Based on Deep Learning And Application in Pronunciation Quality Evaluation System\",\"authors\":\"Teng Haikun, W. Shiying, Liu Xinsheng, Xiaodong Yue\",\"doi\":\"10.1145/3335656.3335657\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In order to improve the performance of the speech recognition and pronunciation quality evaluation system, the deep learning-based computer aided foreign language pronunciation evaluation and learning system has become a research focus of current artificial intelligence technology. Combining with the current advanced voice information technology theory, based on the previous research results, proposed to sparse since the encoder deep learning neural network is applied to speech recognition, using sparse automatic encoder based on MFCC features in-depth study, the imitation of the auditory nerve sparse touches the depth of feature extracting signal, beneficial to the improvement of the HMM model for speech recognition accuracy, meet the needs of the current computer assisted English teaching. The simulation results show that the recognition rate of the deep learning neural network is obviously superior to that of the traditional speech recognition algorithm, which realizes more accurate human-computer interaction and improves the reliability of the evaluation of the quality of foreign language pronunciation.\",\"PeriodicalId\":396772,\"journal\":{\"name\":\"Proceedings of the 2019 International Conference on Data Mining and Machine Learning\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-04-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2019 International Conference on Data Mining and Machine Learning\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3335656.3335657\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 International Conference on Data Mining and Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3335656.3335657","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 10

摘要

为了提高语音识别与语音质量评价系统的性能，基于深度学习的计算机辅助外语语音评价与学习系统成为当前人工智能技术的研究热点。结合当前先进的语音信息技术理论，在前人研究成果的基础上，提出将稀疏自编码器深度学习神经网络应用于语音识别，利用基于MFCC的稀疏自编码器特征进行深入研究，模仿听觉神经的稀疏触点深度特征提取信号，有利于提高HMM模型用于语音识别的准确率;满足当前计算机辅助英语教学的需要。仿真结果表明，深度学习神经网络的识别率明显优于传统语音识别算法，实现了更精确的人机交互，提高了外语语音质量评价的可靠性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Speech Recognition Model Based on Deep Learning And Application in Pronunciation Quality Evaluation System

In order to improve the performance of the speech recognition and pronunciation quality evaluation system, the deep learning-based computer aided foreign language pronunciation evaluation and learning system has become a research focus of current artificial intelligence technology. Combining with the current advanced voice information technology theory, based on the previous research results, proposed to sparse since the encoder deep learning neural network is applied to speech recognition, using sparse automatic encoder based on MFCC features in-depth study, the imitation of the auditory nerve sparse touches the depth of feature extracting signal, beneficial to the improvement of the HMM model for speech recognition accuracy, meet the needs of the current computer assisted English teaching. The simulation results show that the recognition rate of the deep learning neural network is obviously superior to that of the traditional speech recognition algorithm, which realizes more accurate human-computer interaction and improves the reliability of the evaluation of the quality of foreign language pronunciation.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 2019 International Conference on Data Mining and Machine Learning

自引率

0.00%

发文量