基于深度学习的语音识别模型及其在语音质量评价系统中的应用

Teng Haikun, W. Shiying, Liu Xinsheng, Xiaodong Yue
{"title":"基于深度学习的语音识别模型及其在语音质量评价系统中的应用","authors":"Teng Haikun, W. Shiying, Liu Xinsheng, Xiaodong Yue","doi":"10.1145/3335656.3335657","DOIUrl":null,"url":null,"abstract":"In order to improve the performance of the speech recognition and pronunciation quality evaluation system, the deep learning-based computer aided foreign language pronunciation evaluation and learning system has become a research focus of current artificial intelligence technology. Combining with the current advanced voice information technology theory, based on the previous research results, proposed to sparse since the encoder deep learning neural network is applied to speech recognition, using sparse automatic encoder based on MFCC features in-depth study, the imitation of the auditory nerve sparse touches the depth of feature extracting signal, beneficial to the improvement of the HMM model for speech recognition accuracy, meet the needs of the current computer assisted English teaching. The simulation results show that the recognition rate of the deep learning neural network is obviously superior to that of the traditional speech recognition algorithm, which realizes more accurate human-computer interaction and improves the reliability of the evaluation of the quality of foreign language pronunciation.","PeriodicalId":396772,"journal":{"name":"Proceedings of the 2019 International Conference on Data Mining and Machine Learning","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Speech Recognition Model Based on Deep Learning And Application in Pronunciation Quality Evaluation System\",\"authors\":\"Teng Haikun, W. Shiying, Liu Xinsheng, Xiaodong Yue\",\"doi\":\"10.1145/3335656.3335657\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In order to improve the performance of the speech recognition and pronunciation quality evaluation system, the deep learning-based computer aided foreign language pronunciation evaluation and learning system has become a research focus of current artificial intelligence technology. Combining with the current advanced voice information technology theory, based on the previous research results, proposed to sparse since the encoder deep learning neural network is applied to speech recognition, using sparse automatic encoder based on MFCC features in-depth study, the imitation of the auditory nerve sparse touches the depth of feature extracting signal, beneficial to the improvement of the HMM model for speech recognition accuracy, meet the needs of the current computer assisted English teaching. The simulation results show that the recognition rate of the deep learning neural network is obviously superior to that of the traditional speech recognition algorithm, which realizes more accurate human-computer interaction and improves the reliability of the evaluation of the quality of foreign language pronunciation.\",\"PeriodicalId\":396772,\"journal\":{\"name\":\"Proceedings of the 2019 International Conference on Data Mining and Machine Learning\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-04-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2019 International Conference on Data Mining and Machine Learning\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3335656.3335657\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 International Conference on Data Mining and Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3335656.3335657","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10

摘要

为了提高语音识别与语音质量评价系统的性能,基于深度学习的计算机辅助外语语音评价与学习系统成为当前人工智能技术的研究热点。结合当前先进的语音信息技术理论,在前人研究成果的基础上,提出将稀疏自编码器深度学习神经网络应用于语音识别,利用基于MFCC的稀疏自编码器特征进行深入研究,模仿听觉神经的稀疏触点深度特征提取信号,有利于提高HMM模型用于语音识别的准确率;满足当前计算机辅助英语教学的需要。仿真结果表明,深度学习神经网络的识别率明显优于传统语音识别算法,实现了更精确的人机交互,提高了外语语音质量评价的可靠性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Speech Recognition Model Based on Deep Learning And Application in Pronunciation Quality Evaluation System
In order to improve the performance of the speech recognition and pronunciation quality evaluation system, the deep learning-based computer aided foreign language pronunciation evaluation and learning system has become a research focus of current artificial intelligence technology. Combining with the current advanced voice information technology theory, based on the previous research results, proposed to sparse since the encoder deep learning neural network is applied to speech recognition, using sparse automatic encoder based on MFCC features in-depth study, the imitation of the auditory nerve sparse touches the depth of feature extracting signal, beneficial to the improvement of the HMM model for speech recognition accuracy, meet the needs of the current computer assisted English teaching. The simulation results show that the recognition rate of the deep learning neural network is obviously superior to that of the traditional speech recognition algorithm, which realizes more accurate human-computer interaction and improves the reliability of the evaluation of the quality of foreign language pronunciation.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Research on Code Plagiarism Detection Model Based on Random Forest and Gradient Boosting Decision Tree Research on Vehicle Identification Method Based on Computer Vision Simulation for Agglomeration Effect of Internet Crowdfunding Model An improved FCM clustering algorithm based on cosine similarity Research on offline behavior similarity of consumers based on Spatio-temporal data set mining
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1