Teng Haikun, W. Shiying, Liu Xinsheng, Xiaodong Yue
{"title":"基于深度学习的语音识别模型及其在语音质量评价系统中的应用","authors":"Teng Haikun, W. Shiying, Liu Xinsheng, Xiaodong Yue","doi":"10.1145/3335656.3335657","DOIUrl":null,"url":null,"abstract":"In order to improve the performance of the speech recognition and pronunciation quality evaluation system, the deep learning-based computer aided foreign language pronunciation evaluation and learning system has become a research focus of current artificial intelligence technology. Combining with the current advanced voice information technology theory, based on the previous research results, proposed to sparse since the encoder deep learning neural network is applied to speech recognition, using sparse automatic encoder based on MFCC features in-depth study, the imitation of the auditory nerve sparse touches the depth of feature extracting signal, beneficial to the improvement of the HMM model for speech recognition accuracy, meet the needs of the current computer assisted English teaching. The simulation results show that the recognition rate of the deep learning neural network is obviously superior to that of the traditional speech recognition algorithm, which realizes more accurate human-computer interaction and improves the reliability of the evaluation of the quality of foreign language pronunciation.","PeriodicalId":396772,"journal":{"name":"Proceedings of the 2019 International Conference on Data Mining and Machine Learning","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Speech Recognition Model Based on Deep Learning And Application in Pronunciation Quality Evaluation System\",\"authors\":\"Teng Haikun, W. Shiying, Liu Xinsheng, Xiaodong Yue\",\"doi\":\"10.1145/3335656.3335657\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In order to improve the performance of the speech recognition and pronunciation quality evaluation system, the deep learning-based computer aided foreign language pronunciation evaluation and learning system has become a research focus of current artificial intelligence technology. Combining with the current advanced voice information technology theory, based on the previous research results, proposed to sparse since the encoder deep learning neural network is applied to speech recognition, using sparse automatic encoder based on MFCC features in-depth study, the imitation of the auditory nerve sparse touches the depth of feature extracting signal, beneficial to the improvement of the HMM model for speech recognition accuracy, meet the needs of the current computer assisted English teaching. The simulation results show that the recognition rate of the deep learning neural network is obviously superior to that of the traditional speech recognition algorithm, which realizes more accurate human-computer interaction and improves the reliability of the evaluation of the quality of foreign language pronunciation.\",\"PeriodicalId\":396772,\"journal\":{\"name\":\"Proceedings of the 2019 International Conference on Data Mining and Machine Learning\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-04-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2019 International Conference on Data Mining and Machine Learning\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3335656.3335657\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 International Conference on Data Mining and Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3335656.3335657","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Speech Recognition Model Based on Deep Learning And Application in Pronunciation Quality Evaluation System
In order to improve the performance of the speech recognition and pronunciation quality evaluation system, the deep learning-based computer aided foreign language pronunciation evaluation and learning system has become a research focus of current artificial intelligence technology. Combining with the current advanced voice information technology theory, based on the previous research results, proposed to sparse since the encoder deep learning neural network is applied to speech recognition, using sparse automatic encoder based on MFCC features in-depth study, the imitation of the auditory nerve sparse touches the depth of feature extracting signal, beneficial to the improvement of the HMM model for speech recognition accuracy, meet the needs of the current computer assisted English teaching. The simulation results show that the recognition rate of the deep learning neural network is obviously superior to that of the traditional speech recognition algorithm, which realizes more accurate human-computer interaction and improves the reliability of the evaluation of the quality of foreign language pronunciation.