{"title":"一种基于语音识别的农产品价格信息采集方法","authors":"Jinpu Xu, Yeping Zhu, Hailong Liu, J. Zhao","doi":"10.1109/ICNC.2014.6975957","DOIUrl":null,"url":null,"abstract":"Speech recognition technology was applied to information collection of agricultural prices, with the acoustic models trained for agricultural prices information collection environment so as to minimize the environmental influence. Firstly, we constructed the speech corpus by collecting speech under the operating scene, and then selected tri-phone modeling as the decode unit to train hidden Markov model (HMM) for the recognition of male and female voices. Secondly, decision tree-based clustering of states was used to solve the problem caused by insufficiency in training samples, and then increased mixture of Gaussian components to make the model more accurately described. In the end, we adopted the CMN and CVN methods (often used in conjunction, called CMVN) to reduce the mismatch between testing and the training environment. From the test results of different locations and different speakers, the ultimate recognition rate reached 95.04% for males, and 97.62% for females.","PeriodicalId":208779,"journal":{"name":"2014 10th International Conference on Natural Computation (ICNC)","volume":"11 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"An approach of agricultural price information collection based on speech recognition\",\"authors\":\"Jinpu Xu, Yeping Zhu, Hailong Liu, J. Zhao\",\"doi\":\"10.1109/ICNC.2014.6975957\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Speech recognition technology was applied to information collection of agricultural prices, with the acoustic models trained for agricultural prices information collection environment so as to minimize the environmental influence. Firstly, we constructed the speech corpus by collecting speech under the operating scene, and then selected tri-phone modeling as the decode unit to train hidden Markov model (HMM) for the recognition of male and female voices. Secondly, decision tree-based clustering of states was used to solve the problem caused by insufficiency in training samples, and then increased mixture of Gaussian components to make the model more accurately described. In the end, we adopted the CMN and CVN methods (often used in conjunction, called CMVN) to reduce the mismatch between testing and the training environment. From the test results of different locations and different speakers, the ultimate recognition rate reached 95.04% for males, and 97.62% for females.\",\"PeriodicalId\":208779,\"journal\":{\"name\":\"2014 10th International Conference on Natural Computation (ICNC)\",\"volume\":\"11 4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-12-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 10th International Conference on Natural Computation (ICNC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICNC.2014.6975957\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 10th International Conference on Natural Computation (ICNC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICNC.2014.6975957","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An approach of agricultural price information collection based on speech recognition
Speech recognition technology was applied to information collection of agricultural prices, with the acoustic models trained for agricultural prices information collection environment so as to minimize the environmental influence. Firstly, we constructed the speech corpus by collecting speech under the operating scene, and then selected tri-phone modeling as the decode unit to train hidden Markov model (HMM) for the recognition of male and female voices. Secondly, decision tree-based clustering of states was used to solve the problem caused by insufficiency in training samples, and then increased mixture of Gaussian components to make the model more accurately described. In the end, we adopted the CMN and CVN methods (often used in conjunction, called CMVN) to reduce the mismatch between testing and the training environment. From the test results of different locations and different speakers, the ultimate recognition rate reached 95.04% for males, and 97.62% for females.