M Ram Reddy, S Alivelu Mangamma, P. Laxminarayana, A V Ramana, P. Gangadhar
{"title":"Effects of channel conditions on the performance of ASR over GSM networks using full rate speech codec","authors":"M Ram Reddy, S Alivelu Mangamma, P. Laxminarayana, A V Ramana, P. Gangadhar","doi":"10.1109/ICCCT2.2014.7066729","DOIUrl":null,"url":null,"abstract":"In this paper, the performance of Automatic Speech Recognition (ASR) and Mean Opinion Score (MOS) are evaluated under different channel conditions over GSM Network using standard narrow band Full Rate speech and channel coding algorithms. CMU SPHINX toolkit and TIMIT speech database (8 kHz raw files) are used to evaluate the ASR accuracy. MOS values are estimated, using PESQ P.862, as recommended by ITU-T (International Telecommunication Union). In this evaluation, Rayleigh distribution for generation of channel noise is used to simulate the channel conditions. The results obtained in the experiments are reported and analyzed.","PeriodicalId":6860,"journal":{"name":"2021 RIVF International Conference on Computing and Communication Technologies (RIVF)","volume":"33 1","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 RIVF International Conference on Computing and Communication Technologies (RIVF)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCT2.2014.7066729","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, the performance of Automatic Speech Recognition (ASR) and Mean Opinion Score (MOS) are evaluated under different channel conditions over GSM Network using standard narrow band Full Rate speech and channel coding algorithms. CMU SPHINX toolkit and TIMIT speech database (8 kHz raw files) are used to evaluate the ASR accuracy. MOS values are estimated, using PESQ P.862, as recommended by ITU-T (International Telecommunication Union). In this evaluation, Rayleigh distribution for generation of channel noise is used to simulate the channel conditions. The results obtained in the experiments are reported and analyzed.