{"title":"语音编码对说话人识别的影响","authors":"A. Vuppala, K. S. Sekhara Rao, S. Chakrabarti","doi":"10.1109/INDCON.2010.5712604","DOIUrl":null,"url":null,"abstract":"The increasing use of wireless systems is creating great deal of interest in the development of robust speech systems in wireless environment. The major degradations involved in wireless environment are: effect of varying background conditions, degradation due to speech coders and errors due to wireless channels. In this paper, we presented the effect of speech coding on text independent speaker identification (SI). Speech coders considered in this work are GSM full rate (ETSI 06.10), CELP (FS-1016), and MELP (TI 2.4kbps). The amount of distortion introduced by coding is measured using log-likelihood ratio (LLR), weighted spectral slope (WSS) and log-spectral distance (LSD). The effect of coding on SI is analyzed by building SI system using both vocal track system and excitation source features. We observed that there is a significant reduction of performance in SI system due to coding, and effect is more prominent in case of SI system build with source features. We also observed that, speaker characteristics are well preserved in case of MELP compared to CELP even though MELP coder bit rate is less than CELP.","PeriodicalId":109071,"journal":{"name":"2010 Annual IEEE India Conference (INDICON)","volume":"204 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"21","resultStr":"{\"title\":\"Effect of speech coding on speaker identification\",\"authors\":\"A. Vuppala, K. S. Sekhara Rao, S. Chakrabarti\",\"doi\":\"10.1109/INDCON.2010.5712604\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The increasing use of wireless systems is creating great deal of interest in the development of robust speech systems in wireless environment. The major degradations involved in wireless environment are: effect of varying background conditions, degradation due to speech coders and errors due to wireless channels. In this paper, we presented the effect of speech coding on text independent speaker identification (SI). Speech coders considered in this work are GSM full rate (ETSI 06.10), CELP (FS-1016), and MELP (TI 2.4kbps). The amount of distortion introduced by coding is measured using log-likelihood ratio (LLR), weighted spectral slope (WSS) and log-spectral distance (LSD). The effect of coding on SI is analyzed by building SI system using both vocal track system and excitation source features. We observed that there is a significant reduction of performance in SI system due to coding, and effect is more prominent in case of SI system build with source features. We also observed that, speaker characteristics are well preserved in case of MELP compared to CELP even though MELP coder bit rate is less than CELP.\",\"PeriodicalId\":109071,\"journal\":{\"name\":\"2010 Annual IEEE India Conference (INDICON)\",\"volume\":\"204 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"21\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 Annual IEEE India Conference (INDICON)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INDCON.2010.5712604\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 Annual IEEE India Conference (INDICON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INDCON.2010.5712604","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The increasing use of wireless systems is creating great deal of interest in the development of robust speech systems in wireless environment. The major degradations involved in wireless environment are: effect of varying background conditions, degradation due to speech coders and errors due to wireless channels. In this paper, we presented the effect of speech coding on text independent speaker identification (SI). Speech coders considered in this work are GSM full rate (ETSI 06.10), CELP (FS-1016), and MELP (TI 2.4kbps). The amount of distortion introduced by coding is measured using log-likelihood ratio (LLR), weighted spectral slope (WSS) and log-spectral distance (LSD). The effect of coding on SI is analyzed by building SI system using both vocal track system and excitation source features. We observed that there is a significant reduction of performance in SI system due to coding, and effect is more prominent in case of SI system build with source features. We also observed that, speaker characteristics are well preserved in case of MELP compared to CELP even though MELP coder bit rate is less than CELP.