Effect of speech coding on speaker identification

A. Vuppala, K. S. Sekhara Rao, S. Chakrabarti
{"title":"Effect of speech coding on speaker identification","authors":"A. Vuppala, K. S. Sekhara Rao, S. Chakrabarti","doi":"10.1109/INDCON.2010.5712604","DOIUrl":null,"url":null,"abstract":"The increasing use of wireless systems is creating great deal of interest in the development of robust speech systems in wireless environment. The major degradations involved in wireless environment are: effect of varying background conditions, degradation due to speech coders and errors due to wireless channels. In this paper, we presented the effect of speech coding on text independent speaker identification (SI). Speech coders considered in this work are GSM full rate (ETSI 06.10), CELP (FS-1016), and MELP (TI 2.4kbps). The amount of distortion introduced by coding is measured using log-likelihood ratio (LLR), weighted spectral slope (WSS) and log-spectral distance (LSD). The effect of coding on SI is analyzed by building SI system using both vocal track system and excitation source features. We observed that there is a significant reduction of performance in SI system due to coding, and effect is more prominent in case of SI system build with source features. We also observed that, speaker characteristics are well preserved in case of MELP compared to CELP even though MELP coder bit rate is less than CELP.","PeriodicalId":109071,"journal":{"name":"2010 Annual IEEE India Conference (INDICON)","volume":"204 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"21","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 Annual IEEE India Conference (INDICON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INDCON.2010.5712604","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 21

Abstract

The increasing use of wireless systems is creating great deal of interest in the development of robust speech systems in wireless environment. The major degradations involved in wireless environment are: effect of varying background conditions, degradation due to speech coders and errors due to wireless channels. In this paper, we presented the effect of speech coding on text independent speaker identification (SI). Speech coders considered in this work are GSM full rate (ETSI 06.10), CELP (FS-1016), and MELP (TI 2.4kbps). The amount of distortion introduced by coding is measured using log-likelihood ratio (LLR), weighted spectral slope (WSS) and log-spectral distance (LSD). The effect of coding on SI is analyzed by building SI system using both vocal track system and excitation source features. We observed that there is a significant reduction of performance in SI system due to coding, and effect is more prominent in case of SI system build with source features. We also observed that, speaker characteristics are well preserved in case of MELP compared to CELP even though MELP coder bit rate is less than CELP.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
语音编码对说话人识别的影响
随着无线系统应用的日益广泛,人们对开发无线环境下的鲁棒语音系统产生了极大的兴趣。无线环境中涉及的主要退化有:不同背景条件的影响、语音编码器的退化和无线信道的误差。本文研究了语音编码对文本无关说话人识别(SI)的影响。在这项工作中考虑的语音编码是GSM全速率(ETSI 06.10), CELP (FS-1016)和MELP (TI 2.4kbps)。通过对数似然比(LLR)、加权谱斜率(WSS)和对数谱距离(LSD)测量编码带来的失真量。通过构建声道系统和激励源特征相结合的声道系统,分析了编码对声道系统的影响。我们观察到,编码对SI系统的性能有显著的影响,而在带有源特征的SI系统中,这种影响更为突出。我们还观察到,尽管MELP编码器比特率低于CELP,但与CELP相比,MELP的扬声器特性得到了很好的保留。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
An efficient technique for protein classification using feature extraction by artificial neural networks Coordinated design of excitation and TCSC-based stabilizers for multimachine power system Estimation of the resonant frequency and magnetic polarizability of an edge coupled circular split ring resonator with rotated outer ring Realization of ultra wideband bandpass filter using new type of split-ring Defected Ground Structure Double-Pole Four-Throw RF CMOS switch design with double-gate transistors
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1