Bandwidth Extension of a Narrowband Speech Coder for Music Streaming Services Over IP Networks

Young Han Lee, H. Kim
{"title":"Bandwidth Extension of a Narrowband Speech Coder for Music Streaming Services Over IP Networks","authors":"Young Han Lee, H. Kim","doi":"10.1109/SIPS.2007.4387608","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a bandwidth extension (BWE) algorithm for a low-bit-rate narrowband CELP coder using a spectral envelope sharing approach to develop a wideband speech coder. The developed wideband speech coder, referred to here as the BWE coder, is constructed using an embedded structure by adding an enhancement layer to the narrowband CELP coder. To minimize the bit-rate increase caused by the enhancement layer, the proposed BWE coder shares the spectral envelope and excitation parameters both with the narrowband CELP coder and the enhancement layer. In this paper, we choose G.729EV layer 2 as the baseline narrowband speech coder, and mel-frequency cepstral coefficients (MFCCs) are used to reconstruct the higher frequency components at the enhancement layer. By doing this, the bit-rate of the proposed BWE coder is found to be 12.7 kbit/s, just 0.7 kbit/s higher than that of G.729EV layer 2. It is also demonstrated from a MUSHRA test with audio signals from four different music genres, that the BWE coder gives better quality than G.729EV layer 2 and comparable quality to G.729EV layer 3, corresponding to an overall bit-rate reduction of 1.3 kbit/s.","PeriodicalId":93225,"journal":{"name":"Proceedings. IEEE Workshop on Signal Processing Systems (2007-2014)","volume":"37 1","pages":"552-555"},"PeriodicalIF":0.0000,"publicationDate":"2007-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. IEEE Workshop on Signal Processing Systems (2007-2014)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SIPS.2007.4387608","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In this paper, we propose a bandwidth extension (BWE) algorithm for a low-bit-rate narrowband CELP coder using a spectral envelope sharing approach to develop a wideband speech coder. The developed wideband speech coder, referred to here as the BWE coder, is constructed using an embedded structure by adding an enhancement layer to the narrowband CELP coder. To minimize the bit-rate increase caused by the enhancement layer, the proposed BWE coder shares the spectral envelope and excitation parameters both with the narrowband CELP coder and the enhancement layer. In this paper, we choose G.729EV layer 2 as the baseline narrowband speech coder, and mel-frequency cepstral coefficients (MFCCs) are used to reconstruct the higher frequency components at the enhancement layer. By doing this, the bit-rate of the proposed BWE coder is found to be 12.7 kbit/s, just 0.7 kbit/s higher than that of G.729EV layer 2. It is also demonstrated from a MUSHRA test with audio signals from four different music genres, that the BWE coder gives better quality than G.729EV layer 2 and comparable quality to G.729EV layer 3, corresponding to an overall bit-rate reduction of 1.3 kbit/s.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
IP网络上音乐流媒体服务窄带语音编码器的带宽扩展
在本文中,我们提出了一种带宽扩展(BWE)算法,用于低比特率窄带CELP编码器,使用频谱包络共享方法开发宽带语音编码器。所开发的宽带语音编码器,这里称为BWE编码器,是通过在窄带CELP编码器上添加增强层而采用嵌入式结构构建的。为了使增强层引起的比特率增加最小化,所提出的BWE编码器与窄带CELP编码器和增强层共享频谱包络和激励参数。在本文中,我们选择G.729EV第2层作为窄带语音编码器的基线,并使用mel-frequency倒谱系数(mfccc)在增强层重构高频分量。通过这样做,发现所提出的BWE编码器的比特率为12.7 kbit/s,仅比G.729EV第2层高0.7 kbit/s。通过对四种不同音乐类型的音频信号进行的MUSHRA测试也证明,BWE编码器的质量优于G.729EV第2层,与G.729EV第3层相当,相当于总比特率降低了1.3 kbit/s。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Real-Time Estimation of Direction of Arrival of Speech Source Using Three Microphones. Optimization of Calibration Algorithms on a Manycore Embedded Platform A signal denoising technique based on wavelets modulus maxima lines and a self-scalable grid classifier Spectral Management of Multiple Wireless Signals Based Cognitive Radio Synthesizing hardware from dataflow programs: An MPEG-4 simple profile decoder case study
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1