Min Gang, Guo Jian, Yang Jibin, Tan Wei, Chen Yanpu
{"title":"An extreme low bit rate speech coding algorithm around 300bps","authors":"Min Gang, Guo Jian, Yang Jibin, Tan Wei, Chen Yanpu","doi":"10.1109/WCSP.2009.5371427","DOIUrl":null,"url":null,"abstract":"An extreme low bit rate speech coding algorithm around 300bps is proposed in this paper. The algorithm builds mixed excitation segment coding model by taking advantage of the segment coder and the MELP coder. Variable dimension matrix quantization (VDMQ) and Variable dimension vector quantization (VDVQ) scheme are presented for quantizing LSP and excitation parameters. These quantization schemes achieve acceptable performance at very low bit rate. Also, the codebook storage is reduced dramatically. Informal subjective listening test shows that the reconstructed speech has high intelligibility and moderate naturalness, the PESQ score can achieve 2.02.","PeriodicalId":244652,"journal":{"name":"2009 International Conference on Wireless Communications & Signal Processing","volume":"2000 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 International Conference on Wireless Communications & Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WCSP.2009.5371427","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
An extreme low bit rate speech coding algorithm around 300bps is proposed in this paper. The algorithm builds mixed excitation segment coding model by taking advantage of the segment coder and the MELP coder. Variable dimension matrix quantization (VDMQ) and Variable dimension vector quantization (VDVQ) scheme are presented for quantizing LSP and excitation parameters. These quantization schemes achieve acceptable performance at very low bit rate. Also, the codebook storage is reduced dramatically. Informal subjective listening test shows that the reconstructed speech has high intelligibility and moderate naturalness, the PESQ score can achieve 2.02.