Diphone-like units without phonemes - option for very low bit rate speech coding

P. Motlícek, G. Baudoin, J. Černocký
{"title":"Diphone-like units without phonemes - option for very low bit rate speech coding","authors":"P. Motlícek, G. Baudoin, J. Černocký","doi":"10.1109/EURCON.2001.938162","DOIUrl":null,"url":null,"abstract":"The aim of our effort is to reach higher quality of the resulting speech coded by a very low bit rate (VLBR) segmental coder. The basic units are found automatically in a training database using temporal decomposition and vector quantization. They are modeled by HMM. Then two methods of re-segmentation are used in order to find new longer units. In the first approach borders are set to the centers of previous units. In the second, borders are fixed to the centers of middle HMM states of previous units. The number of frames in new units is conditioned to be bigger than a fixed constant. Hence, new units can consist of several previous segments. Decreasing transition noise of the resultant speech was obtained using these techniques.","PeriodicalId":205662,"journal":{"name":"EUROCON'2001. International Conference on Trends in Communications. Technical Program, Proceedings (Cat. No.01EX439)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2001-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"EUROCON'2001. International Conference on Trends in Communications. Technical Program, Proceedings (Cat. No.01EX439)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EURCON.2001.938162","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

The aim of our effort is to reach higher quality of the resulting speech coded by a very low bit rate (VLBR) segmental coder. The basic units are found automatically in a training database using temporal decomposition and vector quantization. They are modeled by HMM. Then two methods of re-segmentation are used in order to find new longer units. In the first approach borders are set to the centers of previous units. In the second, borders are fixed to the centers of middle HMM states of previous units. The number of frames in new units is conditioned to be bigger than a fixed constant. Hence, new units can consist of several previous segments. Decreasing transition noise of the resultant speech was obtained using these techniques.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
没有音素的类似电话的单位-非常低比特率语音编码的选项
我们努力的目标是通过极低比特率(VLBR)分段编码器达到更高质量的语音编码。使用时间分解和向量量化,在训练数据库中自动找到基本单元。它们由HMM建模。然后采用两种方法进行再分割,以寻找新的较长的单元。在第一种方法中,边界被设置为前面单元的中心。在第二种方法中,边界固定在前一个单元的中间HMM状态的中心。新单位的帧数被限定为大于一个固定常数。因此,新的单位可以由之前的几个部分组成。利用这些技术降低了合成语音的过渡噪声。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Network element manager of the SI2000 V5 product line, and the interoperability with the network, service and business level Performance analysis of burst level bandwidth allocation using multipath routing reservation Flexibility of JavaScript controled simulations Influence of parameter initialization on the performance of nonlinear blind equalizers for satellite communications A strategy for voice browsing in 3G wireless networks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1