Digital singing voice synthesis using a new alternating reflection model

M.E. Lee, M.J.T. Smith
{"title":"Digital singing voice synthesis using a new alternating reflection model","authors":"M.E. Lee, M.J.T. Smith","doi":"10.1109/ISCAS.2002.1011490","DOIUrl":null,"url":null,"abstract":"Many models for computer generated singing voices have been proposed in the past and have been shown to produce a wide variety of synthesized voices. While many of these models are capable of synthesizing a particular singing voice with high musical quality, they typically are challenged with respect to naturalness, range, the ability to synthesize both male and female voices, as well as the ability to capture the identity of the singer. The analysis-by-synthesis/overlap-add (ABS/OLA) sinusoidal model has proven to be effective in producing high quality voices with manageable computational cost. It is based on the combination of a block overlap-add sinusoidal representation and an analysis-by-synthesis parameter estimation technique. ABS/OLA is flexible enough to allow for modifications such as time and pitch scaling; however, it can suffer from quality degradation under such conditions. This paper presents an analysis/synthesis model that incorporates new methods to improve synthesis. These improvements add to the naturalness and flexibility in controlling perceptually important musical characteristics.","PeriodicalId":203750,"journal":{"name":"2002 IEEE International Symposium on Circuits and Systems. Proceedings (Cat. No.02CH37353)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2002-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2002 IEEE International Symposium on Circuits and Systems. Proceedings (Cat. No.02CH37353)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCAS.2002.1011490","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

Many models for computer generated singing voices have been proposed in the past and have been shown to produce a wide variety of synthesized voices. While many of these models are capable of synthesizing a particular singing voice with high musical quality, they typically are challenged with respect to naturalness, range, the ability to synthesize both male and female voices, as well as the ability to capture the identity of the singer. The analysis-by-synthesis/overlap-add (ABS/OLA) sinusoidal model has proven to be effective in producing high quality voices with manageable computational cost. It is based on the combination of a block overlap-add sinusoidal representation and an analysis-by-synthesis parameter estimation technique. ABS/OLA is flexible enough to allow for modifications such as time and pitch scaling; however, it can suffer from quality degradation under such conditions. This paper presents an analysis/synthesis model that incorporates new methods to improve synthesis. These improvements add to the naturalness and flexibility in controlling perceptually important musical characteristics.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用一种新的交替反射模型的数字歌声合成
过去已经提出了许多计算机生成歌声的模型,并已被证明可以产生各种各样的合成声音。虽然这些模型中的许多都能够合成具有高音乐质量的特定歌唱声音,但它们通常在自然性,音域,合成男声和女声的能力以及捕捉歌手身份的能力方面受到挑战。合成分析/叠加(ABS/OLA)正弦模型已被证明可以有效地产生高质量的声音,并且计算成本可控。它是基于块重叠加正弦表示和综合分析参数估计技术的结合。ABS/OLA足够灵活,允许修改,如时间和音调缩放;然而,在这种条件下,它的质量会下降。本文提出了一个分析/合成模型,该模型包含了改进合成的新方法。这些改进增加了控制感知上重要的音乐特征的自然性和灵活性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Vector quantization fast search algorithm using hyperplane based k-dimensional multi-node search tree Constant quality rate control for streaming MPEG-4 FGS video Joint space-multipath-Doppler RAKE receiving in DS-CDMA systems over time-selective fading channels Why the terms 'current mode' and 'voltage mode' neither divide nor qualify circuits A robust DWT-based video watermarking algorithm
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1