基于主动形状模型的舌形综合

Chan Song, Jianguo Wei, Qiang Fang, Shen Liu, Yuguang Wang, J. Dang
{"title":"基于主动形状模型的舌形综合","authors":"Chan Song, Jianguo Wei, Qiang Fang, Shen Liu, Yuguang Wang, J. Dang","doi":"10.1109/ISCSLP.2012.6423537","DOIUrl":null,"url":null,"abstract":"Nowadays magnetic resonance imaging (MRI) technique has been widely used in speech production research since it acquires high spatial resolution data of vocal tract shape without any known harm of radiation. However, it would be time consuming and expensive to establish an overall articulatory database using MRI technique due to its low temporal resolution and the large expense of the MRI equipment. In this study, we propose a method to interpolate tongue shapes between static vowels to acquire dynamic tongue shapes. Firstly, a set of parameters is extracted to control tongue shape based on Active Shape Model (ASM). Then, control parameters are interpolated to synthesize dynamic tongue shapes from static vowels' articulation. To evaluate the method, a set of key points were chosen from both the MRI images and the synthesize tongue shapes. Results suggested that the dynamic properties of these key points from the synthesized tongue shapes resemble those of the actual dynamic tongue shapes.","PeriodicalId":186099,"journal":{"name":"2012 8th International Symposium on Chinese Spoken Language Processing","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Tongue shape synthesis based on Active Shape Model\",\"authors\":\"Chan Song, Jianguo Wei, Qiang Fang, Shen Liu, Yuguang Wang, J. Dang\",\"doi\":\"10.1109/ISCSLP.2012.6423537\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Nowadays magnetic resonance imaging (MRI) technique has been widely used in speech production research since it acquires high spatial resolution data of vocal tract shape without any known harm of radiation. However, it would be time consuming and expensive to establish an overall articulatory database using MRI technique due to its low temporal resolution and the large expense of the MRI equipment. In this study, we propose a method to interpolate tongue shapes between static vowels to acquire dynamic tongue shapes. Firstly, a set of parameters is extracted to control tongue shape based on Active Shape Model (ASM). Then, control parameters are interpolated to synthesize dynamic tongue shapes from static vowels' articulation. To evaluate the method, a set of key points were chosen from both the MRI images and the synthesize tongue shapes. Results suggested that the dynamic properties of these key points from the synthesized tongue shapes resemble those of the actual dynamic tongue shapes.\",\"PeriodicalId\":186099,\"journal\":{\"name\":\"2012 8th International Symposium on Chinese Spoken Language Processing\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 8th International Symposium on Chinese Spoken Language Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISCSLP.2012.6423537\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 8th International Symposium on Chinese Spoken Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCSLP.2012.6423537","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

目前,磁共振成像技术在语音生成研究中得到了广泛的应用,因为它可以获得高空间分辨率的声道形状数据,而且没有已知的辐射危害。然而,由于MRI技术的时间分辨率较低,且MRI设备价格昂贵,因此建立一个完整的关节数据库将是耗时且昂贵的。在本研究中,我们提出了一种在静态元音之间插入舌形以获得动态舌形的方法。首先,基于主动形状模型(ASM)提取一组参数来控制舌形;然后,通过插值控制参数,从静态元音的发音合成动态舌形。为了评估该方法,从MRI图像和合成舌形中选择了一组关键点。结果表明,合成舌形的这些关键点的动态特性与实际舌形相似。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Tongue shape synthesis based on Active Shape Model
Nowadays magnetic resonance imaging (MRI) technique has been widely used in speech production research since it acquires high spatial resolution data of vocal tract shape without any known harm of radiation. However, it would be time consuming and expensive to establish an overall articulatory database using MRI technique due to its low temporal resolution and the large expense of the MRI equipment. In this study, we propose a method to interpolate tongue shapes between static vowels to acquire dynamic tongue shapes. Firstly, a set of parameters is extracted to control tongue shape based on Active Shape Model (ASM). Then, control parameters are interpolated to synthesize dynamic tongue shapes from static vowels' articulation. To evaluate the method, a set of key points were chosen from both the MRI images and the synthesize tongue shapes. Results suggested that the dynamic properties of these key points from the synthesized tongue shapes resemble those of the actual dynamic tongue shapes.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Noise-robust whispered speech recognition using a non-audible-murmur microphone with VTS compensation Effects of excitation spread on the intelligibility of Mandarin speech in cochlear implant simulations A comparative study of fMPE and RDLT approaches to LVCSR Keyword-specific normalization based keyword spotting for spontaneous speech A unified trajectory tiling approach to high quality TTS and cross-lingual voice transformation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1