口咽特定对象生物力学建模:迈向语音生成。

IF 1.3 Q4 ENGINEERING, BIOMEDICAL Computer Methods in Biomechanics and Biomedical Engineering-Imaging and Visualization Pub Date : 2017-01-01 Epub Date: 2015-05-05 DOI:10.1080/21681163.2015.1033756
{"title":"口咽特定对象生物力学建模:迈向语音生成。","authors":"","doi":"10.1080/21681163.2015.1033756","DOIUrl":null,"url":null,"abstract":"<p><p>Biomechanical models of the oropharynx are beneficial to treatment planning of speech impediments by providing valuable insight into the speech function such as motor control. In this paper, we develop a subject-specific model of the oropharynx and investigate its utility in speech production. Our approach adapts a generic tongue-jaw-hyoid model (Stavness et al. 2011) to fit and track dynamic volumetric MRI data of a normal speaker, subsequently coupled to a source-filter based acoustic synthesizer. We demonstrate our model's ability to track tongue tissue motion, simulate plausible muscle activation patterns, as well as generate acoustic results that have comparable spectral features to the associated recorded audio. Finally, we propose a method to adjust the spatial resolution of our subject-specific tongue model to match the fidelity level of our MRI data and speech synthesizer. Our findings suggest that a higher resolution tongue model - using similar muscle fibre definition - does not show a significant improvement in acoustic performance, for our speech utterance and at this level of fidelity; however we believe that our approach enables further refinements of the muscle fibres suitable for studying longer speech sequences and finer muscle innervation using higher resolution dynamic data.</p>","PeriodicalId":51800,"journal":{"name":"Computer Methods in Biomechanics and Biomedical Engineering-Imaging and Visualization","volume":null,"pages":null},"PeriodicalIF":1.3000,"publicationDate":"2017-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5699225/pdf/nihms699796.pdf","citationCount":"0","resultStr":"{\"title\":\"Subject-Specific Biomechanical Modelling of the Oropharynx: Towards Speech Production.\",\"authors\":\"\",\"doi\":\"10.1080/21681163.2015.1033756\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Biomechanical models of the oropharynx are beneficial to treatment planning of speech impediments by providing valuable insight into the speech function such as motor control. In this paper, we develop a subject-specific model of the oropharynx and investigate its utility in speech production. Our approach adapts a generic tongue-jaw-hyoid model (Stavness et al. 2011) to fit and track dynamic volumetric MRI data of a normal speaker, subsequently coupled to a source-filter based acoustic synthesizer. We demonstrate our model's ability to track tongue tissue motion, simulate plausible muscle activation patterns, as well as generate acoustic results that have comparable spectral features to the associated recorded audio. Finally, we propose a method to adjust the spatial resolution of our subject-specific tongue model to match the fidelity level of our MRI data and speech synthesizer. Our findings suggest that a higher resolution tongue model - using similar muscle fibre definition - does not show a significant improvement in acoustic performance, for our speech utterance and at this level of fidelity; however we believe that our approach enables further refinements of the muscle fibres suitable for studying longer speech sequences and finer muscle innervation using higher resolution dynamic data.</p>\",\"PeriodicalId\":51800,\"journal\":{\"name\":\"Computer Methods in Biomechanics and Biomedical Engineering-Imaging and Visualization\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2017-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5699225/pdf/nihms699796.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer Methods in Biomechanics and Biomedical Engineering-Imaging and Visualization\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1080/21681163.2015.1033756\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2015/5/5 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q4\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Methods in Biomechanics and Biomedical Engineering-Imaging and Visualization","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/21681163.2015.1033756","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2015/5/5 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0

摘要

口咽部的生物力学模型有助于制定语言障碍的治疗计划,为运动控制等语言功能提供有价值的见解。在本文中,我们开发了针对特定对象的口咽模型,并研究了该模型在语音生成中的实用性。我们的方法采用通用的舌-颚-舌骨模型(Stavness 等人,2011 年)来拟合和跟踪正常说话者的动态容积磁共振成像数据,随后将其与基于声源滤波器的声学合成器相结合。我们展示了我们的模型跟踪舌头组织运动、模拟可信肌肉激活模式以及生成与相关录音具有相似频谱特征的声学结果的能力。最后,我们提出了一种调整特定对象舌头模型空间分辨率的方法,以匹配核磁共振成像数据和语音合成器的保真度。我们的研究结果表明,使用类似的肌肉纤维定义的更高分辨率舌头模型,对于我们的语音语篇和这种保真度水平,并没有显示出声学性能的明显改善;但是我们相信,我们的方法能够进一步完善肌肉纤维,适合使用更高分辨率的动态数据研究更长的语音序列和更精细的肌肉神经支配。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

摘要图片

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Subject-Specific Biomechanical Modelling of the Oropharynx: Towards Speech Production.

Biomechanical models of the oropharynx are beneficial to treatment planning of speech impediments by providing valuable insight into the speech function such as motor control. In this paper, we develop a subject-specific model of the oropharynx and investigate its utility in speech production. Our approach adapts a generic tongue-jaw-hyoid model (Stavness et al. 2011) to fit and track dynamic volumetric MRI data of a normal speaker, subsequently coupled to a source-filter based acoustic synthesizer. We demonstrate our model's ability to track tongue tissue motion, simulate plausible muscle activation patterns, as well as generate acoustic results that have comparable spectral features to the associated recorded audio. Finally, we propose a method to adjust the spatial resolution of our subject-specific tongue model to match the fidelity level of our MRI data and speech synthesizer. Our findings suggest that a higher resolution tongue model - using similar muscle fibre definition - does not show a significant improvement in acoustic performance, for our speech utterance and at this level of fidelity; however we believe that our approach enables further refinements of the muscle fibres suitable for studying longer speech sequences and finer muscle innervation using higher resolution dynamic data.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
2.80
自引率
6.20%
发文量
102
期刊介绍: Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization is an international journal whose main goals are to promote solutions of excellence for both imaging and visualization of biomedical data, and establish links among researchers, clinicians, the medical technology sector and end-users. The journal provides a comprehensive forum for discussion of the current state-of-the-art in the scientific fields related to imaging and visualization, including, but not limited to: Applications of Imaging and Visualization Computational Bio- imaging and Visualization Computer Aided Diagnosis, Surgery, Therapy and Treatment Data Processing and Analysis Devices for Imaging and Visualization Grid and High Performance Computing for Imaging and Visualization Human Perception in Imaging and Visualization Image Processing and Analysis Image-based Geometric Modelling Imaging and Visualization in Biomechanics Imaging and Visualization in Biomedical Engineering Medical Clinics Medical Imaging and Visualization Multi-modal Imaging and Visualization Multiscale Imaging and Visualization Scientific Visualization Software Development for Imaging and Visualization Telemedicine Systems and Applications Virtual Reality Visual Data Mining and Knowledge Discovery.
期刊最新文献
Optimization of deep neural networks for multiclassification of dental X-rays using transfer learning A prototype smartphone jaw tracking application to quantitatively model tooth contact Computer-aided diagnosis of Canine Hip Dysplasia using deep learning approach in a novel X-ray image dataset Decorrelation stretch for enhancing colour fundus photographs affected by cataracts Genetic algorithm for feature selection in mammograms for breast masses classification
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1