Assessment of Tracks of Resonance Frequencies of the Vocal Tract

IF 0.9 4区 物理与天体物理 Q4 ACOUSTICS Acoustical Physics Pub Date : 2024-02-28 DOI:10.1134/S1063771023601140
A. S. Leonov, V. N. Sorokin
{"title":"Assessment of Tracks of Resonance Frequencies of the Vocal Tract","authors":"A. S. Leonov,&nbsp;V. N. Sorokin","doi":"10.1134/S1063771023601140","DOIUrl":null,"url":null,"abstract":"<div><p>A new method for estimating formant frequency tracks of the vocal tract for arbitrary speech segments is proposed. The method uses the ratio of two Fourier transforms of a speech signal with special exponential-type windows depending on some parameter. This ratio is used for specific points in time and is considered as a function of frequency and parameter. By analyzing, for several parameter values, the distribution of minimum points (in terms of frequency) for the phase of this ratio and/or a similar distribution of extreme points for its amplitude, it is possible to estimate formant frequencies from the peaks of these distributions. A mathematical study is presented that substantiates this approach. A series of numerical experiments were carried out on the processing of synthetic and real speech signals, which confirmed the performance capabilities of the proposed formant evaluation method. In particular, in experiments with synthesized vowels, it was found that the error in estimating their resonance frequencies is small and stable with respect to additive noise up to a signal-to-noise ratio of 5 dB. For real speech, the method makes it possible to calculate the formant frequency tracks for both sounds with vocal excitation and for voiceless fricatives, aspirated plosives, and whispered speech.</p></div>","PeriodicalId":455,"journal":{"name":"Acoustical Physics","volume":null,"pages":null},"PeriodicalIF":0.9000,"publicationDate":"2024-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Acoustical Physics","FirstCategoryId":"101","ListUrlMain":"https://link.springer.com/article/10.1134/S1063771023601140","RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0

Abstract

A new method for estimating formant frequency tracks of the vocal tract for arbitrary speech segments is proposed. The method uses the ratio of two Fourier transforms of a speech signal with special exponential-type windows depending on some parameter. This ratio is used for specific points in time and is considered as a function of frequency and parameter. By analyzing, for several parameter values, the distribution of minimum points (in terms of frequency) for the phase of this ratio and/or a similar distribution of extreme points for its amplitude, it is possible to estimate formant frequencies from the peaks of these distributions. A mathematical study is presented that substantiates this approach. A series of numerical experiments were carried out on the processing of synthetic and real speech signals, which confirmed the performance capabilities of the proposed formant evaluation method. In particular, in experiments with synthesized vowels, it was found that the error in estimating their resonance frequencies is small and stable with respect to additive noise up to a signal-to-noise ratio of 5 dB. For real speech, the method makes it possible to calculate the formant frequency tracks for both sounds with vocal excitation and for voiceless fricatives, aspirated plosives, and whispered speech.

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
声道共振频率轨迹评估
本文提出了一种估算任意语音片段声道心形频率轨迹的新方法。该方法使用语音信号的两个傅立叶变换的比值,并根据某些参数使用特殊的指数型窗口。该比率用于特定的时间点,并被视为频率和参数的函数。通过分析几个参数值,该比率相位的最小点(频率)分布和/或其振幅的极值点的类似分布,可以根据这些分布的峰值估算出声母频率。本文提出的数学研究证实了这一方法。对合成和真实语音信号的处理进行了一系列数值实验,证实了所提出的声像评估方法的性能。特别是在合成元音的实验中发现,在信噪比不超过 5 dB 的情况下,估计元音共振频率的误差很小,而且相对于加性噪声来说很稳定。对于真实语音,该方法可以计算出带有发声激励的声音、无声摩擦音、吸气复音和耳语的共振频率轨迹。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Acoustical Physics
Acoustical Physics 物理-声学
CiteScore
1.60
自引率
50.00%
发文量
58
审稿时长
3.5 months
期刊介绍: Acoustical Physics is an international peer reviewed journal published with the participation of the Russian Academy of Sciences. It covers theoretical and experimental aspects of basic and applied acoustics: classical problems of linear acoustics and wave theory; nonlinear acoustics; physical acoustics; ocean acoustics and hydroacoustics; atmospheric and aeroacoustics; acoustics of structurally inhomogeneous solids; geological acoustics; acoustical ecology, noise and vibration; chamber acoustics, musical acoustics; acoustic signals processing, computer simulations; acoustics of living systems, biomedical acoustics; physical principles of engineering acoustics. The journal publishes critical reviews, original articles, short communications, and letters to the editor. It covers theoretical and experimental aspects of basic and applied acoustics. The journal welcomes manuscripts from all countries in the English or Russian language.
期刊最新文献
Peculiarities of Flexural Wave Propagation in a Notched Bar Interference of Echo Signals from Spherical Scatterers Located Near the Bottom Theoretical and Experimental Study of Diffraction by a Thin Cone Thermal Ablation of Biological Tissue by Sonicating Discrete Foci in a Specified Volume with a Single Wave Burst with Shocks On the Evolution of a System of Shock Waves Created by Engine Fan Blades
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1