Nonlinear dynamical analysis of normal voices

M. E. Dajer, J. Pereira, Carlos Dias Maciel
{"title":"Nonlinear dynamical analysis of normal voices","authors":"M. E. Dajer, J. Pereira, Carlos Dias Maciel","doi":"10.1109/ISM.2005.84","DOIUrl":null,"url":null,"abstract":"Human voice has been the focus of study for different areas of sciences. Researches in the last two decades have established the existence of chaos in human voice production. The purpose of this paper is to use nonlinear dynamics methods in the analysis of normal voices from healthy subjects and correlate them to traditional acoustic parameters as well as perceptual analysis. Twelve human voice signals from healthy subjects, 6 males and 6 females, ranging in age from 19 to 39 years old were used. Sustained vowel sounds /a/, /e/ and /i/ if, from Brazilian Portuguese were recorded at a sampling rate of 22,050 Hz and analyzed in order to obtain acoustic perturbation measures (jitter, shimmer, coefficient of excess - EX, and pitch amplitude - PA), The phase space reconstruction method was used to describe the nonlinear dynamic characteristics of voice signal samples. This paper shows that nonlinear dynamical methods as phase space reconstruction seems to be a suitable technique for voice signals analysis, due to the chaotic component of the human voice. The results suggest that non-linear dynamic analysis does not replace existing techniques instead they may improve and complement the recent voice analysis methods available for health professionals, speech therapist and clinician.","PeriodicalId":322363,"journal":{"name":"Seventh IEEE International Symposium on Multimedia (ISM'05)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Seventh IEEE International Symposium on Multimedia (ISM'05)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISM.2005.84","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 19

Abstract

Human voice has been the focus of study for different areas of sciences. Researches in the last two decades have established the existence of chaos in human voice production. The purpose of this paper is to use nonlinear dynamics methods in the analysis of normal voices from healthy subjects and correlate them to traditional acoustic parameters as well as perceptual analysis. Twelve human voice signals from healthy subjects, 6 males and 6 females, ranging in age from 19 to 39 years old were used. Sustained vowel sounds /a/, /e/ and /i/ if, from Brazilian Portuguese were recorded at a sampling rate of 22,050 Hz and analyzed in order to obtain acoustic perturbation measures (jitter, shimmer, coefficient of excess - EX, and pitch amplitude - PA), The phase space reconstruction method was used to describe the nonlinear dynamic characteristics of voice signal samples. This paper shows that nonlinear dynamical methods as phase space reconstruction seems to be a suitable technique for voice signals analysis, due to the chaotic component of the human voice. The results suggest that non-linear dynamic analysis does not replace existing techniques instead they may improve and complement the recent voice analysis methods available for health professionals, speech therapist and clinician.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
正常声音的非线性动力学分析
人声一直是不同科学领域研究的焦点。近二十年来的研究已经证实了人类语音产生过程中混沌的存在。本文的目的是利用非线性动力学方法对健康人的正常声音进行分析,并将其与传统声学参数和感知分析相关联。12个人类语音信号来自健康受试者,6男6女,年龄从19岁到39岁不等。以22,050 Hz的采样率记录巴西葡萄牙语的持续元音/a/、/e/和/i/ if,并对其进行分析,获得声音扰动度量(抖动、闪烁、过量系数- EX和音高幅度- PA),采用相空间重构方法描述语音信号样本的非线性动态特征。本文表明,由于人类声音的混沌成分,作为相空间重构的非线性动态方法似乎是一种适合于语音信号分析的技术。结果表明,非线性动态分析并不会取代现有的技术,相反,它们可能会改进和补充卫生专业人员、语言治疗师和临床医生可用的最新语音分析方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Supporting low-cost video-on-demand in heterogeneous peer-to-peer networks Striping delay-sensitive packets over multiple burst-loss channels with random delays An ontology learning method enhanced by frame semantics BIOGLYPH: biometric identification in pervasive environments Key distributions as musical fingerprints for similarity assessment
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1