Speaker-specificity in speech production: The contribution of source and filter

IF 1.9 1区 文学 0 LANGUAGE & LINGUISTICS Journal of Phonetics Pub Date : 2023-03-01 DOI:10.1016/j.wocn.2023.101224
Vincent Hughes , Amanda Cardoso , Paul Foulkes , Peter French , Amelia Gully , Philip Harrison
{"title":"Speaker-specificity in speech production: The contribution of source and filter","authors":"Vincent Hughes ,&nbsp;Amanda Cardoso ,&nbsp;Paul Foulkes ,&nbsp;Peter French ,&nbsp;Amelia Gully ,&nbsp;Philip Harrison","doi":"10.1016/j.wocn.2023.101224","DOIUrl":null,"url":null,"abstract":"<div><p>This study examines the extent to which speaker-specific information is encoded in different features of vocal output and the relationships between those features. A range of acoustic features, grouped as source (laryngeal voice quality measures and fundamental frequency) and filter features (formants and Mel-frequency cepstral coefficients; MFCCs), were extracted from the vocalic portion of the hesitation marker <em>um</em> for 90 male speakers of Standard Southern British English. Little overall correlation between the sets of features was observed, suggesting no strong interdependence between source and filter in our data. Although filter features were consistently better at discriminating between same- and different-speaker pairs compared with source features, combining source and filter has the potential of producing the lowest error rates and the strongest speaker discrimination scores. Taken together, results show that source and filter provide complementary speaker-specific information. However, the extent of the improvements in speaker discrimination performance when combining source and filter varied across speakers. We explore potential explanations for this finding and discuss the implications for source-filter theory, and for applied fields such as speaker recognition and forensic speech science.</p></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":null,"pages":null},"PeriodicalIF":1.9000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Phonetics","FirstCategoryId":"98","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S009544702300013X","RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 0

Abstract

This study examines the extent to which speaker-specific information is encoded in different features of vocal output and the relationships between those features. A range of acoustic features, grouped as source (laryngeal voice quality measures and fundamental frequency) and filter features (formants and Mel-frequency cepstral coefficients; MFCCs), were extracted from the vocalic portion of the hesitation marker um for 90 male speakers of Standard Southern British English. Little overall correlation between the sets of features was observed, suggesting no strong interdependence between source and filter in our data. Although filter features were consistently better at discriminating between same- and different-speaker pairs compared with source features, combining source and filter has the potential of producing the lowest error rates and the strongest speaker discrimination scores. Taken together, results show that source and filter provide complementary speaker-specific information. However, the extent of the improvements in speaker discrimination performance when combining source and filter varied across speakers. We explore potential explanations for this finding and discuss the implications for source-filter theory, and for applied fields such as speaker recognition and forensic speech science.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
语音生成中的说话人特异性:源和滤波器的贡献
这项研究考察了说话者特定信息在多大程度上被编码在声音输出的不同特征中,以及这些特征之间的关系。一系列声学特征,分为源(喉音质量测量和基频)和滤波器特征(共振峰和梅尔频率倒谱系数;MFCC),从90名标准南方英国英语男性说话者的犹豫标记um的发声部分提取。观察到特征集之间的总体相关性很小,这表明我们的数据中的源和过滤器之间没有很强的相互依赖性。尽管与源特征相比,滤波器特征在区分相同和不同的说话者对方面始终更好,但将源和滤波器相结合有可能产生最低的错误率和最强的说话者区分分数。总之,结果表明,源和滤波器提供了互补的说话者特定信息。然而,当组合源和滤波器时,扬声器辨别性能的改善程度因扬声器而异。我们探索了这一发现的潜在解释,并讨论了对源滤波器理论以及说话人识别和取证语音科学等应用领域的启示。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
3.50
自引率
26.30%
发文量
49
期刊介绍: The Journal of Phonetics publishes papers of an experimental or theoretical nature that deal with phonetic aspects of language and linguistic communication processes. Papers dealing with technological and/or pathological topics, or papers of an interdisciplinary nature are also suitable, provided that linguistic-phonetic principles underlie the work reported. Regular articles, review articles, and letters to the editor are published. Themed issues are also published, devoted entirely to a specific subject of interest within the field of phonetics.
期刊最新文献
Talker variability versus variability of vowel context in training naïve learners on an unfamiliar class of foreign language contrasts Effects of syllable position and place of articulation on secondary dorsal contrasts: An ultrasound study of Irish On the target of phonetic convergence: Acoustic and linguistic aspects of pitch accent imitation Effects of word-level structure on oral stop realization in Hawaiian Lexically-guided perceptual recalibration from acoustically unambiguous input in second language learners
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1