Discriminative segmental cues to vowel height and consonantal place and voicing in whispered speech

IF 1.9 1区 文学 0 LANGUAGE & LINGUISTICS Journal of Phonetics Pub Date : 2023-03-01 DOI:10.1016/j.wocn.2023.101223
Luis M.T. Jesus , Sara Castilho , Aníbal Ferreira , Maria Conceição Costa
{"title":"Discriminative segmental cues to vowel height and consonantal place and voicing in whispered speech","authors":"Luis M.T. Jesus ,&nbsp;Sara Castilho ,&nbsp;Aníbal Ferreira ,&nbsp;Maria Conceição Costa","doi":"10.1016/j.wocn.2023.101223","DOIUrl":null,"url":null,"abstract":"<div><h3>Purpose</h3><p>The acoustic signal attributes of whispered speech potentially carry sufficiently distinct information to define vowel spaces and to disambiguate consonant place and voicing, but what these attributes are and the underlying production mechanisms are not fully known. The purpose of this study was to define segmental cues to place and voicing of vowels and sibilant fricatives and to develop an articulatory interpretation of acoustic data.</p></div><div><h3>Method</h3><p>Seventeen speakers produced sustained sibilants and oral vowels, disyllabic words, sentences and read a phonetically balanced text. All the tasks were repeated in voiced and whispered speech, and the sound source and filter analysed using the following parameters: Fundamental frequency, spectral peak frequencies and levels, spectral slopes, sound pressure level and durations. Logistic linear mixed-effects models were developed to understand what acoustic signal attributes carry sufficiently distinct information to disambiguate /i, a/ and /s, ʃ/.</p></div><div><h3>Results</h3><p>Vowels were produced with significantly different spectral slope, sound pressure level, first and second formant frequencies in voiced and whispered speech. The low frequencies spectral slope of voiced sibilants was significantly different between whispered and voiced speech. The odds of choosing /a/ instead of /i/ were estimated to be lower for whispered speech when compared to voiced speech. Fricatives’ broad peak frequency was statistically significant when discriminating between /s/ and /ʃ/.</p></div><div><h3>Conclusions</h3><p>First formant frequency and relative duration of vowels are consistently used as height cues, and spectral slope and broad peak frequency are attributes associated with consonantal place of articulation. The relative duration of same-place voiceless fricatives was higher than voiced fricatives both in voiced and whispered speech. The evidence presented in this paper can be used to restore voiced speech signals, and to inform rehabilitation strategies that can safely explore the production mechanisms of whispering.</p></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":null,"pages":null},"PeriodicalIF":1.9000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Phonetics","FirstCategoryId":"98","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0095447023000128","RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 1

Abstract

Purpose

The acoustic signal attributes of whispered speech potentially carry sufficiently distinct information to define vowel spaces and to disambiguate consonant place and voicing, but what these attributes are and the underlying production mechanisms are not fully known. The purpose of this study was to define segmental cues to place and voicing of vowels and sibilant fricatives and to develop an articulatory interpretation of acoustic data.

Method

Seventeen speakers produced sustained sibilants and oral vowels, disyllabic words, sentences and read a phonetically balanced text. All the tasks were repeated in voiced and whispered speech, and the sound source and filter analysed using the following parameters: Fundamental frequency, spectral peak frequencies and levels, spectral slopes, sound pressure level and durations. Logistic linear mixed-effects models were developed to understand what acoustic signal attributes carry sufficiently distinct information to disambiguate /i, a/ and /s, ʃ/.

Results

Vowels were produced with significantly different spectral slope, sound pressure level, first and second formant frequencies in voiced and whispered speech. The low frequencies spectral slope of voiced sibilants was significantly different between whispered and voiced speech. The odds of choosing /a/ instead of /i/ were estimated to be lower for whispered speech when compared to voiced speech. Fricatives’ broad peak frequency was statistically significant when discriminating between /s/ and /ʃ/.

Conclusions

First formant frequency and relative duration of vowels are consistently used as height cues, and spectral slope and broad peak frequency are attributes associated with consonantal place of articulation. The relative duration of same-place voiceless fricatives was higher than voiced fricatives both in voiced and whispered speech. The evidence presented in this paper can be used to restore voiced speech signals, and to inform rehabilitation strategies that can safely explore the production mechanisms of whispering.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
轻声语音中元音高度、辅音位置和发音的判别性分段线索
目的低声说话的声学信号属性可能携带足够独特的信息来定义元音空间,消除辅音位置和发音的歧义,但这些属性是什么以及潜在的产生机制尚不完全清楚。本研究的目的是定义元音和嘶嘶擦音的位置和发音的分段线索,并开发声学数据的发音解释。方法17名说话人发出持续的嘶嘶声和口头元音、双音节单词、句子,并阅读语音平衡的文本。所有任务都在浊音和耳语中重复,并使用以下参数分析声源和滤波器:基频、频谱峰值频率和电平、频谱斜率、声压电平和持续时间。建立了Logistic线性混合效应模型,以了解哪些声学信号属性携带足够清晰的信息来消除/i、a/和/s的歧义。轻声和浊音的浊音的低频谱斜率有显著差异。据估计,与浊音语音相比,选择/a/而不是/i/的几率更低。在区分/s/和/?/时,擦音的宽峰频率具有统计学意义。结论元音的第一共振峰频率和相对持续时间一直被用作高度线索,谱斜率和宽峰频率是与辅音发音位置相关的属性。在浊音和轻声两种语音中,同一地点的无声擦音的相对时长均高于有声擦音。本文提供的证据可用于恢复有声语音信号,并为安全探索窃窃私语产生机制的康复策略提供信息。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
3.50
自引率
26.30%
发文量
49
期刊介绍: The Journal of Phonetics publishes papers of an experimental or theoretical nature that deal with phonetic aspects of language and linguistic communication processes. Papers dealing with technological and/or pathological topics, or papers of an interdisciplinary nature are also suitable, provided that linguistic-phonetic principles underlie the work reported. Regular articles, review articles, and letters to the editor are published. Themed issues are also published, devoted entirely to a specific subject of interest within the field of phonetics.
期刊最新文献
Talker variability versus variability of vowel context in training naïve learners on an unfamiliar class of foreign language contrasts Effects of syllable position and place of articulation on secondary dorsal contrasts: An ultrasound study of Irish On the target of phonetic convergence: Acoustic and linguistic aspects of pitch accent imitation Effects of word-level structure on oral stop realization in Hawaiian Lexically-guided perceptual recalibration from acoustically unambiguous input in second language learners
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1