Discriminative segmental cues to vowel height and consonantal place and voicing in whispered speech

IF 2.4 1区文学 0 LANGUAGE & LINGUISTICS Journal of Phonetics Pub Date : 2023-03-01 Epub Date: 2023-02-28 DOI:10.1016/j.wocn.2023.101223

Luis M.T. Jesus , Sara Castilho , Aníbal Ferreira , Maria Conceição Costa

{"title":"Discriminative segmental cues to vowel height and consonantal place and voicing in whispered speech","authors":"Luis M.T. Jesus , Sara Castilho , Aníbal Ferreira , Maria Conceição Costa","doi":"10.1016/j.wocn.2023.101223","DOIUrl":null,"url":null,"abstract":"<div><h3>Purpose</h3><p>The acoustic signal attributes of whispered speech potentially carry sufficiently distinct information to define vowel spaces and to disambiguate consonant place and voicing, but what these attributes are and the underlying production mechanisms are not fully known. The purpose of this study was to define segmental cues to place and voicing of vowels and sibilant fricatives and to develop an articulatory interpretation of acoustic data.</p></div><div><h3>Method</h3><p>Seventeen speakers produced sustained sibilants and oral vowels, disyllabic words, sentences and read a phonetically balanced text. All the tasks were repeated in voiced and whispered speech, and the sound source and filter analysed using the following parameters: Fundamental frequency, spectral peak frequencies and levels, spectral slopes, sound pressure level and durations. Logistic linear mixed-effects models were developed to understand what acoustic signal attributes carry sufficiently distinct information to disambiguate /i, a/ and /s, ʃ/.</p></div><div><h3>Results</h3><p>Vowels were produced with significantly different spectral slope, sound pressure level, first and second formant frequencies in voiced and whispered speech. The low frequencies spectral slope of voiced sibilants was significantly different between whispered and voiced speech. The odds of choosing /a/ instead of /i/ were estimated to be lower for whispered speech when compared to voiced speech. Fricatives’ broad peak frequency was statistically significant when discriminating between /s/ and /ʃ/.</p></div><div><h3>Conclusions</h3><p>First formant frequency and relative duration of vowels are consistently used as height cues, and spectral slope and broad peak frequency are attributes associated with consonantal place of articulation. The relative duration of same-place voiceless fricatives was higher than voiced fricatives both in voiced and whispered speech. The evidence presented in this paper can be used to restore voiced speech signals, and to inform rehabilitation strategies that can safely explore the production mechanisms of whispering.</p></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"97 ","pages":"Article 101223"},"PeriodicalIF":2.4000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Phonetics","FirstCategoryId":"98","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0095447023000128","RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/2/28 0:00:00","PubModel":"Epub","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}

引用次数: 1

Abstract

Purpose

The acoustic signal attributes of whispered speech potentially carry sufficiently distinct information to define vowel spaces and to disambiguate consonant place and voicing, but what these attributes are and the underlying production mechanisms are not fully known. The purpose of this study was to define segmental cues to place and voicing of vowels and sibilant fricatives and to develop an articulatory interpretation of acoustic data.

Method

Seventeen speakers produced sustained sibilants and oral vowels, disyllabic words, sentences and read a phonetically balanced text. All the tasks were repeated in voiced and whispered speech, and the sound source and filter analysed using the following parameters: Fundamental frequency, spectral peak frequencies and levels, spectral slopes, sound pressure level and durations. Logistic linear mixed-effects models were developed to understand what acoustic signal attributes carry sufficiently distinct information to disambiguate /i, a/ and /s, ʃ/.

Results

Vowels were produced with significantly different spectral slope, sound pressure level, first and second formant frequencies in voiced and whispered speech. The low frequencies spectral slope of voiced sibilants was significantly different between whispered and voiced speech. The odds of choosing /a/ instead of /i/ were estimated to be lower for whispered speech when compared to voiced speech. Fricatives’ broad peak frequency was statistically significant when discriminating between /s/ and /ʃ/.

Conclusions

First formant frequency and relative duration of vowels are consistently used as height cues, and spectral slope and broad peak frequency are attributes associated with consonantal place of articulation. The relative duration of same-place voiceless fricatives was higher than voiced fricatives both in voiced and whispered speech. The evidence presented in this paper can be used to restore voiced speech signals, and to inform rehabilitation strategies that can safely explore the production mechanisms of whispering.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

轻声语音中元音高度、辅音位置和发音的判别性分段线索

目的低声说话的声学信号属性可能携带足够独特的信息来定义元音空间，消除辅音位置和发音的歧义，但这些属性是什么以及潜在的产生机制尚不完全清楚。本研究的目的是定义元音和嘶嘶擦音的位置和发音的分段线索，并开发声学数据的发音解释。方法17名说话人发出持续的嘶嘶声和口头元音、双音节单词、句子，并阅读语音平衡的文本。所有任务都在浊音和耳语中重复，并使用以下参数分析声源和滤波器：基频、频谱峰值频率和电平、频谱斜率、声压电平和持续时间。建立了Logistic线性混合效应模型，以了解哪些声学信号属性携带足够清晰的信息来消除/i、a/和/s的歧义。轻声和浊音的浊音的低频谱斜率有显著差异。据估计，与浊音语音相比，选择/a/而不是/i/的几率更低。在区分/s/和/？/时，擦音的宽峰频率具有统计学意义。结论元音的第一共振峰频率和相对持续时间一直被用作高度线索，谱斜率和宽峰频率是与辅音发音位置相关的属性。在浊音和轻声两种语音中，同一地点的无声擦音的相对时长均高于有声擦音。本文提供的证据可用于恢复有声语音信号，并为安全探索窃窃私语产生机制的康复策略提供信息。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of Phonetics Multiple-

CiteScore

3.50

自引率

26.30%

发文量

期刊介绍： The Journal of Phonetics publishes papers of an experimental or theoretical nature that deal with phonetic aspects of language and linguistic communication processes. Papers dealing with technological and/or pathological topics, or papers of an interdisciplinary nature are also suitable, provided that linguistic-phonetic principles underlie the work reported. Regular articles, review articles, and letters to the editor are published. Themed issues are also published, devoted entirely to a specific subject of interest within the field of phonetics.