Speech Prosody 2022最新文献_第7页

Creaky voice and utterance fluency measures in predicting fluency and oral proficiency of spontaneous L2 Finnish 沙哑声音和话语流畅性测量对自发性第二语言芬兰语流畅性和口语熟练程度的预测

Speech Prosody 2022

Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-158

Heini Kallio, Rosa Suviranta, M. Kuronen, Anna von Zansen

While utterance fluency measures are often studied in relation to perceived L2 fluency and proficiency, the effect of creaky voice remains ignored. However, creaky voice is frequent in a number of languages, including Finnish, where it serves as a cue for phrase-boundaries and turn-taking. In this study we investigate the roles of creaky voice and utterance fluency measures in predicting fluency and proficiency ratings of spontaneous L2 Finnish (F2) speech. In so doing, 16 expert raters participated in assessing narrative spontaneous speech samples from 160 learners of Finnish. The effect of creaky voice and utterance fluency measures on proficiency and fluency ratings was studied using linear regression models. The results indicate that creaky voice can contribute to both oral proficiency and fluency alongside utterance fluency measures. Furthermore, average duration of composite breaks – a measure combining breakdown and repair phenomena – proved to be the most significant predictor of fluency. Based on these findings we recommend further investigation of the effect of creaky voice to the assessment of L2 speech as well as reconsideration of the utterance fluency measures used in predicting L2 fluency or proficiency.

虽然人们经常研究言语流利度对二语流利度和熟练程度的影响，但人们往往忽视了声音沙哑的影响。然而，沙哑的声音在包括芬兰语在内的许多语言中都很常见，它可以作为短语边界和轮流的提示。在这项研究中，我们探讨了吱吱声和话语流畅度测量在预测自发性二语芬兰语(F2)语言流畅性和熟练程度评分中的作用。在此过程中，16位专家评分员参与了对160名芬兰语学习者的叙述性自发语音样本的评估。采用线性回归模型研究了声音沙哑和话语流畅度测量对熟练程度和流畅度评分的影响。结果表明，沙哑的声音有助于提高口语的熟练程度和流利程度，同时也有助于提高话语的流利程度。此外，复合中断的平均持续时间——一种结合故障和修复现象的测量方法——被证明是流利程度的最重要预测指标。基于这些发现，我们建议进一步研究声音沙哑对二语言语评估的影响，并重新考虑用于预测二语流利或熟练程度的话语流利度测量。

{"title":"Creaky voice and utterance fluency measures in predicting fluency and oral proficiency of spontaneous L2 Finnish","authors":"Heini Kallio, Rosa Suviranta, M. Kuronen, Anna von Zansen","doi":"10.21437/speechprosody.2022-158","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-158","url":null,"abstract":"While utterance fluency measures are often studied in relation to perceived L2 fluency and proficiency, the effect of creaky voice remains ignored. However, creaky voice is frequent in a number of languages, including Finnish, where it serves as a cue for phrase-boundaries and turn-taking. In this study we investigate the roles of creaky voice and utterance fluency measures in predicting fluency and proficiency ratings of spontaneous L2 Finnish (F2) speech. In so doing, 16 expert raters participated in assessing narrative spontaneous speech samples from 160 learners of Finnish. The effect of creaky voice and utterance fluency measures on proficiency and fluency ratings was studied using linear regression models. The results indicate that creaky voice can contribute to both oral proficiency and fluency alongside utterance fluency measures. Furthermore, average duration of composite breaks – a measure combining breakdown and repair phenomena – proved to be the most significant predictor of fluency. Based on these findings we recommend further investigation of the effect of creaky voice to the assessment of L2 speech as well as reconsideration of the utterance fluency measures used in predicting L2 fluency or proficiency.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"186 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123041239","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Can speech rate transfer between languages? Evidence from Japanese and Mandarin Chinese 语言之间的语速可以传递吗?来自日语和普通话的证据

Speech Prosody 2022

Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-70

Zhiqiang Zhu, P. Mok

Whether speech rate can transfer between languages with distinctive speech rates is an understudied issue. Impressionistically, Japanese is faster than Mandarin Chinese. We investigated the speech rate of native Japanese and Mandarin speakers, advanced L2 learners and simultaneous bilinguals respectively. Nine native Beijing Mandarin speakers, five native Japanese speakers, thirteen advanced L1 Japanese learners of Mandarin and eleven Japanese-Mandarin simultaneous bilinguals participated in a passage reading task and a spontaneous speech task. The comparison between Japanese and Mandarin by native Mandarin speakers and native Japanese speakers confirmed that the speech rate of native Japanese was faster than that of Mandarin. Comparison between the speech rate of Japanese and Mandarin by advanced Japanese learners and simultaneous bilinguals showed that both groups produced Japanese constantly faster than their Mandarin. Both advanced Japanese learners and simultaneous bilinguals produced Japanese similarly as native Japanese speakers did. However, the Mandarin speech rate by advanced Japanese learners was significantly slower than that of native Mandarin speakers, while the Mandarin speech rate between simultaneous bilinguals and native Mandarin speakers remained non-significant. The findings challenge previous proposals that speech rate transfer could happen at a language level. Moreover, simultaneous bilinguals showed an advantage over advanced L2 learners in speech rate mastery.

语音速率能否在具有不同语音速率的语言之间传递是一个尚未得到充分研究的问题。印象上，日语比普通话快。我们分别调查了日语母语者和普通话母语者、高级第二语言学习者和同步双语者的言语率。9名以北京普通话为母语的人，5名以日语为母语的人，13名日语普通话高级学习者和11名日语普通话同步双语者参加了文章阅读任务和自发演讲任务。以普通话为母语的人和以日语为母语的人对日语和普通话的比较，证实了以日语为母语的人的语速比普通话快。高级日语学习者和同步双语者日语和普通话语速的比较表明，两组的日语语速都比普通话语速快。高级日语学习者和同时双语者产生的日语与日语母语者相似。然而，高级日语学习者的普通话语速明显慢于母语者，而同步双语者和母语者的普通话语速则不显著。这一发现挑战了之前的说法，即语速转移可能发生在语言水平上。此外，同步双语者在语速掌握方面比高级二语学习者有优势。

{"title":"Can speech rate transfer between languages? Evidence from Japanese and Mandarin Chinese","authors":"Zhiqiang Zhu, P. Mok","doi":"10.21437/speechprosody.2022-70","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-70","url":null,"abstract":"Whether speech rate can transfer between languages with distinctive speech rates is an understudied issue. Impressionistically, Japanese is faster than Mandarin Chinese. We investigated the speech rate of native Japanese and Mandarin speakers, advanced L2 learners and simultaneous bilinguals respectively. Nine native Beijing Mandarin speakers, five native Japanese speakers, thirteen advanced L1 Japanese learners of Mandarin and eleven Japanese-Mandarin simultaneous bilinguals participated in a passage reading task and a spontaneous speech task. The comparison between Japanese and Mandarin by native Mandarin speakers and native Japanese speakers confirmed that the speech rate of native Japanese was faster than that of Mandarin. Comparison between the speech rate of Japanese and Mandarin by advanced Japanese learners and simultaneous bilinguals showed that both groups produced Japanese constantly faster than their Mandarin. Both advanced Japanese learners and simultaneous bilinguals produced Japanese similarly as native Japanese speakers did. However, the Mandarin speech rate by advanced Japanese learners was significantly slower than that of native Mandarin speakers, while the Mandarin speech rate between simultaneous bilinguals and native Mandarin speakers remained non-significant. The findings challenge previous proposals that speech rate transfer could happen at a language level. Moreover, simultaneous bilinguals showed an advantage over advanced L2 learners in speech rate mastery.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"98 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123983734","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Speaking Rate in 3-4-Year-Old Children: Its Correlation with Gesture Rate and Word Learning 3-4岁儿童说话速率与手势速率和词汇学习的关系

Speech Prosody 2022

Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-48

Zhenyang Xi, Yan Gu, G. Vigliocco

Past research has shown that while speaking children before 3-year-old often use gesture to supplement speech while not using gesture as an integrated system with speech, and that the relationship between speech and gesture may relate to vocabulary development. However, such a relationship is unknown in 3-4-year-old children, a period in which we can capture key developmental changes from using gestures alone to using them along with speech. Using a new corpus of semi-naturalistic interaction between caregivers and their 3-4-year-old children (ECOLANG Corpus), this study investigates (1) the effect of age on children’s speaking and gesture rate, (2) the relationship between speaking and gesture rates and (3) their correlation with word learning. Specifically, we studied speaking and gesture rates of 32 English-speaking children while talking with their caregivers about sets of pre-selected toys. The children completed a vocabulary test at the time of the experiment and one year later. Results show that there was no effect of age on speaking and gesture rates at this age range, but we found that children with a fast speaking rate also had a higher gesture rate. Additionally, neither speaking rate nor gesture rate correlates with word. Thus, our findings show that by this age, children use gestures that are integrated with speech and their relationship is no longer a predictor of vocabulary learning. We speculate that the transition in the relationship is mainly a result of enhanced conceptual representation ability.

过去的研究表明，3岁以前的儿童在说话时，经常使用手势来补充语言，而不是将手势与语言作为一个整体系统使用，并且语言和手势之间的关系可能与词汇发展有关。然而，这种关系在3-4岁的儿童中是未知的，在这个时期，我们可以捕捉到从单独使用手势到将手势与语言一起使用的关键发展变化。本研究利用一个新的3-4岁幼儿半自然互动语料库(ECOLANG语料库)，研究了(1)年龄对幼儿说话和手势速度的影响，(2)说话和手势速度之间的关系，以及(3)说话和手势速度与词汇学习的相关性。具体来说，我们研究了32名说英语的儿童在与他们的看护者谈论预先选择的玩具时的说话和手势率。孩子们在实验时和一年后完成了一项词汇测试。结果表明，在这个年龄范围内，年龄对说话和手势率没有影响，但我们发现说话速度快的孩子也有更高的手势率。此外，说话速率和手势速率都与单词无关。因此，我们的研究结果表明，到了这个年龄，孩子们使用的手势与语言结合在一起，他们的关系不再是词汇学习的预测指标。我们推测，这种关系的转变主要是概念表征能力增强的结果。

{"title":"Speaking Rate in 3-4-Year-Old Children: Its Correlation with Gesture Rate and Word Learning","authors":"Zhenyang Xi, Yan Gu, G. Vigliocco","doi":"10.21437/speechprosody.2022-48","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-48","url":null,"abstract":"Past research has shown that while speaking children before 3-year-old often use gesture to supplement speech while not using gesture as an integrated system with speech, and that the relationship between speech and gesture may relate to vocabulary development. However, such a relationship is unknown in 3-4-year-old children, a period in which we can capture key developmental changes from using gestures alone to using them along with speech. Using a new corpus of semi-naturalistic interaction between caregivers and their 3-4-year-old children (ECOLANG Corpus), this study investigates (1) the effect of age on children’s speaking and gesture rate, (2) the relationship between speaking and gesture rates and (3) their correlation with word learning. Specifically, we studied speaking and gesture rates of 32 English-speaking children while talking with their caregivers about sets of pre-selected toys. The children completed a vocabulary test at the time of the experiment and one year later. Results show that there was no effect of age on speaking and gesture rates at this age range, but we found that children with a fast speaking rate also had a higher gesture rate. Additionally, neither speaking rate nor gesture rate correlates with word. Thus, our findings show that by this age, children use gestures that are integrated with speech and their relationship is no longer a predictor of vocabulary learning. We speculate that the transition in the relationship is mainly a result of enhanced conceptual representation ability.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128415797","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Interpretation of prosodically marked focus in cochlear implant-simulated speech by non-native listeners 非母语听者对人工耳蜗模拟语音中韵律标记焦点的解读

Speech Prosody 2022

Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-15

Marita K Everhardt, A. Sarampalis, M. Coler, D. Başkent, W. Lowie

This study assesses how a cochlear implant (CI) simulation inﬂuences the interpretation of prosodically marked linguistic focus in a non-native language. In an online experiment, two groups of normal-hearing native Dutch learners of English of different ages (12–14 year-old adolescents vs. 18 + year-old adults) and with different proﬁciency levels in English (A2 vs. B2/C1) were asked to listen to CI-simulated and non-CI-simulated English sentences differing in prosodically marked focus and indicate which of four possible context questions the speaker answered. Results show that, as expected, focus interpretation is signiﬁcantly less accurate in the CI-simulated condition compared to the non-CI-simulated condition and that more proﬁcient non-native listeners outperform less proﬁcient non-native listeners. However, there was no interaction between the inﬂuence of the spectro-temporal degradation of the CI-simulated speech signal and that of the English proﬁciency level of the non-native listeners, suggesting that less proﬁcient nonnative listeners are not more strongly affected by the spectro-temporal degradation of an electric speech signal than more proﬁcient non-native listeners.

本研究评估人工耳蜗(CI)模拟如何影响非母语中韵律标记语言焦点的解释。在一项在线实验中，两组不同年龄(12-14岁的青少年与18岁以上的成年人)和不同英语熟练程度(A2与B2/C1)的听力正常的母语荷兰英语学习者被要求听不同韵律标记焦点的ci模拟和非ci模拟英语句子，并指出说话者回答了四个可能的上下文问题中的哪一个。结果表明，与非ci模拟条件相比，在ci模拟条件下焦点解释的准确性显著降低，并且熟练程度较高的非母语听者表现优于不熟练的非母语听者。然而，ci模拟语音信号的频谱-时间退化与非母语听众的英语水平之间不存在交互作用，这表明不熟练的非母语听众受电语音信号频谱-时间退化的影响并不比熟练的非母语听众更强烈。

{"title":"Interpretation of prosodically marked focus in cochlear implant-simulated speech by non-native listeners","authors":"Marita K Everhardt, A. Sarampalis, M. Coler, D. Başkent, W. Lowie","doi":"10.21437/speechprosody.2022-15","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-15","url":null,"abstract":"This study assesses how a cochlear implant (CI) simulation inﬂuences the interpretation of prosodically marked linguistic focus in a non-native language. In an online experiment, two groups of normal-hearing native Dutch learners of English of different ages (12–14 year-old adolescents vs. 18 + year-old adults) and with different proﬁciency levels in English (A2 vs. B2/C1) were asked to listen to CI-simulated and non-CI-simulated English sentences differing in prosodically marked focus and indicate which of four possible context questions the speaker answered. Results show that, as expected, focus interpretation is signiﬁcantly less accurate in the CI-simulated condition compared to the non-CI-simulated condition and that more proﬁcient non-native listeners outperform less proﬁcient non-native listeners. However, there was no interaction between the inﬂuence of the spectro-temporal degradation of the CI-simulated speech signal and that of the English proﬁciency level of the non-native listeners, suggesting that less proﬁcient nonnative listeners are not more strongly affected by the spectro-temporal degradation of an electric speech signal than more proﬁcient non-native listeners.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116823430","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Articulatory properties of period-doubled voice in Mandarin 汉语重音的发音特性

Speech Prosody 2022

Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-111

Yaqian Huang

Period-doubled phonation is a type of creaky voice that contains two alternating periods. By presenting data from Mandarin Chinese read speech recordings, this study probes the articulatory properties of period-doubled phonation and its tonal distribution based on time-domain measures using electroglottography (EGG). Period doubling (PD) was found across all the tones (T3: 43% > T2 > T4 > T1: 11%), which was more prevalent than vocal fry, found mainly in T3 (48%) and T2 (43%), and only sporadically in T4 (7%) and T1 (2%). We calculated the two alternating glottal periods in PD, and they exhibited a ratio close to 3:2 or 2:1. The two pulses also alternated between higher and lower amplitudes with a mean ratio approximating 2 or 1.6. Women tended to produce more PD than men. Moreover, the contact quotient of PD, measured via EGG using the hybrid method, was around 0.5, similar to modal voice (0.54) and smaller than that of vocal fry (0.74), implying a more balanced opening and contact phase during phonation. Alternation of contact quotient and symmetry quotient was also seen in a few samples, suggesting that PD is likely articulated through two alternating pulses with distinct voice qualities and pitches.

双周期发声是一种包含两个交替周期的沙哑声音。本研究以普通话朗读录音资料为基础，基于电声门图(EGG)的时域测量方法，探讨了汉语双周期发音的发音特性及其音调分布。所有音调(T3: 43%， >， T2， >， T4， >， T1: 11%)均发现周期加倍(PD)，比声部更普遍，主要见于T3(48%)和T2(43%)，仅偶有见于T4(7%)和T1(2%)。我们计算了PD的两个交替的声门周期，它们的比例接近3:2或2:1。这两个脉冲也在高振幅和低振幅之间交替，平均比值约为2或1.6。女性往往比男性产生更多的PD。此外，使用混合方法通过EGG测量PD的接触商约为0.5，与模态声音(0.54)相似，小于声部(0.74)，表明发声过程中开口和接触相位更为平衡。接触商和对称商的交替也出现在一些样本中，这表明PD可能是通过两个具有不同音质和音调的交替脉冲来表达的。

{"title":"Articulatory properties of period-doubled voice in Mandarin","authors":"Yaqian Huang","doi":"10.21437/speechprosody.2022-111","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-111","url":null,"abstract":"Period-doubled phonation is a type of creaky voice that contains two alternating periods. By presenting data from Mandarin Chinese read speech recordings, this study probes the articulatory properties of period-doubled phonation and its tonal distribution based on time-domain measures using electroglottography (EGG). Period doubling (PD) was found across all the tones (T3: 43% > T2 > T4 > T1: 11%), which was more prevalent than vocal fry, found mainly in T3 (48%) and T2 (43%), and only sporadically in T4 (7%) and T1 (2%). We calculated the two alternating glottal periods in PD, and they exhibited a ratio close to 3:2 or 2:1. The two pulses also alternated between higher and lower amplitudes with a mean ratio approximating 2 or 1.6. Women tended to produce more PD than men. Moreover, the contact quotient of PD, measured via EGG using the hybrid method, was around 0.5, similar to modal voice (0.54) and smaller than that of vocal fry (0.74), implying a more balanced opening and contact phase during phonation. Alternation of contact quotient and symmetry quotient was also seen in a few samples, suggesting that PD is likely articulated through two alternating pulses with distinct voice qualities and pitches.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114595145","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Schwa deletion and perceived tempo in English 英语弱读音缺失与节奏感知

Speech Prosody 2022

Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-96

L. Plug, R. Lennon, Rachel Smith

We report on an experiment aimed to test the hypothesis that listeners orient to canonical forms when judging the tempo of reduced speech. Orientation to canonical forms should yield higher tempo estimates than orientation to surface phone strings when canonical phones are deleted. We tested the hypothesis for English, capitalizing on the fact that the non-realization of schwa in an unstressed syllable (e.g. support ) may result in a surface phone string associated with a different word than the intended one ( sport ). We presented listeners with sentences containing ambiguous surface realizations, along with orthographic representations which convinced some that they were listening to disyllabic words ( support etc.) and others that they were listening to monosyllabic ones ( sport etc.). Asking listeners to judge the tempo of the sentences allowed us to assess whether the difference in imposed lexical interpretation had an impact on perceived tempo. Our results reveal the predicted effect of the imposed interpretation: sentences with a ‘disyllabic’ interpretation for the ambiguous word form were judged faster than (the same) sentences with a ‘monosyllabic’ interpretation.

我们报告了一项实验，旨在测试听众在判断精简语音的节奏时倾向于规范形式的假设。当规范电话被删除时，规范形式的方向应该比表面电话字符串的方向产生更高的速度估计。我们在英语中测试了这一假设，利用了这样一个事实，即非重读音节(例如support)中的弱读音未实现可能导致表面电话字符串与预期单词(sport)相关联的不同。我们向听众展示了包含模棱两可的表面实现的句子，以及正字法表征，这些表征使一些人相信他们听到的是双音节单词(支持等)，而另一些人则相信他们听到的是单音节单词(运动等)。通过让听者判断句子的节奏，我们可以评估不同的词汇解释是否对感知到的节奏有影响。我们的研究结果揭示了强制解释的预期效果:对歧义词形式进行“双音节”解释的句子比(相同的)“单音节”解释的句子判断更快。

引用次数: 1

Melodic perception skills predict Catalan speakers’ imitation abilities of unfamiliar languages 旋律感知技能可以预测加泰罗尼亚语使用者对陌生语言的模仿能力

Speech Prosody 2022

Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-178

Peng Li, Yuan Zhang, Xianqiang Fu, Florence Baills, P. Prieto

引用次数: 2

Articulation rate in psychotherapeutic dialogues for depression: patients and therapists 抑郁症心理治疗对话中的发音率:患者和治疗师

Speech Prosody 2022

Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-27

Laurence White, H. Grimes

Prosodic features anecdotally associated with the speech of people with clinical depression include slower rate, lower pitch range and reduced loudness, but there is a significant degree of contradiction in the literature regarding depressed prosody. This complex picture reflects the heterogeneity of depression aetiology, symptomatology and prognosis. It is also likely to be influenced by elicitation methods, in particular, whether natural dialogue contexts are employed and whether the interlocutor’s prosody is also considered. We analysed 40 patient-therapist dialogues from the first and last of 29 weekly sessions of a behavioural therapy for refractory depression, sampling early and late in both sessions. Across all dialogues, we found that therapists spoke faster than patients, as expected, but in female-female therapist-patient dialogues (the majority of our sample), patients’ articulation rate increased substantially over the first session. Moreover, and contrary to expectations, there was a positive correlation between articulation rate and assessed depression severity (PHQ-9 scale) in the final therapy session, also evident in therapists’ speech for female-female dialogues. We suggest that this may reflect features of anxiety in speakers with ongoing depression and possibly also personality characteristics. We also consider evidence for prosodic convergence between patients and therapists.

与临床抑郁症患者的言语相关的韵律特征包括语速较慢、音调范围较低和响度降低，但在文献中关于抑郁症的韵律存在很大程度的矛盾。这一复杂的图景反映了抑郁症病因、症状和预后的异质性。它也可能受到启发方法的影响，特别是是否使用自然对话上下文以及是否也考虑了对话者的韵律。我们分析了难治性抑郁症行为疗法29周疗程的第一次和最后一次的40个患者-治疗师对话，在两个疗程的早期和晚期取样。在所有的对话中，我们发现治疗师比患者说话更快，正如预期的那样，但在女性-女性治疗师-患者对话中(我们的大多数样本)，患者的发音率在第一次会话中大幅提高。此外，与预期相反，在最后的治疗过程中，发音率和评估的抑郁严重程度(PHQ-9量表)之间存在正相关，这在治疗师对女性对话的演讲中也很明显。我们认为这可能反映了持续抑郁的说话者的焦虑特征，也可能是人格特征。我们也考虑患者和治疗师之间韵律趋同的证据。

{"title":"Articulation rate in psychotherapeutic dialogues for depression: patients and therapists","authors":"Laurence White, H. Grimes","doi":"10.21437/speechprosody.2022-27","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-27","url":null,"abstract":"Prosodic features anecdotally associated with the speech of people with clinical depression include slower rate, lower pitch range and reduced loudness, but there is a significant degree of contradiction in the literature regarding depressed prosody. This complex picture reflects the heterogeneity of depression aetiology, symptomatology and prognosis. It is also likely to be influenced by elicitation methods, in particular, whether natural dialogue contexts are employed and whether the interlocutor’s prosody is also considered. We analysed 40 patient-therapist dialogues from the first and last of 29 weekly sessions of a behavioural therapy for refractory depression, sampling early and late in both sessions. Across all dialogues, we found that therapists spoke faster than patients, as expected, but in female-female therapist-patient dialogues (the majority of our sample), patients’ articulation rate increased substantially over the first session. Moreover, and contrary to expectations, there was a positive correlation between articulation rate and assessed depression severity (PHQ-9 scale) in the final therapy session, also evident in therapists’ speech for female-female dialogues. We suggest that this may reflect features of anxiety in speakers with ongoing depression and possibly also personality characteristics. We also consider evidence for prosodic convergence between patients and therapists.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128173137","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Building bridges: The role of prosody in Mandarin-speaking adults' and children's anaphora resolution 搭建桥梁:韵律在汉语成人和儿童的回指消解中的作用

Speech Prosody 2022

Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-16

Kexin Du, S. Avrutin, Aoju Chen

Past research on the role of prosody in reference is primarily concerned with how adults and children use prosodic cues to signal accessibility change from givenness to newness of the same noun phrase. This study explores the role of prosody in the referential dependencies between one antecedent noun-phrase and one reflexive anaphor ‘zi-ji’ (oneself) in Mandarin-speaking adults and children. In sentences like “Boris dreamed that Miffy painted ‘zi-ji’”, ‘zi-ji’ can establish two types of anaphor-antecedent dependencies: (1) a local dependency where ‘zi-ji’ refers to Miffy, (2) a non-local dependency where ‘zi-ji’ refers to Boris. Such sentences were elicited in both interpretations from Mandarin-speaking adults and 6 to 10-year-olds in a picture-matching game. Duration analysis on ‘zi-ji’ shows that adults produced ‘zi-ji’ with a longer duration in the non-local dependency condition than in the local dependency condition. This result can be explained by the economy hierarchy model, whereby the local antecedent made more accessible by the locality constraint is preferred, thus necessitating the use of more prosodic prominence to mark the less accessible non-local antecedent. This pattern was not found in children’s production, suggesting prolonged acquisition of using prosody to build anaphor-antecedent dependencies for ‘zi-ji’.

以往关于韵律的指称作用的研究主要关注成人和儿童如何使用韵律线索来表示同一名词短语从给定到新奇的可及性变化。本研究探讨了韵律在汉语成人和儿童一个先行名词短语与一个反身指己之间的指称依赖关系中所起的作用。在“Boris梦见Miffy画了“zi-ji””这样的句子中，“zi-ji”可以建立两种类型的回指-前因依赖关系:(1)本地依赖关系，“zi-ji”指的是米菲;(2)非本地依赖关系，“zi-ji”指的是Boris。这些句子是由说普通话的成年人和6到10岁的孩子在一个图片匹配游戏中以两种不同的方式得出的。对“子鸡”的持续时间分析表明，成人在非本地依赖条件下比在本地依赖条件下产生“子鸡”的持续时间更长。这一结果可以用经济层次模型来解释，即本地约束下更容易获得的本地先行词是首选的，因此需要使用更多的韵律突出来标记不太容易获得的非本地先行词。这种模式在儿童的生产中没有发现，这表明使用韵律的长期习得建立了“字记”的回指-先行词依赖关系。

{"title":"Building bridges: The role of prosody in Mandarin-speaking adults' and children's anaphora resolution","authors":"Kexin Du, S. Avrutin, Aoju Chen","doi":"10.21437/speechprosody.2022-16","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-16","url":null,"abstract":"Past research on the role of prosody in reference is primarily concerned with how adults and children use prosodic cues to signal accessibility change from givenness to newness of the same noun phrase. This study explores the role of prosody in the referential dependencies between one antecedent noun-phrase and one reflexive anaphor ‘zi-ji’ (oneself) in Mandarin-speaking adults and children. In sentences like “Boris dreamed that Miffy painted ‘zi-ji’”, ‘zi-ji’ can establish two types of anaphor-antecedent dependencies: (1) a local dependency where ‘zi-ji’ refers to Miffy, (2) a non-local dependency where ‘zi-ji’ refers to Boris. Such sentences were elicited in both interpretations from Mandarin-speaking adults and 6 to 10-year-olds in a picture-matching game. Duration analysis on ‘zi-ji’ shows that adults produced ‘zi-ji’ with a longer duration in the non-local dependency condition than in the local dependency condition. This result can be explained by the economy hierarchy model, whereby the local antecedent made more accessible by the locality constraint is preferred, thus necessitating the use of more prosodic prominence to mark the less accessible non-local antecedent. This pattern was not found in children’s production, suggesting prolonged acquisition of using prosody to build anaphor-antecedent dependencies for ‘zi-ji’.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126719239","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

ha-HA-hha? Intensity and voice quality characteristics of laughter ha-HA-hha吗?笑声的强度和音质特征

Speech Prosody 2022

Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-114

Bogdan Ludusan, P. Wagner

Laughter is one of the most widely-encountered paralinguis-tic phenomena in human interaction and it has been studied from different perspectives throughout the years, including its acoustic-prosodic realization. However, previous studies have mostly focused on fundamental frequency and duration measures, with other prosodic features being less studied. We ex-amine here the acoustic marking of laughter in terms of intensity and voice quality characteristics, by using a corpus of spontaneous conversations. We operationalized the two cues by means of the root-mean-square energy and the cepstral peak prominence, respectively. Examining laughs, speech-laughs and speech instances at two different levels (that of the entire event and at the syllable nucleus level) we observed the least regular phonation for laughs and the most regular one for speech, while intensity was the highest for speech-laughs, followed by laughter and the lowest for speech. Using mixed effect models we determined that all three vocalization classes differ signiﬁcantly from one another, in terms of both acoustic cues. Moreover, an interesting effect of syllable position was seen for laughter, with phonation becoming more regular for later syllables.

笑声是人类交往中最常见的副语言现象之一，多年来人们从不同的角度对其进行了研究，包括它的声学-韵律实现。然而，以往的研究主要集中在基本频率和持续时间的测量上，对其他韵律特征的研究较少。我们在这里通过使用自发对话语料库，从强度和语音质量特征方面研究笑声的声学标记。我们分别通过均方根能量和倒谱峰突出来操作这两个线索。在两个不同的层面(整个事件和音节核层面)检查笑声、言语-笑声和言语实例，我们观察到笑的发音最不规则，说话的发音最规则，而言语-笑声的强度最高，其次是笑声，说话的强度最低。使用混合效果模型，我们确定所有三种发声类别在声学线索方面彼此之间存在显着差异。此外，在笑声中还发现了一个有趣的音节位置效应，后面的音节发音变得更有规律。

{"title":"ha-HA-hha? Intensity and voice quality characteristics of laughter","authors":"Bogdan Ludusan, P. Wagner","doi":"10.21437/speechprosody.2022-114","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-114","url":null,"abstract":"Laughter is one of the most widely-encountered paralinguis-tic phenomena in human interaction and it has been studied from different perspectives throughout the years, including its acoustic-prosodic realization. However, previous studies have mostly focused on fundamental frequency and duration measures, with other prosodic features being less studied. We ex-amine here the acoustic marking of laughter in terms of intensity and voice quality characteristics, by using a corpus of spontaneous conversations. We operationalized the two cues by means of the root-mean-square energy and the cepstral peak prominence, respectively. Examining laughs, speech-laughs and speech instances at two different levels (that of the entire event and at the syllable nucleus level) we observed the least regular phonation for laughs and the most regular one for speech, while intensity was the highest for speech-laughs, followed by laughter and the lowest for speech. Using mixed effect models we determined that all three vocalization classes differ signiﬁcantly from one another, in terms of both acoustic cues. Moreover, an interesting effect of syllable position was seen for laughter, with phonation becoming more regular for later syllables.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123555313","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0