Speech Prosody 2022最新文献

英文中文

A preliminary analysis on children’s phonation contrast in Kunshan Wu Chinese tones 昆山吴语儿童语音对比初步分析

Speech Prosody 2022

Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-85

Wenwei Xu, Chunyu Ge, Wentao Gu, P. Mok

Previous studies have established that phonation contrasts can be, apart from pitch, an important dimension of tonal contrasts in some languages, and modern Wu Chinese is a good example in which the lower register tones are produced with breathier phonation than the upper register tones. Nevertheless, researchers have shown that such phonation contrast is declining among young speakers in Shanghai and Suzhou Wu. This pilot study is thus motivated to investigate children’s production in Kunshan Wu, a neighboring yet rather understudied dialect with more tones, in order to see if a similar trend is ongoing. Two male and two female school-age children (8;4 to 10;4) were recorded reading isolated monosyllabic words with different lexical tones, and simultaneous acoustic and electroglottographic (EGG) data were collected. Results of EGG and acoustic parameters demonstrate that at least near the onset of the vowel, glottal constriction is smaller and glottal closure is less abrupt in the lower register tones than in the upper register tones, suggesting that the lower register tones are generally produced with breathier phonation. Therefore, school-age child speakers of Kunshan Wu are still able to produce the phonation contrast between the tone registers.

以往的研究已经证实，在某些语言中，发声对比除了音高之外，也是声调对比的一个重要维度，现代吴语就是一个很好的例子，其低音域的发声比上音域的发声更呼吸。然而，研究人员发现，在上海和苏州的年轻人中，这种发音对比正在下降。因此，这项试点研究的动机是调查昆山吴语的儿童生产，昆山吴语是一种邻近但研究较少的方言，声调更多，以了解是否有类似的趋势正在进行。记录两名男、两名女学龄儿童(8岁、4岁至10岁)阅读不同声调的单音节孤立单词，同时收集声学和声门电图(EGG)数据。EGG和声学参数的结果表明，至少在元音开始附近，与上声区音相比，下声区音的声门收缩较小，声门关闭不那么突然，这表明下声区音通常是用呼吸发声的。因此，昆山吴语学龄儿童说话者仍然能够产生音域之间的语音对比。

{"title":"A preliminary analysis on children’s phonation contrast in Kunshan Wu Chinese tones","authors":"Wenwei Xu, Chunyu Ge, Wentao Gu, P. Mok","doi":"10.21437/speechprosody.2022-85","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-85","url":null,"abstract":"Previous studies have established that phonation contrasts can be, apart from pitch, an important dimension of tonal contrasts in some languages, and modern Wu Chinese is a good example in which the lower register tones are produced with breathier phonation than the upper register tones. Nevertheless, researchers have shown that such phonation contrast is declining among young speakers in Shanghai and Suzhou Wu. This pilot study is thus motivated to investigate children’s production in Kunshan Wu, a neighboring yet rather understudied dialect with more tones, in order to see if a similar trend is ongoing. Two male and two female school-age children (8;4 to 10;4) were recorded reading isolated monosyllabic words with different lexical tones, and simultaneous acoustic and electroglottographic (EGG) data were collected. Results of EGG and acoustic parameters demonstrate that at least near the onset of the vowel, glottal constriction is smaller and glottal closure is less abrupt in the lower register tones than in the upper register tones, suggesting that the lower register tones are generally produced with breathier phonation. Therefore, school-age child speakers of Kunshan Wu are still able to produce the phonation contrast between the tone registers.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"66 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121591501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Prosodic and lexical entrainment in adults with and without schizophrenia 有和没有精神分裂症的成人的韵律和词汇的干扰

Speech Prosody 2022

Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-26

J. Kruyt, S. Benus, C. Faget, C. Lançon, M. Champagne-Lavau

Entrainment refers to the tendency people have to speak more similarly during a conversation. Although entrainment has been observed frequently, the underlying mechanisms of the phe-nomenon are debated. A speciﬁc point of disagreement is the role of social or higher-order cognitive factors in entrainment. The present study aimed to explore prosodic and lexical entrainment in small groups of individuals with schizophrenia, a dis-order that has been associated with theory of mind impairments and social difﬁculties, and a control group without schizophrenia. All participants completed a referential communication task with an experimenter. To determine prosodic entrainment, the measures proposed by Levitan and Hirshberg [1] were used. Results seem to suggest that the effect of task role on prosodic entrainment was larger than any possible effects of group, suggesting that social factors affect prosodic entrainment behaviour more than individual differences in cognition or other factors. Conversely, lexical entrainment was not affected by task role or group. Importantly, no clear patterns in entrainment on different dimensions, levels, or features could be observed, highlighting the complex and multifaceted nature of entrainment.

“娱乐化”指的是人们在谈话中说话更加相似的趋势。虽然经常观察到夹带现象，但这一现象的潜在机制仍存在争议。一个具体的分歧点是社会或高阶认知因素在娱乐中的作用。本研究旨在探索精神分裂症患者(一种与心理障碍理论和社交困难相关的疾病)和非精神分裂症对照组的韵律和词汇参与情况。所有的参与者都完成了一项与实验者交流的任务。为了确定韵律蕴意，我们采用Levitan和Hirshberg[1]提出的测量方法。结果似乎表明，任务角色对韵律夹带行为的影响大于任何可能的群体影响，这表明社会因素对韵律夹带行为的影响大于个体认知差异或其他因素的影响。相反，词汇吸收不受任务角色或群体的影响。重要的是，在不同的维度、层次或特征上，没有观察到明显的夹带模式，突出了夹带的复杂性和多面性。

{"title":"Prosodic and lexical entrainment in adults with and without schizophrenia","authors":"J. Kruyt, S. Benus, C. Faget, C. Lançon, M. Champagne-Lavau","doi":"10.21437/speechprosody.2022-26","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-26","url":null,"abstract":"Entrainment refers to the tendency people have to speak more similarly during a conversation. Although entrainment has been observed frequently, the underlying mechanisms of the phe-nomenon are debated. A speciﬁc point of disagreement is the role of social or higher-order cognitive factors in entrainment. The present study aimed to explore prosodic and lexical entrainment in small groups of individuals with schizophrenia, a dis-order that has been associated with theory of mind impairments and social difﬁculties, and a control group without schizophrenia. All participants completed a referential communication task with an experimenter. To determine prosodic entrainment, the measures proposed by Levitan and Hirshberg [1] were used. Results seem to suggest that the effect of task role on prosodic entrainment was larger than any possible effects of group, suggesting that social factors affect prosodic entrainment behaviour more than individual differences in cognition or other factors. Conversely, lexical entrainment was not affected by task role or group. Importantly, no clear patterns in entrainment on different dimensions, levels, or features could be observed, highlighting the complex and multifaceted nature of entrainment.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121455741","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An EEG-study on L2 categorization of emotional prosody in German 德语情绪韵律二语分类的脑电图研究

Speech Prosody 2022

Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-128

Hua Wei, Yifei He, C. Kauschke, Mathias Scharinger, Ulrike Domahs

Previous behavioral studies on the processing of emotional prosody in L2 learners showed similarities and differences between L1- and L2-processing and suggested that emotional perception has both universal and culture-specific aspects. However, little is known about the processing of emotional prosody in L2 learners' brains. Therefore, the present study used event-related potentials to compare the processing of emotional prosodies between German native speakers and Chinese L2 learners of German. Participants performed a prosody recognition task with semantically neutral German words recorded with emotional "neutral" , "like" , and "disgust" prosodies. The accuracy ratings of categorizing emotional prosodies of L2 learners were above chance but significantly better for the L1 speakers. Both groups yielded an early and a late positivity for processing "like" in comparison to "disgust" , reflecting the emotional prosodic predictive processing. However, an early left anterior negativity (ELAN) and a late anterior negativity observed in the L2 learners suggest that they are more sensitive to acoustic differences of the presented stimuli. Overall, our findings support the assumption that the processing of emotional prosody is in principle universal across languages, but that in addition to the general mechanisms involved in the processing of emotional speech language-specific aspects also modify emotional processing.

以往关于二语学习者情绪韵律加工的行为研究表明，二语学习者的情绪知觉具有共性和文化特异性。然而，人们对二语学习者大脑中情绪韵律的加工过程知之甚少。因此，本研究采用事件相关电位对德语母语者和汉语第二语言学习者的情绪韵律加工进行了比较。参与者完成了一项韵律识别任务，他们用语义中性的德语单词记录了情感上的“中性”、“喜欢”和“厌恶”韵律。第二语言学习者对情绪韵律的分类准确率高于随机，而第一语言学习者对情绪韵律的分类准确率明显高于随机。与“厌恶”相比，两组对“喜欢”的处理都产生了早期和晚期的积极反应，反映了情绪韵律预测处理。然而，在二语学习者中观察到的早期左前负性(ELAN)和晚期前负性表明他们对所呈现刺激的声音差异更敏感。总的来说，我们的研究结果支持这样的假设，即情绪韵律的处理原则上是跨语言通用的，但除了涉及情绪言语处理的一般机制外，语言特定方面也会改变情绪处理。

{"title":"An EEG-study on L2 categorization of emotional prosody in German","authors":"Hua Wei, Yifei He, C. Kauschke, Mathias Scharinger, Ulrike Domahs","doi":"10.21437/speechprosody.2022-128","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-128","url":null,"abstract":"Previous behavioral studies on the processing of emotional prosody in L2 learners showed similarities and differences between L1- and L2-processing and suggested that emotional perception has both universal and culture-specific aspects. However, little is known about the processing of emotional prosody in L2 learners' brains. Therefore, the present study used event-related potentials to compare the processing of emotional prosodies between German native speakers and Chinese L2 learners of German. Participants performed a prosody recognition task with semantically neutral German words recorded with emotional \"neutral\" , \"like\" , and \"disgust\" prosodies. The accuracy ratings of categorizing emotional prosodies of L2 learners were above chance but significantly better for the L1 speakers. Both groups yielded an early and a late positivity for processing \"like\" in comparison to \"disgust\" , reflecting the emotional prosodic predictive processing. However, an early left anterior negativity (ELAN) and a late anterior negativity observed in the L2 learners suggest that they are more sensitive to acoustic differences of the presented stimuli. Overall, our findings support the assumption that the processing of emotional prosody is in principle universal across languages, but that in addition to the general mechanisms involved in the processing of emotional speech language-specific aspects also modify emotional processing.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115926113","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Rising declaratives in Veneto dialects 威尼托方言中的上升陈述句

Speech Prosody 2022

Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-36

G. Magistro, Claudia Crocco

引用次数: 0

Intonation in advice-giving in Kenyan English and Kiswahili 肯尼亚英语和斯瓦希里语建议中的语调

Speech Prosody 2022

Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-31

B. Otundo, M. Grice

We examine salient prosodic features used in advice-giving in Kenyan English and Kiswahili from a radio phone-in programme. Our pilot corpus constitutes 40 sequences taken from The Breakfast Show , a Kenyan radio phone-in aired on Classic 105 fm. Although the programme is moderated in English, advice is given in both English and Kiswahili, since Kenya is highly multilingual with frequent code-switching. In this paper, we focus on the pragmatic strategies of expressing advice involving forms that furnish the recipient with little optionality in carrying out the suggested action, including, imperatives, declaratives with modal verbs, and conditional forms. In both languages, we observe a terminal falling intonation in advice-giving. However, whilst the global pitch contours in Kenyan English follow a marked downtrend for expressing advice in imperative, declarative and conditional forms, interpreted as a downstepping sequence of H* accents, those in Kiswahili have alternating rises and falls, suggesting a more elaborate intonational phonology. In instances of code-switching, imperative forms of advice generally reveal alternating rises and falls. This pattern is also found in declarative and conditional forms, although with a greater pitch range. These preliminary findings are useful in applications such as identification of language and variety, especially in multilingual interactions.

我们研究了肯尼亚英语和斯瓦希里语在电台电话节目中提出建议时所使用的显著韵律特征。我们的试点语料库由40个片段组成，取自《早餐秀》，这是一档在Classic 105调频播出的肯尼亚电台电话节目。虽然该方案以英语主持，但建议是用英语和斯瓦希里语提供的，因为肯尼亚是高度多语言的国家，经常进行代码转换。在本文中，我们关注的是表达建议的语用策略，这些建议包括祈使句、情态动词陈述句和条件句，这些形式使接受者在执行建议的行动时几乎没有选择余地。在这两种语言中，我们都可以观察到在给出建议时语调的最终降调。然而，肯尼亚英语的整体音高轮廓在祈使、陈述句和条件句中表达建议时呈明显的下降趋势，被解释为H*口音的降调序列，而斯瓦希里语的音高轮廓则有上升和下降的交替，这表明了一种更复杂的语调音系。在代码转换的情况下，祈使句形式的建议通常显示交替的上升和下降。这种模式也出现在陈述句和条件句中，尽管它们的音高范围更大。这些初步发现在语言和多样性的识别等应用中非常有用，特别是在多语言互动中。

{"title":"Intonation in advice-giving in Kenyan English and Kiswahili","authors":"B. Otundo, M. Grice","doi":"10.21437/speechprosody.2022-31","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-31","url":null,"abstract":"We examine salient prosodic features used in advice-giving in Kenyan English and Kiswahili from a radio phone-in programme. Our pilot corpus constitutes 40 sequences taken from The Breakfast Show , a Kenyan radio phone-in aired on Classic 105 fm. Although the programme is moderated in English, advice is given in both English and Kiswahili, since Kenya is highly multilingual with frequent code-switching. In this paper, we focus on the pragmatic strategies of expressing advice involving forms that furnish the recipient with little optionality in carrying out the suggested action, including, imperatives, declaratives with modal verbs, and conditional forms. In both languages, we observe a terminal falling intonation in advice-giving. However, whilst the global pitch contours in Kenyan English follow a marked downtrend for expressing advice in imperative, declarative and conditional forms, interpreted as a downstepping sequence of H* accents, those in Kiswahili have alternating rises and falls, suggesting a more elaborate intonational phonology. In instances of code-switching, imperative forms of advice generally reveal alternating rises and falls. This pattern is also found in declarative and conditional forms, although with a greater pitch range. These preliminary findings are useful in applications such as identification of language and variety, especially in multilingual interactions.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"80 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123240362","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

The role of audio-visual phrasal prosody in bootstrapping the acquisition of word order 视听短语韵律在引导语序习得中的作用

Speech Prosody 2022

Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-47

Irene De la Cruz-Pavía

From early in development infants integrate auditory and visual facial information while processing language. The potential role of visual cues in the acquisition of grammar remains however virtually unexplored. Phrasal prosodic prominence correlates systematically with basic word order in natural languages. Co-verbal gestures—head and eyebrow motion—act in turn as markers of auditory prosody. Here, we examine whether co-verbal gestures could help infants parse the input into prosodic units such as phrases, and discover the basic word order of the native language. In a first study we show that adult talkers spontaneously produce co-verbal gestures signaling phrase boundaries across languages and speech styles: Japanese and English, adult- and infant-directed speech. A second study shows that adult speakers use co-verbal information, specifically head nods marking phrasal prosodic prominence, to parse an artificial language into phrase-like units that follow the native language’s word order. Finally, a third study shows that the presence of co-verbal gestures—i.e. head nods—also impacts 8-month-old infants’ segmentation preferences of a structurally ambiguous artificial language. However, infants’ ability to use this cue is still limited, suggesting that co-verbal gestures might be acquired later in development than visual speech, presumably due to their greater inter-/intra-speaker variability.

从发育早期开始，婴儿在处理语言时就整合了听觉和视觉面部信息。然而，视觉线索在语法习得中的潜在作用实际上尚未被探索。自然语言中的短语韵律突出与基本语序有系统的联系。共同语言的手势——头部和眉毛的动作——依次作为听觉韵律的标志。在这里，我们研究了共语手势是否可以帮助婴儿将输入解析成韵律单位，如短语，并发现母语的基本词序。在第一项研究中，我们发现，成年人说话时会自发地做出共同语言手势，表明不同语言和说话风格之间的短语界限:日语和英语，成人和婴儿指向语。另一项研究表明，成年说话者使用共同语言信息，特别是标记短语韵律突出的头部点头，将人工语言解析成遵循母语词序的短语单元。最后，第三项研究表明，共同语言手势的存在-即。头部点头也会影响8个月大的婴儿对结构模糊的人工语言的分割偏好。然而，婴儿使用这种线索的能力仍然有限，这表明共同语言手势可能比视觉语言在发育的后期获得，可能是由于他们更大的说话人之间/说话人内部的可变性。

{"title":"The role of audio-visual phrasal prosody in bootstrapping the acquisition of word order","authors":"Irene De la Cruz-Pavía","doi":"10.21437/speechprosody.2022-47","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-47","url":null,"abstract":"From early in development infants integrate auditory and visual facial information while processing language. The potential role of visual cues in the acquisition of grammar remains however virtually unexplored. Phrasal prosodic prominence correlates systematically with basic word order in natural languages. Co-verbal gestures—head and eyebrow motion—act in turn as markers of auditory prosody. Here, we examine whether co-verbal gestures could help infants parse the input into prosodic units such as phrases, and discover the basic word order of the native language. In a first study we show that adult talkers spontaneously produce co-verbal gestures signaling phrase boundaries across languages and speech styles: Japanese and English, adult- and infant-directed speech. A second study shows that adult speakers use co-verbal information, specifically head nods marking phrasal prosodic prominence, to parse an artificial language into phrase-like units that follow the native language’s word order. Finally, a third study shows that the presence of co-verbal gestures—i.e. head nods—also impacts 8-month-old infants’ segmentation preferences of a structurally ambiguous artificial language. However, infants’ ability to use this cue is still limited, suggesting that co-verbal gestures might be acquired later in development than visual speech, presumably due to their greater inter-/intra-speaker variability.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124317281","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

New evidence for melodic speech in Autism Spectrum Disorder 自闭症谱系障碍中旋律语言的新证据

Speech Prosody 2022

Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-8

Simon Wehrle, F. Cangemi, K. Vogeley, M. Grice

Since the very beginnings of research into Autism Spectrum Disorder (ASD), there have been contradicting descriptions of speech in ASD as being “singsongy” or melodic on the one hand and “robotic” or monotonous on the other. We highlight some issues regarding the terminology and methodologies used in previous studies as well as their comparability, concluding that previous accounts, particularly of monotonous speech in ASD, may have been misleading. We expand on a previous pilot study in using the same method of quantifying the spaciousness and liveliness of speech along two dimensions in order to analyse an extended data set (~ 5 hours) of semi-spontaneous conversations. We compare 14 German adults diagnosed with ASD and 14 matched control speakers (CTR), recorded in disposition-matched dyads (ASD-ASD; CTR-CTR). Using Bayesian modelling, we present evidence that most (but not all) ASD speakers in our corpus produced a more melodic intonation style than non-autistic CTR speakers, while, crucially, none produced a more monotonous intonation style. We emphasise the importance of inter-individual variability in groups of autistic speakers and point out that our results align with a clear tendency in recent studies to report more melodic speech in ASD.

自从对自闭症谱系障碍(ASD)的研究开始以来，对ASD的语言的描述就一直存在矛盾，一方面是“歌唱”或旋律优美，另一方面是“机器人”或单调。我们强调了以前研究中使用的术语和方法的一些问题，以及它们的可比性，结论是以前的描述，特别是ASD中单调的语言，可能具有误导性。为了分析半自发对话的扩展数据集(约5小时)，我们扩展了先前的试点研究，使用相同的方法沿着两个维度量化语音的空间和活泼度。我们比较了14名诊断为ASD的德国成年人和14名匹配的对照说话者(CTR)，他们记录在性格匹配的双染色体组(ASD-ASD;CTR-CTR)。使用贝叶斯模型，我们提供的证据表明，在我们的语料库中，大多数(但不是全部)ASD说话者比非自闭症CTR说话者产生了更有旋律的语调风格，而至关重要的是，没有人产生更单调的语调风格。我们强调自闭症说话者群体中个体间差异的重要性，并指出我们的结果与最近研究中报道的ASD中旋律性语言的明显趋势一致。

{"title":"New evidence for melodic speech in Autism Spectrum Disorder","authors":"Simon Wehrle, F. Cangemi, K. Vogeley, M. Grice","doi":"10.21437/speechprosody.2022-8","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-8","url":null,"abstract":"Since the very beginnings of research into Autism Spectrum Disorder (ASD), there have been contradicting descriptions of speech in ASD as being “singsongy” or melodic on the one hand and “robotic” or monotonous on the other. We highlight some issues regarding the terminology and methodologies used in previous studies as well as their comparability, concluding that previous accounts, particularly of monotonous speech in ASD, may have been misleading. We expand on a previous pilot study in using the same method of quantifying the spaciousness and liveliness of speech along two dimensions in order to analyse an extended data set (~ 5 hours) of semi-spontaneous conversations. We compare 14 German adults diagnosed with ASD and 14 matched control speakers (CTR), recorded in disposition-matched dyads (ASD-ASD; CTR-CTR). Using Bayesian modelling, we present evidence that most (but not all) ASD speakers in our corpus produced a more melodic intonation style than non-autistic CTR speakers, while, crucially, none produced a more monotonous intonation style. We emphasise the importance of inter-individual variability in groups of autistic speakers and point out that our results align with a clear tendency in recent studies to report more melodic speech in ASD.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114557457","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Does prosody influence segments differently in Cantonese and Mandarin? A case study of the open vowel /a/ 粤语和普通话中韵律对音段的影响不同吗?一个关于开元音/ A /的案例研究

Speech Prosody 2022

Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-137

Yike Yang, Si Chen

The interaction between segment and prosody has been receiving increasing attention. While speakers of European languages are found to hyper-articulate their speech to maintain the distinction between the focused and unfocused portions, little is known about focus effects on vowels in Chinese languages. This study investigated the potential interaction between prosodic focus and vowels and tested whether the effects of focus function differently in Cantonese and Mandarin, two closely related Chinese languages. In a focus production experiment, the target vowels were analysed on the duration, formants and distances. The results showed that prosodic focus influenced the open vowel /a/ differently in Cantonese and Mandarin. Although focus increased the vowel duration in both languages, the on-focus vowels were lengthened to a greater extent in Cantonese. The effect of focus was minimal on the vowel formants, especially in Cantonese. For the Euclidean distances between the vowels under broad focus and those under the remaining focus types, no difference was found, but Cantonese and Mandarin diverged in the directions in which each focus type moved away from broad focus. These results suggest that, while speakers of both languages hyper-articulate on-focus vowels, there are more differences than similarities between the two languages.

语段与韵律之间的相互作用越来越受到人们的关注。人们发现，说欧洲语言的人会用超清晰的发音来区分焦点部分和非焦点部分，但人们对汉语中焦点对元音的影响知之甚少。本研究探讨了韵律焦点和元音之间潜在的相互作用，并测试了焦点在粤语和普通话这两种密切相关的汉语语言中的作用是否不同。在焦点产生实验中，对目标元音的持续时间、共振峰和距离进行了分析。结果表明，粤语和普通话的韵律焦点对开元音/a/的影响存在差异。虽然在两种语言中，焦点都增加了元音的持续时间，但在广东话中，非焦点元音的延长程度更大。焦点对元音共振峰的影响很小，尤其是在广东话中。广焦点下的元音与其余焦点类型下的元音之间的欧氏距离没有差异，但粤语和普通话在各焦点类型远离广焦点的方向上存在分歧。这些结果表明，虽然两种语言的使用者都非常清楚地表达出焦点元音，但两种语言之间的差异多于相似之处。

{"title":"Does prosody influence segments differently in Cantonese and Mandarin? A case study of the open vowel /a/","authors":"Yike Yang, Si Chen","doi":"10.21437/speechprosody.2022-137","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-137","url":null,"abstract":"The interaction between segment and prosody has been receiving increasing attention. While speakers of European languages are found to hyper-articulate their speech to maintain the distinction between the focused and unfocused portions, little is known about focus effects on vowels in Chinese languages. This study investigated the potential interaction between prosodic focus and vowels and tested whether the effects of focus function differently in Cantonese and Mandarin, two closely related Chinese languages. In a focus production experiment, the target vowels were analysed on the duration, formants and distances. The results showed that prosodic focus influenced the open vowel /a/ differently in Cantonese and Mandarin. Although focus increased the vowel duration in both languages, the on-focus vowels were lengthened to a greater extent in Cantonese. The effect of focus was minimal on the vowel formants, especially in Cantonese. For the Euclidean distances between the vowels under broad focus and those under the remaining focus types, no difference was found, but Cantonese and Mandarin diverged in the directions in which each focus type moved away from broad focus. These results suggest that, while speakers of both languages hyper-articulate on-focus vowels, there are more differences than similarities between the two languages.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122370500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Building a Persian-English OMProDat Database Read by Persian Speakers 建立波斯语使用者可阅读的波斯语-英语OMProDat数据库

Speech Prosody 2022

Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-90

Mortaza Taheri-Ardali, D. Hirst

OMProDat is an open multilingual prosodic database, which aims to collect, archive and distribute recordings and annotations of directly comparable data from different languages. As part of the OMProDat project, this paper focuses on the creation of a bilingual Persian-English prosodic database read by native speakers of Persian. This collection contains 40 continuous, thematically connected paragraphs, each of five sentences, originally created during the European SAM project. Our collection was recorded by 5 male and 5 female speakers of standard Persian, all from monolingual families. The Persian texts were romanised and transcribed phonetically using the ASCII phonetic alphabet SAMPA. The database includes TextGrid annotations, which will be obtained semi-automatically from the sound and the orthographic transcription using the SPPAS alignment software. The Momel and INSINT algorithms will be used to provide prosodic annotation of the corpus. This considerable amount of data will allow us to compare the production of Persian and English as L1 and L2, respectively. In addition, a cross-linguistic comparison with other languages in OMProDat is easily feasible.

OMProDat是一个开放的多语言韵律数据库，旨在收集、存档和分发来自不同语言的直接可比数据的记录和注释。作为OMProDat项目的一部分，本文着重于创建一个波斯语-英语双语韵律数据库，供母语为波斯语的人阅读。这个集合包含40个连续的、主题相连的段落，每个段落有5个句子，最初是在欧洲SAM项目期间创建的。我们的收集是由5名说标准波斯语的男性和5名女性记录的，他们都来自单语家庭。波斯语文本被罗马化，并使用ASCII音标字母SAMPA进行语音转录。数据库包括TextGrid注释，这些注释将使用SPPAS对齐软件从声音和正字法转录中半自动获得。将使用Momel和INSINT算法对语料库进行韵律标注。这些相当多的数据将使我们能够将波斯语和英语分别作为第一语言和第二语言进行比较。此外，与OMProDat中的其他语言进行跨语言比较也很容易实现。

{"title":"Building a Persian-English OMProDat Database Read by Persian Speakers","authors":"Mortaza Taheri-Ardali, D. Hirst","doi":"10.21437/speechprosody.2022-90","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-90","url":null,"abstract":"OMProDat is an open multilingual prosodic database, which aims to collect, archive and distribute recordings and annotations of directly comparable data from different languages. As part of the OMProDat project, this paper focuses on the creation of a bilingual Persian-English prosodic database read by native speakers of Persian. This collection contains 40 continuous, thematically connected paragraphs, each of five sentences, originally created during the European SAM project. Our collection was recorded by 5 male and 5 female speakers of standard Persian, all from monolingual families. The Persian texts were romanised and transcribed phonetically using the ASCII phonetic alphabet SAMPA. The database includes TextGrid annotations, which will be obtained semi-automatically from the sound and the orthographic transcription using the SPPAS alignment software. The Momel and INSINT algorithms will be used to provide prosodic annotation of the corpus. This considerable amount of data will allow us to compare the production of Persian and English as L1 and L2, respectively. In addition, a cross-linguistic comparison with other languages in OMProDat is easily feasible.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129915457","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Developing and validating a rating scale of speaking prosody ability for learners of Chinese as a second language 开发并验证了汉语作为第二语言学习者口语韵律能力评定量表

Speech Prosody 2022

Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-166

Sichang Gao, Mingwei Pan

This study aims to develop a rating scale for evaluating speech prosody of learners of Chinese as a second language (CSL). The researchers first gathered 41 descriptors that were perceived as crucial indicators of prosody ability through interviewing ten CSL teachers, analyzing existing Chinese speaking proficiency scales from five universities in Mainland China. After rating the perception of the selected descriptors by ninety-four CSL teachers and consulting with four expert-teachers, 15 out of 41 descriptors remained to form a rating scale. Principal component analysis revealed that 15 descriptors with three different dimensions (prosodic strategic competence, fluency, prosodic naturalness) could meaningfully describe CSL prosody. Finally, using the 15 descriptors, 29 samples of CSL learners’ speech were evaluated by four raters. A combination of the structural equation modeling and the Many-Facets Rasch modeling confirmed that all the 15 descriptors fit well with the construct of prosody ability measured, demonstrating a good validity of this rating scale.

本研究旨在建立一套评价汉语学习者语音韵律的量表。研究人员首先通过对10名对外汉语教师的访谈，分析了中国大陆5所大学现有的汉语口语水平量表，收集了41个被认为是韵律能力关键指标的描述词。在对94名汉语教学教师对所选描述词的感知进行评分并咨询了4名专家教师后，41个描述词中有15个被保留下来形成评分量表。主成分分析表明，韵律策略能力、韵律流畅性、韵律自然度三个维度的15个描述词都能有效地描述汉语韵律。最后，利用这15个描述词，对29个汉语学习者的语音样本进行了4位评分者的评价。结构方程模型和多面Rasch模型相结合，证实了15个描述符都与韵律能力测量的结构吻合良好，证明了该量表的有效性。

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Speech Prosody 2022

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀