首页 > 最新文献

Speech Prosody 2022最新文献

英文 中文
Syllable rate and speech rhythm in multiethnolectal Zurich German: a comparison of speaking styles 多民族苏黎世德语的音节率和语音节奏:说话风格的比较
Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-69
Marie-Anne Morand, M. Bruno, Sandra Schwab, Stephan Schmid
Multiethnolectal ways of speaking have been emerging for 30 years in culturally and linguistically diverse neighborhoods of European cities, including Zurich (Switzerland). Among the prosodic features of Germanic multiethnolects, a so-called ‘staccato’ rhythm has been mentioned in several studies. For instance, a comparison between two groups of adolescents (12 speakers each) showed that speakers of multiethnolectal Zurich German displayed slower syllable rates and less vowel duration variability than speakers of a rather traditional dialect. This study compares syllable rate and speech rhythm metrics ( nPVI-V, nPVI-C ) in spontaneous and read speech of 48 Zurich German adolescents. In a regression analysis, rhythmic measures were compared with the perception of how multiethnolectal the speakers sounded ( rating score ). The results showed that syllable rate and nPVI-V were related to rating score independently of speaking style (read, spontaneous speech): Speakers who were perceived as more multiethnolectal had a slower syllable rate and less vowel duration variability. Such findings were not observed for nPVI-C. These results suggest that syllable rate and speech rhythm (at least, vowel duration variability) are stable phonetic features of multiethnolectal Zurich German, since the relationship between these features and the perception of multiethnolectal speech was observed in both read and spontaneous speech.
30年来,在包括瑞士苏黎世在内的欧洲城市的文化和语言多样化的社区中,多民族的说话方式已经出现。在日耳曼多民族的韵律特征中,一种所谓的“断奏”节奏在一些研究中被提到。例如,对两组青少年(每组12人)的比较表明,说多民族苏黎世德语的人比说传统方言的人表现出更慢的音节速率和更少的元音持续时间变化。本研究比较了48名苏黎世德语青少年的自然语音和诵读语音的音节率和语音节奏指标(nPVI-V, nPVI-C)。在回归分析中,节奏测量与说话者听起来多民族的感觉(评分)进行了比较。结果表明,音节率和nPVI-V与评分分数的关系独立于说话风格(阅读,自发演讲):被认为是多民族的说话者有更慢的音节率和更少的元音持续时间变化。在nPVI-C中未观察到这些结果。这些结果表明,音节速率和语音节奏(至少,元音持续时间的变化)是多民族苏黎世德语的稳定语音特征,因为这些特征与多民族语音感知之间的关系在阅读和自发语音中都被观察到。
{"title":"Syllable rate and speech rhythm in multiethnolectal Zurich German: a comparison of speaking styles","authors":"Marie-Anne Morand, M. Bruno, Sandra Schwab, Stephan Schmid","doi":"10.21437/speechprosody.2022-69","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-69","url":null,"abstract":"Multiethnolectal ways of speaking have been emerging for 30 years in culturally and linguistically diverse neighborhoods of European cities, including Zurich (Switzerland). Among the prosodic features of Germanic multiethnolects, a so-called ‘staccato’ rhythm has been mentioned in several studies. For instance, a comparison between two groups of adolescents (12 speakers each) showed that speakers of multiethnolectal Zurich German displayed slower syllable rates and less vowel duration variability than speakers of a rather traditional dialect. This study compares syllable rate and speech rhythm metrics ( nPVI-V, nPVI-C ) in spontaneous and read speech of 48 Zurich German adolescents. In a regression analysis, rhythmic measures were compared with the perception of how multiethnolectal the speakers sounded ( rating score ). The results showed that syllable rate and nPVI-V were related to rating score independently of speaking style (read, spontaneous speech): Speakers who were perceived as more multiethnolectal had a slower syllable rate and less vowel duration variability. Such findings were not observed for nPVI-C. These results suggest that syllable rate and speech rhythm (at least, vowel duration variability) are stable phonetic features of multiethnolectal Zurich German, since the relationship between these features and the perception of multiethnolectal speech was observed in both read and spontaneous speech.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125939024","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Building a Persian-English OMProDat Database Read by Persian Speakers 建立波斯语使用者可阅读的波斯语-英语OMProDat数据库
Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-90
Mortaza Taheri-Ardali, D. Hirst
OMProDat is an open multilingual prosodic database, which aims to collect, archive and distribute recordings and annotations of directly comparable data from different languages. As part of the OMProDat project, this paper focuses on the creation of a bilingual Persian-English prosodic database read by native speakers of Persian. This collection contains 40 continuous, thematically connected paragraphs, each of five sentences, originally created during the European SAM project. Our collection was recorded by 5 male and 5 female speakers of standard Persian, all from monolingual families. The Persian texts were romanised and transcribed phonetically using the ASCII phonetic alphabet SAMPA. The database includes TextGrid annotations, which will be obtained semi-automatically from the sound and the orthographic transcription using the SPPAS alignment software. The Momel and INSINT algorithms will be used to provide prosodic annotation of the corpus. This considerable amount of data will allow us to compare the production of Persian and English as L1 and L2, respectively. In addition, a cross-linguistic comparison with other languages in OMProDat is easily feasible.
OMProDat是一个开放的多语言韵律数据库,旨在收集、存档和分发来自不同语言的直接可比数据的记录和注释。作为OMProDat项目的一部分,本文着重于创建一个波斯语-英语双语韵律数据库,供母语为波斯语的人阅读。这个集合包含40个连续的、主题相连的段落,每个段落有5个句子,最初是在欧洲SAM项目期间创建的。我们的收集是由5名说标准波斯语的男性和5名女性记录的,他们都来自单语家庭。波斯语文本被罗马化,并使用ASCII音标字母SAMPA进行语音转录。数据库包括TextGrid注释,这些注释将使用SPPAS对齐软件从声音和正字法转录中半自动获得。将使用Momel和INSINT算法对语料库进行韵律标注。这些相当多的数据将使我们能够将波斯语和英语分别作为第一语言和第二语言进行比较。此外,与OMProDat中的其他语言进行跨语言比较也很容易实现。
{"title":"Building a Persian-English OMProDat Database Read by Persian Speakers","authors":"Mortaza Taheri-Ardali, D. Hirst","doi":"10.21437/speechprosody.2022-90","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-90","url":null,"abstract":"OMProDat is an open multilingual prosodic database, which aims to collect, archive and distribute recordings and annotations of directly comparable data from different languages. As part of the OMProDat project, this paper focuses on the creation of a bilingual Persian-English prosodic database read by native speakers of Persian. This collection contains 40 continuous, thematically connected paragraphs, each of five sentences, originally created during the European SAM project. Our collection was recorded by 5 male and 5 female speakers of standard Persian, all from monolingual families. The Persian texts were romanised and transcribed phonetically using the ASCII phonetic alphabet SAMPA. The database includes TextGrid annotations, which will be obtained semi-automatically from the sound and the orthographic transcription using the SPPAS alignment software. The Momel and INSINT algorithms will be used to provide prosodic annotation of the corpus. This considerable amount of data will allow us to compare the production of Persian and English as L1 and L2, respectively. In addition, a cross-linguistic comparison with other languages in OMProDat is easily feasible.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129915457","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Shape matters: Machine classification and listeners’ perceptual discrimination of American English intonational tunes 形状关系:机器分类与听者对美式英语语调语调的知觉辨别
Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-61
J. Cole, Jeremy Steffman, Sam Tilsen
In Autosegmental-Metrical models of intonational phonology, pitch accents, phrase accents and boundary tones may combine freely to create a predicted set of phonologically distinct phrase-final “nuclear” tunes. In this study we ask if an 8-way distinction in nuclear tune shape in American English, predicted from combinations of 2 (monotonal) pitch accents, 2 phrase accents and 2 boundary tones, is manifest in speech production and in speech perception. F0 trajectories from an imitative speech production experiment were analyzed using (i) neural net classification, and (ii) human listeners’ perceptual discrimination of the model utterances. Pairwise classification accuracy of the imitative productions is highest for tune pairs that differ in holistic shape (high-rising vs. rise-fall), and poorest for tunes with the same shape that differ in (higher vs. lower) final f0. Perception results show a similar pattern, with poor pairwise discrimination for tunes that differ primarily, but by a small degree, in final f0. Together the results suggest a hierarchy of distinctiveness among nuclear tunes, with a robust distinction based on holistic tune shape, which only partly aligns with distinctions in tonal specification, and a weak/poorly differentiated distinction between tunes with the same holistic shape but small differences in final f0.
在语调音韵学的自分音格律模型中,音高重音、短语重音和边界音可以自由地组合在一起,形成一组在音韵学上截然不同的短语末“核”曲调。在这项研究中,我们提出了一个问题,即从两个(单调的)音高重音、两个短语重音和两个边界音的组合中预测的美式英语核调形状的8向区别是否在语音产生和语音感知中表现出来。使用(i)神经网络分类和(ii)人类听者对模型话语的感知辨别来分析来自模仿语音产生实验的F0轨迹。模仿作品的两两分类精度对于整体形状不同的曲调对(高升与低升)是最高的,而对于相同形状的曲调(高与低)最终f0不同的曲调是最差的。感知结果也显示了类似的模式,对于主要不同的曲调,配对辨别能力很差,但程度很小,在最后的60中。总之,结果表明,核曲调之间存在着层次结构的独特性,其中基于整体曲调形状的强大区分,仅部分与音调规格的区分一致,而具有相同整体形状但最终音高差异很小的曲调之间存在弱/差区分。
{"title":"Shape matters: Machine classification and listeners’ perceptual discrimination of American English intonational tunes","authors":"J. Cole, Jeremy Steffman, Sam Tilsen","doi":"10.21437/speechprosody.2022-61","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-61","url":null,"abstract":"In Autosegmental-Metrical models of intonational phonology, pitch accents, phrase accents and boundary tones may combine freely to create a predicted set of phonologically distinct phrase-final “nuclear” tunes. In this study we ask if an 8-way distinction in nuclear tune shape in American English, predicted from combinations of 2 (monotonal) pitch accents, 2 phrase accents and 2 boundary tones, is manifest in speech production and in speech perception. F0 trajectories from an imitative speech production experiment were analyzed using (i) neural net classification, and (ii) human listeners’ perceptual discrimination of the model utterances. Pairwise classification accuracy of the imitative productions is highest for tune pairs that differ in holistic shape (high-rising vs. rise-fall), and poorest for tunes with the same shape that differ in (higher vs. lower) final f0. Perception results show a similar pattern, with poor pairwise discrimination for tunes that differ primarily, but by a small degree, in final f0. Together the results suggest a hierarchy of distinctiveness among nuclear tunes, with a robust distinction based on holistic tune shape, which only partly aligns with distinctions in tonal specification, and a weak/poorly differentiated distinction between tunes with the same holistic shape but small differences in final f0.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"21 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129701349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A preliminary analysis on children’s phonation contrast in Kunshan Wu Chinese tones 昆山吴语儿童语音对比初步分析
Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-85
Wenwei Xu, Chunyu Ge, Wentao Gu, P. Mok
Previous studies have established that phonation contrasts can be, apart from pitch, an important dimension of tonal contrasts in some languages, and modern Wu Chinese is a good example in which the lower register tones are produced with breathier phonation than the upper register tones. Nevertheless, researchers have shown that such phonation contrast is declining among young speakers in Shanghai and Suzhou Wu. This pilot study is thus motivated to investigate children’s production in Kunshan Wu, a neighboring yet rather understudied dialect with more tones, in order to see if a similar trend is ongoing. Two male and two female school-age children (8;4 to 10;4) were recorded reading isolated monosyllabic words with different lexical tones, and simultaneous acoustic and electroglottographic (EGG) data were collected. Results of EGG and acoustic parameters demonstrate that at least near the onset of the vowel, glottal constriction is smaller and glottal closure is less abrupt in the lower register tones than in the upper register tones, suggesting that the lower register tones are generally produced with breathier phonation. Therefore, school-age child speakers of Kunshan Wu are still able to produce the phonation contrast between the tone registers.
以往的研究已经证实,在某些语言中,发声对比除了音高之外,也是声调对比的一个重要维度,现代吴语就是一个很好的例子,其低音域的发声比上音域的发声更呼吸。然而,研究人员发现,在上海和苏州的年轻人中,这种发音对比正在下降。因此,这项试点研究的动机是调查昆山吴语的儿童生产,昆山吴语是一种邻近但研究较少的方言,声调更多,以了解是否有类似的趋势正在进行。记录两名男、两名女学龄儿童(8岁、4岁至10岁)阅读不同声调的单音节孤立单词,同时收集声学和声门电图(EGG)数据。EGG和声学参数的结果表明,至少在元音开始附近,与上声区音相比,下声区音的声门收缩较小,声门关闭不那么突然,这表明下声区音通常是用呼吸发声的。因此,昆山吴语学龄儿童说话者仍然能够产生音域之间的语音对比。
{"title":"A preliminary analysis on children’s phonation contrast in Kunshan Wu Chinese tones","authors":"Wenwei Xu, Chunyu Ge, Wentao Gu, P. Mok","doi":"10.21437/speechprosody.2022-85","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-85","url":null,"abstract":"Previous studies have established that phonation contrasts can be, apart from pitch, an important dimension of tonal contrasts in some languages, and modern Wu Chinese is a good example in which the lower register tones are produced with breathier phonation than the upper register tones. Nevertheless, researchers have shown that such phonation contrast is declining among young speakers in Shanghai and Suzhou Wu. This pilot study is thus motivated to investigate children’s production in Kunshan Wu, a neighboring yet rather understudied dialect with more tones, in order to see if a similar trend is ongoing. Two male and two female school-age children (8;4 to 10;4) were recorded reading isolated monosyllabic words with different lexical tones, and simultaneous acoustic and electroglottographic (EGG) data were collected. Results of EGG and acoustic parameters demonstrate that at least near the onset of the vowel, glottal constriction is smaller and glottal closure is less abrupt in the lower register tones than in the upper register tones, suggesting that the lower register tones are generally produced with breathier phonation. Therefore, school-age child speakers of Kunshan Wu are still able to produce the phonation contrast between the tone registers.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"66 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121591501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An EEG-study on L2 categorization of emotional prosody in German 德语情绪韵律二语分类的脑电图研究
Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-128
Hua Wei, Yifei He, C. Kauschke, Mathias Scharinger, Ulrike Domahs
Previous behavioral studies on the processing of emotional prosody in L2 learners showed similarities and differences between L1- and L2-processing and suggested that emotional perception has both universal and culture-specific aspects. However, little is known about the processing of emotional prosody in L2 learners' brains. Therefore, the present study used event-related potentials to compare the processing of emotional prosodies between German native speakers and Chinese L2 learners of German. Participants performed a prosody recognition task with semantically neutral German words recorded with emotional "neutral" , "like" , and "disgust" prosodies. The accuracy ratings of categorizing emotional prosodies of L2 learners were above chance but significantly better for the L1 speakers. Both groups yielded an early and a late positivity for processing "like" in comparison to "disgust" , reflecting the emotional prosodic predictive processing. However, an early left anterior negativity (ELAN) and a late anterior negativity observed in the L2 learners suggest that they are more sensitive to acoustic differences of the presented stimuli. Overall, our findings support the assumption that the processing of emotional prosody is in principle universal across languages, but that in addition to the general mechanisms involved in the processing of emotional speech language-specific aspects also modify emotional processing.
以往关于二语学习者情绪韵律加工的行为研究表明,二语学习者的情绪知觉具有共性和文化特异性。然而,人们对二语学习者大脑中情绪韵律的加工过程知之甚少。因此,本研究采用事件相关电位对德语母语者和汉语第二语言学习者的情绪韵律加工进行了比较。参与者完成了一项韵律识别任务,他们用语义中性的德语单词记录了情感上的“中性”、“喜欢”和“厌恶”韵律。第二语言学习者对情绪韵律的分类准确率高于随机,而第一语言学习者对情绪韵律的分类准确率明显高于随机。与“厌恶”相比,两组对“喜欢”的处理都产生了早期和晚期的积极反应,反映了情绪韵律预测处理。然而,在二语学习者中观察到的早期左前负性(ELAN)和晚期前负性表明他们对所呈现刺激的声音差异更敏感。总的来说,我们的研究结果支持这样的假设,即情绪韵律的处理原则上是跨语言通用的,但除了涉及情绪言语处理的一般机制外,语言特定方面也会改变情绪处理。
{"title":"An EEG-study on L2 categorization of emotional prosody in German","authors":"Hua Wei, Yifei He, C. Kauschke, Mathias Scharinger, Ulrike Domahs","doi":"10.21437/speechprosody.2022-128","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-128","url":null,"abstract":"Previous behavioral studies on the processing of emotional prosody in L2 learners showed similarities and differences between L1- and L2-processing and suggested that emotional perception has both universal and culture-specific aspects. However, little is known about the processing of emotional prosody in L2 learners' brains. Therefore, the present study used event-related potentials to compare the processing of emotional prosodies between German native speakers and Chinese L2 learners of German. Participants performed a prosody recognition task with semantically neutral German words recorded with emotional \"neutral\" , \"like\" , and \"disgust\" prosodies. The accuracy ratings of categorizing emotional prosodies of L2 learners were above chance but significantly better for the L1 speakers. Both groups yielded an early and a late positivity for processing \"like\" in comparison to \"disgust\" , reflecting the emotional prosodic predictive processing. However, an early left anterior negativity (ELAN) and a late anterior negativity observed in the L2 learners suggest that they are more sensitive to acoustic differences of the presented stimuli. Overall, our findings support the assumption that the processing of emotional prosody is in principle universal across languages, but that in addition to the general mechanisms involved in the processing of emotional speech language-specific aspects also modify emotional processing.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115926113","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The effect of intonational rises on serial recall in German 语调上升对德语连续记忆的影响
Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-154
Christine T. Röhr, Michelina Savino, M. Grice
This paper uses a serial recall task to investigate the role of rising intonation in the allocation of attentional resources in German. It has been shown for Italian that rising intonation at prosodic boundaries enhances recall of digits in auditorily presented lists. Since resources are usually allocated to prominent items, and since pitch accents are primary encoders of prominence in both languages, we investigate whether an accentual rise leads to better recall than a boundary rise. In a serial recall task on nine-digit sequences in German we compare the effect on working memory of sequences grouped by marking the last item of the two non-final triplets with (i) a high/rising accent followed by an equally high boundary, (ii) a low accent followed by a boundary rise, or (iii) a low/falling accent-boundary sequence, as compared to (iv) ungrouped sequences as controls. Results reveal that items with a rise are recalled more accurately than items without a rise, with no evidence for superior recall of items with accent rises over those with boundary rises. However, boundary rises appear to facilitate recall over a larger domain than accentual rises.
本文采用连续回忆实验研究了升语调在德语注意资源分配中的作用。有研究表明,意大利语在韵律边界处的升调可以增强对听觉呈现列表中数字的记忆。由于资源通常分配给突出的项目,并且由于音高重音是两种语言中突出的主要编码器,因此我们研究了重音上升是否比边界上升能更好地唤起记忆。在一个九位数德语序列的连续回忆任务中,我们比较了通过标记两个非最后三联体的最后一个项目来分组的序列对工作记忆的影响,(i)高/上升重音后面跟着同样高的边界,(ii)低重音后面跟着边界上升,或(iii)低/下降重音边界序列,与(iv)未分组的序列作为对照。结果显示,有音调上升的项目比没有音调上升的项目回忆起来更准确,没有证据表明有口音上升的项目比有边界上升的项目回忆起来更好。然而,边界上升似乎比重音上升更容易在更大的范围内回忆。
{"title":"The effect of intonational rises on serial recall in German","authors":"Christine T. Röhr, Michelina Savino, M. Grice","doi":"10.21437/speechprosody.2022-154","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-154","url":null,"abstract":"This paper uses a serial recall task to investigate the role of rising intonation in the allocation of attentional resources in German. It has been shown for Italian that rising intonation at prosodic boundaries enhances recall of digits in auditorily presented lists. Since resources are usually allocated to prominent items, and since pitch accents are primary encoders of prominence in both languages, we investigate whether an accentual rise leads to better recall than a boundary rise. In a serial recall task on nine-digit sequences in German we compare the effect on working memory of sequences grouped by marking the last item of the two non-final triplets with (i) a high/rising accent followed by an equally high boundary, (ii) a low accent followed by a boundary rise, or (iii) a low/falling accent-boundary sequence, as compared to (iv) ungrouped sequences as controls. Results reveal that items with a rise are recalled more accurately than items without a rise, with no evidence for superior recall of items with accent rises over those with boundary rises. However, boundary rises appear to facilitate recall over a larger domain than accentual rises.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126692316","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Developing and validating a rating scale of speaking prosody ability for learners of Chinese as a second language 开发并验证了汉语作为第二语言学习者口语韵律能力评定量表
Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-166
Sichang Gao, Mingwei Pan
This study aims to develop a rating scale for evaluating speech prosody of learners of Chinese as a second language (CSL). The researchers first gathered 41 descriptors that were perceived as crucial indicators of prosody ability through interviewing ten CSL teachers, analyzing existing Chinese speaking proficiency scales from five universities in Mainland China. After rating the perception of the selected descriptors by ninety-four CSL teachers and consulting with four expert-teachers, 15 out of 41 descriptors remained to form a rating scale. Principal component analysis revealed that 15 descriptors with three different dimensions (prosodic strategic competence, fluency, prosodic naturalness) could meaningfully describe CSL prosody. Finally, using the 15 descriptors, 29 samples of CSL learners’ speech were evaluated by four raters. A combination of the structural equation modeling and the Many-Facets Rasch modeling confirmed that all the 15 descriptors fit well with the construct of prosody ability measured, demonstrating a good validity of this rating scale.
本研究旨在建立一套评价汉语学习者语音韵律的量表。研究人员首先通过对10名对外汉语教师的访谈,分析了中国大陆5所大学现有的汉语口语水平量表,收集了41个被认为是韵律能力关键指标的描述词。在对94名汉语教学教师对所选描述词的感知进行评分并咨询了4名专家教师后,41个描述词中有15个被保留下来形成评分量表。主成分分析表明,韵律策略能力、韵律流畅性、韵律自然度三个维度的15个描述词都能有效地描述汉语韵律。最后,利用这15个描述词,对29个汉语学习者的语音样本进行了4位评分者的评价。结构方程模型和多面Rasch模型相结合,证实了15个描述符都与韵律能力测量的结构吻合良好,证明了该量表的有效性。
{"title":"Developing and validating a rating scale of speaking prosody ability for learners of Chinese as a second language","authors":"Sichang Gao, Mingwei Pan","doi":"10.21437/speechprosody.2022-166","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-166","url":null,"abstract":"This study aims to develop a rating scale for evaluating speech prosody of learners of Chinese as a second language (CSL). The researchers first gathered 41 descriptors that were perceived as crucial indicators of prosody ability through interviewing ten CSL teachers, analyzing existing Chinese speaking proficiency scales from five universities in Mainland China. After rating the perception of the selected descriptors by ninety-four CSL teachers and consulting with four expert-teachers, 15 out of 41 descriptors remained to form a rating scale. Principal component analysis revealed that 15 descriptors with three different dimensions (prosodic strategic competence, fluency, prosodic naturalness) could meaningfully describe CSL prosody. Finally, using the 15 descriptors, 29 samples of CSL learners’ speech were evaluated by four raters. A combination of the structural equation modeling and the Many-Facets Rasch modeling confirmed that all the 15 descriptors fit well with the construct of prosody ability measured, demonstrating a good validity of this rating scale.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"97 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126867657","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Comparison of Rhythm Metrics for L2 Speech 二语语音节奏度量的比较
Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-68
Kakeru Yazawa, M. Kondo
A wide range of rhythm metrics (global metrics: %V, Δ, Varco, and segVarco; pairwise metrics: rPVI, nPVI, CCI, and D nCCI) was applied to L1 Japanese speakers’ L2 English speech data. Less proficient Japanese speakers of English are expected to show less durational variability for both vocalic and consonantal intervals (because of insufficient stress realization and transfer of CV syllable structure), although this pattern may be obscured by their slower speech rate (which increases interval durations in general). To test if the metrics can capture the L2 rhythmic characteristics, each metric was applied to read speech samples of “The North Wind and the Sun” by 183 Japanese speakers in the J-AESOP corpus. Only %V, VarcoV, and segVarcoV/C were successful; other metrics yielded inconsistent or implausible results likely due to insufficient rate normalization. The overall results indicate that global metrics can effectively quantify L2 rhythm if speech rate is normalized by the mean duration of segments (which is a good predictor of tempo) rather than the mean interval duration (which is popular but susceptible to syllable complexity).
广泛的节奏指标(全局指标:%V, Δ, Varco和segVarco;两两指标:rPVI、nPVI、CCI和D nCCI)应用于L1日语使用者的L2英语语音数据。不太精通英语的日本人在元音和辅音音程上表现出较小的持续时间变化(因为重音实现和CV音节结构的转移不足),尽管这种模式可能被他们较慢的说话速度所掩盖(这通常会增加音程持续时间)。为了测试这些度量是否能够捕捉L2节奏特征,我们将每个度量应用于183名日本人在J-AESOP语料库中朗读的《北风与太阳》的语音样本。只有%V、VarcoV和segVarcoV/C成功;其他指标产生了不一致或不可信的结果,可能是由于利率正常化不够。总体结果表明,如果语音速率是通过片段的平均持续时间(这是一个很好的节奏预测器)而不是平均间隔持续时间(这很流行,但容易受到音节复杂性的影响)来标准化的,那么全局指标可以有效地量化第二语言节奏。
{"title":"A Comparison of Rhythm Metrics for L2 Speech","authors":"Kakeru Yazawa, M. Kondo","doi":"10.21437/speechprosody.2022-68","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-68","url":null,"abstract":"A wide range of rhythm metrics (global metrics: %V, Δ, Varco, and segVarco; pairwise metrics: rPVI, nPVI, CCI, and D nCCI) was applied to L1 Japanese speakers’ L2 English speech data. Less proficient Japanese speakers of English are expected to show less durational variability for both vocalic and consonantal intervals (because of insufficient stress realization and transfer of CV syllable structure), although this pattern may be obscured by their slower speech rate (which increases interval durations in general). To test if the metrics can capture the L2 rhythmic characteristics, each metric was applied to read speech samples of “The North Wind and the Sun” by 183 Japanese speakers in the J-AESOP corpus. Only %V, VarcoV, and segVarcoV/C were successful; other metrics yielded inconsistent or implausible results likely due to insufficient rate normalization. The overall results indicate that global metrics can effectively quantify L2 rhythm if speech rate is normalized by the mean duration of segments (which is a good predictor of tempo) rather than the mean interval duration (which is popular but susceptible to syllable complexity).","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125986412","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Top-Down and Bottom-up Processing of Familiar and Unfamiliar Mandarin Dialect Tone Systems 熟悉与陌生普通话方言声调系统的自顶向下与自底向上加工
Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-171
Liang Zhao, Shayne Sloggett, Eleanor Chodroff
Speech processing involves active integration of bottom-up and top-down information types. In the present study, we investigated the relative weighting of top-down expectedness and bottom-up lexical tone in the perception of familiar and unfamiliar lexical tone systems. Standard Mandarin and Chengdu Mandarin are mutually intelligible language varieties with comparable segmental and highly distinct tonal realizations. In a spoken semantic-plausibility judgment task, we manipulated whether a word was high-surprisal or low-surprisal given the preceding context and dialect-specific tone. All participants were native Standard Mandarin speakers with minimal Chengdu Mandarin experience. Lower judgment accuracy was observed when the stimulus was Chengdu Mandarin, and suggested that expectedness (i.e., top-down) information overrides tonal (i.e., bottom-up) information in sentence plausibility judgments. However, judgment response times to sentence surprisal were uniform across stimuli from both dialects, suggesting that speakers are aware of the surprisal conveyed by a non-standard tone, even if not used in their final decision. These findings reveal listener sensitivity to both top-down expectedness and bottom-up tone regardless of the initial tone reliability. For unfamiliar tone systems, top-down influence overrides bottom-up processing to access utterance meaning, but bottom-up processing is indeed present and may reflect rapid learning of the unfamiliar tone system.
语音处理涉及自底向上和自顶向下信息类型的主动整合。在本研究中,我们研究了自上而下的期望和自下而上的词汇语调在熟悉和不熟悉的词汇语调系统感知中的相对权重。标准普通话和成都普通话是相互可理解的语言变体,具有可比较的分段和高度不同的音调实现。在口语语义合理性判断任务中,我们根据前面的上下文和方言特定的语气来操纵一个词是高惊奇还是低惊奇。所有参与者的母语都是标准普通话,只有很少的成都普通话经验。当刺激为成都普通话时,判断正确率较低,这表明在句子可信性判断中,期望性(即自上而下)信息优于调性(即自下而上)信息。然而,在两种方言的刺激下,对句子“惊讶”的判断反应时间是一致的,这表明说话者意识到非标准语气传达的“惊讶”,即使他们没有在最终决定中使用。这些发现揭示了听者对自上而下的期望和自下而上的语调的敏感性,而不管最初的语调可靠性如何。对于不熟悉的声调系统,自上而下的影响压倒自下而上的加工以获取话语意义,但自下而上的加工确实存在,并且可能反映了对不熟悉的声调系统的快速学习。
{"title":"Top-Down and Bottom-up Processing of Familiar and Unfamiliar Mandarin Dialect Tone Systems","authors":"Liang Zhao, Shayne Sloggett, Eleanor Chodroff","doi":"10.21437/speechprosody.2022-171","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-171","url":null,"abstract":"Speech processing involves active integration of bottom-up and top-down information types. In the present study, we investigated the relative weighting of top-down expectedness and bottom-up lexical tone in the perception of familiar and unfamiliar lexical tone systems. Standard Mandarin and Chengdu Mandarin are mutually intelligible language varieties with comparable segmental and highly distinct tonal realizations. In a spoken semantic-plausibility judgment task, we manipulated whether a word was high-surprisal or low-surprisal given the preceding context and dialect-specific tone. All participants were native Standard Mandarin speakers with minimal Chengdu Mandarin experience. Lower judgment accuracy was observed when the stimulus was Chengdu Mandarin, and suggested that expectedness (i.e., top-down) information overrides tonal (i.e., bottom-up) information in sentence plausibility judgments. However, judgment response times to sentence surprisal were uniform across stimuli from both dialects, suggesting that speakers are aware of the surprisal conveyed by a non-standard tone, even if not used in their final decision. These findings reveal listener sensitivity to both top-down expectedness and bottom-up tone regardless of the initial tone reliability. For unfamiliar tone systems, top-down influence overrides bottom-up processing to access utterance meaning, but bottom-up processing is indeed present and may reflect rapid learning of the unfamiliar tone system.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"61 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126205707","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Prosodic and lexical entrainment in adults with and without schizophrenia 有和没有精神分裂症的成人的韵律和词汇的干扰
Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-26
J. Kruyt, S. Benus, C. Faget, C. Lançon, M. Champagne-Lavau
Entrainment refers to the tendency people have to speak more similarly during a conversation. Although entrainment has been observed frequently, the underlying mechanisms of the phe-nomenon are debated. A specific point of disagreement is the role of social or higher-order cognitive factors in entrainment. The present study aimed to explore prosodic and lexical entrainment in small groups of individuals with schizophrenia, a dis-order that has been associated with theory of mind impairments and social difficulties, and a control group without schizophrenia. All participants completed a referential communication task with an experimenter. To determine prosodic entrainment, the measures proposed by Levitan and Hirshberg [1] were used. Results seem to suggest that the effect of task role on prosodic entrainment was larger than any possible effects of group, suggesting that social factors affect prosodic entrainment behaviour more than individual differences in cognition or other factors. Conversely, lexical entrainment was not affected by task role or group. Importantly, no clear patterns in entrainment on different dimensions, levels, or features could be observed, highlighting the complex and multifaceted nature of entrainment.
“娱乐化”指的是人们在谈话中说话更加相似的趋势。虽然经常观察到夹带现象,但这一现象的潜在机制仍存在争议。一个具体的分歧点是社会或高阶认知因素在娱乐中的作用。本研究旨在探索精神分裂症患者(一种与心理障碍理论和社交困难相关的疾病)和非精神分裂症对照组的韵律和词汇参与情况。所有的参与者都完成了一项与实验者交流的任务。为了确定韵律蕴意,我们采用Levitan和Hirshberg[1]提出的测量方法。结果似乎表明,任务角色对韵律夹带行为的影响大于任何可能的群体影响,这表明社会因素对韵律夹带行为的影响大于个体认知差异或其他因素的影响。相反,词汇吸收不受任务角色或群体的影响。重要的是,在不同的维度、层次或特征上,没有观察到明显的夹带模式,突出了夹带的复杂性和多面性。
{"title":"Prosodic and lexical entrainment in adults with and without schizophrenia","authors":"J. Kruyt, S. Benus, C. Faget, C. Lançon, M. Champagne-Lavau","doi":"10.21437/speechprosody.2022-26","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-26","url":null,"abstract":"Entrainment refers to the tendency people have to speak more similarly during a conversation. Although entrainment has been observed frequently, the underlying mechanisms of the phe-nomenon are debated. A specific point of disagreement is the role of social or higher-order cognitive factors in entrainment. The present study aimed to explore prosodic and lexical entrainment in small groups of individuals with schizophrenia, a dis-order that has been associated with theory of mind impairments and social difficulties, and a control group without schizophrenia. All participants completed a referential communication task with an experimenter. To determine prosodic entrainment, the measures proposed by Levitan and Hirshberg [1] were used. Results seem to suggest that the effect of task role on prosodic entrainment was larger than any possible effects of group, suggesting that social factors affect prosodic entrainment behaviour more than individual differences in cognition or other factors. Conversely, lexical entrainment was not affected by task role or group. Importantly, no clear patterns in entrainment on different dimensions, levels, or features could be observed, highlighting the complex and multifaceted nature of entrainment.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121455741","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Speech Prosody 2022
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1