首页 > 最新文献

Speech Prosody 2022最新文献

英文 中文
Perception of the strength of prosodic breaks in three conditions: Explicit pause, implicit pause, and no pause 在三种情况下感知韵律中断的强度:显性停顿、隐性停顿和无停顿
Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-97
V. Silber-Varod, Ella Alfon, N. Amir
In this study we examine the perceptual strength of prosodic boundaries in Hebrew speech. The stimuli consisted of 28 sequences of two inter-pausal units (IPUs) taken from the Map Task recordings in Hebrew. Listeners were exposed only to the silent pause following the first IPU (hence, Explicit pauses) while the second pause was omitted (hence, Implicit pauses) thus creating a stimulus model of IPU-pause-IPU. Ten female listeners labeled the strength of each break between adjacent words on a scale from 1 (no break) to 5 (strong break). Higher average scores were assigned to the implicit pauses as compared to the explicit ones, however scores for explicit pauses received higher agreement between raters. Moreover, we found only borderline significant influence of the explicit pause duration on the raters' scores. Looking at gender differences, the results suggest that raters' scores were higher when the speakers were females. Further, an interaction was found between the gender of the speaker and the gender of the recipient (i.e., the interlocutor). In particular, female speakers received a higher score overall, and for male speakers the rating was higher when they spoke to males than to females.
在这项研究中,我们研究了希伯来语语音中韵律边界的感知强度。刺激包括28个序列的两个间歇间单元(ipu),取自希伯来语的地图任务记录。听者只暴露于第一个IPU之后的沉默暂停(因此,显式暂停),而第二个暂停被省略(因此,隐式暂停),从而创建了IPU-暂停-IPU的刺激模型。10名女性听众给相邻单词之间的停顿强度打上了1(无停顿)到5(强停顿)的等级。与显式暂停相比,隐式暂停的平均得分更高,然而,显式暂停的得分在评分者之间得到了更高的一致性。此外,我们发现明确的暂停时间对评分者的分数只有边缘性的显著影响。从性别差异来看,结果表明,当说话者是女性时,评分者的得分更高。此外,在说话者的性别和接受者(即对话者)的性别之间发现了一种相互作用。特别是,女性演讲者总体得分较高,而男性演讲者在与男性交谈时得分高于与女性交谈。
{"title":"Perception of the strength of prosodic breaks in three conditions: Explicit pause, implicit pause, and no pause","authors":"V. Silber-Varod, Ella Alfon, N. Amir","doi":"10.21437/speechprosody.2022-97","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-97","url":null,"abstract":"In this study we examine the perceptual strength of prosodic boundaries in Hebrew speech. The stimuli consisted of 28 sequences of two inter-pausal units (IPUs) taken from the Map Task recordings in Hebrew. Listeners were exposed only to the silent pause following the first IPU (hence, Explicit pauses) while the second pause was omitted (hence, Implicit pauses) thus creating a stimulus model of IPU-pause-IPU. Ten female listeners labeled the strength of each break between adjacent words on a scale from 1 (no break) to 5 (strong break). Higher average scores were assigned to the implicit pauses as compared to the explicit ones, however scores for explicit pauses received higher agreement between raters. Moreover, we found only borderline significant influence of the explicit pause duration on the raters' scores. Looking at gender differences, the results suggest that raters' scores were higher when the speakers were females. Further, an interaction was found between the gender of the speaker and the gender of the recipient (i.e., the interlocutor). In particular, female speakers received a higher score overall, and for male speakers the rating was higher when they spoke to males than to females.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121425357","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using prosody to organize the signal: Sensitivities across species set the stage for prosodic bootstrapping 利用韵律来组织信号:跨物种的敏感性为韵律的自我引导奠定了基础
Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-1
J. M. Toro
Prosody is a major source of information that both adults and infants use to organize the speech signal, from segmenting words to inferring syntactic structures. Here, I will explore the extent to which the ability to take advantage of prosodic cues that we observe in humans might emerge from sensibilities already present in other species. I will review recent studies along 2 lines of research. The first one covers research into how listeners follow the principles described by the Iambic-Trochaic Law to group sounds. The second one explores how they take advantage of sonority differences and natural prosodic contours to better identify words. Together, the evidence gathered so far suggests that, similarly to humans, non-human animals use certain acoustic cues present in the signal to extract difficult-to-find regularities. More broadly, they provide support to the idea that general perceptual biases that form the bases for prosodic bootstrapping are already present in other animals. Importantly, in humans but not in other animals, such biases are combined with domain-specific representations that guide the discovery of linguistic structures.
韵律是成人和婴儿用来组织语音信号的主要信息来源,从分词到推断句法结构。在这里,我将探讨我们在人类身上观察到的利用韵律线索的能力在多大程度上可能来自于其他物种已经存在的敏感性。我将从两个方面回顾最近的研究。第一部分研究的是听者如何遵循抑扬格-扬格律所描述的原则来分组发音。第二部分探讨了他们如何利用声音差异和自然韵律轮廓来更好地识别单词。总之,到目前为止收集到的证据表明,与人类类似,非人类动物利用信号中存在的某些声音线索来提取难以发现的规律。更广泛地说,它们支持了一种观点,即构成韵律自我引导基础的一般感知偏差已经存在于其他动物中。重要的是,在人类中,而不是在其他动物中,这种偏见与指导语言结构发现的领域特定表征相结合。
{"title":"Using prosody to organize the signal: Sensitivities across species set the stage for prosodic bootstrapping","authors":"J. M. Toro","doi":"10.21437/speechprosody.2022-1","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-1","url":null,"abstract":"Prosody is a major source of information that both adults and infants use to organize the speech signal, from segmenting words to inferring syntactic structures. Here, I will explore the extent to which the ability to take advantage of prosodic cues that we observe in humans might emerge from sensibilities already present in other species. I will review recent studies along 2 lines of research. The first one covers research into how listeners follow the principles described by the Iambic-Trochaic Law to group sounds. The second one explores how they take advantage of sonority differences and natural prosodic contours to better identify words. Together, the evidence gathered so far suggests that, similarly to humans, non-human animals use certain acoustic cues present in the signal to extract difficult-to-find regularities. More broadly, they provide support to the idea that general perceptual biases that form the bases for prosodic bootstrapping are already present in other animals. Importantly, in humans but not in other animals, such biases are combined with domain-specific representations that guide the discovery of linguistic structures.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"64 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114708590","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Affect Expression: Global and Local Control of Voice Source Parameters 影响表达:声源参数的全局和局部控制
Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-107
Andy Murphy, Irena Yanushevskaya, A. N. Chasaide, C. Gobl
This paper explores how the acoustic characteristics of the voice signal affect. It considers the proposition that the cueing of affect relies on variations in voice source parameters (includ-ing f 0 ) that involve both global, uniform shifts across an utterance, and local, within-utterance changes, at prosodically rele-vant points. To test this, a perception test was conducted with stimuli where modifications were made to voice source parameters of a synthesised baseline utterance, to target angry and sad renditions. The baseline utterance was generated with the ABAIR Irish TTS system, for one male and one female voice. The voice parameter manipulations drew on earlier production and perception experiments, and involved three stimulus series: those with global, local and a combination of global and local adjustments. 65 listeners judged the stimuli as one of the fol-lowing: angry, interested, no emotion, relaxed and sad , and in-dicated how strongly any affect was perceived. Results broadly support the initial proposition, in that the most effective signalling of both angry and sad affect tended to involve those stimuli which combined global and local adjustments. However, results for stimuli targeting angry were often judged as interested , in-dicating that the negative valence is not consistently cued by the manipulations in these stimuli.
本文探讨了语音信号的声学特性对语音信号的影响。它考虑了这样一个命题,即情感的线索依赖于语音源参数(包括f0)的变化,这些参数既涉及整个话语的全局、统一的变化,也涉及在韵律相关点上的局部、话语内的变化。为了验证这一点,研究人员进行了一项感知测试,对合成基线话语的声源参数进行了修改,以瞄准愤怒和悲伤的场景。基线话语是用ABAIR爱尔兰TTS系统生成的,分别为一个男性和一个女性的声音。声音参数操纵借鉴了早期的生产和感知实验,并涉及三个刺激系列:具有全局,局部以及全局和局部调整的组合。65名听众将刺激分为以下几种:愤怒、感兴趣、没有情绪、放松和悲伤,并指出感知到的情绪有多强烈。结果广泛地支持了最初的观点,即愤怒和悲伤情绪的最有效信号往往涉及那些结合了全局和局部调整的刺激。然而,针对愤怒的刺激的结果往往被判断为感兴趣,这表明这些刺激中的操作并不总是引起负效价。
{"title":"Affect Expression: Global and Local Control of Voice Source Parameters","authors":"Andy Murphy, Irena Yanushevskaya, A. N. Chasaide, C. Gobl","doi":"10.21437/speechprosody.2022-107","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-107","url":null,"abstract":"This paper explores how the acoustic characteristics of the voice signal affect. It considers the proposition that the cueing of affect relies on variations in voice source parameters (includ-ing f 0 ) that involve both global, uniform shifts across an utterance, and local, within-utterance changes, at prosodically rele-vant points. To test this, a perception test was conducted with stimuli where modifications were made to voice source parameters of a synthesised baseline utterance, to target angry and sad renditions. The baseline utterance was generated with the ABAIR Irish TTS system, for one male and one female voice. The voice parameter manipulations drew on earlier production and perception experiments, and involved three stimulus series: those with global, local and a combination of global and local adjustments. 65 listeners judged the stimuli as one of the fol-lowing: angry, interested, no emotion, relaxed and sad , and in-dicated how strongly any affect was perceived. Results broadly support the initial proposition, in that the most effective signalling of both angry and sad affect tended to involve those stimuli which combined global and local adjustments. However, results for stimuli targeting angry were often judged as interested , in-dicating that the negative valence is not consistently cued by the manipulations in these stimuli.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114853924","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Gender effects on perception of emotional speech- and visual-prosody in a second language: Emotion recognition in English-speaking films 性别对第二语言情感言语和视觉韵律感知的影响:英语电影中的情感识别
Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-126
S. Verheul, Adriana Hartman, Roselinde Supheert, Aoju Chen
Speakers use both speech prosody and visual prosody (facial expressions, gestures, body postures) to express emotion. Receivers register and recognise emotion via both types of prosodic cues. In this study, we examined gender differences in both recognition of type of emotion (e.g. anger vs. joy) and perceived emotionality (e.g. the degree of anger) expressed via speech prosody and visual prosody in a second language (L2). In a perception experiment using film scenes, proficient Dutch learners of English rated the emotionality of each protagonist and identified the specific type of emotion expressed by each protagonist in each scene in both the visual-only and audio-only modality. We have found no evidence for gender-related differences in perceived emotionality, possibly due to potential difficulty of participants in identifying with the protagonists portrayed in a different society. However, the female Dutch learners of English were more accurate in recognising type of emotion than the male Dutch learners of English from both speech prosody and visual prosody. These findings suggest that there is transfer of learners’ ability in recognising type of emotion in the native language to L2 and that female L2 learners may be better at learning cues in speech prosody to emotion in L2.
说话者使用语言韵律和视觉韵律(面部表情、手势、身体姿势)来表达情感。接受者通过这两种韵律线索来记录和识别情绪。在这项研究中,我们研究了在第二语言(L2)中通过语音韵律和视觉韵律表达的情感类型识别(如愤怒与喜悦)和感知情绪(如愤怒程度)方面的性别差异。在一项使用电影场景的感知实验中,熟练的荷兰英语学习者对每个主角的情绪进行了评分,并以视觉和听觉两种方式识别出每个场景中每个主角所表达的特定情感类型。我们没有发现与感知情绪相关的性别差异的证据,这可能是由于参与者在识别不同社会中所描绘的主角方面存在潜在的困难。然而,从言语韵律和视觉韵律两方面来看,荷兰英语女性学习者对情感类型的识别比荷兰英语男性学习者更准确。这些发现表明,学习者在识别母语情感类型方面的能力存在向二语的转移,并且女性二语学习者可能更善于从语音韵律中学习线索到二语的情感。
{"title":"Gender effects on perception of emotional speech- and visual-prosody in a second language: Emotion recognition in English-speaking films","authors":"S. Verheul, Adriana Hartman, Roselinde Supheert, Aoju Chen","doi":"10.21437/speechprosody.2022-126","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-126","url":null,"abstract":"Speakers use both speech prosody and visual prosody (facial expressions, gestures, body postures) to express emotion. Receivers register and recognise emotion via both types of prosodic cues. In this study, we examined gender differences in both recognition of type of emotion (e.g. anger vs. joy) and perceived emotionality (e.g. the degree of anger) expressed via speech prosody and visual prosody in a second language (L2). In a perception experiment using film scenes, proficient Dutch learners of English rated the emotionality of each protagonist and identified the specific type of emotion expressed by each protagonist in each scene in both the visual-only and audio-only modality. We have found no evidence for gender-related differences in perceived emotionality, possibly due to potential difficulty of participants in identifying with the protagonists portrayed in a different society. However, the female Dutch learners of English were more accurate in recognising type of emotion than the male Dutch learners of English from both speech prosody and visual prosody. These findings suggest that there is transfer of learners’ ability in recognising type of emotion in the native language to L2 and that female L2 learners may be better at learning cues in speech prosody to emotion in L2.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124444798","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Effects of delayed auditory feedback interacting with prosodic structure 延迟听觉反馈与韵律结构相互作用的影响
Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-65
Jinyu Li, L. Lancia
Speakers usually respond to time-delayed auditory feedback (DAF) by decreasing their speech rate (i.e., lengthening syllables). However, the syllable position in prosodic structure may affect syllabic prominence and duration. In the present study, we investigated whether the lengthening effect of DAF on syllables could depend on their position in French utterance. We analyzed recordings of several repetitions of three five-syllables French sentences from 10 French speakers under three conditions of DAF (0, 60, 120ms). The results suggest that the duration of syllables is generally longer when DAF is present, and it increases with the increasing DAF level. Accented vowels are more lengthened by DAF in relation to nonaccented vowels in the same accentual group. Final sentence vowels, which bear the nuclear pitch accent and may be additionally affected by final lengthening, could even be more lengthened by DAF. Given that the extent of lengthening effect is not correlated with the original syllabic duration, we assume that the greater lengthening effect on accented vowels could not be due to the longer duration of these vowels in general. Overall, our results suggest that speakers’ responses to DAF depend on the syllabic status in the prosodic hierarchy.
说话者通常通过降低语速(即延长音节)来回应延时听觉反馈(DAF)。然而,音节在韵律结构中的位置会影响音节的突出度和持续时间。在本研究中,我们调查了DAF对音节的延长效应是否取决于它们在法语话语中的位置。我们分析了10名法语使用者在3种DAF条件下(0、60、120ms)的3个五音节法语句子的多次重复录音。结果表明,当DAF存在时,音节的持续时间通常更长,并且随着DAF水平的增加而增加。在同一个重音组中,重音元音比非重音元音更容易被DAF拉长。最后的句子元音,带有核音重音,可能会受到最后延长的影响,DAF甚至会使其更长。考虑到延长效应的程度与原始音节的持续时间无关,我们假设重音元音的更大延长效应不可能是由于这些元音的持续时间更长。总体而言,我们的研究结果表明,说话者对DAF的反应取决于音节在韵律层次中的地位。
{"title":"Effects of delayed auditory feedback interacting with prosodic structure","authors":"Jinyu Li, L. Lancia","doi":"10.21437/speechprosody.2022-65","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-65","url":null,"abstract":"Speakers usually respond to time-delayed auditory feedback (DAF) by decreasing their speech rate (i.e., lengthening syllables). However, the syllable position in prosodic structure may affect syllabic prominence and duration. In the present study, we investigated whether the lengthening effect of DAF on syllables could depend on their position in French utterance. We analyzed recordings of several repetitions of three five-syllables French sentences from 10 French speakers under three conditions of DAF (0, 60, 120ms). The results suggest that the duration of syllables is generally longer when DAF is present, and it increases with the increasing DAF level. Accented vowels are more lengthened by DAF in relation to nonaccented vowels in the same accentual group. Final sentence vowels, which bear the nuclear pitch accent and may be additionally affected by final lengthening, could even be more lengthened by DAF. Given that the extent of lengthening effect is not correlated with the original syllabic duration, we assume that the greater lengthening effect on accented vowels could not be due to the longer duration of these vowels in general. Overall, our results suggest that speakers’ responses to DAF depend on the syllabic status in the prosodic hierarchy.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125454004","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Acoustic correlates of Dutch lexical stress re-examined: Spectral tilt is not always more reliable than intensity 荷兰语词汇重音的声学相关性重新检查:光谱倾斜并不总是比强度更可靠
Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-57
G. Severijnen, H. Bosker, J. McQueen
The present study examined two acoustic cues in the production of lexical stress in Dutch: spectral tilt and overall intensity. Sluijter and Van Heuven (1996) reported that spectral tilt is a more reliable cue to stress than intensity. However, that study included only a small number of talkers (10) and only syllables with the vowels /a ː / and / ɔ /. The present study re-examined this issue in a larger and more variable dataset. We recorded 38 native speakers of Dutch (20 females) producing 744 tokens of Dutch segmentally overlapping words (e.g., VOORnaam vs. voorNAAM, “first name” vs. “respectable”), targeting 10 different vowels, in variable sentence contexts. For each syllable, we measured overall intensity and spectral tilt following Sluijter and Van Heuven (1996). Results from Linear Discriminant Analyses showed that, for the vowel /a ː / alone, spectral tilt showed an advantage over intensity, as evidenced by higher stressed/unstressed syllable classification accuracy scores for spectral tilt. However, when all vowels were included in the analysis, the advantage disappeared. These findings confirm that spectral tilt plays a larger role in signaling stress in Dutch /a ː / but show that, for a larger sample of Dutch vowels, overall intensity and spectral tilt are equally important.
本研究考察了荷兰语词汇重音产生的两种声学线索:谱倾斜和总体强度。slujter和Van Heuven(1996)报告说,光谱倾斜是比强度更可靠的应力提示。然而,这项研究只包括了一小部分说话的人(10人),而且只包括了带有/a / /和/ / /元音的音节。本研究在一个更大、更可变的数据集中重新审视了这个问题。我们记录了38名母语为荷兰语的人(20名女性)在不同的句子语境中,针对10个不同的元音,产生744个分段重叠的荷兰语标记(例如,VOORnaam与VOORnaam,“first name”与“可敬的”)。根据Sluijter和Van Heuven(1996)的研究,我们测量了每个音节的总体强度和光谱倾斜度。线性判别分析结果表明,对于元音/a / /,谱倾斜比强度更有优势,这可以从谱倾斜的重读/非重读音节分类准确率得分中得到证明。然而,当所有元音都包含在分析中时,优势就消失了。这些发现证实,在荷兰语/a / /中,频谱倾斜在信号压力中起着更大的作用,但也表明,对于更大的荷兰语元音样本,总体强度和频谱倾斜同样重要。
{"title":"Acoustic correlates of Dutch lexical stress re-examined: Spectral tilt is not always more reliable than intensity","authors":"G. Severijnen, H. Bosker, J. McQueen","doi":"10.21437/speechprosody.2022-57","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-57","url":null,"abstract":"The present study examined two acoustic cues in the production of lexical stress in Dutch: spectral tilt and overall intensity. Sluijter and Van Heuven (1996) reported that spectral tilt is a more reliable cue to stress than intensity. However, that study included only a small number of talkers (10) and only syllables with the vowels /a ː / and / ɔ /. The present study re-examined this issue in a larger and more variable dataset. We recorded 38 native speakers of Dutch (20 females) producing 744 tokens of Dutch segmentally overlapping words (e.g., VOORnaam vs. voorNAAM, “first name” vs. “respectable”), targeting 10 different vowels, in variable sentence contexts. For each syllable, we measured overall intensity and spectral tilt following Sluijter and Van Heuven (1996). Results from Linear Discriminant Analyses showed that, for the vowel /a ː / alone, spectral tilt showed an advantage over intensity, as evidenced by higher stressed/unstressed syllable classification accuracy scores for spectral tilt. However, when all vowels were included in the analysis, the advantage disappeared. These findings confirm that spectral tilt plays a larger role in signaling stress in Dutch /a ː / but show that, for a larger sample of Dutch vowels, overall intensity and spectral tilt are equally important.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"45 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131771021","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Social and situational factors of speaker variability in collaborative dialogues 协作对话中说话人变异性的社会和情境因素
Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-93
Tatiana V. Kachkovskaia, A. Menshikova, D. Kocharov, Pavel Kholiavin, Anna Mamushina
The acoustic features of the speaker’s voice in dialogues are li-able to change due to various situational factors, such as success of communication, social distance between the interlocutors, conversational roles etc. This paper presents an analysis of variation in the basic prosodic features—pitch, intensity, and speech tempo—across speakers’ gender, conversational role (informa-tion leader vs. follower), and social distance. The research is based on the SibLing speech corpus where five degrees of social distance between the interlocutors are presented: there are dialogues between same-gender siblings, same-gender friends, same-gender and opposite-gender strangers, strangers of different age and social status. Each pair of interlocutors played a card-matching game and performed a classical map task. The factor of conversational role revealed a significant influence on all the analysed speech features: pitch, intensity, and speech tempo. Gender was not found to influence speech tempo, unlike pitch and loudness. Social distance was shown to play a significant role for speech tempo (e.g., it tends to be lower in dialogues with strangers of different age and social sta-tus), and also, in interaction with other factors, for pitch and loudness. There was also a significant influence of the type of task: card-matching game vs. map task.
对话中说话人声音的声学特征会因各种情境因素而发生变化,如沟通的成功程度、对话者之间的社会距离、会话角色等。本文分析了基本韵律特征——音高、强度和语速——在说话者性别、会话角色(信息领导者与追随者)和社会距离上的变化。本研究以兄妹语音语料库为基础,对话者之间的社会距离呈现出五种程度:同性兄弟姐妹之间的对话、同性朋友之间的对话、同性和异性陌生人之间的对话、不同年龄和社会地位的陌生人之间的对话。每对对话者都玩了一个纸牌配对游戏,并执行了一个经典的地图任务。会话角色因素对所分析的语音特征:音高、强度和语速均有显著影响。不像音高和响度,性别对语速没有影响。研究表明,社交距离对说话速度起着重要作用(例如,与不同年龄和社会地位的陌生人对话时,说话速度往往较低),而且,在与其他因素的互动中,对音调和音量也有影响。任务类型也有显著的影响:纸牌匹配游戏vs.地图任务。
{"title":"Social and situational factors of speaker variability in collaborative dialogues","authors":"Tatiana V. Kachkovskaia, A. Menshikova, D. Kocharov, Pavel Kholiavin, Anna Mamushina","doi":"10.21437/speechprosody.2022-93","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-93","url":null,"abstract":"The acoustic features of the speaker’s voice in dialogues are li-able to change due to various situational factors, such as success of communication, social distance between the interlocutors, conversational roles etc. This paper presents an analysis of variation in the basic prosodic features—pitch, intensity, and speech tempo—across speakers’ gender, conversational role (informa-tion leader vs. follower), and social distance. The research is based on the SibLing speech corpus where five degrees of social distance between the interlocutors are presented: there are dialogues between same-gender siblings, same-gender friends, same-gender and opposite-gender strangers, strangers of different age and social status. Each pair of interlocutors played a card-matching game and performed a classical map task. The factor of conversational role revealed a significant influence on all the analysed speech features: pitch, intensity, and speech tempo. Gender was not found to influence speech tempo, unlike pitch and loudness. Social distance was shown to play a significant role for speech tempo (e.g., it tends to be lower in dialogues with strangers of different age and social sta-tus), and also, in interaction with other factors, for pitch and loudness. There was also a significant influence of the type of task: card-matching game vs. map task.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130795676","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Prosody and cognitive accessibility in left-detached topics: lessons from Nigerian Pidgin 左分离话题的韵律和认知可及性:来自尼日利亚洋泾浜语的教训
Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-4
E. Strickland, Anne Lacheret-Dujour, C. Simard
{"title":"Prosody and cognitive accessibility in left-detached topics: lessons from Nigerian Pidgin","authors":"E. Strickland, Anne Lacheret-Dujour, C. Simard","doi":"10.21437/speechprosody.2022-4","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-4","url":null,"abstract":"","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133193603","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mandarin Disyllabic Word Imitation in Children with and without Autism Spectrum Disorder 自闭症谱系障碍儿童普通话双音节词语模仿的研究
Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-22
Tingbo Wang, Heng Ding
Atypical pitch production and perception in individuals with autism spectrum disorders (ASD) have been reported mainly from non-tonal language backgrounds. In tonal languages such as Mandarin, the changes of pitch not only signal prosody at a sentence level but also contrast word meanings known as tones at a lexical level. It remains unclear whether children with ASD from tonal language backgrounds show a deficit in the use of pitch at both levels. Therefore, the current study aims to exploit whether Mandarin-speaking children with ASD exhibit atypical lexical pitch production and whether their performance is influenced by semantic information in a disyllabic true and pseudo-words imitation task. Results from acoustic analysis demonstrated significant differences in pitch and duration measures between both subject groups and word types.
自闭症谱系障碍(ASD)患者的非典型音高产生和感知主要来自非声调语言背景。在像普通话这样的声调语言中,音高的变化不仅在句子层面上表明韵律,而且在词汇层面上也对比了被称为声调的词义。目前尚不清楚来自声调语言背景的自闭症儿童是否在这两个水平上都表现出使用音高的缺陷。因此,本研究旨在探讨普通话自闭症儿童在双音节真词和假词模仿任务中是否表现出非典型的词汇音高产生,以及他们的表现是否受到语义信息的影响。声学分析结果表明,两组受试者在音高和持续时间测量上存在显著差异。
{"title":"Mandarin Disyllabic Word Imitation in Children with and without Autism Spectrum Disorder","authors":"Tingbo Wang, Heng Ding","doi":"10.21437/speechprosody.2022-22","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-22","url":null,"abstract":"Atypical pitch production and perception in individuals with autism spectrum disorders (ASD) have been reported mainly from non-tonal language backgrounds. In tonal languages such as Mandarin, the changes of pitch not only signal prosody at a sentence level but also contrast word meanings known as tones at a lexical level. It remains unclear whether children with ASD from tonal language backgrounds show a deficit in the use of pitch at both levels. Therefore, the current study aims to exploit whether Mandarin-speaking children with ASD exhibit atypical lexical pitch production and whether their performance is influenced by semantic information in a disyllabic true and pseudo-words imitation task. Results from acoustic analysis demonstrated significant differences in pitch and duration measures between both subject groups and word types.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133792102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Children’s Use of Uptalk in Narratives 儿童在叙事中的上升语运用
Pub Date : 2022-05-23 DOI: 10.21437/speechprosody.2022-7
Yujia Song, Cynthia G. Clopper, Laura Wagner
Uptalk refers to the use of rising intonation on declarative utterances. Previous research has shown that, at age 6 years, children use rising contours with declaratives more frequently than adults, and this pattern appears to persist until 14 years of age. However, it is unclear why such a trend persists. To gain a clearer developmental picture of uptalk, the present study analyzed the form and function of uptalk produced by children aged 6 to 7 and 10 to 11 years from the American Midwest, using a storytelling task. Contrary to previous findings, the results indicate that children of both age groups use uptalk in an adult-like way: they overwhelmingly favor L-H% over H-H% boundary tones, and most strongly associate the contour with continuation. The lack of age differences suggests that children’s use of uptalk is comparable to that of adults by the age of 6, at least in certain narrative contexts. The use of a familiar storytelling task in the current study may explain the greater success observed for children than in previous studies, suggesting the relative importance of the elicitation task in the investigation of child speech.
升调是指在陈述句中使用升调。先前的研究表明,在6岁时,儿童使用上升等高线的陈述比成年人更频繁,这种模式似乎一直持续到14岁。然而,目前尚不清楚为什么这种趋势会持续下去。为了更清楚地了解向上说话的发展情况,本研究通过一个讲故事的任务,分析了美国中西部6 ~ 7岁和10 ~ 11岁儿童向上说话的形式和功能。与之前的发现相反,结果表明,两个年龄组的儿童都以成年人的方式使用向上的谈话:他们压倒性地喜欢L-H%而不是H-H%的边界音调,并且最强烈地将轮廓与延续联系起来。没有年龄差异表明,至少在某些叙事背景下,儿童使用向上谈话的能力与6岁的成年人相当。在当前的研究中,使用一个熟悉的讲故事任务可能解释了在儿童中观察到的比以前的研究更大的成功,这表明在儿童语言研究中,启发任务的相对重要性。
{"title":"Children’s Use of Uptalk in Narratives","authors":"Yujia Song, Cynthia G. Clopper, Laura Wagner","doi":"10.21437/speechprosody.2022-7","DOIUrl":"https://doi.org/10.21437/speechprosody.2022-7","url":null,"abstract":"Uptalk refers to the use of rising intonation on declarative utterances. Previous research has shown that, at age 6 years, children use rising contours with declaratives more frequently than adults, and this pattern appears to persist until 14 years of age. However, it is unclear why such a trend persists. To gain a clearer developmental picture of uptalk, the present study analyzed the form and function of uptalk produced by children aged 6 to 7 and 10 to 11 years from the American Midwest, using a storytelling task. Contrary to previous findings, the results indicate that children of both age groups use uptalk in an adult-like way: they overwhelmingly favor L-H% over H-H% boundary tones, and most strongly associate the contour with continuation. The lack of age differences suggests that children’s use of uptalk is comparable to that of adults by the age of 6, at least in certain narrative contexts. The use of a familiar storytelling task in the current study may explain the greater success observed for children than in previous studies, suggesting the relative importance of the elicitation task in the investigation of child speech.","PeriodicalId":442842,"journal":{"name":"Speech Prosody 2022","volume":"125 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115509686","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Speech Prosody 2022
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1