首页 > 最新文献

Language and Speech最新文献

英文 中文
Revisiting the Benefits of Hand Gestures in L2 Pronunciation: Generalization Effects in Multi-Session Multimodal Phonetic Training. 回顾手势在二语发音中的益处:多会话多模态语音训练的推广效应。
IF 1.1 2区 文学 Q3 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Pub Date : 2026-01-13 DOI: 10.1177/00238309251399140
Sichang Gao, Xiaotong Xi, Peng Li

Multimodal phonetic training with hand gestures can support L2 speech learning, but few studies have examined its generalization effects. In this between-subjects study with a pretest/post-test paradigm, 39 Japanese learners of Mandarin practiced Mandarin aspirated stops, high back rounded vowel /u/, and T3 Sandhi over four training sessions. The gesture (G) group (n = 20) received training with hand gestures illustrating the phonetic and articulatory features of the target sounds, while the no gesture (NG) group (n = 19) received the same training without gestures. Participants read trained words during the pre- and post-tests and untrained words in a generalization test. Results showed that the G group outperformed the NG group in improving the pronunciation accuracy of aspirated stops and vowels, but not T3 Sandhi. Importantly, the G group showed better generalization to untrained items across all targets. These findings highlight the robust effects of hand gestures in multimodal phonetic training and the role of embodied cognition in L2 speech acquisition.

手势的多模态语音训练可以支持第二语言学习,但很少有研究考察其泛化效应。在这项采用测试前/测试后范式的受试者间研究中,39名日语普通话学习者在四个训练阶段中练习了普通话送气停顿、高背圆元音/u/和T3连音。手势(G)组(n = 20)接受用手势说明目标声音的语音和发音特征的训练,而无手势(NG)组(n = 19)接受相同的不使用手势的训练。参与者在前后测试中阅读训练过的单词,在泛化测试中阅读未训练过的单词。结果表明,G组在提高送气顿音和元音的发音准确率方面优于NG组,而T3连音则不优于NG组。重要的是,G组在所有目标上对未训练的项目表现出更好的泛化。这些发现强调了手势在多模态语音训练中的强大作用,以及体现认知在二语语音习得中的作用。
{"title":"Revisiting the Benefits of Hand Gestures in L2 Pronunciation: Generalization Effects in Multi-Session Multimodal Phonetic Training.","authors":"Sichang Gao, Xiaotong Xi, Peng Li","doi":"10.1177/00238309251399140","DOIUrl":"https://doi.org/10.1177/00238309251399140","url":null,"abstract":"<p><p>Multimodal phonetic training with hand gestures can support L2 speech learning, but few studies have examined its generalization effects. In this between-subjects study with a pretest/post-test paradigm, 39 Japanese learners of Mandarin practiced Mandarin aspirated stops, high back rounded vowel /u/, and T3 Sandhi over four training sessions. The gesture (G) group (<i>n</i> = 20) received training with hand gestures illustrating the phonetic and articulatory features of the target sounds, while the no gesture (NG) group (<i>n</i> = 19) received the same training without gestures. Participants read trained words during the pre- and post-tests and untrained words in a generalization test. Results showed that the G group outperformed the NG group in improving the pronunciation accuracy of aspirated stops and vowels, but not T3 Sandhi. Importantly, the G group showed better generalization to untrained items across all targets. These findings highlight the robust effects of hand gestures in multimodal phonetic training and the role of embodied cognition in L2 speech acquisition.</p>","PeriodicalId":51255,"journal":{"name":"Language and Speech","volume":" ","pages":"238309251399140"},"PeriodicalIF":1.1,"publicationDate":"2026-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145967744","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Is "Intrinsic Vowel Duration" Bio-Mechanical or More? Preliminary Results from Northwestern Italian. “固有元音时长”是生物力学还是更多?意大利西北部的初步结果。
IF 1.1 2区 文学 Q3 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Pub Date : 2026-01-02 DOI: 10.1177/00238309251399657
Stefano Coretta

A well-known property of vowel duration is that it tends to be mediated by vowel height: higher vowels are usually shorter than lower vowels (aka "intrinsic vowel duration"). Results from previous work on a limited number of languages suggest two possible scenarios: (A) vowel duration is entirely driven by the duration of the gesture necessary to reach the tongue height target of the specific vowel, (B) vowel duration targets are part of the cognitive representation of the vowel. However, a third scenario has been put forward theoretically: (C) the duration of the vocalic gesture partially determines vowel duration, but a vowel duration target for each vowel category is also necessary. This study set out to investigate vowel duration data from Northern Italian to assess which of the three scenarios finds support. The first formant frequency (F1) of vowels was used as a proxy for tongue height position, and directed acyclic graph theory was employed to determine the causal relationship between vowel duration, vowel category, and F1. Bayesian modeling results suggest a robust non-linear effect of F1 on vowel duration, together with a direct effect of vowel category, which matches scenario (C).

元音持续时间的一个众所周知的特性是,它往往是由元音高度调节的:高元音通常比低元音短(又名“固有元音持续时间”)。先前对有限数量语言的研究结果提出了两种可能的情况:(a)元音持续时间完全由达到特定元音舌高目标所需的手势持续时间驱动;(B)元音持续时间目标是元音认知表征的一部分。然而,从理论上提出了第三种情况:(C)发声手势的持续时间部分决定了元音的持续时间,但也需要为每个元音类别设定一个元音持续时间目标。这项研究着手调查意大利北部的元音持续时间数据,以评估三种情况中哪一种得到支持。以元音的第一形成峰频率(F1)作为舌高位置的代表,利用有向无环图理论确定元音时长、元音类别与F1之间的因果关系。贝叶斯建模结果表明,F1对元音持续时间有强大的非线性影响,同时元音类别也有直接影响,这与情景(C)相匹配。
{"title":"Is \"Intrinsic Vowel Duration\" Bio-Mechanical or More? Preliminary Results from Northwestern Italian.","authors":"Stefano Coretta","doi":"10.1177/00238309251399657","DOIUrl":"https://doi.org/10.1177/00238309251399657","url":null,"abstract":"<p><p>A well-known property of vowel duration is that it tends to be mediated by vowel height: higher vowels are usually shorter than lower vowels (aka \"intrinsic vowel duration\"). Results from previous work on a limited number of languages suggest two possible scenarios: (A) vowel duration is entirely driven by the duration of the gesture necessary to reach the tongue height target of the specific vowel, (B) vowel duration targets are part of the cognitive representation of the vowel. However, a third scenario has been put forward theoretically: (C) the duration of the vocalic gesture partially determines vowel duration, but a vowel duration target for each vowel category is also necessary. This study set out to investigate vowel duration data from Northern Italian to assess which of the three scenarios finds support. The first formant frequency (F1) of vowels was used as a proxy for tongue height position, and directed acyclic graph theory was employed to determine the causal relationship between vowel duration, vowel category, and F1. Bayesian modeling results suggest a robust non-linear effect of F1 on vowel duration, together with a direct effect of vowel category, which matches scenario (C).</p>","PeriodicalId":51255,"journal":{"name":"Language and Speech","volume":" ","pages":"238309251399657"},"PeriodicalIF":1.1,"publicationDate":"2026-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145890522","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How Perceptual Learning Extends Across Vowels. 感知学习是如何跨元音扩展的。
IF 1.1 2区 文学 Q3 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Pub Date : 2026-01-02 DOI: 10.1177/00238309251395663
Chelsea Sanker

Speakers' perception of phonemes can be shifted based on hearing tokens of them with altered acoustic characteristics, and those shifts are extended to phonemes not heard during exposure. The patterns of extension from one vowel to others can help clarify the phonological representation of vowels and the processes that underlie extension of acoustic shifts. Three perceptual learning tasks tested how exposure to shifted F1 or F2 in a single vowel quality in American English influences other vowels with a range of characteristics, and how differences between dialects interact with those patterns of extension. In Experiment 1, shifted F1 in /ɪ/ exposure items produced perceptual shifts in the boundary between several high and mid vowels, as well as the /ε-æ/ boundary. In Experiment 2, shifted F2 in /u/ exposure items produced perceptual shifts in the boundary between front and back vowels. In Experiment 3, shifted F2 in /ε/ or /ei/ produced different patterns; shifted /ei/ only impacted the /ou-ei/ boundary, while shifted /ε/ impacted /ʌ-ε/ and /ʊ-ɪ/. The results can be explained by shifts in perception extending to vowels that share phonological features which are linked to the manipulated acoustic characteristic. However, the results are also largely consistent with extension based on acoustic similarity. There was little evidence for the listener's dialect affecting patterns of extension.

说话者对音素的感知可以根据他们听到的具有改变的声学特征的符号而转移,这些转移扩展到在暴露期间听不到的音素。从一个元音到另一个元音的延伸模式可以帮助阐明元音的音系表征和声学转移延伸的过程。三个感知学习任务测试了美国英语中单个元音质量中移位的F1或F2如何影响具有一系列特征的其他元音,以及方言之间的差异如何与这些扩展模式相互作用。在实验1中,在/ / /暴露项中移位的F1会在几个高元音和中元音之间的边界以及/ε-æ/边界产生感知移位。在实验2中,/u/暴露项中F2移位会产生前后元音边界的感知移位。在实验3中,/ε/或/ei/中F2移位产生不同的模式;移位的/ei/只影响了/ou-ei/的边界,而移位的/ε/影响了/ u- ε/和/ u- æ /。这一结果可以用感知的变化来解释,这些变化延伸到元音,这些元音具有与被操纵的声学特性相关的语音特征。然而,结果也与基于声学相似性的扩展基本一致。几乎没有证据表明听者的方言会影响延伸模式。
{"title":"How Perceptual Learning Extends Across Vowels.","authors":"Chelsea Sanker","doi":"10.1177/00238309251395663","DOIUrl":"https://doi.org/10.1177/00238309251395663","url":null,"abstract":"<p><p>Speakers' perception of phonemes can be shifted based on hearing tokens of them with altered acoustic characteristics, and those shifts are extended to phonemes not heard during exposure. The patterns of extension from one vowel to others can help clarify the phonological representation of vowels and the processes that underlie extension of acoustic shifts. Three perceptual learning tasks tested how exposure to shifted F1 or F2 in a single vowel quality in American English influences other vowels with a range of characteristics, and how differences between dialects interact with those patterns of extension. In Experiment 1, shifted F1 in /ɪ/ exposure items produced perceptual shifts in the boundary between several high and mid vowels, as well as the /ε-æ/ boundary. In Experiment 2, shifted F2 in /u/ exposure items produced perceptual shifts in the boundary between front and back vowels. In Experiment 3, shifted F2 in /ε/ or /ei/ produced different patterns; shifted /ei/ only impacted the /ou-ei/ boundary, while shifted /ε/ impacted /ʌ-ε/ and /ʊ-ɪ/. The results can be explained by shifts in perception extending to vowels that share phonological features which are linked to the manipulated acoustic characteristic. However, the results are also largely consistent with extension based on acoustic similarity. There was little evidence for the listener's dialect affecting patterns of extension.</p>","PeriodicalId":51255,"journal":{"name":"Language and Speech","volume":" ","pages":"238309251395663"},"PeriodicalIF":1.1,"publicationDate":"2026-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145890583","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Multi-CEFR-Level Learner Corpus Study to Quantify Fluency and Accuracy in Speech. 多cefr水平学习者语料库研究量化语言流畅性和准确性。
IF 1.1 2区 文学 Q3 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Pub Date : 2025-12-26 DOI: 10.1177/00238309251393170
Lan-Fen Huang, Tomáš Gráf

This study strengthens the validation of learner speech assessment in the Common European Framework of Reference (CEFR) by analyzing the quantitative variables related to fluency and accuracy across four CEFR levels (A2, B1, B2, and C1). Drawing on a learner corpus approach, we examine 500,000 tokens from the Louvain International Database of Spoken English Interlanguage (LINDSEI) and its extensions, supplemented by post hoc rater evaluations. Three task types-a semi-monologic topic discussion, a dialogic interaction, and a monologic picture description-are used to elicit variation in speech production. The analysis focuses on speech rates, the frequency of filled and unfilled pauses, and error rates to unveil developmental trends in learner speech. The results reveal strong correlations between these fluency and accuracy metrics and CEFR levels, with speech rate emerging as the most reliable indicator of proficiency. The frequency of unfilled pauses decreases as proficiency increases, while filled pauses, although less critical to fluency assessment, offer insights into speech planning mechanisms. Error rates similarly decline with higher proficiency, reflecting greater accuracy in speech production. Exemplary instances for each CEFR level are presented, offering practical metrics for teaching, assessment, and rater training. While the study's limitations include an overrepresentation of Mandarin Chinese learners and the exclusion of pronunciation errors, these gaps highlight avenues for future research. This study provides empirical, task-sensitive evidence to enrich CEFR can-do descriptors, enhance rater training, and refine speaking assessments, contributing to more effective language teaching, learning, and assessment practices.

本研究通过分析欧洲共同参考框架(CEFR)四个水平(A2、B1、B2和C1)与流利性和准确性相关的定量变量,加强了学习者语言评估在CEFR中的有效性。利用学习者语料库方法,我们检查了来自鲁汶国际英语口语中介语言数据库(LINDSEI)及其扩展的500,000个令牌,并辅以事后评估。三种任务类型——半单一主题讨论、对话互动和单一图片描述——用于引发语音产生的变化。分析的重点是语速,填充和未填充停顿的频率,错误率,以揭示学习者语言的发展趋势。结果显示,这些流利度和准确性指标与CEFR水平之间存在很强的相关性,其中语速是衡量熟练程度的最可靠指标。随着熟练程度的提高,未填充停顿的频率会降低,而填充停顿虽然对流利度评估不那么重要,但却能让我们深入了解语音规划机制。错误率也随着熟练程度的提高而下降,这反映了语音生成的准确性。为每个CEFR级别提供了示例实例,为教学、评估和评分员培训提供了实用的度量。虽然这项研究的局限性包括对普通话学习者的过度代表和排除发音错误,但这些差距突出了未来研究的途径。本研究为丰富CEFR can-do描述符、加强语言培训、完善口语评估提供了实证、任务敏感的证据,有助于更有效的语言教学、学习和评估实践。
{"title":"A Multi-CEFR-Level Learner Corpus Study to Quantify Fluency and Accuracy in Speech.","authors":"Lan-Fen Huang, Tomáš Gráf","doi":"10.1177/00238309251393170","DOIUrl":"https://doi.org/10.1177/00238309251393170","url":null,"abstract":"<p><p>This study strengthens the validation of learner speech assessment in the Common European Framework of Reference (CEFR) by analyzing the quantitative variables related to fluency and accuracy across four CEFR levels (A2, B1, B2, and C1). Drawing on a learner corpus approach, we examine 500,000 tokens from the Louvain International Database of Spoken English Interlanguage (LINDSEI) and its extensions, supplemented by post hoc rater evaluations. Three task types-a semi-monologic topic discussion, a dialogic interaction, and a monologic picture description-are used to elicit variation in speech production. The analysis focuses on speech rates, the frequency of filled and unfilled pauses, and error rates to unveil developmental trends in learner speech. The results reveal strong correlations between these fluency and accuracy metrics and CEFR levels, with speech rate emerging as the most reliable indicator of proficiency. The frequency of unfilled pauses decreases as proficiency increases, while filled pauses, although less critical to fluency assessment, offer insights into speech planning mechanisms. Error rates similarly decline with higher proficiency, reflecting greater accuracy in speech production. Exemplary instances for each CEFR level are presented, offering practical metrics for teaching, assessment, and rater training. While the study's limitations include an overrepresentation of Mandarin Chinese learners and the exclusion of pronunciation errors, these gaps highlight avenues for future research. This study provides empirical, task-sensitive evidence to enrich CEFR can-do descriptors, enhance rater training, and refine speaking assessments, contributing to more effective language teaching, learning, and assessment practices.</p>","PeriodicalId":51255,"journal":{"name":"Language and Speech","volume":" ","pages":"238309251393170"},"PeriodicalIF":1.1,"publicationDate":"2025-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145835380","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
What Determines Personality Impressions of Synthetic and Natural Voices? The Effects of Voice Quality and Intonation. 是什么决定了合成声音和自然声音的个性印象?语音质量和语调的影响。
IF 1.1 2区 文学 Q3 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Pub Date : 2025-12-26 DOI: 10.1177/00238309251389567
Minjeong Kim, Jaehan Park, Minhong Jeong, Jieun Song

The present study investigated how acoustic and phonetic characteristics of synthetic and natural voices affect personality impressions of the voices. To this end, we conducted a personality rating experiment in which 30 native Korean speakers judged the perceived personality of natural Korean utterances and their synthetic counterparts (voice clones) using the Big-Five personality model. Various acoustic parameters, including measures of voice quality, F0, and articulation rate, were then extracted from the speech, and Intonational Phrase boundary tones were annotated. The ratings of the Big-Five personality traits were reduced to two dimensions (P1: agreeableness, conscientiousness, and emotional stability; P2: extraversion and openness) using a principal component analysis. The results suggest that the acoustic differences between state-of-the-art synthetic speech and its original counterpart can produce varying effects on personality perception. For example, speech produced with a narrower F0 range received lower scores on P1 and P2, but for male speakers, this effect was only observed in synthetic voices, likely due to the less-natural intonational patterns used. The intonation analysis further demonstrates that across speech type, using context-appropriate tones or those conveying positive attitudes improves the overall impression of the voice (both P1 and P2). The results also suggest that a less-modal voice enhances the personality scores overall, but specific voice qualities (i.e., breathiness and creakiness) and voice pitch seem to affect P1 and P2 differently. The present study demonstrates a range of acoustic and phonetic characteristics that should be considered when designing personas for AI voices or developing more likable synthetic voices.

本研究探讨了合成声音和自然声音的声学和语音特征如何影响声音的个性印象。为此,我们进行了一项人格评定实验,让30名母语为韩语的人用大五人格模型来判断自然的韩语话语和合成的韩语话语(语音克隆)的感知人格。然后从语音中提取各种声学参数,包括语音质量、F0和发音率,并对语调短语边界音调进行注释。采用主成分分析法,将大五人格特征的评分降至两个维度(P1:宜人性、尽责性和情绪稳定性;P2:外向性和开放性)。结果表明,最先进的合成语音与原始语音之间的声学差异会对人格感知产生不同的影响。例如,F0范围较窄的语音在P1和P2上得分较低,但对于男性说话者来说,这种影响只在合成声音中观察到,可能是由于使用的语调模式不太自然。语调分析进一步表明,在不同的语音类型中,使用适合上下文的语调或传达积极态度的语调可以改善声音的整体印象(P1和P2)。结果还表明,少模态的声音总体上提高了个性得分,但特定的声音质量(即呼吸和嘎吱声)和音高似乎对P1和P2的影响不同。目前的研究表明,在为人工智能语音设计人物角色或开发更讨人喜欢的合成语音时,应该考虑一系列声学和语音特征。
{"title":"What Determines Personality Impressions of Synthetic and Natural Voices? The Effects of Voice Quality and Intonation.","authors":"Minjeong Kim, Jaehan Park, Minhong Jeong, Jieun Song","doi":"10.1177/00238309251389567","DOIUrl":"https://doi.org/10.1177/00238309251389567","url":null,"abstract":"<p><p>The present study investigated how acoustic and phonetic characteristics of synthetic and natural voices affect personality impressions of the voices. To this end, we conducted a personality rating experiment in which 30 native Korean speakers judged the perceived personality of natural Korean utterances and their synthetic counterparts (voice clones) using the Big-Five personality model. Various acoustic parameters, including measures of voice quality, F0, and articulation rate, were then extracted from the speech, and Intonational Phrase boundary tones were annotated. The ratings of the Big-Five personality traits were reduced to two dimensions (P1: agreeableness, conscientiousness, and emotional stability; P2: extraversion and openness) using a principal component analysis. The results suggest that the acoustic differences between state-of-the-art synthetic speech and its original counterpart can produce varying effects on personality perception. For example, speech produced with a narrower F0 range received lower scores on P1 and P2, but for male speakers, this effect was only observed in synthetic voices, likely due to the less-natural intonational patterns used. The intonation analysis further demonstrates that across speech type, using context-appropriate tones or those conveying positive attitudes improves the overall impression of the voice (both P1 and P2). The results also suggest that a less-modal voice enhances the personality scores overall, but specific voice qualities (i.e., breathiness and creakiness) and voice pitch seem to affect P1 and P2 differently. The present study demonstrates a range of acoustic and phonetic characteristics that should be considered when designing personas for AI voices or developing more likable synthetic voices.</p>","PeriodicalId":51255,"journal":{"name":"Language and Speech","volume":" ","pages":"238309251389567"},"PeriodicalIF":1.1,"publicationDate":"2025-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145835357","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unveiling Denasalization as an Ongoing Sound Change: The Role of Prosody and Gender in Seoul Korean. 揭示去中性化是一种持续的声音变化:首尔韩语中韵律和性别的作用。
IF 1.1 2区 文学 Q3 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Pub Date : 2025-12-17 DOI: 10.1177/00238309251390431
Jiyoung Jang, Jungah Lee, Jiyoung Lee, Sahyang Kim, Taehong Cho

This study examines variation in coarticulatory vowel nasalization in Seoul Korean as a function of prosodic boundaries and gender, exploring its role in an emerging denasalization sound change. Coarticulatory vowel nasality, measured by A1-P0, was analyzed in the word-initial vowels of /ma.mi/ across three prosodic boundary conditions (IP-initial, AP-initial, and Wd-initial) in 35 speakers in their 20s. Results show that phrase-initial vowels exhibit reduced nasality as part of domain-initial articulatory strengthening, suggesting that denasalization of word-initial nasal consonants extends to the following vowel, reducing its coarticulatory nasalization and thus signaling the progression of a position-driven sound change. Significant gender differences were found: male speakers consistently adhere to this change throughout the vowel, exhibiting greater reductions in coarticulatory vowel nasalization in phrase-initial contexts. In contrast, female speakers retain higher nasality levels in both phrase-initial and phrase-medial positions by regulating the coarticulatory process. These gender-related differences may reflect socially grounded perceptions of nasality and/or female speakers' tendency to preserve phonological features, influencing speech production choices. These findings highlight the interplay between prosodically driven phonetic variation and gender: speakers actively control the degree of vowel nasalization, and this phonetic variation, in turn, is further shaped by gender, potentially evolving into a systematic sound change.

本研究考察了首尔朝鲜语元音协同发音的鼻音化随韵律边界和性别的变化,探讨了其在新兴的去鼻音化声音变化中的作用。用A1-P0测量/ma的单词开头元音,分析辅音元音的鼻音性。35名20多岁的说话者在三个韵律边界条件(ip首、ap首和wd首)上的发音差异。结果表明,短语初始元音作为域初始发音强化的一部分表现出鼻音减弱,这表明单词初始鼻辅音的去鼻音化延伸到下一个元音,减少了它的协同发音,从而标志着位置驱动的声音变化的进展。研究发现了显著的性别差异:男性说话者始终坚持元音的这种变化,在短语开头的语境中,元音的协同发音明显减少。相比之下,女性说话者通过调节协同发音过程,在短语起始和短语中间位置都保持了较高的鼻音水平。这些与性别相关的差异可能反映了社会对鼻音的认知和/或女性说话者倾向于保留语音特征,从而影响语音产生的选择。这些发现强调了韵律驱动的语音变化与性别之间的相互作用:说话者主动控制元音鼻音化的程度,而这种语音变化反过来又受到性别的进一步影响,有可能演变成一种系统的语音变化。
{"title":"Unveiling Denasalization as an Ongoing Sound Change: The Role of Prosody and Gender in Seoul Korean.","authors":"Jiyoung Jang, Jungah Lee, Jiyoung Lee, Sahyang Kim, Taehong Cho","doi":"10.1177/00238309251390431","DOIUrl":"https://doi.org/10.1177/00238309251390431","url":null,"abstract":"<p><p>This study examines variation in coarticulatory vowel nasalization in Seoul Korean as a function of prosodic boundaries and gender, exploring its role in an emerging denasalization sound change. Coarticulatory vowel nasality, measured by A1-P0, was analyzed in the word-initial vowels of /ma.mi/ across three prosodic boundary conditions (IP-initial, AP-initial, and Wd-initial) in 35 speakers in their 20s. Results show that phrase-initial vowels exhibit reduced nasality as part of domain-initial articulatory strengthening, suggesting that denasalization of word-initial nasal consonants extends to the following vowel, reducing its coarticulatory nasalization and thus signaling the progression of a position-driven sound change. Significant gender differences were found: male speakers consistently adhere to this change throughout the vowel, exhibiting greater reductions in coarticulatory vowel nasalization in phrase-initial contexts. In contrast, female speakers retain higher nasality levels in both phrase-initial and phrase-medial positions by regulating the coarticulatory process. These gender-related differences may reflect socially grounded perceptions of nasality and/or female speakers' tendency to preserve phonological features, influencing speech production choices. These findings highlight the interplay between prosodically driven phonetic variation and gender: speakers actively control the degree of vowel nasalization, and this phonetic variation, in turn, is further shaped by gender, potentially evolving into a systematic sound change.</p>","PeriodicalId":51255,"journal":{"name":"Language and Speech","volume":" ","pages":"238309251390431"},"PeriodicalIF":1.1,"publicationDate":"2025-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145769981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Listening Effort Across Non-Native and Regional Accents: A Pupillometry Study. 非母语和地方口音的听力努力:一项瞳孔测量研究。
IF 1.1 2区 文学 Q3 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Pub Date : 2025-12-17 DOI: 10.1177/00238309251389573
Marc Barnard, Scott Kunkel, Rémi Lamarque, Adam J Chong

Previous work has shown that L2-accented speech incurs a processing cost even when accurately understood. It remains unknown, however, whether an online processing cost is found when listeners process speech produced in L1 accents that are not their own. In this study, we examine this question by using comparative pupil dilation as a measure of cognitive load. Participants from the South of England heard sentences produced in four different accents: Southern British English (the listeners' own familiar accent), American English (a standard L1 accent widely used in media), Glaswegian English (a less-familiar regional L1 accent), and Mandarin Chinese-accented English (an L2 English accent). Results show that Chinese-accented speech elicited significantly larger pupil dilation responses compared with Southern British English. Speech from less-familiar L1 accents elicited pupil dilation responses of different shapes and trajectories, suggesting differences in processing of these accents. Furthermore, participants showed larger mean pupil dilation when they heard relatively less-familiar L1 American-accented speech than when hearing Glaswegian English. Interestingly, this effect was found despite participants self-reporting that they were less familiar with the Glaswegian accent and found it more effortful to comprehend compared with American English. These results suggest that accurately perceived and highly intelligible L1 accents such as American English also incur a cognitive cost in processing, but to a smaller extent compared with L2-accented speech. We discuss the implications of our findings for the relationship between exposure, subjective effortfulness measures, and pupil dilation responses.

先前的研究表明,l2口音的语音即使被准确理解也会产生处理成本。然而,当听者处理非母语口音的语音时,是否会产生在线处理成本,目前尚不清楚。在这项研究中,我们通过使用比较瞳孔扩张作为认知负荷的测量来检验这个问题。来自英格兰南部的参与者听到了四种不同口音的句子:英国南部英语(听众自己熟悉的口音),美国英语(媒体中广泛使用的标准L1口音),格拉斯哥英语(不太熟悉的地区L1口音)和普通话中国口音英语(第二语言英语口音)。结果表明,与英国南部英语相比,中国口音语音引起的瞳孔扩张反应明显更大。来自不太熟悉的母语口音的语音引起不同形状和轨迹的瞳孔扩张反应,表明这些口音的处理存在差异。此外,当参与者听到相对不太熟悉的L1美国口音讲话时,他们的平均瞳孔扩张幅度比听到格拉斯哥英语时要大。有趣的是,尽管参与者自我报告说他们对格拉斯哥口音不太熟悉,并且发现与美式英语相比,格拉斯哥口音更难以理解,但还是发现了这种影响。这些结果表明,准确感知和高度可理解的L1口音(如美式英语)在处理过程中也会产生认知成本,但与l2口音相比,其程度较小。我们讨论了我们的发现对暴露、主观努力测量和瞳孔扩张反应之间关系的影响。
{"title":"Listening Effort Across Non-Native and Regional Accents: A Pupillometry Study.","authors":"Marc Barnard, Scott Kunkel, Rémi Lamarque, Adam J Chong","doi":"10.1177/00238309251389573","DOIUrl":"https://doi.org/10.1177/00238309251389573","url":null,"abstract":"<p><p>Previous work has shown that L2-accented speech incurs a processing cost even when accurately understood. It remains unknown, however, whether an online processing cost is found when listeners process speech produced in L1 accents that are not their own. In this study, we examine this question by using comparative pupil dilation as a measure of cognitive load. Participants from the South of England heard sentences produced in four different accents: Southern British English (the listeners' own familiar accent), American English (a standard L1 accent widely used in media), Glaswegian English (a less-familiar regional L1 accent), and Mandarin Chinese-accented English (an L2 English accent). Results show that Chinese-accented speech elicited significantly larger pupil dilation responses compared with Southern British English. Speech from less-familiar L1 accents elicited pupil dilation responses of different shapes and trajectories, suggesting differences in processing of these accents. Furthermore, participants showed larger mean pupil dilation when they heard relatively less-familiar L1 American-accented speech than when hearing Glaswegian English. Interestingly, this effect was found despite participants self-reporting that they were less familiar with the Glaswegian accent and found it more effortful to comprehend compared with American English. These results suggest that accurately perceived and highly intelligible L1 accents such as American English also incur a cognitive cost in processing, but to a smaller extent compared with L2-accented speech. We discuss the implications of our findings for the relationship between exposure, subjective effortfulness measures, and pupil dilation responses.</p>","PeriodicalId":51255,"journal":{"name":"Language and Speech","volume":" ","pages":"238309251389573"},"PeriodicalIF":1.1,"publicationDate":"2025-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145769993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Corrigendum to "Sources of Intelligibility of Distant Languages: An Empirical Study". “遥远语言可理解性的来源:一项实证研究”的勘误表。
IF 1.1 2区 文学 Q3 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Pub Date : 2025-12-16 DOI: 10.1177/00238309251410999
{"title":"Corrigendum to \"Sources of Intelligibility of Distant Languages: An Empirical Study\".","authors":"","doi":"10.1177/00238309251410999","DOIUrl":"https://doi.org/10.1177/00238309251410999","url":null,"abstract":"","PeriodicalId":51255,"journal":{"name":"Language and Speech","volume":" ","pages":"238309251410999"},"PeriodicalIF":1.1,"publicationDate":"2025-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145764562","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Apparent Talker Variability and Speaking Style Similarity Can Enhance Comprehension of Novel L2-Accented Talkers. 说话人的明显变异性和说话风格的相似性可以增强对l2口音新说话人的理解。
IF 1.1 2区 文学 Q3 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Pub Date : 2025-12-15 DOI: 10.1177/00238309251390505
Nicholas B Aoki, Georgia Zellou

Certain studies report facilitatory effects of multiple-talker exposure on cross-talker generalization of L2-accented speech (often defined as greater comprehension of novel talkers). However, a confound exists in prior work: do multiple-talker exposure benefits stem from the greater number of talkers (numerosity) or greater phonological variability (heterogeneity)? This study examined how apparent talker variability and speaking style affect L2-accent adaptation, while keeping phonological variation as constant as possible across exposure conditions. L1-English participants transcribed sentences in noise for a single Mandarin-accented English talker in an exposure phase and a novel Mandarin-accented English speaker in a test phase (a control condition received no exposure). Although all exposure stimuli came from one speaker, half of the listeners who received exposure were led to believe that multiple talkers were present by shifting the F0 and formants of a subset of sentences. We find: (a) when the test talker produces casual speech, all critical conditions with exposure enhance generalization (i.e., greater comprehension of the test talker relative to control); (b) when the test talker produces hard-of-hearing-directed speech, there is no difference in transcription accuracy between the control and critical conditions; and (c) when the test talker produces casual speech, generalization is greatest when listeners are exposed to multiple apparent talkers, but only given speaking style similarity between exposure and test (i.e., when the exposure phase also presents casual speech). This work lends credence to numerosity accounts-given a minimal change in phonological variability, the illusion of multiple-talker exposure can facilitate cross-talker generalization of L2-accented speech.

某些研究报告了多重说话者暴露对相声者l2口音语音的泛化(通常定义为对新说话者的更好理解)的促进作用。然而,在先前的研究中存在一个困惑:多说话者暴露的好处是来自于更多的说话者(数量)还是更大的语音变异性(异质性)?本研究考察了说话者的明显变化和说话风格如何影响l2口音适应,同时在不同的暴露条件下尽可能保持语音变化不变。l1 -英语的参与者在噪音中为一个有普通话口音的英语说话者转录句子,在一个暴露阶段,为一个有普通话口音的英语说话者转录句子,在一个测试阶段(控制条件没有暴露)。尽管所有的暴露刺激都来自一个说话者,但通过改变句子子集的F0和共振峰,接受暴露的一半听众被引导相信有多个说话者在场。我们发现:(a)当测试说话者随意说话时,所有暴露的关键条件都增强了泛化(即,相对于对照组,对测试说话者有更好的理解);(b)当测试说话者产生听力困难的定向语音时,对照和临界条件下的转录准确性没有差异;(c)当测试说话者产生随意言语时,当听者暴露于多个明显的说话者,但只有在暴露和测试之间的说话风格相似时(即,当暴露阶段也呈现随意言语时),泛化效果最大。这项工作为数字解释提供了可信度——考虑到语音可变性的最小变化,多说话者暴露的错觉可以促进对口说话者对l2口音的概括。
{"title":"Apparent Talker Variability and Speaking Style Similarity Can Enhance Comprehension of Novel L2-Accented Talkers.","authors":"Nicholas B Aoki, Georgia Zellou","doi":"10.1177/00238309251390505","DOIUrl":"https://doi.org/10.1177/00238309251390505","url":null,"abstract":"<p><p>Certain studies report facilitatory effects of multiple-talker exposure on cross-talker generalization of L2-accented speech (often defined as greater comprehension of novel talkers). However, a confound exists in prior work: do multiple-talker exposure benefits stem from the greater number of talkers (numerosity) or greater phonological variability (heterogeneity)? This study examined how apparent talker variability and speaking style affect L2-accent adaptation, while keeping phonological variation as constant as possible across exposure conditions. L1-English participants transcribed sentences in noise for a single Mandarin-accented English talker in an exposure phase and a novel Mandarin-accented English speaker in a test phase (a control condition received no exposure). Although all exposure stimuli came from one speaker, half of the listeners who received exposure were led to believe that multiple talkers were present by shifting the F0 and formants of a subset of sentences. We find: (a) when the test talker produces casual speech, all critical conditions with exposure enhance generalization (i.e., greater comprehension of the test talker relative to control); (b) when the test talker produces hard-of-hearing-directed speech, there is no difference in transcription accuracy between the control and critical conditions; and (c) when the test talker produces casual speech, generalization is greatest when listeners are exposed to multiple apparent talkers, but only given speaking style similarity between exposure and test (i.e., when the exposure phase also presents casual speech). This work lends credence to numerosity accounts-given a minimal change in phonological variability, the illusion of multiple-talker exposure can facilitate cross-talker generalization of L2-accented speech.</p>","PeriodicalId":51255,"journal":{"name":"Language and Speech","volume":" ","pages":"238309251390505"},"PeriodicalIF":1.1,"publicationDate":"2025-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145764569","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparing Online and Face-to-Face Administration of the Polish Sentence Repetition Task in Monolingual and Multilingual Children: Higher Scores in Face-to-Face Testing. 单语和多语儿童波兰语句子重复任务在线管理和面对面管理的比较:面对面测试得分更高。
IF 1.1 2区 文学 Q3 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Pub Date : 2025-12-14 DOI: 10.1177/00238309251394372
Natalia Banasik-Jemielniak, Magdalena Kochańska, Maria Obarska, Maria Zajączkowska, Joanna Świderska, Ewa Haman

This study compared online and face-to-face (f2f) testing using the short Polish version of the LITMUS Sentence Repetition Task (SRep) with multilingual and monolingual Polish-speaking children. The shift to remote testing during the COVID-19 pandemic prompted questions about whether online methods yield results comparable with in-person testing for assessing multilingual children's grammatical abilities. Reliable online testing could enhance access to underrepresented populations, enabling families from diverse backgrounds to participate from home. We tested 92 multilingual children (speaking Polish and English or German) and 55 monolingual Polish-speaking children aged 4;6-7;6. Each child completed the SRep task twice (online and f2f) in a counterbalanced order. Results showed better performance on f2f tasks for both groups. Multilingual children improved on their second attempt, regardless of format, while monolinguals consistently scored higher in the f2f condition. These findings indicate differences in performance across testing modalities and the need to adapt and norm the SRep task for both online and f2f administration separately.

这项研究比较了在线测试和面对面测试(f2f),使用短波兰语版本的LITMUS句子重复任务(SRep)对多语和单语波兰语儿童进行测试。在COVID-19大流行期间转向远程测试引发了人们的质疑,即在线方法是否能产生与现场测试相当的结果,以评估多语言儿童的语法能力。可靠的在线测试可以增加对代表性不足的人群的访问,使来自不同背景的家庭能够在家参与。我们测试了92名多语儿童(讲波兰语、英语或德语)和55名4岁、6-7岁和6岁的单语波兰语儿童。每个孩子以平衡的顺序完成两次SRep任务(在线和f2f)。结果显示,两组在f2f任务上的表现都更好。多语言儿童在第二次测试中有所提高,无论测试形式如何,而单语言儿童在第二次测试中得分始终较高。这些发现表明,不同测试方式的表现存在差异,需要分别适应和规范在线和在线管理的SRep任务。
{"title":"Comparing Online and Face-to-Face Administration of the Polish Sentence Repetition Task in Monolingual and Multilingual Children: Higher Scores in Face-to-Face Testing.","authors":"Natalia Banasik-Jemielniak, Magdalena Kochańska, Maria Obarska, Maria Zajączkowska, Joanna Świderska, Ewa Haman","doi":"10.1177/00238309251394372","DOIUrl":"https://doi.org/10.1177/00238309251394372","url":null,"abstract":"<p><p>This study compared online and face-to-face (f2f) testing using the short Polish version of the LITMUS Sentence Repetition Task (SRep) with multilingual and monolingual Polish-speaking children. The shift to remote testing during the COVID-19 pandemic prompted questions about whether online methods yield results comparable with in-person testing for assessing multilingual children's grammatical abilities. Reliable online testing could enhance access to underrepresented populations, enabling families from diverse backgrounds to participate from home. We tested 92 multilingual children (speaking Polish and English or German) and 55 monolingual Polish-speaking children aged 4;6-7;6. Each child completed the SRep task twice (online and f2f) in a counterbalanced order. Results showed better performance on f2f tasks for both groups. Multilingual children improved on their second attempt, regardless of format, while monolinguals consistently scored higher in the f2f condition. These findings indicate differences in performance across testing modalities and the need to adapt and norm the SRep task for both online and f2f administration separately.</p>","PeriodicalId":51255,"journal":{"name":"Language and Speech","volume":" ","pages":"238309251394372"},"PeriodicalIF":1.1,"publicationDate":"2025-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145758397","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Language and Speech
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1