Journal of Phonetics最新文献

The effect of contrast-specific minimal pair competitor in hyperarticulation of VOT and F0 phonetic cues in Korean initial stops in tonogenetic sound change 对比特异性最小对竞争对手对朝鲜语VOT和F0语音线索高发音的影响

IF 2.4 1区文学 0 LANGUAGE & LINGUISTICS

Journal of Phonetics

Pub Date : 2026-01-31 DOI: 10.1016/j.wocn.2026.101474

Cheonkam Jeong, Andrew Wedel

In Seoul Korean, VOT is historically a primary cue distinguishing the aspirated

\sim

lenis contrast in stops, with fundamental frequency (F0) of the following vowel as a secondary cue. However, for many speakers a transphonologization is in progress in which the VOT cue progresses toward neutralization, with a concomitant expansion of the F0 contrast. Prior research shows that phonetic cues which distinguish a lexical minimal pair are hyperarticulated, and suggests that this contrastive hyperarticulation influences the trajectory of change in that phoneme contrast. Here we investigate minimal pair-associated hyperarticulaton of the VOT and F0 contrasts of aspirated/lenis stops in both a production study and a study of a corpus of natural speech. We ask (i) if we in fact find contrastive hyperarticulation of the aspirated

\sim

lenis distinction in minimal pairs, and (ii) if the degree of VOT hyperarticulation tracks the degree to which a speaker employs the VOT cue in the aspirated

\sim

lenis distinction. We find that speakers contrastively hyperarticulate both the VOT and F0 cues to the aspirated

\sim

lenis distinction, although hyperarticulation of the F0 contrast is less robust. Moreover, using a new measure of advancement in this sound change, we show that VOT still shows minimal pair-associated hyperarticulation even in advanced speakers who do not detectably use VOT in their general speech. We discuss possible explanations for why minimal pair-associated hyperarticulation of VOT appears to persist in these advanced speakers, and its implications for theories of sound change.

在首尔韩语中，VOT历来是区分送气~ lenis停顿对比的主要线索，下面元音的基频（F0）作为次要线索。然而，对于许多说话者来说，语音转义化正在进行中，VOT线索向中和发展，伴随着F0对比的扩大。先前的研究表明，区分词汇最小对的语音线索是高清晰度的，并表明这种对比高清晰度影响了音素对比的变化轨迹。在这里，我们研究了最小对相关的高发音VOT和F0对比的吸气/lenis停顿在生产研究和自然语音语料库的研究。我们要问：(i)我们是否在最小对中发现了吸气~ lenis区分的对比高清晰度，以及（ii） VOT高清晰度的程度是否跟踪说话者在吸气~ lenis区分中使用VOT线索的程度。我们发现，说话者在VOT和F0的对比上都有高清晰度提示吸气~ lenis的区别，尽管F0对比的高清晰度不那么强。此外，使用这种声音变化的新测量方法，我们表明即使在一般讲话中没有检测到VOT的高级说话者中，VOT仍然显示出最小的成对相关的高发音。我们讨论了为什么在这些高级说话者中存在最小对相关的VOT高清晰度的可能解释，以及它对声音变化理论的影响。

{"title":"The effect of contrast-specific minimal pair competitor in hyperarticulation of VOT and F0 phonetic cues in Korean initial stops in tonogenetic sound change","authors":"Cheonkam Jeong, Andrew Wedel","doi":"10.1016/j.wocn.2026.101474","DOIUrl":"10.1016/j.wocn.2026.101474","url":null,"abstract":"<div><div>In Seoul Korean, VOT is historically a primary cue distinguishing the aspirated<span><math><mrow><mo>∼</mo></mrow></math></span>lenis contrast in stops, with fundamental frequency (F0) of the following vowel as a secondary cue. However, for many speakers a transphonologization is in progress in which the VOT cue progresses toward neutralization, with a concomitant expansion of the F0 contrast. Prior research shows that phonetic cues which distinguish a lexical minimal pair are hyperarticulated, and suggests that this contrastive hyperarticulation influences the trajectory of change in that phoneme contrast. Here we investigate minimal pair-associated hyperarticulaton of the VOT and F0 contrasts of aspirated/lenis stops in both a production study and a study of a corpus of natural speech. We ask (i) if we in fact find contrastive hyperarticulation of the aspirated<span><math><mrow><mo>∼</mo></mrow></math></span>lenis distinction in minimal pairs, and (ii) if the degree of VOT hyperarticulation tracks the degree to which a speaker employs the VOT cue in the aspirated<span><math><mrow><mo>∼</mo></mrow></math></span>lenis distinction. We find that speakers contrastively hyperarticulate both the VOT and F0 cues to the aspirated<span><math><mrow><mo>∼</mo></mrow></math></span>lenis distinction, although hyperarticulation of the F0 contrast is less robust. Moreover, using a new measure of advancement in this sound change, we show that VOT still shows minimal pair-associated hyperarticulation even in advanced speakers who do not detectably use VOT in their general speech. We discuss possible explanations for why minimal pair-associated hyperarticulation of VOT appears to persist in these advanced speakers, and its implications for theories of sound change.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"115 ","pages":"Article 101474"},"PeriodicalIF":2.4,"publicationDate":"2026-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146090807","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The Interplay of Planning and Prosody: Investigating the Bidirectional Influences of Planning and Prosody in Speech Production 计划与韵律的相互作用：研究计划与韵律在言语产生中的双向影响

IF 2.4 1区文学 0 LANGUAGE & LINGUISTICS

Journal of Phonetics

Pub Date : 2026-01-01 DOI: 10.1016/j.wocn.2025.101461

Jungyun Seo, Ruaridh Purse, Jelena Krivokapić

This study investigates how planning and prosodic structure interact in speech production. Planning is operationalized in this study as the selection of one lexical item from two possible candidates. In an Electromagnetic Articulometry study that elicited this planning at a word or phrase boundary, two questions were examined. The first question tested whether planning has an effect on the kinematic properties of prosodic phrase boundaries. The results show that an increase in planning load does not affect the scope of boundary-related gestural lengthening but only leads to an increase in pause duration. The second question tested the effect of planning at word boundaries, specifically whether an increase in planning load at a word boundary leads to the production of a prosodic phrase boundary or just the insertion of a pause. The results show that an increase in the planning load at word boundaries leads to lengthening of gestures at a boundary and the insertion of pauses, indicating that speakers insert prosodic phrase boundaries when they need more planning time.

本研究探讨计划与韵律结构在言语产生中的相互作用。计划在本研究中被操作为从两个可能的候选词汇中选择一个词汇。在一项电磁发音测量研究中，在单词或短语边界引出了这种规划，研究了两个问题。第一个问题是测试规划是否对韵律短语边界的运动特性有影响。结果表明，规划负荷的增加不影响边界相关手势延长的范围，而只导致暂停时间的增加。第二个问题测试了在单词边界处规划的效果，特别是在单词边界处增加规划负荷是否会导致韵律短语边界的产生或只是插入停顿。结果表明，单词边界规划负荷的增加导致边界手势的延长和停顿的插入，表明说话者在需要更多规划时间时插入韵律短语边界。

{"title":"The Interplay of Planning and Prosody: Investigating the Bidirectional Influences of Planning and Prosody in Speech Production","authors":"Jungyun Seo, Ruaridh Purse, Jelena Krivokapić","doi":"10.1016/j.wocn.2025.101461","DOIUrl":"10.1016/j.wocn.2025.101461","url":null,"abstract":"<div><div>This study investigates how planning and prosodic structure interact in speech production. Planning is operationalized in this study as the selection of one lexical item from two possible candidates. In an Electromagnetic Articulometry study that elicited this planning at a word or phrase boundary, two questions were examined. The first question tested whether planning has an effect on the kinematic properties of prosodic phrase boundaries. The results show that an increase in planning load does not affect the scope of boundary-related gestural lengthening but only leads to an increase in pause duration. The second question tested the effect of planning at word boundaries, specifically whether an increase in planning load at a word boundary leads to the production of a prosodic phrase boundary or just the insertion of a pause. The results show that an increase in the planning load at word boundaries leads to lengthening of gestures at a boundary and the insertion of pauses, indicating that speakers insert prosodic phrase boundaries when they need more planning time.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"114 ","pages":"Article 101461"},"PeriodicalIF":2.4,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145885492","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Advancements of phonetics in the 21st century: Intonation 21世纪语音学的进展：语调

IF 2.4 1区文学 0 LANGUAGE & LINGUISTICS

Journal of Phonetics

Pub Date : 2026-01-01 DOI: 10.1016/j.wocn.2025.101459

Amalia Arvaniti , Martine Grice , Mariapaola D’Imperio

Intonation, the linguistic use of voice pitch, is critical for acquisition, phonetic organisation at the phrasal level and subsequent speech processing. By indicating syntactic constituency and encoding information structure and conversational implicatures, intonation is not only crucial for communication, but also for understanding grammatical phenomena such as information structure and syntax. Nevertheless, existing research has not always done justice to the roles that intonation plays in grammar, acquisition, and communication. Here, we first present an overview of what intonation is (and what it is not), and briefly discuss the essential tenets of the autosegmental-metrical (AM) theory of intonational phonology, currently the most influential and widely used approach to intonation. We critically review the standard methods of conducting intonation research within AM and present newer methodologies developed in response to shortcomings of the standard methods. We conclude by reviewing potential changes to AM in light of the findings stemming from these newer methods, followed by suggestions for future research.

语调，即音高的语言使用，对于习得、短语水平的语音组织和随后的语音处理至关重要。语调通过指示句法组成、编码信息结构和会话含义，不仅对交际至关重要，而且对理解信息结构和句法等语法现象也至关重要。然而，现有的研究并不总是公正地对待语调在语法、习得和交际中的作用。在这里，我们首先概述了什么是语调（什么不是），并简要讨论了语调音韵学的自音段格律（AM）理论的基本原则，这是目前最具影响力和广泛使用的语调研究方法。我们批判性地回顾了在AM中进行语调研究的标准方法，并提出了针对标准方法缺点而开发的新方法。最后，根据这些新方法的发现，我们回顾了AM的潜在变化，然后对未来的研究提出了建议。

引用次数: 0

Tone sandhi and tonal coarticulation in disyllabic sequences in Changsha Xiang 长沙方言双音节序列中的连读和连音

IF 2.4 1区文学 0 LANGUAGE & LINGUISTICS

Journal of Phonetics

Pub Date : 2026-01-01 DOI: 10.1016/j.wocn.2025.101472

Miao Zhang , Shuxiao Gong , Chengyu Guo

This study investigates tone sandhi and tonal coarticulation in disyllabic sequences in Changsha Xiang, a Sinitic language with six lexical tones in Hunan Province, China. Changsha Xiang exhibits two distinct tone sandhi patterns: right-dominant and left-dominant. Previous descriptions of these patterns have been largely impressionistic, lacking an instrumental phonetic analysis. Building on data from 16 native speakers, this study provides an acoustic analysis of the tonal realization, coarticulation, and phonetic reduction patterns in Changsha Xiang. Our results indicate that in right-dominant sequences, the non-dominant syllable exhibits reduced F0 excursion without neutralization, whereas in left-dominant sequences, the non-dominant syllable undergoes paradigmatic neutralization to four short level tones. Additionally, the study finds that the tone sandhi pattern affects tonal coarticulation, with a larger carryover effect in non-dominant final syllables. These findings contribute to a broader understanding of tonal processes in Sinitic languages and highlight parallels between tone sandhi and stress-tone interactions in other languages.

本文研究了湖南六声调汉语长沙乡的双音节连读和连音现象。长沙语有两种截然不同的变调模式：左调和右调。以前对这些模式的描述在很大程度上是印象派的，缺乏工具语音分析。本研究以16位长沙乡话为研究对象，对长沙乡话的声调实现、协同发音和语音弱化模式进行了声学分析。结果表明，在右优势序列中，非优势音节的F0偏移在不中和的情况下减少，而在左优势序列中，非优势音节的F0偏移在四个短水平音中进行范式中和。此外，研究还发现变调模式会影响音调的协同发音，在非优势音节的最后音节中，变调模式的结转效应更大。这些发现有助于更广泛地理解汉语的声调过程，并突出了其他语言中变调和重音互动之间的相似之处。

{"title":"Tone sandhi and tonal coarticulation in disyllabic sequences in Changsha Xiang","authors":"Miao Zhang , Shuxiao Gong , Chengyu Guo","doi":"10.1016/j.wocn.2025.101472","DOIUrl":"10.1016/j.wocn.2025.101472","url":null,"abstract":"<div><div>This study investigates tone sandhi and tonal coarticulation in disyllabic sequences in Changsha Xiang, a Sinitic language with six lexical tones in Hunan Province, China. Changsha Xiang exhibits two distinct tone sandhi patterns: right-dominant and left-dominant. Previous descriptions of these patterns have been largely impressionistic, lacking an instrumental phonetic analysis. Building on data from 16 native speakers, this study provides an acoustic analysis of the tonal realization, coarticulation, and phonetic reduction patterns in Changsha Xiang. Our results indicate that in right-dominant sequences, the non-dominant syllable exhibits reduced F0 excursion without neutralization, whereas in left-dominant sequences, the non-dominant syllable undergoes paradigmatic neutralization to four short level tones. Additionally, the study finds that the tone sandhi pattern affects tonal coarticulation, with a larger carryover effect in non-dominant final syllables. These findings contribute to a broader understanding of tonal processes in Sinitic languages and highlight parallels between tone sandhi and stress-tone interactions in other languages.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"114 ","pages":"Article 101472"},"PeriodicalIF":2.4,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145927545","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Dynamic formant trajectories in L2 English: evidence from Arabic-speaking adolescent learners 二语英语动态形成轨迹：来自阿拉伯语青少年学习者的证据

IF 2.4 1区文学 0 LANGUAGE & LINGUISTICS

Journal of Phonetics

Pub Date : 2025-12-26 DOI: 10.1016/j.wocn.2025.101471

Hicham Adem

Previous research on Arabic-speaking learners of English (L2) has often focused on adults and relied on static formant measurements, underscoring the need for dynamic approaches that better capture the temporal characteristics of L2 vowel production. Grounded in the revised Speech Learning Model (SLM-r), this study analyzes the formant trajectories of seven English vowels in CVC contexts, produced by 65 adolescent female learners whose L1 is Palestinian Arabic. Static vowel targets were descriptively compared to normative values from a corpus of L1 English speakers (N = 436; Lee et al., 1999), with age, gender, and CVC context matched across datasets; dynamic analyses were conducted on the L2 learner data only. Formant trajectories were modeled as multidimensional time series to assess Vowel-Inherent Spectral Change (VISC), using three trajectory-based measures—Vector Length (VL), Trajectory Length (TL), and Formant Velocity (FV)—alongside angular displacement as an additional cue. A multi-method approach combining Linear Mixed-Effects modeling (LME), Functional Principal Component Analysis (fPCA), and trajectory analysis identified group-level patterns of vowel-space compression, slower formant transitions, and reduced spectral–temporal modulation. fPCA showed joint reductions in F1 and F2 dynamics, reflecting a centralized, flattened vowel space with limited temporal modulation and diminished contrast. These outcomes suggest system-level reorganization of the L2 vowel space, shaped by L1-based constraints. Trajectory orientation emerged as a secondary cue when integrated with other time-varying measures, with directional characteristics providing limited but detectable acoustic distinctiveness. Classification models comparing dynamic, static, combined, and direction-only features showed that time-resolved spectral cues most consistently supported within-speaker classification and were more sensitive to spectral variation. Trajectory and fPCA analyses extend the empirical scope of the SLM-r by highlighting that equivalence classification may involve not only reduced acoustic category separation but also a reorganization of the spectral–temporal geometry of the L2 vowel system. These findings offer a signal-level perspective on vowel restructuring as it unfolds acoustically in underrepresented L2 learners and broaden our understanding of the acoustic structure of L2 vowels across diverse learning environments.

先前对说阿拉伯语的英语学习者（L2）的研究通常集中在成年人身上，并依赖于静态的形成峰测量，强调需要动态的方法来更好地捕捉L2元音产生的时间特征。本研究以修正后的言语学习模型（SLM-r）为基础，分析了65名母语为巴勒斯坦阿拉伯语的青春期女性学习者在CVC语境中7个英语元音的形成轨迹。静态元音目标与来自母语英语者语料库的规范值进行描述性比较（N = 436; Lee et al., 1999），年龄、性别和CVC上下文在数据集之间匹配；仅对第二语言学习者的数据进行动态分析。形成峰轨迹被建模为多维时间序列，以评估元音固有谱变化（VISC），使用三种基于轨迹的测量-矢量长度（VL），轨迹长度（TL）和形成峰速度(FV) -以及角位移作为额外线索。结合线性混合效应建模（LME）、功能主成分分析（fPCA）和轨迹分析的多方法方法确定了元音空间压缩、较慢的形成峰转换和减少的频谱时间调制的群体水平模式。fPCA表现出F1和F2动态的联合减弱，反映了集中、平坦的元音空间，时间调制有限，对比度减弱。这些结果表明L2元音空间的系统级重组是由基于l1的约束形成的。当与其他时变测量相结合时，轨迹方向成为次要线索，方向特征提供有限但可检测的声学特征。对比动态、静态、组合和方向特征的分类模型表明，时间分辨光谱线索最一致地支持说话人内部分类，并且对光谱变化更敏感。轨迹和fPCA分析扩展了SLM-r的经验范围，强调等效分类可能不仅涉及减少声学类别分离，还涉及L2元音系统的频谱-时间几何结构的重组。这些发现提供了一个信号层面的元音重组视角，因为它在代表性不足的二语学习者中展现了声学特征，并拓宽了我们对不同学习环境中二语元音声学结构的理解。

{"title":"Dynamic formant trajectories in L2 English: evidence from Arabic-speaking adolescent learners","authors":"Hicham Adem","doi":"10.1016/j.wocn.2025.101471","DOIUrl":"10.1016/j.wocn.2025.101471","url":null,"abstract":"<div><div>Previous research on Arabic-speaking learners of English (L2) has often focused on adults and relied on static formant measurements, underscoring the need for dynamic approaches that better capture the temporal characteristics of L2 vowel production. Grounded in the revised Speech Learning Model (SLM-r), this study analyzes the formant trajectories of seven English vowels in CVC contexts, produced by 65 adolescent female learners whose L1 is Palestinian Arabic. Static vowel targets were descriptively compared to normative values from a corpus of L1 English speakers (N = 436; Lee et al., 1999), with age, gender, and CVC context matched across datasets; dynamic analyses were conducted on the L2 learner data only. Formant trajectories were modeled as multidimensional time series to assess Vowel-Inherent Spectral Change (VISC), using three trajectory-based measures—Vector Length (VL), Trajectory Length (TL), and Formant Velocity (FV)—alongside angular displacement as an additional cue. A multi-method approach combining Linear Mixed-Effects modeling (LME), Functional Principal Component Analysis (fPCA), and trajectory analysis identified group-level patterns of vowel-space compression, slower formant transitions, and reduced spectral–temporal modulation. fPCA showed joint reductions in F1 and F2 dynamics, reflecting a centralized, flattened vowel space with limited temporal modulation and diminished contrast. These outcomes suggest system-level reorganization of the L2 vowel space, shaped by L1-based constraints. Trajectory orientation emerged as a secondary cue when integrated with other time-varying measures, with directional characteristics providing limited but detectable acoustic distinctiveness. Classification models comparing dynamic, static, combined, and direction-only features showed that time-resolved spectral cues most consistently supported within-speaker classification and were more sensitive to spectral variation. Trajectory and fPCA analyses extend the empirical scope of the SLM-r by highlighting that equivalence classification may involve not only reduced acoustic category separation but also a reorganization of the spectral–temporal geometry of the L2 vowel system. These findings offer a signal-level perspective on vowel restructuring as it unfolds acoustically in underrepresented L2 learners and broaden our understanding of the acoustic structure of L2 vowels across diverse learning environments.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"114 ","pages":"Article 101471"},"PeriodicalIF":2.4,"publicationDate":"2025-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145842578","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Linguistic and cognitive functions of fine phonetic detail underlying sound systems and sound change 语音系统和语音变化背后的语音细节的语言和认知功能

IF 2.4 1区文学 0 LANGUAGE & LINGUISTICS

Journal of Phonetics

Pub Date : 2025-12-23 DOI: 10.1016/j.wocn.2025.101470

Taehong Cho , Sahyang Kim , Holger Mitterer

This special issue examines how fine phonetic detail participates in the shaping of sound systems. Across fourteen studies, the central theme is that subtle temporal, spectral, and articulatory patterns are not incidental by-products of articulation, but are systematically regulated aspects of speakers’ phonetic knowledge. They provide the means through which phonological contrasts and prosodic structure are realized, maintained, and sometimes reorganized. The contributions show how languages allocate continuous phonetic parameters—such as timing, coordination, voice quality, and nasality—within prosodic domains (e.g., phrases, words, and syllables) and under general biomechanical and communicative pressures. Studies of Irish, Hawaiian, Japanese, and Mandarin illustrate how prosodic structure guides segmental and suprasegmental realization. Work on English, German, Danish, and Cantonese demonstrates how fine phonetic detail underlies patterns of variation and creates potential pathways for change. Production connects naturally to perception and learning: findings from English accent adaptation and Samoan iterated learning reveal how listeners stabilize or reinterpret detail, linking individual processing to community-level patterning. A set of studies on Italian, Korean, English, and L2 German show how prominence reorganizes cues across articulation, interaction, and acquisition, shaping how speakers signal and listeners recover linguistic structure. These studies converge on a view in which fine phonetic detail arises from a central phonetic component (or the phonetic grammar) of linguistic structure—controlled by speakers, shaped by universal motor and perceptual constraints, and continually adjusted through perception and learning. In this perspective, sound systems emerge from the interplay of these regulated patterns, which sustain contrasts, support communication, and open principled routes for change.

本期特刊探讨了语音细节如何参与声音系统的形成。在14项研究中，中心主题是微妙的时间、频谱和发音模式不是发音的偶然副产品，而是说话者语音知识的系统调节方面。它们提供了实现、维持和有时重组语音对比和韵律结构的手段。这些贡献展示了语言如何在韵律域（如短语、单词和音节）和一般生物力学和交际压力下分配连续的语音参数，如时间、协调、语音质量和鼻音。对爱尔兰语、夏威夷语、日语和普通话的研究说明了韵律结构如何指导音段和超音段的实现。对英语、德语、丹麦语和广东话的研究表明，细微的语音细节是变化模式的基础，并为变化创造了潜在的途径。生产与感知和学习自然相关：来自英语口音适应和萨摩亚迭代学习的研究结果揭示了听众如何稳定或重新解释细节，将个人处理与社区水平的模式联系起来。一系列关于意大利语、韩语、英语和德语第二语言的研究表明，突出性如何在发音、互动和习得过程中重组线索，塑造说话者和听者如何发出信号和恢复语言结构。这些研究集中于这样一种观点，即精细的语音细节来自语言结构的中心语音成分（或语音语法），由说话者控制，由普遍的运动和感知约束形成，并通过感知和学习不断调整。从这个角度来看，健全的系统是从这些受调节模式的相互作用中产生的，这些模式维持了对比，支持了交流，并为变革开辟了原则性的路线。

{"title":"Linguistic and cognitive functions of fine phonetic detail underlying sound systems and sound change","authors":"Taehong Cho , Sahyang Kim , Holger Mitterer","doi":"10.1016/j.wocn.2025.101470","DOIUrl":"10.1016/j.wocn.2025.101470","url":null,"abstract":"<div><div>This special issue examines how fine phonetic detail participates in the shaping of sound systems. Across fourteen studies, the central theme is that subtle temporal, spectral, and articulatory patterns are not incidental by-products of articulation, but are systematically regulated aspects of speakers’ phonetic knowledge. They provide the means through which phonological contrasts and prosodic structure are realized, maintained, and sometimes reorganized. The contributions show how languages allocate continuous phonetic parameters—such as timing, coordination, voice quality, and nasality—within prosodic domains (e.g., phrases, words, and syllables) and under general biomechanical and communicative pressures. Studies of Irish, Hawaiian, Japanese, and Mandarin illustrate how prosodic structure guides segmental and suprasegmental realization. Work on English, German, Danish, and Cantonese demonstrates how fine phonetic detail underlies patterns of variation and creates potential pathways for change. Production connects naturally to perception and learning: findings from English accent adaptation and Samoan iterated learning reveal how listeners stabilize or reinterpret detail, linking individual processing to community-level patterning. A set of studies on Italian, Korean, English, and L2 German show how prominence reorganizes cues across articulation, interaction, and acquisition, shaping how speakers signal and listeners recover linguistic structure. These studies converge on a view in which fine phonetic detail arises from a central phonetic component (or the phonetic grammar) of linguistic structure—controlled by speakers, shaped by universal motor and perceptual constraints, and continually adjusted through perception and learning. In this perspective, sound systems emerge from the interplay of these regulated patterns, which sustain contrasts, support communication, and open principled routes for change.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"114 ","pages":"Article 101470"},"PeriodicalIF":2.4,"publicationDate":"2025-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145808343","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Imitation of F0 tone contours by Mandarin and English speakers is both categorical and continuous 普通话和英语使用者对F0音调轮廓的模仿既有绝对的，也有连续的

IF 2.4 1区文学 0 LANGUAGE & LINGUISTICS

Journal of Phonetics

Pub Date : 2025-11-01 DOI: 10.1016/j.wocn.2025.101457

Wei Zhang , Meghan Clayards , Morgan Sonderegger

Native speakers imitate F0 contours that vary between two lexical tones non-linearly–they do not precisely reproduce the presented F0 features but instead cluster them toward tonal categories, the so-called contrast mediation effect. However, less is known whether non-native speakers who lack the lexical tone phonology will show linear imitation of F0 contours. Addressing this question will deepen our understanding of whether F0 imitation is solely influenced by lexical tone contrasts or also shaped by other sources of non-linearity beyond phonological contrasts. To investigate this, the current study examined the categorization and imitation of a Mandarin flat-falling tonal continuum by both Mandarin speakers and English speakers who were naïve to tonal languages. Imitation distributions were analyzed by comparing two models: a linear regression model, which assumes participants linearly track phonetic cues, and a mixture regression model, which assumes imitation reflects underlying categories. The mixture regression model fit the data better for the Mandarin speakers while the reverse was true for the English speakers, suggesting that Mandarin speakers imitated the F0 contours more categorically than English speakers. However, for both groups, the data was best fit using a weighted combination of both models. For the Mandarin group this result along with additional analyses of duration, F1 and intensity suggest that tone categories involve both phonological and phonetic information and imitation taps both, possibly via hyper- and hypo-articulation. For English participants, the evidence for categorical mediation suggests that imitation is mediated by factors other than lexically contrastive linguistic categories, although the exact nature of the factors is unclear.

母语人士模仿的F0轮廓在两个词汇音调之间呈非线性变化——他们并没有精确地再现所呈现的F0特征，而是将它们聚集到音调类别中，即所谓的对比中介效应。然而，缺乏词汇音系学的非母语人士是否会对F0轮廓表现出线性模仿，这一点尚不清楚。解决这个问题将加深我们对F0模仿是否仅仅受到词汇语调对比的影响，还是也受到语音对比以外的其他非线性来源的影响的理解。为了研究这一点，本研究调查了普通话使用者和英语使用者对声调语言naïve的普通话降调连续体的分类和模仿。通过比较两种模型来分析模仿分布：线性回归模型（假设参与者线性跟踪语音线索）和混合回归模型（假设模仿反映潜在类别）。混合回归模型对说普通话的人更符合数据，而说英语的人则相反，这表明说普通话的人比说英语的人更明确地模仿F0轮廓。然而，对于两组来说，使用两种模型的加权组合来拟合数据是最好的。对于普通话组，这一结果以及对持续时间、F1和强度的额外分析表明，声调类别涉及语音和语音信息，模仿可能通过高发音和低发音来实现。对于英语参与者来说，范畴中介的证据表明，模仿是由词汇对比语言类别以外的因素介导的，尽管这些因素的确切性质尚不清楚。

{"title":"Imitation of F0 tone contours by Mandarin and English speakers is both categorical and continuous","authors":"Wei Zhang , Meghan Clayards , Morgan Sonderegger","doi":"10.1016/j.wocn.2025.101457","DOIUrl":"10.1016/j.wocn.2025.101457","url":null,"abstract":"<div><div>Native speakers imitate F0 contours that vary between two lexical tones non-linearly–they do not precisely reproduce the presented F0 features but instead cluster them toward tonal categories, the so-called contrast mediation effect. However, less is known whether non-native speakers who lack the lexical tone phonology will show linear imitation of F0 contours. Addressing this question will deepen our understanding of whether F0 imitation is solely influenced by lexical tone contrasts or also shaped by other sources of non-linearity beyond phonological contrasts. To investigate this, the current study examined the categorization and imitation of a Mandarin flat-falling tonal continuum by both Mandarin speakers and English speakers who were naïve to tonal languages. Imitation distributions were analyzed by comparing two models: a linear regression model, which assumes participants linearly track phonetic cues, and a mixture regression model, which assumes imitation reflects underlying categories. The mixture regression model fit the data better for the Mandarin speakers while the reverse was true for the English speakers, suggesting that Mandarin speakers imitated the F0 contours more categorically than English speakers. However, for both groups, the data was best fit using a weighted combination of both models. For the Mandarin group this result along with additional analyses of duration, F1 and intensity suggest that tone categories involve both phonological and phonetic information and imitation taps both, possibly via hyper- and hypo-articulation. For English participants, the evidence for categorical mediation suggests that imitation is mediated by factors other than lexically contrastive linguistic categories, although the exact nature of the factors is unclear.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"113 ","pages":"Article 101457"},"PeriodicalIF":2.4,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145473984","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

F0 derivatives in the classification of meaningful tonal movements F0阶导数在有意义的调性动作分类中

IF 2.4 1区文学 0 LANGUAGE & LINGUISTICS

Journal of Phonetics

Pub Date : 2025-11-01 DOI: 10.1016/j.wocn.2025.101454

Constantijn Kaland

Recent work applied cluster analysis on f0 contours in order to find ‘prototypical’ or ‘underlying’ categories as assumed in intonational phonology. However, it remains to be answered to what extent meaningful f0 variation can indeed be captured using automatic classification of surface realizations. Studies on f0 dynamics have suggested that derivatives (e.g., f0 velocity, acceleration and jerk) closely approximate the meaningful components of f0. The question answered in this study is to what extent f0 derivatives are more informative for cluster analysis than other metrics, such as the (time series) f0 contour they are derived from, a static measure representing it, or other acoustic measures such as intensity and duration. This is tested across two clustering techniques (hierarchical and k-medoids) for three different meaningful features expressed in Dutch noun phrases (of the type ‘blue sofa’): focus type (broad, narrow), focus position (adjective, noun) and phrase position (medial, final). Results show that derivatives are among the most informative acoustic measures, although the best performing cluster analyses are the ones based on multiple acoustic measures. Crucially, cluster analyses reveal that the different meaningful prosodic features each have their own characteristics in terms of acoustics and number of clusters.

最近的工作应用聚类分析的40轮廓，以找到“原型”或“潜在的”类别，假设在语调音系。然而，在多大程度上使用表面实现的自动分类确实可以捕获有意义的f0变化仍然有待回答。对f0动力学的研究表明，f0的导数（例如，f0的速度、加速度和加速度）与f0的有意义分量非常接近。本研究回答的问题是，对于聚类分析，f0导数在多大程度上比其他指标(如它们派生的（时间序列）f0轮廓，代表它的静态测量或其他声学测量（如强度和持续时间）提供更多信息。这是通过两种聚类技术（分层聚类和k-medoids聚类）对荷兰语名词短语（“蓝色沙发”类型）中表达的三种不同意义特征进行的测试：焦点类型（宽、窄）、焦点位置（形容词、名词）和短语位置（中间、结尾）。结果表明，尽管性能最好的聚类分析是基于多个声学测量的聚类分析，但导数是信息量最大的声学测量之一。重要的是，聚类分析表明，不同的有意义的韵律特征在声学和聚类数量方面都有自己的特点。

{"title":"F0 derivatives in the classification of meaningful tonal movements","authors":"Constantijn Kaland","doi":"10.1016/j.wocn.2025.101454","DOIUrl":"10.1016/j.wocn.2025.101454","url":null,"abstract":"<div><div>Recent work applied cluster analysis on f0 contours in order to find ‘prototypical’ or ‘underlying’ categories as assumed in intonational phonology. However, it remains to be answered to what extent meaningful f0 variation can indeed be captured using automatic classification of surface realizations. Studies on f0 dynamics have suggested that derivatives (e.g., f0 velocity, acceleration and jerk) closely approximate the meaningful components of f0. The question answered in this study is to what extent f0 derivatives are more informative for cluster analysis than other metrics, such as the (time series) f0 contour they are derived from, a static measure representing it, or other acoustic measures such as intensity and duration. This is tested across two clustering techniques (hierarchical and k-medoids) for three different meaningful features expressed in Dutch noun phrases (of the type ‘blue sofa’): focus type (broad, narrow), focus position (adjective, noun) and phrase position (medial, final). Results show that derivatives are among the most informative acoustic measures, although the best performing cluster analyses are the ones based on multiple acoustic measures. Crucially, cluster analyses reveal that the different meaningful prosodic features each have their own characteristics in terms of acoustics and number of clusters.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"113 ","pages":"Article 101454"},"PeriodicalIF":2.4,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145473983","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

How pitch gestures facilitate L2 lexical tone learning: The role of L1–L2 perceptual assimilation in Mandarin speakers’ acquisition of Thai tones 音高手势如何促进二语词汇语调学习：L1-L2知觉同化在普通话使用者泰语语调习得中的作用

IF 2.4 1区文学 0 LANGUAGE & LINGUISTICS

Journal of Phonetics

Pub Date : 2025-11-01 DOI: 10.1016/j.wocn.2025.101460

Keke Yu , Jie Zhang , Zilong Li , Xuliang Zhang , Yiyuan He , Li Li , Ruiming Wang

Lexical tone acquisition is a significant yet challenging aspect of learning a tonal language as a second language (L2). Embodied cognition theory offers a promising perspective by highlighting the role of pitch gestures in L2 lexical tone learning. Nevertheless, how pitch gestures facilitate L2 lexical tone acquisition, particularly in relation to native language (L1)–L2 perceptual assimilation patterns remain poorly understood. The present study recruited native Mandarin speakers to learn Thai lexical tones. First, we classified different types of Mandarin-Thai tone assimilation before learning. Next, we employed tone discrimination and identification tasks to investigate how pitch gestures facilitate the learning of Thai lexical tones with varying assimilation patterns. We compared three learning approaches, pitch gesture production, pitch feature observation, and word-picture association. The results revealed three Mandarin-Thai tone assimilation patterns: the Mid and Low Thai tones were assimilated to Mandarin Tone 1; the Falling Thai tone was assimilated to Mandarin Tone 4; the High and Rising Thai tones were assimilated to Mandarin Tone 2. Notably, the pitch gesture production approach enhanced learners’ ability to discriminate between Thai tones assimilated to different Mandarin tones, and identify Thai tones assimilated to Mandarin Tone 1 (Mid/Low) and Tone 4 (Falling). These findings indicate that while embodied experience through pitch gesture production facilitates L2 lexical tone acquisition, its efficacy is modulated by L1–L2 perceptual assimilation patterns. Based on these results, we propose an embodied learning viewpoint that incorporates L1 tonal experience, offering new insights into L2 lexical tone acquisition.

词汇语调习得是声调语言作为第二语言学习的一个重要而又具有挑战性的方面。具身认知理论强调了音高手势在二语词汇语调学习中的作用，为二语词汇语调学习提供了一个有希望的视角。然而，音高手势如何促进第二语言词汇语调习得，特别是与母语(L1) -第二语言感知同化模式的关系，仍然知之甚少。本研究招募了以普通话为母语的人来学习泰语词汇音调。首先，我们在学习前对汉语-泰语声调同化的不同类型进行了分类。接下来，我们采用音调辨别和识别任务来研究音调手势如何促进不同同化模式的泰语词汇音调的学习。我们比较了三种学习方法：音高手势产生、音高特征观察和词-图联想。结果揭示了三种普通话-泰语声调同化模式：中泰语声调和下泰语声调被同化为普通话声调1；泰国降调被普通话同化；泰国的高低声调被同化为普通话声调。值得注意的是，音高手势生成方法增强了学习者区分被不同普通话声调同化的泰语声调的能力，以及识别被普通话声调1（中/低）和声调4（降调）同化的泰语声调的能力。这些发现表明，虽然通过音高手势产生的具身经验促进了二语词汇语调习得，但其效果受到L1-L2知觉同化模式的调节。在此基础上，我们提出了一种结合母语声调经验的具身学习观点，为二语词汇声调习得提供了新的见解。

{"title":"How pitch gestures facilitate L2 lexical tone learning: The role of L1–L2 perceptual assimilation in Mandarin speakers’ acquisition of Thai tones","authors":"Keke Yu , Jie Zhang , Zilong Li , Xuliang Zhang , Yiyuan He , Li Li , Ruiming Wang","doi":"10.1016/j.wocn.2025.101460","DOIUrl":"10.1016/j.wocn.2025.101460","url":null,"abstract":"<div><div>Lexical tone acquisition is a significant yet challenging aspect of learning a tonal language as a second language (L2). Embodied cognition theory offers a promising perspective by highlighting the role of pitch gestures in L2 lexical tone learning. Nevertheless, how pitch gestures facilitate L2 lexical tone acquisition, particularly in relation to native language (L1)–L2 perceptual assimilation patterns remain poorly understood. The present study recruited native Mandarin speakers to learn Thai lexical tones. First, we classified different types of Mandarin-Thai tone assimilation before learning. Next, we employed tone discrimination and identification tasks to investigate how pitch gestures facilitate the learning of Thai lexical tones with varying assimilation patterns. We compared three learning approaches, pitch gesture production, pitch feature observation, and word-picture association. The results revealed three Mandarin-Thai tone assimilation patterns: the Mid and Low Thai tones were assimilated to Mandarin Tone 1; the Falling Thai tone was assimilated to Mandarin Tone 4; the High and Rising Thai tones were assimilated to Mandarin Tone 2. Notably, the pitch gesture production approach enhanced learners’ ability to discriminate between Thai tones assimilated to different Mandarin tones, and identify Thai tones assimilated to Mandarin Tone 1 (Mid/Low) and Tone 4 (Falling). These findings indicate that while embodied experience through pitch gesture production facilitates L2 lexical tone acquisition, its efficacy is modulated by L1–L2 perceptual assimilation patterns. Based on these results, we propose an embodied learning viewpoint that incorporates L1 tonal experience, offering new insights into L2 lexical tone acquisition.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"113 ","pages":"Article 101460"},"PeriodicalIF":2.4,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145578709","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Bayesian beta regressions with brms in R: A tutorial for phoneticians 在R中使用brms的贝叶斯beta回归：语音专家教程

IF 2.4 1区文学 0 LANGUAGE & LINGUISTICS

Journal of Phonetics

Pub Date : 2025-11-01 DOI: 10.1016/j.wocn.2025.101455

Stefano Coretta , Paul Bürkner

Phonetic research frequently involves analyzing numeric continuous outcome variables, such as durations, frequencies, loudness, and ratios. Another commonly used outcome type is proportions, including measures like the proportion of voicing during closure, gesture amplitude, and nasalance. Despite their bounded nature, proportions are often modeled using Gaussian regression, largely due to the default settings of commonly used statistical functions in R (e.g., lm() and lmer() from lme4). This practice persists in teaching and research, despite the fact that Gaussian models assume unbounded continuous data and may poorly fit proportion data. To address this issue, this tutorial introduces beta regression models, a more appropriate statistical approach for analyzing proportions. The beta distribution provides a flexible framework for modelling continuous data constrained between 0 and 1. The tutorial employs the brms package in R and assumes familiarity with regression modeling but no prior knowledge of Bayesian statistics. The tutorial includes two case studies illustrating the practical implementation of Bayesian beta regression models. Data and code are available athttps://github.com/stefanocoretta/beta-phon.

语音研究经常涉及分析数字连续结果变量，如持续时间、频率、响度和比率。另一种常用的结果类型是比例，包括在结束时发声的比例、手势幅度和鼻子平衡等指标。尽管比例有界，但通常使用高斯回归建模，这主要是由于R中常用统计函数的默认设置（例如，lme4中的lm（）和lmer()）。这种做法在教学和研究中一直存在，尽管高斯模型假设的是无界的连续数据，可能难以拟合比例数据。为了解决这个问题，本教程介绍了beta回归模型，这是一种更适合分析比例的统计方法。beta分布提供了一个灵活的框架，用于建模约束在0和1之间的连续数据。本教程使用R中的brms包，假设您熟悉回归建模，但不具备贝叶斯统计的先验知识。本教程包括两个案例研究，说明贝叶斯beta回归模型的实际实现。数据和代码可从https://github.com/stefanocoretta/beta-phon获得。

{"title":"Bayesian beta regressions with brms in R: A tutorial for phoneticians","authors":"Stefano Coretta , Paul Bürkner","doi":"10.1016/j.wocn.2025.101455","DOIUrl":"10.1016/j.wocn.2025.101455","url":null,"abstract":"<div><div>Phonetic research frequently involves analyzing numeric continuous outcome variables, such as durations, frequencies, loudness, and ratios. Another commonly used outcome type is proportions, including measures like the proportion of voicing during closure, gesture amplitude, and nasalance. Despite their bounded nature, proportions are often modeled using Gaussian regression, largely due to the default settings of commonly used statistical functions in R (e.g., lm() and lmer() from lme4). This practice persists in teaching and research, despite the fact that Gaussian models assume unbounded continuous data and may poorly fit proportion data. To address this issue, this tutorial introduces beta regression models, a more appropriate statistical approach for analyzing proportions. The beta distribution provides a flexible framework for modelling continuous data constrained between 0 and 1. The tutorial employs the brms package in R and assumes familiarity with regression modeling but no prior knowledge of Bayesian statistics. The tutorial includes two case studies illustrating the practical implementation of Bayesian beta regression models. Data and code are available at<span><span>https://github.com/stefanocoretta/beta-phon</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"113 ","pages":"Article 101455"},"PeriodicalIF":2.4,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145528489","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0