首页 > 最新文献

Journal of Phonetics最新文献

英文 中文
The role of prior knowledge in second-language learners’ overnight consolidation of Cantonese tones 先验知识在第二语言学习者粤语语调夜间巩固中的作用
IF 1.9 1区 文学 0 LANGUAGE & LINGUISTICS Pub Date : 2025-05-10 DOI: 10.1016/j.wocn.2025.101417
Quentin Zhen Qin , Rui Jin , Ruofan Wu
This study examines the role of prior (tonal) knowledge in memory consolidation of non-native tones after an overnight sleep. While memory consolidation is beneficial in learning new sounds in a second language, only new linguistic information consistent with the existing knowledge is often prioritized for consolidation. What remains unclear from the research is whether prior tonal knowledge from a native language (i.e., pitch contour signaling the Mandarin contour-tone system) influences an overnight consolidation of tone learning. The study adopts an overnight design, using Cantonese contour and level tones contrasting in pitch contour and height, for two perceptual learning experiments conducted separately on Mandarin and English-speaking novice learners of Cantonese. The first experiment found that Mandarin-speaking participants showed a stronger effect of consolidation in novel words contrasting in contour tones than in level tones, thanks to their prior knowledge of contour tones. The consolidation effect was predicted by rough estimates of deep-sleep length. Without prior knowledge of tones, English-speaking L2 learners in the second experiment showed an (unexpected) offline improvement for both contour and level tones. Overall, the findings suggest a preferential effect on overnight consolidation of contour tones when the cues contrasting L2-Cantonese tones are consistent with L1-Mandarin prior knowledge.
本研究考察了隔夜睡眠后,先验(音调)知识在非母语音调记忆巩固中的作用。虽然巩固记忆对学习第二语言的新声音是有益的,但只有与现有知识一致的新语言信息才会优先被巩固。该研究尚不清楚的是,来自母语的先验音调知识(即,音高轮廓信号表明普通话的轮廓音系统)是否会影响一夜之间的音调学习巩固。本研究采用隔夜设计,采用粤语轮廓和音高对比的平调,分别对普通话和英语粤语初学者进行感知学习实验。第一个实验发现,说普通话的参与者对轮廓音对比的新单词的巩固效果要强于对平调的巩固效果,这要归功于他们对轮廓音的先验知识。巩固效应是通过深度睡眠时间的粗略估计来预测的。在第二个实验中,在没有音调先验知识的情况下,说英语的第二语言学习者在轮廓和水平音调方面都表现出(意想不到的)离线改善。总体而言,研究结果表明,当对比l2 -粤语语调的线索与l1 -普通话先验知识一致时,对轮廓语调的隔夜巩固具有优先效应。
{"title":"The role of prior knowledge in second-language learners’ overnight consolidation of Cantonese tones","authors":"Quentin Zhen Qin ,&nbsp;Rui Jin ,&nbsp;Ruofan Wu","doi":"10.1016/j.wocn.2025.101417","DOIUrl":"10.1016/j.wocn.2025.101417","url":null,"abstract":"<div><div>This study examines the role of prior (tonal) knowledge in memory consolidation of non-native tones after an overnight sleep. While memory consolidation is beneficial in learning new sounds in a second language, only new linguistic information consistent with the existing knowledge is often prioritized for consolidation. What remains unclear from the research is whether prior tonal knowledge from a native language (i.e., pitch contour signaling the Mandarin contour-tone system) influences an overnight consolidation of tone learning. The study adopts an overnight design, using Cantonese contour and level tones contrasting in pitch contour and height, for two perceptual learning experiments conducted separately on Mandarin and English-speaking novice learners of Cantonese. The first experiment found that Mandarin-speaking participants showed a stronger effect of consolidation in novel words contrasting in contour tones than in level tones, thanks to their prior knowledge of contour tones. The consolidation effect was predicted by rough estimates of deep-sleep length. Without prior knowledge of tones, English-speaking L2 learners in the second experiment showed an (unexpected) offline improvement for both contour and level tones. Overall, the findings suggest a preferential effect on overnight consolidation of contour tones when the cues contrasting L2-Cantonese tones are consistent with L1-Mandarin prior knowledge.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"111 ","pages":"Article 101417"},"PeriodicalIF":1.9,"publicationDate":"2025-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143931342","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Advancements of phonetics in the 21st century: Quantitative data analysis 21世纪语音学的进展:定量数据分析
IF 1.9 1区 文学 0 LANGUAGE & LINGUISTICS Pub Date : 2025-05-09 DOI: 10.1016/j.wocn.2025.101415
Morgan Sonderegger , Márton Sóskuthy
Phonetic research in the 21st century has relied heavily on quantitative analysis. This article reviews the evolution of common practices and the emergence of newer techniques. Using a detailed literature survey, we show that most work follows a mainstream, which has shifted from ANOVAs to mixed-effects regression models over time. Alongside this mainstream, we highlight the increasing use of a diverse methodological toolbox, especially Bayesian methods and dynamic methods, for which we provide comprehensive reviews. Bayesian methods, as well as frequentist methods beyond linear and logistic regression, offer flexibility in model specification, interpretation, and incorporation of prior knowledge. Dynamic methods, such as GAMs and functional data analysis, capture non-linear patterns in acoustic and articulatory data. Machine learning techniques, such as random forests, expand the questions and types of data phoneticians can analyze. We also discuss the growing importance of open science practices promoting replicability and transparency. We argue that the future lies in a diverse methodological toolbox, with techniques chosen based on research questions and data structure.
21世纪的语音研究很大程度上依赖于定量分析。本文回顾了常用实践的演变和新技术的出现。通过详细的文献调查,我们发现大多数工作遵循一个主流,随着时间的推移,它已经从方差分析转向混合效应回归模型。除了这一主流之外,我们还强调越来越多地使用多样化的方法工具箱,特别是贝叶斯方法和动态方法,我们对此进行了全面的回顾。贝叶斯方法,以及超越线性和逻辑回归的频率方法,在模型规范、解释和合并先验知识方面提供了灵活性。动态方法,如GAMs和功能数据分析,捕获声学和发音数据中的非线性模式。机器学习技术,如随机森林,扩展了语音学家可以分析的问题和数据类型。我们还讨论了促进可复制性和透明度的开放科学实践日益增长的重要性。我们认为,未来在于一个多样化的方法工具箱,根据研究问题和数据结构选择技术。
{"title":"Advancements of phonetics in the 21st century: Quantitative data analysis","authors":"Morgan Sonderegger ,&nbsp;Márton Sóskuthy","doi":"10.1016/j.wocn.2025.101415","DOIUrl":"10.1016/j.wocn.2025.101415","url":null,"abstract":"<div><div>Phonetic research in the 21st century has relied heavily on quantitative analysis. This article reviews the evolution of common practices and the emergence of newer techniques. Using a detailed literature survey, we show that most work follows a mainstream, which has shifted from ANOVAs to mixed-effects regression models over time. Alongside this mainstream, we highlight the increasing use of a diverse methodological toolbox, especially Bayesian methods and dynamic methods, for which we provide comprehensive reviews. Bayesian methods, as well as frequentist methods beyond linear and logistic regression, offer flexibility in model specification, interpretation, and incorporation of prior knowledge. Dynamic methods, such as GAMs and functional data analysis, capture non-linear patterns in acoustic and articulatory data. Machine learning techniques, such as random forests, expand the questions and types of data phoneticians can analyze. We also discuss the growing importance of open science practices promoting replicability and transparency. We argue that the future lies in a diverse methodological toolbox, with techniques chosen based on research questions and data structure.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"111 ","pages":"Article 101415"},"PeriodicalIF":1.9,"publicationDate":"2025-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143923730","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mapping to perceptual identification in Mandarin learners of English 汉语英语学习者的感知识别映射
IF 1.9 1区 文学 0 LANGUAGE & LINGUISTICS Pub Date : 2025-04-24 DOI: 10.1016/j.wocn.2025.101411
Kenneth de Jong , Yu-Jung Lin , Yen-Chen Hao , Hanyong Park
This paper examines the relationship between cross-language segmental mapping and second language identification accuracy in Taiwan Mandarin speakers learning English, and compares this relationship with that found in previous, parallel research on Korean learners of English. Mapping and identification data were collected for English anterior plosives and non-sibilant fricatives, by means of two parallel identification experiments. Mapping data came from a 17-alternative identification task with Zhuyin Fuhao labels (phonetic script used to annotate Mandarin sounds in Taiwan), and identification data came from a 15-alternative identification task with Roman labels, both applied to the same stimuli. Mapping data were used to generate predictions about the identification performance by estimating what the performance would be, given the use of only the Mandarin categories. Like the previous Korean data, Mandarin speakers exhibited identification rates for plosives that are very close to predicted, indicating that their plosive identification performance was heavily entangled with their Mandarin system, while fricative identification performance was greatly under-predicted by the mapping data. Further analyses of category differentiation measured with d-prime estimates showed that Mandarin speakers’ manner differentiation performance was very well-predicted by the mapping data, while Korean speakers’ laryngeal differentiation was better predicted. Taken together, these results indicate that the second language identification performance and the cross-language mapping into the first language are closely entangled in a single system. The additional second language component appears in a pervasive increment in performance in the second language beyond what is predicted from the first language system, and in more unaccounted-for variance in laryngeal identification than in manner identification.
本文探讨了台湾普通话学习者学习英语的跨语段映射与第二语言识别准确率之间的关系,并将这种关系与之前对韩国英语学习者的平行研究结果进行了比较。通过两个平行的识别实验,收集了英语前爆破音和非音节摩擦音的制图和识别数据。映射数据来自一项使用注音复号标签(台湾用于注释普通话发音的语音文字)的17种替代识别任务,识别数据来自一项使用罗马标签的15种替代识别任务,两者都应用于相同的刺激。在只使用普通话类别的情况下,通过估计识别性能,使用映射数据来生成识别性能的预测。与之前的韩国数据一样,普通话使用者对炸药的识别率非常接近预测,这表明他们的炸药识别性能与他们的普通话系统严重纠缠,而摩擦识别性能被地图数据大大低估。进一步分析用d-prime估计测量的类别分化表明,映射数据可以很好地预测普通话使用者的方式分化表现,而韩语使用者的喉部分化则可以更好地预测。综上所述,这些结果表明,第二语言识别性能和跨语言映射到第一语言是紧密纠缠在一个单一的系统。额外的第二语言成分出现在第二语言表现的普遍增长中,超出了第一语言系统的预测,并且在喉部识别方面比在方式识别方面更无法解释的差异。
{"title":"Mapping to perceptual identification in Mandarin learners of English","authors":"Kenneth de Jong ,&nbsp;Yu-Jung Lin ,&nbsp;Yen-Chen Hao ,&nbsp;Hanyong Park","doi":"10.1016/j.wocn.2025.101411","DOIUrl":"10.1016/j.wocn.2025.101411","url":null,"abstract":"<div><div>This paper examines the relationship between cross-language segmental mapping and second language identification accuracy in Taiwan Mandarin speakers learning English, and compares this relationship with that found in previous, parallel research on Korean learners of English. Mapping and identification data were collected for English anterior plosives and non-sibilant fricatives, by means of two parallel identification experiments. Mapping data came from a 17-alternative identification task with <em>Zhuyin Fuhao</em> labels (phonetic script used to annotate Mandarin sounds in Taiwan), and identification data came from a 15-alternative identification task with Roman labels, both applied to the same stimuli. Mapping data were used to generate predictions about the identification performance by estimating what the performance would be, given the use of only the Mandarin categories. Like the previous Korean data, Mandarin speakers exhibited identification rates for plosives that are very close to predicted, indicating that their plosive identification performance was heavily entangled with their Mandarin system, while fricative identification performance was greatly under-predicted by the mapping data. Further analyses of category differentiation measured with <em>d</em>-prime estimates showed that Mandarin speakers’ manner differentiation performance was very well-predicted by the mapping data, while Korean speakers’ laryngeal differentiation was better predicted. Taken together, these results indicate that the second language identification performance and the cross-language mapping into the first language are closely entangled in a single system. The additional second language component appears in a pervasive increment in performance in the second language beyond what is predicted from the first language system, and in more unaccounted-for variance in laryngeal identification than in manner identification.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"110 ","pages":"Article 101411"},"PeriodicalIF":1.9,"publicationDate":"2025-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143868895","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Contribution of F0 and phonation to tone perception in the Zaiwa language 在瓦语中F0和发声对声调感知的贡献
IF 1.9 1区 文学 0 LANGUAGE & LINGUISTICS Pub Date : 2025-04-23 DOI: 10.1016/j.wocn.2025.101413
Yao Lu, Changwei Liang, Jiangping Kong
Previous research on categorical perception of tone has primarily examined the influence of fundamental frequency (F0), while the role of phonation, though increasingly studied, remains underexplored. This study investigates the role of phonation and how it interacts with F0 cues in tone perception, using the Zaiwa language as a case study. Specifically, we examine the categorical perception of Tone 44 (produced with a pressed voice) and Tone 35 (produced with a modal voice). To achieve this, we first conducted an acoustic analysis of the Zaiwa tone system, which forms the basis for our novel method of speech synthesis. Using this method, we created six tonal continua between Tone 44 and Tone 35 by systematically modifying F0 alone, phonation alone, and both simultaneously. Native Zaiwa speakers then participated in an experiment using the categorical perception paradigm with these synthesized continua. The results indicate that the participants were unable to distinguish the phonemic categories of the two tones when only phonation was modified. While modifying F0 alone allowed for tone distinction, participants’ perception followed a continuous pattern. However, when both F0 and phonation were modified simultaneously, participants accurately identified the phonemic categories of tones and perceived the continuum between the two tones categorically. These findings suggest that both F0 and phonation serve as perceptual cues for distinguishing Tone 44 and Tone 35 in Zaiwa, with F0 as the primary cue and phonation as a secondary cue. However, phonation remains crucial, as its absence weakens the categorical perception of these tones.
先前关于音调分类感知的研究主要是考察了基频(F0)的影响,而发声的作用虽然研究越来越多,但仍未得到充分的探索。本研究以在瓦语为例,探讨了语音在声调感知中的作用及其与F0线索的相互作用。具体来说,我们研究了Tone 44(用压音产生)和Tone 35(用情态音产生)的分类感知。为了实现这一目标,我们首先对“在话”音调系统进行了声学分析,这构成了我们新的语音合成方法的基础。使用这种方法,我们通过系统地修改F0,单独发声,同时在Tone 44和Tone 35之间创建了六个音调连续体。在此基础上,以再语为母语的人用综合连续词进行分类知觉范式实验。结果表明,当仅改变发音时,被试无法区分两个声调的音素类别。虽然单独修改F0允许音调区分,但参与者的感知遵循连续模式。然而,当F0和发音同时被修改时,参与者准确地识别出音调的音位类别,并在类别上感知到两种音调之间的连续体。这些结果表明,F0和发音都是在语区分声调44和声调35的知觉线索,F0是主要线索,发音是次要线索。然而,发音仍然是至关重要的,因为它的缺失削弱了对这些音调的绝对感知。
{"title":"Contribution of F0 and phonation to tone perception in the Zaiwa language","authors":"Yao Lu,&nbsp;Changwei Liang,&nbsp;Jiangping Kong","doi":"10.1016/j.wocn.2025.101413","DOIUrl":"10.1016/j.wocn.2025.101413","url":null,"abstract":"<div><div>Previous research on categorical perception of tone has primarily examined the influence of fundamental frequency (F0), while the role of phonation, though increasingly studied, remains underexplored. This study investigates the role of phonation and how it interacts with F0 cues in tone perception, using the Zaiwa language as a case study. Specifically, we examine the categorical perception of Tone 44 (produced with a pressed voice) and Tone 35 (produced with a modal voice). To achieve this, we first conducted an acoustic analysis of the Zaiwa tone system, which forms the basis for our novel method of speech synthesis. Using this method, we created six tonal continua between Tone 44 and Tone 35 by systematically modifying F0 alone, phonation alone, and both simultaneously. Native Zaiwa speakers then participated in an experiment using the categorical perception paradigm with these synthesized continua. The results indicate that the participants were unable to distinguish the phonemic categories of the two tones when only phonation was modified. While modifying F0 alone allowed for tone distinction, participants’ perception followed a continuous pattern. However, when both F0 and phonation were modified simultaneously, participants accurately identified the phonemic categories of tones and perceived the continuum between the two tones categorically. These findings suggest that both F0 and phonation serve as perceptual cues for distinguishing Tone 44 and Tone 35 in Zaiwa, with F0 as the primary cue and phonation as a secondary cue. However, phonation remains crucial, as its absence weakens the categorical perception of these tones.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"110 ","pages":"Article 101413"},"PeriodicalIF":1.9,"publicationDate":"2025-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143863559","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Corrigendum to “Towards a dynamical account of inter-segmental coordination” [J. Phon. 109 (2025) 101392] 对“段间协调的动态解释”的更正[J]。电话109 (2025)101392]
IF 1.9 1区 文学 0 LANGUAGE & LINGUISTICS Pub Date : 2025-04-17 DOI: 10.1016/j.wocn.2025.101414
Shihao Du, Stephan R. Kuberski, Adamantios I. Gafos
{"title":"Corrigendum to “Towards a dynamical account of inter-segmental coordination” [J. Phon. 109 (2025) 101392]","authors":"Shihao Du,&nbsp;Stephan R. Kuberski,&nbsp;Adamantios I. Gafos","doi":"10.1016/j.wocn.2025.101414","DOIUrl":"10.1016/j.wocn.2025.101414","url":null,"abstract":"","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"110 ","pages":"Article 101414"},"PeriodicalIF":1.9,"publicationDate":"2025-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143838437","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Contextual and paradigmatic effects on suspended contrast across generations: The case of Cantonese pinjam revisited 语境和范式对跨代悬浮对比的影响:广东弹词案例再探
IF 1.9 1区 文学 0 LANGUAGE & LINGUISTICS Pub Date : 2025-04-10 DOI: 10.1016/j.wocn.2025.101412
Alan C.L. Yu , Vivian Guo Li , Peggy P.K. Mok
Suspended contrast refers to the phenomenon whereby sound change brings two phonemes into such close approximation that semantic contrast between them is suspended for native speakers of the language, without necessarily leading to complete merger or neutralization. The existence of suspended contrasts not only raises questions about the nature of the phonetics-phonology interface, but also for theories of sound change that assume sound change is biased toward selective maintenance of phonemes that contribute more to distinguishing existing lexical items in usage. Small differences supporting a suspended contrast are expected to disappear quickly given that they do not serve any apparent communicative functions. It remains a question whether a contrast can be suspended for a considerable period of time. This study revisits a case of suspended contrast in Cantonese between the lexical high rising tone and the high rising tone derived through morphological tone change (pinjam). We use an apparent-time approach to investigate the diachronic trajectory of this neutralization by comparing the distribution of this suspended contrast along both F0 and durational dimensions across two generations of Hong Kong Cantonese speakers. While this case of suspended tonal contrast has been in circulation for almost a century, our findings suggest that the distinction might be disappearing among the younger speakers. Only older speakers maintain a distinction between the lexical and derived rising tones, albeit in very restricted tonal contexts. The fact that this suspended tonal contrast exhibits great sensitivity to contextual and morphological influences may help explain the progression of this case of merger-in-progress.
悬置对比指的是一种现象,即语音变化使两个音素非常接近,对于母语人士来说,它们之间的语义对比被悬置,而不一定导致完全合并或中和。悬置对比的存在不仅对语音-音系界面的本质提出了质疑,而且也对语音变化理论提出了质疑,这些理论认为语音变化倾向于选择性地维持音素,这些音素更有助于区分使用中的现有词汇。支持悬置对比的微小差异预计会很快消失,因为它们没有任何明显的交际功能。是否可以在相当长的一段时间内暂停对比仍然是一个问题。本研究重访了广东话词性高升调与词性语调变化引起的高升调的悬空对比。通过比较两代香港粤语使用者在F0和持续时间维度上的悬浮对比分布,我们采用明显时间方法来研究这种中和的历时轨迹。虽然这种暂停音调对比的情况已经流传了将近一个世纪,但我们的研究结果表明,这种区别可能正在年轻的说话者中消失。只有年长者才会区分词性的和派生的上升声调,尽管是在非常有限的声调上下文中。这种暂停的音调对比对语境和形态的影响非常敏感,这一事实可能有助于解释这种合并进行中的情况的进展。
{"title":"Contextual and paradigmatic effects on suspended contrast across generations: The case of Cantonese pinjam revisited","authors":"Alan C.L. Yu ,&nbsp;Vivian Guo Li ,&nbsp;Peggy P.K. Mok","doi":"10.1016/j.wocn.2025.101412","DOIUrl":"10.1016/j.wocn.2025.101412","url":null,"abstract":"<div><div>Suspended contrast refers to the phenomenon whereby sound change brings two phonemes into such close approximation that semantic contrast between them is suspended for native speakers of the language, without necessarily leading to complete merger or neutralization. The existence of suspended contrasts not only raises questions about the nature of the phonetics-phonology interface, but also for theories of sound change that assume sound change is biased toward selective maintenance of phonemes that contribute more to distinguishing existing lexical items in usage. Small differences supporting a suspended contrast are expected to disappear quickly given that they do not serve any apparent communicative functions. It remains a question whether a contrast can be suspended for a considerable period of time. This study revisits a case of suspended contrast in Cantonese between the lexical high rising tone and the high rising tone derived through morphological tone change (<em>pinjam</em>). We use an apparent-time approach to investigate the diachronic trajectory of this neutralization by comparing the distribution of this suspended contrast along both F0 and durational dimensions across two generations of Hong Kong Cantonese speakers. While this case of suspended tonal contrast has been in circulation for almost a century, our findings suggest that the distinction might be disappearing among the younger speakers. Only older speakers maintain a distinction between the lexical and derived rising tones, albeit in very restricted tonal contexts. The fact that this suspended tonal contrast exhibits great sensitivity to contextual and morphological influences may help explain the progression of this case of merger-in-progress.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"110 ","pages":"Article 101412"},"PeriodicalIF":1.9,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143816847","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Normalization, essentialization, and the erasure of social and linguistic variation 正常化,本质化,以及消除社会和语言的差异
IF 1.9 1区 文学 0 LANGUAGE & LINGUISTICS Pub Date : 2025-04-09 DOI: 10.1016/j.wocn.2025.101409
Santiago Barreda
Linguists investigating the phonetic properties of vowels, e.g. height and frontness, often use normalization algorithms to remove ‘irrelevant’ variation from vowel formant data. The current conception and evaluation of these algorithms focuses on phonemic classification and the removal of ‘anatomical’ variation, an approach which suggests an essentialist perspective on linguistic variation and leads to the erasure and underreporting of linguistic and social information. Instead, it is suggested that for many purposes, researchers need algorithms that correctly represent phonetic information by removing only non-phonetic formant variation. Acoustic variation that does not affect phonetic properties is non-phonetic, making it ‘transparent’ to the linguistic system and incapable of communicating linguistic contrast. Evidence is presented that only the uniform scaling of formant patterns appears to be non-phonetic, indicating that uniform scaling normalization algorithms should be preferred. Finally, given that phonetic properties are products of human psychology that enter into experience only through perception, it is argued that the normalization algorithms used by phoneticians and sociolinguists should be thought of as models of human perception. The change to a perceptual and phonetic, rather than anatomical and phonemic, approach to normalization will promote more reliable and theoretically sound research outcomes, and better aligns with linguistic theory.
语言学家研究元音的语音特性,例如高度和正面,经常使用规范化算法从元音形成体数据中去除“不相关”的变化。目前对这些算法的概念和评估侧重于音位分类和消除“解剖”变异,这种方法提出了语言变异的本质主义观点,并导致语言和社会信息的抹除和少报。相反,有人建议,出于多种目的,研究人员需要通过仅去除非语音形成峰变化来正确表示语音信息的算法。不影响语音特性的声学变化是非语音的,使其对语言系统“透明”,无法传达语言对比。证据表明,只有均匀缩放的形成模式似乎是非语音的,表明均匀缩放归一化算法应该是首选。最后,考虑到语音属性是人类心理的产物,只能通过感知进入经验,有人认为语音学家和社会语言学家使用的规范化算法应该被认为是人类感知的模型。从解剖学和音位学转向感性和音位学的规范化方法,将促进更可靠和理论上健全的研究成果,并更好地符合语言学理论。
{"title":"Normalization, essentialization, and the erasure of social and linguistic variation","authors":"Santiago Barreda","doi":"10.1016/j.wocn.2025.101409","DOIUrl":"10.1016/j.wocn.2025.101409","url":null,"abstract":"<div><div>Linguists investigating the phonetic properties of vowels, e.g. height and frontness, often use normalization algorithms to remove ‘irrelevant’ variation from vowel formant data. The current conception and evaluation of these algorithms focuses on phonemic classification and the removal of ‘anatomical’ variation, an approach which suggests an essentialist perspective on linguistic variation and leads to the erasure and underreporting of linguistic and social information. Instead, it is suggested that for many purposes, researchers need algorithms that correctly represent phonetic information by removing only <em>non-phonetic</em> formant variation. Acoustic variation that does not affect phonetic properties is non-phonetic, making it ‘transparent’ to the linguistic system and incapable of communicating linguistic contrast. Evidence is presented that only the uniform scaling of formant patterns appears to be non-phonetic, indicating that uniform scaling normalization algorithms should be preferred. Finally, given that phonetic properties are products of human psychology that enter into experience only through perception, it is argued that the normalization algorithms used by phoneticians and sociolinguists should be thought of as models of human perception. The change to a perceptual and phonetic, rather than anatomical and phonemic, approach to normalization will promote more reliable and theoretically sound research outcomes, and better aligns with linguistic theory.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"110 ","pages":"Article 101409"},"PeriodicalIF":1.9,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143808284","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Gesture-Field-Register (GFR) framework for modeling F0 control F0控制建模的手势场寄存器(GFR)框架
IF 1.9 1区 文学 0 LANGUAGE & LINGUISTICS Pub Date : 2025-03-26 DOI: 10.1016/j.wocn.2025.101410
Seung-Eun Kim , Sam Tilsen
In this study, we introduce an F0 modeling framework – which we refer to as the Gesture-Field-Register (GFR) framework – in which F0 production involves joint control of relatively generic intentions and how those intentions are mapped to physical F0 values. Building on Articulatory Phonology (AP) and Task Dynamics (TD), the GFR framework considers F0 gestures to be the fundamental units of F0 control. It further holds (i) that the dynamic target F0 state of a speaker is determined by the blending of F0 gestural targets in a planning field and (ii) that the gestural targets and dynamic targets are represented in normalized values which are converted to F0 in Hz via dynamic control of F0 register. We show how this framework accounts for a variety of empirical F0 patterns, and we present a case study that uses parameter optimization to analyze empirical F0 contours into a time series of gestural activation and register states. In doing so, we demonstrate that the framework allows for gestural targets to be invariant within an utterance, despite the fact that the surface contours are highly variable. Model code and examples for generating and fitting F0 contours are publicly available in Github and OSF repositories. Overall, the GFR framework provides a novel way of conceptualizing and modeling F0 control under AP/TD and further expands the AP/TD by incorporating the mechanisms of a planning field and dynamic register control.
在这项研究中,我们引入了一个F0建模框架——我们称之为手势-场-寄存器(GFR)框架——其中F0的产生涉及对相对通用意图的联合控制,以及这些意图如何映射到物理F0值。GFR框架以发音音韵学(articatory Phonology, AP)和任务动力学(Task Dynamics, TD)为基础,认为F0手势是F0控制的基本单位。进一步认为(1)说话人的动态目标F0状态由F0个手势目标在规划域中的混合决定;(2)手势目标和动态目标以归一化值表示,这些归一化值通过F0寄存器的动态控制转换为F0 (Hz)。我们展示了这个框架如何解释各种经验F0模式,并提出了一个案例研究,该案例研究使用参数优化将经验F0轮廓分析为手势激活和寄存器状态的时间序列。在此过程中,我们证明了该框架允许手势目标在话语中保持不变,尽管表面轮廓是高度可变的。生成和拟合F0轮廓的模型代码和示例可以在Github和OSF存储库中公开获得。总体而言,GFR框架为AP/TD下的F0控制提供了一种概念化和建模的新方法,并通过结合规划场和动态寄存器控制机制进一步扩展了AP/TD。
{"title":"The Gesture-Field-Register (GFR) framework for modeling F0 control","authors":"Seung-Eun Kim ,&nbsp;Sam Tilsen","doi":"10.1016/j.wocn.2025.101410","DOIUrl":"10.1016/j.wocn.2025.101410","url":null,"abstract":"<div><div>In this study, we introduce an F0 modeling framework – which we refer to as the Gesture-Field-Register (GFR) framework – in which F0 production involves joint control of relatively generic intentions and how those intentions are mapped to physical F0 values. Building on Articulatory Phonology (AP) and Task Dynamics (TD), the GFR framework considers F0 gestures to be the fundamental units of F0 control. It further holds (i) that the dynamic target F0 state of a speaker is determined by the blending of F0 gestural targets in a planning field and (ii) that the gestural targets and dynamic targets are represented in normalized values which are converted to F0 in Hz via dynamic control of F0 register. We show how this framework accounts for a variety of empirical F0 patterns, and we present a case study that uses parameter optimization to analyze empirical F0 contours into a time series of gestural activation and register states. In doing so, we demonstrate that the framework allows for gestural targets to be invariant within an utterance, despite the fact that the surface contours are highly variable. Model code and examples for generating and fitting F0 contours are publicly available in Github and OSF repositories. Overall, the GFR framework provides a novel way of conceptualizing and modeling F0 control under AP/TD and further expands the AP/TD by incorporating the mechanisms of a planning field and dynamic register control.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"110 ","pages":"Article 101410"},"PeriodicalIF":1.9,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143697848","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Processing pronunciation variation with independently mappable allophones 用可独立映射的音素处理语音变化
IF 1.9 1区 文学 0 LANGUAGE & LINGUISTICS Pub Date : 2025-03-20 DOI: 10.1016/j.wocn.2025.101402
Rachel Soo, Molly Babel
Sound change can present synchronic variation with categorical pronunciation variants. This is the case in Cantonese, where syllable-initial /n/ is merging with /l/, occasionally creating homophones (e.g., lou5 腦 “brain”/ 老“old”) and giving rise to [n]- and [l]-initial pronunciation variants that are allophones. This pronunciation variation offers insight into how variation is processed in spoken word recognition because [n] and [l] in Cantonese are not associated with an orthographic standard. Across four experiments, we examine the perception, recognition, and encoding of Cantonese [n] and [l], and use Bayesian analyses where gradient interpretations are more straightforward. We observe perceptual evidence that these allophones are distinguishable (Exp 2). In recognition (Exp 1) and encoding (Exp 3) paradigms, we find that the [n] and [l] allophones are processed neither equivalently nor distinctly when the targets bear the more common [l]-initial allophone. When the targets bear the [n]-initial allophone (Exp 4), we observe high error rates, and somewhat contradictory results. Altogether, the results suggest that [n] and [l] are allophonic variants independently mapped to a phoneme, with connection strengths varying as a function of the frequency, such that the more common [l]-initial pronunciation demonstrates an overall recognition advantage.
语音变化可以表现为同时性的发音变化。在广东话中就是这种情况,音节开头的/n/与/l/合并,偶尔会产生同音异义字(例如,“脑”/“老”),并产生[n]-和[l]-开头的发音变体,即音素。由于广东话中的[n]和[l]与正字法标准没有关联,因此这种发音变化可以深入了解口语单词识别中的变化是如何处理的。在四个实验中,我们研究了粤语的感知、识别和编码[n]和[l],并使用贝叶斯分析,其中梯度解释更直接。我们观察到这些音素是可区分的(经验2)。在识别(经验1)和编码(经验3)范式中,我们发现当目标使用更常见的[l]-初始音素时,[n]和[l]音素的加工既不等同也不明显。当目标具有[n]-初始音素(Exp 4)时,我们观察到高错误率,并且结果有些矛盾。综上所述,结果表明[n]和[l]是独立映射到音素的音素变体,其连接强度随频率而变化,因此更常见的[l]-初始发音显示出整体识别优势。
{"title":"Processing pronunciation variation with independently mappable allophones","authors":"Rachel Soo,&nbsp;Molly Babel","doi":"10.1016/j.wocn.2025.101402","DOIUrl":"10.1016/j.wocn.2025.101402","url":null,"abstract":"<div><div>Sound change can present synchronic variation with categorical pronunciation variants. This is the case in Cantonese, where syllable-initial /n/ is merging with /l/, occasionally creating homophones (e.g., <em>lou5</em> 腦 “brain”/ 老“old”) and giving rise to [n]- and [l]-initial pronunciation variants that are allophones. This pronunciation variation offers insight into how variation is processed in spoken word recognition because [n] and [l] in Cantonese are not associated with an orthographic standard. Across four experiments, we examine the perception, recognition, and encoding of Cantonese [n] and [l], and use Bayesian analyses where gradient interpretations are more straightforward. We observe perceptual evidence that these allophones are distinguishable (Exp 2). In recognition (Exp 1) and encoding (Exp 3) paradigms, we find that the [n] and [l] allophones are processed neither equivalently nor distinctly when the targets bear the more common [l]-initial allophone. When the targets bear the [n]-initial allophone (Exp 4), we observe high error rates, and somewhat contradictory results. Altogether, the results suggest that [n] and [l] are allophonic variants independently mapped to a phoneme, with connection strengths varying as a function of the frequency, such that the more common [l]-initial pronunciation demonstrates an overall recognition advantage.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"110 ","pages":"Article 101402"},"PeriodicalIF":1.9,"publicationDate":"2025-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143684746","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The acquisition of Multicultural London English: Child and adolescent diphthong variation in West London 多元文化伦敦英语的习得:西伦敦儿童和青少年双元音的变化
IF 1.9 1区 文学 0 LANGUAGE & LINGUISTICS Pub Date : 2025-03-01 DOI: 10.1016/j.wocn.2024.101388
Rosamund Oxbury , Matthew Hunt , Kathleen M. McCarthy
This study investigated Multicultural London English (MLE) diphthongs as produced by children and adolescents in the London borough of Ealing, UK. We conducted an acoustic analysis of the diphthongs face, price and goat in the speech of 24 young people aged 16–24 years and, 14 children aged 5–7 years. The results revealed different production patterns between the children and adolescents for some but not all the diphthong variables. We found that the children’s and adolescents’ diphthongs were similar in the quality of the onset, and similar to the MLE system described in East London, in the London borough of Hackney. However, the children had not acquired monophthongization of the diphthongs, with adolescents producing significantly more monophthongal tokens of price, goat and, to a lesser extent, face. These findings have implications both for the study of multiethnolects and MLE, and for research on children’s acquisition of sociophonetic variation.
本研究调查了多元文化伦敦英语(MLE)双元音,由英国伦敦伊灵区的儿童和青少年产生。我们对24名16-24岁的年轻人和14名5-7岁的儿童的双元音face, price和goat进行了声学分析。结果显示,儿童和青少年在某些双元音变量上的产生模式不同,但不是所有的双元音变量。我们发现,儿童和青少年的双元音在起音质量上是相似的,并且与伦敦东部哈克尼区描述的MLE系统相似。然而,儿童并没有获得双元音的单音节化,青少年产生明显更多的单音节符号,如价格、山羊和脸,在较小程度上。这些发现对多民族语和多语学习的研究以及儿童社会语音变异的习得研究具有重要意义。
{"title":"The acquisition of Multicultural London English: Child and adolescent diphthong variation in West London","authors":"Rosamund Oxbury ,&nbsp;Matthew Hunt ,&nbsp;Kathleen M. McCarthy","doi":"10.1016/j.wocn.2024.101388","DOIUrl":"10.1016/j.wocn.2024.101388","url":null,"abstract":"<div><div>This study investigated Multicultural London English (MLE) diphthongs as produced by children and adolescents in the London borough of Ealing, UK. We conducted an acoustic analysis of the diphthongs <span>face</span>, <span>price</span> and <span>goat</span> in the speech of 24 young people aged 16–24 years and, 14 children aged 5–7 years. The results revealed different production patterns between the children and adolescents for some but not all the diphthong variables. We found that the children’s and adolescents’ diphthongs were similar in the quality of the onset, and similar to the MLE system described in East London, in the London borough of Hackney. However, the children had not acquired monophthongization of the diphthongs, with adolescents producing significantly more monophthongal tokens of <span>price,</span> <span>goat</span> and, to a lesser extent, <span>face</span>. These findings have implications both for the study of multiethnolects and MLE, and for research on children’s acquisition of sociophonetic variation.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"109 ","pages":"Article 101388"},"PeriodicalIF":1.9,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143520857","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Phonetics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1