首页 > 最新文献

Journal of Phonetics最新文献

英文 中文
The contribution of the visual modality to vowel perception in native and non-native speakers 视觉形态对母语和非母语者元音感知的贡献
IF 1.9 1区 文学 0 LANGUAGE & LINGUISTICS Pub Date : 2025-01-01 Epub Date: 2024-12-05 DOI: 10.1016/j.wocn.2024.101375
Sinéad M. Rankin, Maria-Josep Solé
This study examines how visual cues impact the intelligibility of foreign-accented speech for native listeners. English CVC words with vowels involving salient (e.g., lip-spreading for /iː/) and non-salient visual cues (neutral lips for /ɪ/) produced by two French speakers and a native English control, were presented to native English listeners who identified the word heard. Tokens were presented in both auditory-only and audiovisual (AV) mode in cafeteria noise at −15 dB SNR. The visual cues analysed were lip spreading, lip rounding, jaw opening and tongue frontness in vowels, as well as lip-rounding in schwa. Visually salient cues improved vowel intelligibility, compared to non-visual cues, but the audiovisual benefit varied across vowel features and speaker groups. The presence of lip-spreading for /iː/ (vs /ɪ/) and jaw-opening for /æ/ (vs /ɪ/) enhanced intelligibility (i.e., larger AV benefit) for both speaker groups. However, compared to the English speaker, lip-rounding in /ɔː/ and /ə/ produced by French speakers (likely accompanied by lip protrusion) had a smaller, or negative AV benefit. These results suggest that the influence of L1 gestures on L2 production may reduce or negatively affect intelligibility. Furthermore, French productions of /ɑː/ exhibited unusually high AV benefits, suggesting an extreme jaw-opening for this vowel in an attempt to distinguish between L2 contrasts (/æ ʌ ɑː/) not present in the L1.
本研究探讨视觉线索如何影响母语听众对外国口音演讲的可理解性。研究人员将两名法语使用者和一名英语母语对照者所产生的带有明显元音(例如,发/i / /时嘴唇张开)和非明显视觉线索(发/ / /时嘴唇中性)的英语CVC单词呈现给母语为英语的听众,让他们识别所听到的单词。在- 15 dB信噪比的自助餐厅噪声中,标记以纯听觉和视听(AV)模式呈现。分析的视觉线索包括元音中的嘴唇张开、嘴唇圆润、下巴张开和舌头正面,以及弱读音中的嘴唇圆润。与非视觉线索相比,视觉上显著的线索提高了元音的可理解性,但在不同的元音特征和说话者群体中,视听上的好处有所不同。对于两个说话群体来说,发/i / / (vs / / /)时嘴唇张开,发/æ/ (vs / / /)时下巴张开,提高了可理解性(即更大的AV优势)。然而,与说英语的人相比,说法语的人在发/[/]和/[/]音时发圆唇(可能伴随着嘴唇突出)对AV的好处更小,甚至是负面的。这些结果表明,母语手势对第二语言产生的影响可能会降低或负面影响可理解性。此外,法语中的/ j /音表现出了不同寻常的高音高优势,这表明这个元音在区分L2对比音(/ j / /)时,需要极大地张开下巴,而L2对比音(/ j / /)在L1中并不存在。
{"title":"The contribution of the visual modality to vowel perception in native and non-native speakers","authors":"Sinéad M. Rankin,&nbsp;Maria-Josep Solé","doi":"10.1016/j.wocn.2024.101375","DOIUrl":"10.1016/j.wocn.2024.101375","url":null,"abstract":"<div><div>This study examines how visual cues impact the intelligibility of foreign-accented speech for native listeners. English CVC words with vowels involving salient (e.g., lip-spreading for /iː/) and non-salient visual cues (neutral lips for /ɪ/) produced by two French speakers and a native English control, were presented to native English listeners who identified the word heard. Tokens were presented in both auditory-only and audiovisual (AV) mode in cafeteria noise at −15 dB SNR. The visual cues analysed were lip spreading, lip rounding, jaw opening and tongue frontness in vowels, as well as lip-rounding in schwa. Visually salient cues improved vowel intelligibility, compared to non-visual cues, but the audiovisual benefit varied across vowel features and speaker groups. The presence of lip-spreading for /iː/ (vs /ɪ/) and jaw-opening for /æ/ (vs /ɪ/) enhanced intelligibility (i.e., larger AV benefit) for both speaker groups. However, compared to the English speaker, lip-rounding in /ɔː/ and /ə/ produced by French speakers<!--> <!-->(likely accompanied by lip protrusion) had a smaller, or negative AV benefit. These results suggest that the influence of L1 gestures on L2 production may reduce or negatively affect intelligibility. Furthermore, French productions of /ɑː/ exhibited unusually high AV benefits, suggesting an extreme jaw-opening for this vowel in an attempt to distinguish between L2 contrasts (/æ ʌ ɑː/) not present in the L1.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"108 ","pages":"Article 101375"},"PeriodicalIF":1.9,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143144890","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Contrast enhancement and the distribution of vowel duration in Japanese 日语中元音时长的对比增强与分布
IF 1.9 1区 文学 0 LANGUAGE & LINGUISTICS Pub Date : 2025-01-01 Epub Date: 2024-12-24 DOI: 10.1016/j.wocn.2024.101386
Shin-ichiro Sano, Céleste Guillemot
This paper offers a case study of the role of information in patterns of segmental duration, focusing on the short vs. long vowel contrast in Japanese. We confirmed that the position of vowels in linguistic units (intonation phrase/IP and word) and the length of the units affect vowel duration, but the patterns differ depending on the type of units. The results suggest that at the word level, vowel duration patterns in such a way that cues for vowel contrast are enhanced in salient (earlier) positions and units with less predictability or more lexical competitors (shorter words), while cues are relatively reduced in non-salient (later) positions and units with more predictability or less lexical competitors (longer words). In this way, the variation in vowel duration balances the successful transmission of lexical information and the cost for phonetic implementation, which supports previous findings in communication-based approaches of language.
本文以日语短元音和长元音的对比为例,分析了信息在分音持续时间模式中的作用。我们证实,元音在语言单位(语调、短语/IP和单词)中的位置和单位的长度影响元音的持续时间,但模式因单位类型而异。结果表明,在单词水平上,元音持续时间的模式是这样的:元音对比的线索在显著(较早)位置和可预测性较低或词汇竞争对手较多的单元(较短的单词)中得到增强,而在非显著(较晚)位置和可预测性较强或词汇竞争对手较少的单元(较长单词)中相对减少。通过这种方式,元音持续时间的变化平衡了词汇信息的成功传递和语音实现的成本,这支持了先前基于交际的语言研究方法的发现。
{"title":"Contrast enhancement and the distribution of vowel duration in Japanese","authors":"Shin-ichiro Sano,&nbsp;Céleste Guillemot","doi":"10.1016/j.wocn.2024.101386","DOIUrl":"10.1016/j.wocn.2024.101386","url":null,"abstract":"<div><div>This paper offers a case study of the role of information in patterns of segmental duration, focusing on the short vs. long vowel contrast in Japanese. We confirmed that the position of vowels in linguistic units (intonation phrase/IP and word) and the length of the units affect vowel duration, but the patterns differ depending on the type of units. The results suggest that at the word level, vowel duration patterns in such a way that cues for vowel contrast are enhanced in salient (earlier) positions and units with less predictability or more lexical competitors (shorter words), while cues are relatively reduced in non-salient (later) positions and units with more predictability or less lexical competitors (longer words). In this way, the variation in vowel duration balances the successful transmission of lexical information and the cost for phonetic implementation, which supports previous findings in communication-based approaches of language.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"108 ","pages":"Article 101386"},"PeriodicalIF":1.9,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143144887","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Coarticulation and coordination in phonological development: Insights from children’s and adults’ production of complex–simplex stop contrasts in Gã 语音发展中的协同发音和协调:来自儿童和成人在Gã中复杂单音停顿对比的见解
IF 1.9 1区 文学 0 LANGUAGE & LINGUISTICS Pub Date : 2025-01-01 Epub Date: 2024-12-19 DOI: 10.1016/j.wocn.2024.101378
Felix Kpogo , Charles B. Chang
Achieving adult-like coarticulation, which relies on precise gestural coordination, is known to be a challenging aspect of phonological development. Unique coordination challenges are posed by doubly articulated stops, typologically uncommon complex consonants that show crosslinguistic variation in their acoustic contrast with simplex (singly articulated) consonants. We examined the acoustics and development of complex–simplex stop contrasts between labio-velars (/k͡p/, /ɡ͡b/) and bilabials (/p/, /b/) in Gã (Niger-Congo, Kwa), with special attention to coarticulation with adjacent sonorants. We found that Gã adults mostly produced differences in voice onset time and closure duration to implement these contrasts, and Gã five-year-olds also produced differences in these dimensions. Crucially, however, five-year-olds also produced significant differences in onset formants, which adults did not. These findings provide evidence of age-graded variation in the implementation of complex–simplex stop contrasts in Gã, suggesting that over the course of development there may be a shift away from production of carryover coarticulatory differences toward greater reliance on durational differences. We argue that children’s initial reliance on carryover coarticulation capitalizes on a tendency toward greater consonant–vowel coarticulation as compared to adults, discussing implications for our understanding of how coarticulation develops.
实现成人般的协同发音依赖于精确的手势协调,这是语音发展的一个具有挑战性的方面。独特的协调挑战是由双铰接停止,在类型学上不常见的复杂辅音,表现出跨语言的变化,他们的声学对比单一(单铰接)辅音。我们研究了Gã(尼日尔-刚果语,克瓦语)中阴唇-舌尖(/k /p/, / j /b/)和双唇音(/p/, /b/)之间复杂-单音止音对比的声学和发展,特别注意与相邻的辅音的协同发音。我们发现Gã成年人大多在声音开始时间和关闭持续时间上产生差异来实现这些对比,Gã五岁儿童也在这些维度上产生差异。然而,至关重要的是,五岁儿童在发病谱上也有显著差异,而成年人则没有。这些发现为Gã中复杂-单一停止对比的实现提供了年龄分级差异的证据,表明在发育过程中,可能会从产生结转协同发音差异转向更大程度上依赖于持续差异。我们认为,与成人相比,儿童最初对结转协同发音的依赖利用了更大的辅音-元音协同发音的趋势,并讨论了我们对协同发音如何发展的理解的影响。
{"title":"Coarticulation and coordination in phonological development: Insights from children’s and adults’ production of complex–simplex stop contrasts in Gã","authors":"Felix Kpogo ,&nbsp;Charles B. Chang","doi":"10.1016/j.wocn.2024.101378","DOIUrl":"10.1016/j.wocn.2024.101378","url":null,"abstract":"<div><div>Achieving adult-like coarticulation, which relies on precise gestural coordination, is known to be a challenging aspect of phonological development. Unique coordination challenges are posed by doubly articulated stops, typologically uncommon complex consonants that show crosslinguistic variation in their acoustic contrast with simplex (singly articulated) consonants. We examined the acoustics and development of complex–simplex stop contrasts between labio-velars (/k͡p/, /ɡ͡b/) and bilabials (/p/, /b/) in Gã (Niger-Congo, Kwa), with special attention to coarticulation with adjacent sonorants. We found that Gã adults mostly produced differences in voice onset time and closure duration to implement these contrasts, and Gã five-year-olds also produced differences in these dimensions. Crucially, however, five-year-olds also produced significant differences in onset formants, which adults did not. These findings provide evidence of age-graded variation in the implementation of complex–simplex stop contrasts in Gã, suggesting that over the course of development there may be a shift away from production of carryover coarticulatory differences toward greater reliance on durational differences. We argue that children’s initial reliance on carryover coarticulation capitalizes on a tendency toward greater consonant–vowel coarticulation as compared to adults, discussing implications for our understanding of how coarticulation develops.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"108 ","pages":"Article 101378"},"PeriodicalIF":1.9,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143144894","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The relation between perceptual retuning and articulatory restructuring: Individual differences in accommodating a novel phonetic variant 知觉重谐与发音重组之间的关系:适应新语音变体的个体差异
IF 1.9 1区 文学 0 LANGUAGE & LINGUISTICS Pub Date : 2024-11-01 Epub Date: 2024-09-12 DOI: 10.1016/j.wocn.2024.101352
Patrice Speeter Beddor , Andries W. Coetzee , Ian Calloway , Stephen Tobin , Ruaridh Purse

When language users accommodate a novel phonetic variant, they adjust their perceptual and articulatory spaces in listener- and speaker-specific ways. Motivated by the centrality of accommodation and the perception-production relation to theories of phonetics and sound change, this study tests the hypothesis that individuals who are adept at perceptually retuning for a novel variant will be more accurate imitators of that form. In perceptual eye-tracking and spontaneous imitation ultrasound-imaging tasks, 37 American English participants were exposed to a talker’s novel raised /æ/ before /ɡ/ (bag), and to their familiar unraised /æk/ (back) and /eɪk/ (bake). Consistent with the hypothesis, results showed that the more participants showed perceptual facilitation (i.e., used raised /æ(ɡ)/ to disambiguate back-bag trials), the more they imitated raised /æ(ɡ)/. Perceptual retuning, though, did not predict articulatory restructuring: imitators produced not context-dependent raising, but more general “imitative” raising. For theories of sound change, the findings provide circumscribed support for especially adept perceptual adapters to an innovation having the potential to be strong disseminators of that variant. For theories of accommodation, findings point toward the importance of studying imitation of a targeted variant in the broader context of how talkers and imitators situate that variant in relation to phonetically similar forms.

当语言使用者适应一个新的语音变体时,他们会根据听者和说话者的具体情况调整他们的知觉和发音空间。在语音学和音变理论中,调适和感知-生成关系占据着核心地位,受此激励,本研究对以下假设进行了检验:善于对新变体进行感知调适的个体将更准确地模仿该形式。在知觉眼动跟踪和自发模仿超声波成像任务中,37 名美式英语参与者接触了说话者在 /ɡ/(bag)之前发出的新的上扬 /æ/,以及他们熟悉的未上扬 /æk/(back)和 /eɪk/(bake)。结果表明,参与者表现出的知觉促进越多(即使用提高的/æ(ɡ)/来消除背包试验的歧义),他们模仿提高的/æ(ɡ)/的次数就越多,这与假设是一致的。然而,知觉重谐并不能预测发音重组:模仿者产生的不是依赖于语境的提高,而是更普遍的 "模仿性 "提高。就声音变化理论而言,研究结果提供了有限的支持,即特别擅长感知适应创新的人有可能成为该变体的有力传播者。就调适理论而言,研究结果表明,在研究目标变体的模仿时,必须考虑到说话者和模仿者是如何将该变体与语音相似的形式联系起来的。
{"title":"The relation between perceptual retuning and articulatory restructuring: Individual differences in accommodating a novel phonetic variant","authors":"Patrice Speeter Beddor ,&nbsp;Andries W. Coetzee ,&nbsp;Ian Calloway ,&nbsp;Stephen Tobin ,&nbsp;Ruaridh Purse","doi":"10.1016/j.wocn.2024.101352","DOIUrl":"10.1016/j.wocn.2024.101352","url":null,"abstract":"<div><p>When language users accommodate a novel phonetic variant, they adjust their perceptual and articulatory spaces in listener- and speaker-specific ways. Motivated by the centrality of accommodation and the perception-production relation to theories of phonetics and sound change, this study tests the hypothesis that individuals who are adept at perceptually retuning for a novel variant will be more accurate imitators of that form. In perceptual eye-tracking and spontaneous imitation ultrasound-imaging tasks, 37 American English participants were exposed to a talker’s novel raised /æ/ before /ɡ/ (<em>bag</em>), and to their familiar unraised /æk/ (<em>back</em>) and /eɪk/ (<em>bake</em>). Consistent with the hypothesis, results showed that the more participants showed perceptual facilitation (i.e., used raised /æ(ɡ)/ to disambiguate <em>back-bag</em> trials), the more they imitated raised /æ(ɡ)/. Perceptual retuning, though, did not predict articulatory restructuring: imitators produced not context-dependent raising, but more general “imitative” raising. For theories of sound change, the findings provide circumscribed support for especially adept perceptual adapters to an innovation having the potential to be strong disseminators of that variant. For theories of accommodation, findings point toward the importance of studying imitation of a targeted variant in the broader context of how talkers and imitators situate that variant in relation to phonetically similar forms.</p></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"107 ","pages":"Article 101352"},"PeriodicalIF":1.9,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0095447024000585/pdfft?md5=897dce2fd42f59ca368b5e7dc02d21a1&pid=1-s2.0-S0095447024000585-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142173205","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Code-switching experience as a mitigating factor for cross-linguistic phonetic interference 作为跨语言语音干扰缓解因素的代码转换经验
IF 1.9 1区 文学 0 LANGUAGE & LINGUISTICS Pub Date : 2024-11-01 Epub Date: 2024-09-18 DOI: 10.1016/j.wocn.2024.101356
Daniel J. Olson , Yuhyeon Seo

Abstract

The present study investigates the extent to which code-switching experience modulates short-term cross-linguistic phonetic interference. Three experiments were conducted, each examining a different acoustic parameter in the context of code-switching, a dual language paradigm previously shown to enhance cross-linguistic phonetic interference. Bilinguals’ prior experience with code-switching was assessed using the Bilingual Code-Switching Profile. In Experiment 1, Korean–English bilinguals’ productions of F1 and F2 for the code-switched English vowel /æ/ were compared to monolingual (i.e., non-switched) Korean /e/ and English /æ/. While code-switched English vowels shifted in the direction of monolingual Korean vowels, the results suggest that bilinguals with more code-switching experience exhibited reduced cross-linguistic interference relative to those with less experience. In Experiments 2 and 3, Spanish–English bilinguals’ productions of fricative voicing (Experiment 2) and spirantization of intervocalic voiced stops (Experiment 3) in Spanish and English code-switched tokens were compared to monolingual tokens. Results suggest that participants with greater code-switching experience produced less evidence of cross-linguistic phonetic interference for both fricative voicing and intervocalic spirantization. Collectively, and suggesting a role for executive control mechanisms at the phonetic level, these findings illustrate that code-switching experience serves to mitigate short-term cross-linguistic interference.

摘要 本研究调查了代码转换经验对短期跨语言语音干扰的调节程度。本研究共进行了三项实验,每项实验都在代码转换的背景下考察了不同的声学参数。双语者之前的代码转换经验是通过双语代码转换档案来评估的。在实验 1 中,韩英双语者对代码转换英语元音 /æ/ 的 F1 和 F2 的发音与单语(即无代码转换)韩语 /e/ 和英语 /æ/ 的发音进行了比较。虽然代码转换的英语元音向单语韩语元音的方向移动,但结果表明,相对于经验较少的人来说,代码转换经验较多的二语者表现出的跨语言干扰较少。在实验 2 和 3 中,将西英双语者在西班牙语和英语代码转换标记中的摩擦音发声(实验 2)和声间带音停顿的螺旋化(实验 3)与单语标记进行了比较。结果表明,有较多代码转换经验的参与者在摩擦音发声和声间螺旋化方面产生的跨语言语音干扰证据较少。总之,这些研究结果表明,代码转换经验可以减轻短期的跨语言语音干扰,同时也表明了执行控制机制在语音水平上的作用。
{"title":"Code-switching experience as a mitigating factor for cross-linguistic phonetic interference","authors":"Daniel J. Olson ,&nbsp;Yuhyeon Seo","doi":"10.1016/j.wocn.2024.101356","DOIUrl":"10.1016/j.wocn.2024.101356","url":null,"abstract":"<div><h3>Abstract</h3><p>The present study investigates the extent to which code-switching experience modulates short-term cross-linguistic phonetic interference. Three experiments were conducted, each examining a different acoustic parameter in the context of code-switching, a dual language paradigm previously shown to enhance cross-linguistic phonetic interference. Bilinguals’ prior experience with code-switching was assessed using the Bilingual Code-Switching Profile. In Experiment 1, Korean–English bilinguals’ productions of F1 and F2 for the code-switched English vowel /æ/ were compared to monolingual (i.e., non-switched) Korean /e/ and English /æ/. While code-switched English vowels shifted in the direction of monolingual Korean vowels, the results suggest that bilinguals with more code-switching experience exhibited reduced cross-linguistic interference relative to those with less experience. In Experiments 2 and 3, Spanish–English bilinguals’ productions of fricative voicing (Experiment 2) and spirantization of intervocalic voiced stops (Experiment 3) in Spanish and English code-switched tokens were compared to monolingual tokens. Results suggest that participants with greater code-switching experience produced less evidence of cross-linguistic phonetic interference for both fricative voicing and intervocalic spirantization. Collectively, and suggesting a role for executive control mechanisms at the phonetic level, these findings illustrate that code-switching experience serves to mitigate short-term cross-linguistic interference.</p></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"107 ","pages":"Article 101356"},"PeriodicalIF":1.9,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142241576","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Relating pronunciation distance metrics to intelligibility across English accents 将发音距离指标与不同英语口音的可懂度联系起来
IF 1.9 1区 文学 0 LANGUAGE & LINGUISTICS Pub Date : 2024-11-01 Epub Date: 2024-09-17 DOI: 10.1016/j.wocn.2024.101357
Tessa Bent , Malachi Henry , Rachael F. Holt , Holly Lind-Combs

Unfamiliar accents can cause word recognition challenges, particularly in noisy environments, but few studies have incorporated quantitative pronunciation distance metrics to explain intelligibility differences across accents. To address this gap, intelligibility was measured for 18 talkers -- two from each of three first-language, one bilingual, and five second-language accents -- in quiet and two noise conditions. The relations between two edit distance metrics, which quantify phonetic differences from a reference accent, and intelligibility scores were assessed. Intelligibility was quantified through both fuzzy string matching and percent words correct. Both edit distance metrics were significantly related to intelligibility scores; a heuristic edit distance metric was the best predictor of intelligibility for both scoring methods. Further, there were stronger effects of edit distance as the listening condition increased in difficulty. Talker accent also contributed substantially to intelligibility models, but relations between accent and edit distance did not consistently pattern for the two talkers representing each accent. Frequency of production differences in vowels and consonants was negatively correlated with intelligibility, particularly for consonants. Together, these results suggest that significant amounts of variability in intelligibility across accents can be predicted by phonetic differences from the listener’s home accent. However, talker- and accent-specific pronunciation features, including suprasegmental characteristics, must be quantified to fully explain intelligibility across talkers and listening conditions.

不熟悉的口音会给单词识别带来困难,尤其是在嘈杂的环境中,但很少有研究采用定量发音距离指标来解释不同口音之间的可懂度差异。为了弥补这一不足,研究人员在安静和噪音两种环境下测量了 18 位说话者的可懂度,其中三种第一语言口音、一种双语口音和五种第二语言口音各占两位。两个编辑距离指标量化了与参考口音的语音差异,评估了这两个指标与可懂度得分之间的关系。可懂度通过模糊字符串匹配和单词正确率进行量化。两种编辑距离指标都与可懂度得分有显著关系;启发式编辑距离指标是两种评分方法中预测可懂度的最佳指标。此外,随着听力条件难度的增加,编辑距离的影响也越来越大。说话者的口音对可懂度模型也有很大影响,但口音和编辑距离之间的关系在代表每种口音的两个说话者身上并没有一致的模式。元音和辅音的发音差异频率与可懂度呈负相关,尤其是辅音。总之,这些结果表明,不同口音之间的可懂度差异可以通过听者家乡口音的语音差异来预测。然而,要全面解释不同说话者和听力条件下的可懂度,还必须量化说话者和口音的特定发音特征,包括超音段特征。
{"title":"Relating pronunciation distance metrics to intelligibility across English accents","authors":"Tessa Bent ,&nbsp;Malachi Henry ,&nbsp;Rachael F. Holt ,&nbsp;Holly Lind-Combs","doi":"10.1016/j.wocn.2024.101357","DOIUrl":"10.1016/j.wocn.2024.101357","url":null,"abstract":"<div><p>Unfamiliar accents can cause word recognition challenges, particularly in noisy environments, but few studies have incorporated quantitative pronunciation distance metrics to explain intelligibility differences across accents. To address this gap, intelligibility was measured for 18 talkers -- two from each of three first-language, one bilingual, and five second-language accents -- in quiet and two noise conditions. The relations between two edit distance metrics, which quantify phonetic differences from a reference accent, and intelligibility scores were assessed. Intelligibility was quantified through both fuzzy string matching and percent words correct. Both edit distance metrics were significantly related to intelligibility scores; a heuristic edit distance metric was the best predictor of intelligibility for both scoring methods. Further, there were stronger effects of edit distance as the listening condition increased in difficulty. Talker accent also contributed substantially to intelligibility models, but relations between accent and edit distance did not consistently pattern for the two talkers representing each accent. Frequency of production differences in vowels and consonants was negatively correlated with intelligibility, particularly for consonants. Together, these results suggest that significant amounts of variability in intelligibility across accents can be predicted by phonetic differences from the listener’s home accent. However, talker- and accent-specific pronunciation features, including suprasegmental characteristics, must be quantified to fully explain intelligibility across talkers and listening conditions.</p></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"107 ","pages":"Article 101357"},"PeriodicalIF":1.9,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0095447024000639/pdfft?md5=8f68608c6b33b9f8c629933aabc9e62b&pid=1-s2.0-S0095447024000639-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142241716","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Effects of word-level structure on oral stop realization in Hawaiian 词级结构对夏威夷语口语停顿实现的影响
IF 1.9 1区 文学 0 LANGUAGE & LINGUISTICS Pub Date : 2024-11-01 Epub Date: 2024-10-23 DOI: 10.1016/j.wocn.2024.101367
Lisa Davidson , Oiwi Parker Jones
Hawaiian, or ‘Ōlelo Hawaiʻi, is an Eastern Polynesian language spoken on the islands of Hawaiʻi. This study examines oral stops in Hawaiian as produced by speakers on the 1970–80s radio program Ka Leo Hawaiʻi, first, to establish whether the voiceless stops of this generation of Hawaiian speakers were aspirated or unaspirated, and second, to determine whether word-level prosodic structure has an effect on either the implementation of voice onset time (VOT) or closure duration, or how often stops are lenited. Hawaiian has only two primary oral stops /p k/ ([t] is a rare variant in these speakers’ dialects), and the results indicate that /p/ and /k/ are unaspirated for these speakers. Prosodic influences are examined by coding each stop for lexical word position (initial, medial), prosodic word position (initial, medial) and whether it is in a primary stressed, secondary stressed, or unstressed syllable. Results indicate that prosodic word initial position conditions both longer VOT, closure duration, and fewer lenited productions, separately from lexical word position. Moreover, there is an interaction indicating that word initial position leads to longer VOT in unstressed and secondarily stressed syllables, but not in syllables with primary stress. These results are discussed with respect to the prosodic and phonotactic structure of Hawaiian, which may rely on acoustic cues for the disambiguation of prosodic structure and segmentation of lexical items.
夏威夷语,或称 "Ōlelo Hawaiʻi",是一种在夏威夷群岛上使用的东波利尼西亚语。本研究考察了 1970-80 年代广播节目《Ka Leo Hawaiʻi》中的夏威夷语口语停顿,首先确定这一代夏威夷语使用者的无声停顿是有吸气的还是无吸气的,其次确定单词层面的前音结构是否会影响语音起始时间(VOT)或闭合持续时间的实施,或停顿变长的频率。夏威夷语只有两个主要的口语停顿/p k/([t]在这些说话者的方言中是一个罕见的变体),结果表明/p/和/k/对这些说话者来说是不吸气的。通过对每个停顿词的词性词位(初始词位、中间词位)、前置词位(初始词位、中间词位)以及它是在主要重读音节、次要重读音节还是在非重读音节中进行编码,考察了前置词的影响。结果表明,与词性位置不同,前置词初始位置的条件是较长的 VOT、闭合持续时间和较少的连读。此外,还存在交互作用,表明在非重音和次重音音节中,词的初始位置会导致更长的 VOT,而在主重音音节中则不会。这些结果与夏威夷语的拟声和语音战术结构有关,夏威夷语的拟声和语音战术结构可能依赖于声学线索来区分拟声结构和词项的分段。
{"title":"Effects of word-level structure on oral stop realization in Hawaiian","authors":"Lisa Davidson ,&nbsp;Oiwi Parker Jones","doi":"10.1016/j.wocn.2024.101367","DOIUrl":"10.1016/j.wocn.2024.101367","url":null,"abstract":"<div><div>Hawaiian, or ‘Ōlelo Hawaiʻi, is an Eastern Polynesian language spoken on the islands of Hawaiʻi. This study examines oral stops in Hawaiian as produced by speakers on the 1970–80s radio program Ka Leo Hawaiʻi, first, to establish whether the voiceless stops of this generation of Hawaiian speakers were aspirated or unaspirated, and second, to determine whether word-level prosodic structure has an effect on either the implementation of voice onset time (VOT) or closure duration, or how often stops are lenited. Hawaiian has only two primary oral stops /p k/ ([t] is a rare variant in these speakers’ dialects), and the results indicate that /p/ and /k/ are unaspirated for these speakers. Prosodic influences are examined by coding each stop for lexical word position (initial, medial), prosodic word position (initial, medial) and whether it is in a primary stressed, secondary stressed, or unstressed syllable. Results indicate that prosodic word initial position conditions both longer VOT, closure duration, and fewer lenited productions, separately from lexical word position. Moreover, there is an interaction indicating that word initial position leads to longer VOT in unstressed and secondarily stressed syllables, but not in syllables with primary stress. These results are discussed with respect to the prosodic and phonotactic structure of Hawaiian, which may rely on acoustic cues for the disambiguation of prosodic structure and segmentation of lexical items.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"107 ","pages":"Article 101367"},"PeriodicalIF":1.9,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142531424","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Formant-based articulatory strategies: Characterisation and inter-speaker variability analysis 基于构音的发音策略:特征描述和说话者之间的差异分析
IF 1.9 1区 文学 0 LANGUAGE & LINGUISTICS Pub Date : 2024-11-01 Epub Date: 2024-11-08 DOI: 10.1016/j.wocn.2024.101374
Antoine Serrurier, Christiane Neuschaefer-Rube
Vowels are articulatorily characterised by the shape of the vocal tract and acoustically by their three lowest formants. The relationship between formant variations and articulatory variations is well documented. This study addresses the opposite problem: describing the main articulatory variations associated with the variations of single formants. A data-driven modelling-based approach was chosen for this purpose. Midsagittal vocal tract contours from the glottis to the lips for 532 vowels from 41 speakers of three different languages were obtained from MRI data. Corresponding formant values were obtained by acoustic modelling. For each speaker, linear regressions of the contours on the formant values were performed. It led to five articulatory components, characterising the vocal tract variations associated with variations of the first three formants and their differences. Inter-speaker variability was analysed by applying principal components analysis on the components in a second level of modelling. A correlation analysis of the resulting inter-speaker components with morphological features was performed to determine whether a speaker’s strategy could be driven by the morphology. Results show that the palate shape and the vertical pharyngeal height, related to the male–female difference, have a small influence on the speaker’s strategy. Associated Matlab code is publicly available.
元音的发音特征是声道的形状,声学特征是其三个最低的声母。关于声母的声调变化与发音变化之间的关系,已有大量文献记载。本研究解决的是相反的问题:描述与单个声母变化相关的主要发音变化。为此,我们选择了一种基于数据驱动的建模方法。从核磁共振成像数据中获得了三种不同语言的 41 位说话者的 532 个元音从声门到嘴唇的声道中矢状面轮廓。通过声学建模获得了相应的声门值。对每个说话者的声带轮廓和声门值进行了线性回归。结果得出了五个发音成分,描述了与前三个声母变化及其差异相关的声道变化。在第二层建模中,通过对这些成分进行主成分分析,分析了扬声器间的变异性。对所得的说话者间成分与形态特征进行了相关分析,以确定说话者的策略是否由形态驱动。结果表明,与男女差异有关的上颚形状和咽部垂直高度对说话者的策略影响较小。相关的 Matlab 代码已公开。
{"title":"Formant-based articulatory strategies: Characterisation and inter-speaker variability analysis","authors":"Antoine Serrurier,&nbsp;Christiane Neuschaefer-Rube","doi":"10.1016/j.wocn.2024.101374","DOIUrl":"10.1016/j.wocn.2024.101374","url":null,"abstract":"<div><div>Vowels are articulatorily characterised by the shape of the vocal tract and acoustically by their three lowest formants. The relationship between formant variations and articulatory variations is well documented. This study addresses the opposite problem: describing the main articulatory variations associated with the variations of single formants. A data-driven modelling-based approach was chosen for this purpose. Midsagittal vocal tract contours from the glottis to the lips for 532 vowels from 41 speakers of three different languages were obtained from MRI data. Corresponding formant values were obtained by acoustic modelling. For each speaker, linear regressions of the contours on the formant values were performed. It led to five articulatory components, characterising the vocal tract variations associated with variations of the first three formants and their differences. Inter-speaker variability was analysed by applying principal components analysis on the components in a second level of modelling. A correlation analysis of the resulting inter-speaker components with morphological features was performed to determine whether a speaker’s strategy could be driven by the morphology. Results show that the palate shape and the vertical pharyngeal height, related to the male–female difference, have a small influence on the speaker’s strategy. Associated Matlab code is publicly available.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"107 ","pages":"Article 101374"},"PeriodicalIF":1.9,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142659520","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Corrigendum to “Being clear about clear speech: Intelligibility of hard-of-hearing-directed, non-native-directed, and casual speech for L1- and L2-English listeners” [J. Phonet. 104 (2024) 101328] “明确清晰的言语:听力困难导向、非母语导向和随意话语对母语和l2英语听者的可理解性”[J]。电话。104 (2024)101328]
IF 1.9 1区 文学 0 LANGUAGE & LINGUISTICS Pub Date : 2024-11-01 Epub Date: 2024-10-21 DOI: 10.1016/j.wocn.2024.101373
Nicholas B. Aoki, Georgia Zellou
{"title":"Corrigendum to “Being clear about clear speech: Intelligibility of hard-of-hearing-directed, non-native-directed, and casual speech for L1- and L2-English listeners” [J. Phonet. 104 (2024) 101328]","authors":"Nicholas B. Aoki,&nbsp;Georgia Zellou","doi":"10.1016/j.wocn.2024.101373","DOIUrl":"10.1016/j.wocn.2024.101373","url":null,"abstract":"","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"107 ","pages":"Article 101373"},"PeriodicalIF":1.9,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143098481","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the target of phonetic convergence: Acoustic and linguistic aspects of pitch accent imitation 语音趋同的目标:音高重音模仿的声学和语言学方面
IF 1.9 1区 文学 0 LANGUAGE & LINGUISTICS Pub Date : 2024-11-01 Epub Date: 2024-11-06 DOI: 10.1016/j.wocn.2024.101372
Kuniko Nielsen , Rebecca Scarborough
Studies in phonetic convergence have shown that speakers imitate phonetic patterns to which they are exposed (e.g., Pardo et al., 2013; Nielsen, 2011). However, it is less clear what aspects of the speech signal speakers are responding to when they change their speech behavior and to what extent: specific acoustic values, linguistically-interpreted targets, or some combination. To explore this issue, we conducted two imitation experiments, one in which participants were exposed to a linguistically-salient manipulated pitch accent realization, and one in which participants were exposed to a linguistically-less-salient overall pitch difference. Phonetic convergence might target acoustic values of the speech signal, leading participants to converge acoustically toward the low f0 and fast speech rate of the model taker. On the other hand, phonetic imitation might target linguistic patterns in the stimuli, leading participants to imitate the enhanced pitch accent in f0 and duration, even if it would result in acoustic divergence.
Our results show that it is both: participants converged to the model talker in linguistically-interpreted targets (pitch accent type, relative rise of the peak f0 on L + H*, and relative duration of the target phrase) and acoustic values (carrier phrase f0 and duration). However, our data indicated that higher-level linguistically-interpreted convergence may be manifested more robustly than low-level convergence. We also observed an asymmetry in convergence robustness between f0 and duration, as well as individual difference in the patterns of convergence, which suggest that there are constraints on convergence related to structural properties in the stimuli as well as speakers’ characteristics in perception and production.
语音趋同研究表明,说话者会模仿他们所接触到的语音模式(例如,Pardo 等人,2013;Nielsen,2011)。然而,目前还不太清楚说话者在改变其说话行为时是在对语音信号的哪些方面做出反应,以及在多大程度上做出反应:特定的声学值、语言解释的目标,还是某些组合。为了探究这个问题,我们进行了两项模仿实验,一项是让受试者接触语言学上有意义的受控音高重音实现,另一项是让受试者接触语言学上无意义的整体音高差异。语音趋同可能以语音信号的声学值为目标,导致参与者在声学上趋向于模范模仿者的低 f0 和快语速。我们的研究结果表明,两者兼而有之:参与者在语言解释目标(音调重音类型、L + H* 峰值 f0 的相对上升和目标短语的相对持续时间)和声学值(载体短语的 f0 和持续时间)上都向模范说话者靠拢。然而,我们的数据表明,较高层次的语言解释趋同可能比低层次的趋同表现得更为明显。我们还观察到,f0 和持续时间之间的收敛稳健性不对称,收敛模式也存在个体差异,这表明收敛受到刺激结构属性以及说话者感知和生产特征的制约。
{"title":"On the target of phonetic convergence: Acoustic and linguistic aspects of pitch accent imitation","authors":"Kuniko Nielsen ,&nbsp;Rebecca Scarborough","doi":"10.1016/j.wocn.2024.101372","DOIUrl":"10.1016/j.wocn.2024.101372","url":null,"abstract":"<div><div>Studies in phonetic convergence have shown that speakers imitate phonetic patterns to which they are exposed (e.g., Pardo et al., 2013; Nielsen, 2011). However, it is less clear what aspects of the speech signal speakers are responding to when they change their speech behavior and to what extent: specific acoustic values, linguistically-interpreted targets, or some combination. To explore this issue, we conducted two imitation experiments, one in which participants were exposed to a linguistically-salient manipulated pitch accent realization, and one in which participants were exposed to a linguistically-less-salient overall pitch difference. Phonetic convergence might target acoustic values of the speech signal, leading participants to converge acoustically toward the low f0 and fast speech rate of the model taker. On the other hand, phonetic imitation might target linguistic patterns in the stimuli, leading participants to imitate the enhanced pitch accent in f0 and duration, even if it would result in acoustic divergence.</div><div>Our results show that it is both: participants converged to the model talker in linguistically-interpreted targets (pitch accent type, relative rise of the peak f0 on L + H*, and relative duration of the target phrase) and acoustic values (carrier phrase f0 and duration). However, our data indicated that higher-level linguistically-interpreted convergence may be manifested more robustly than low-level convergence. We also observed an asymmetry in convergence robustness between f0 and duration, as well as individual difference in the patterns of convergence, which suggest that there are constraints on convergence related to structural properties in the stimuli as well as speakers’ characteristics in perception and production.</div></div>","PeriodicalId":51397,"journal":{"name":"Journal of Phonetics","volume":"107 ","pages":"Article 101372"},"PeriodicalIF":1.9,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142593950","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Phonetics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1