Pub Date : 2024-08-07DOI: 10.1177/00238309241255319
Jonathan Barnes, Alejna Brugos, Nanette Veilleux, Stefanie Shattuck-Hufnagel
Researchers investigating a broad array of questions in spoken language prosody routinely base their arguments on measurements taken from the F0 contours of representative speech samples. These analyses, however, frequently involve abstracting F0 contours away from the segmental strings that bear them, potentially overlooking in the process the role played by segmental qualities such as sonority or periodicity in the realization of F0 patterns by speakers and their interpretation by listeners. This paper reports the results of two experiments investigating how perception of F0 contours is affected by the segmental string over which those contours are realized. The first focuses on gaps in F0 contours created by voiceless obstruents such as stops and fricatives, while the second investigates F0 intervals spanning lower-sonority voiced segments, such as nasals and voiced fricatives. While these two scenarios might at first seem unrelated, we argue that listeners treat both with a single mechanism in perception, namely, by reducing (potentially to zero) the amount of weight accorded to those portions of the contour for determination of the speaker's intended F0 scaling level. We present an account of both effects within a unified model of F0 scaling perception called TCoG-F, with discussion of its implications for phonetic and phonological intonation research going forward.
{"title":"Segmental Influences on the Perception of High Pitch Accent Scaling in American English.","authors":"Jonathan Barnes, Alejna Brugos, Nanette Veilleux, Stefanie Shattuck-Hufnagel","doi":"10.1177/00238309241255319","DOIUrl":"https://doi.org/10.1177/00238309241255319","url":null,"abstract":"<p><p>Researchers investigating a broad array of questions in spoken language prosody routinely base their arguments on measurements taken from the F0 contours of representative speech samples. These analyses, however, frequently involve abstracting F0 contours away from the segmental strings that bear them, potentially overlooking in the process the role played by segmental qualities such as sonority or periodicity in the realization of F0 patterns by speakers and their interpretation by listeners. This paper reports the results of two experiments investigating how perception of F0 contours is affected by the segmental string over which those contours are realized. The first focuses on gaps in F0 contours created by voiceless obstruents such as stops and fricatives, while the second investigates F0 intervals spanning lower-sonority voiced segments, such as nasals and voiced fricatives. While these two scenarios might at first seem unrelated, we argue that listeners treat both with a single mechanism in perception, namely, by reducing (potentially to zero) the amount of weight accorded to those portions of the contour for determination of the speaker's intended F0 scaling level. We present an account of both effects within a unified model of F0 scaling perception called TCoG-F, with discussion of its implications for phonetic and phonological intonation research going forward.</p>","PeriodicalId":51255,"journal":{"name":"Language and Speech","volume":" ","pages":"238309241255319"},"PeriodicalIF":1.1,"publicationDate":"2024-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141903521","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-07DOI: 10.1177/00238309241256653
Mónica Anna Wagner, Mirjam Broersma, James M McQueen, Roeland van Hout, Kristin Lemhöfer
Research with nonnative speech spans many different linguistic branches and topics. Most studies include one or a few well-known features of a particular accent. However, due to a lack of empirical studies, little is known about how common these features are among nonnative speakers or how uncommon they are among native speakers. Moreover, it remains to be seen whether findings from such studies generalize to lesser-known features. Here, we demonstrate a quantitative approach to study nonnative accent features using Dutch-accented English as an example. By analyzing the phonetic distances between transcriptions of speech samples, this approach can identify the features that best distinguish nonnative from native speech. In addition, we describe a method to test hypotheses about accent features by checking whether the prevalence of the features overall varies between native and nonnative speakers. Furthermore, we include English speakers from the United States and United Kingdom and native Dutch speakers from Belgium and The Netherlands to address the issue of regional accent variability in both the native and target language. We discuss the results concerning three observed features. Overall, the results provide empirical support for some well-known features of Dutch-accented English, but suggest that others may be infrequent among nonnatives or in fact frequent among natives. In addition, the findings reveal potentially new accent features, and factors that may modulate the expression of known features. Our study demonstrates a fruitful approach to study nonnative accent features that has the potential to expand our understanding of the phenomenon of accent.
{"title":"The Case for a Quantitative Approach to the Study of Nonnative Accent Features.","authors":"Mónica Anna Wagner, Mirjam Broersma, James M McQueen, Roeland van Hout, Kristin Lemhöfer","doi":"10.1177/00238309241256653","DOIUrl":"https://doi.org/10.1177/00238309241256653","url":null,"abstract":"<p><p>Research with nonnative speech spans many different linguistic branches and topics. Most studies include one or a few well-known features of a particular accent. However, due to a lack of empirical studies, little is known about how common these features are among nonnative speakers or how uncommon they are among native speakers. Moreover, it remains to be seen whether findings from such studies generalize to lesser-known features. Here, we demonstrate a quantitative approach to study nonnative accent features using Dutch-accented English as an example. By analyzing the phonetic distances between transcriptions of speech samples, this approach can identify the features that best distinguish nonnative from native speech. In addition, we describe a method to test hypotheses about accent features by checking whether the prevalence of the features overall varies between native and nonnative speakers. Furthermore, we include English speakers from the United States and United Kingdom and native Dutch speakers from Belgium and The Netherlands to address the issue of regional accent variability in both the native and target language. We discuss the results concerning three observed features. Overall, the results provide empirical support for some well-known features of Dutch-accented English, but suggest that others may be infrequent among nonnatives or in fact frequent among natives. In addition, the findings reveal potentially new accent features, and factors that may modulate the expression of known features. Our study demonstrates a fruitful approach to study nonnative accent features that has the potential to expand our understanding of the phenomenon of accent.</p>","PeriodicalId":51255,"journal":{"name":"Language and Speech","volume":" ","pages":"238309241256653"},"PeriodicalIF":1.1,"publicationDate":"2024-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141898936","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-31DOI: 10.1177/00238309241264296
Murray J Munro, Tracey M Derwing, Kazuya Saito
This short report describes a longitudinal examination of the acquisition of English-aspirated stops by an initial cohort of 24 adult Slavic-language (Russian, Ukrainian, and Croatian) speakers. All had arrived in Canada with low oral English proficiency, and all were enrolled in the same language instruction program at the outset. Initial bilabial stops in CVCs were recorded at eight testing times: six during the first year of the study, again at year 7, and finally at year 10. Intelligibility was evaluated through a blind listening assessment of the stop productions from the first seven testing times. Voice onset times (VOT) were measured for /p/ from all eight times. Mean /p/ intelligibility improved-mainly during a proposed Window of Maximal Opportunity for L2 speech acquisition-but remained below 100%, even after 7 years. For some speakers, early /p/ productions were minimally aspirated, with VOT increasing over time but remaining intermediate between L1 English and L1 Slavic-Language values at 10 years. However, inter-speaker variability was dramatic, with some speakers showing full intelligibility throughout the study and others showing many unintelligible productions at all times. Individual learning trajectories tended to be non-linear and often non-cumulative. Overall, these findings point to a developmental process that varies considerably from one learner to another. It also demonstrates the serious drawbacks of relying on group means to characterize the process of L2 segmental learning.
{"title":"Aspiring to Aspirate: L2 Acquisition of English Word-Initial /p/ Over 10 Years.","authors":"Murray J Munro, Tracey M Derwing, Kazuya Saito","doi":"10.1177/00238309241264296","DOIUrl":"https://doi.org/10.1177/00238309241264296","url":null,"abstract":"<p><p>This short report describes a longitudinal examination of the acquisition of English-aspirated stops by an initial cohort of 24 adult Slavic-language (Russian, Ukrainian, and Croatian) speakers. All had arrived in Canada with low oral English proficiency, and all were enrolled in the same language instruction program at the outset. Initial bilabial stops in CVCs were recorded at eight testing times: six during the first year of the study, again at year 7, and finally at year 10. Intelligibility was evaluated through a blind listening assessment of the stop productions from the first seven testing times. Voice onset times (VOT) were measured for /p/ from all eight times. Mean /p/ intelligibility improved-mainly during a proposed Window of Maximal Opportunity for L2 speech acquisition-but remained below 100%, even after 7 years. For some speakers, early /p/ productions were minimally aspirated, with VOT increasing over time but remaining intermediate between L1 English and L1 Slavic-Language values at 10 years. However, inter-speaker variability was dramatic, with some speakers showing full intelligibility throughout the study and others showing many unintelligible productions at all times. Individual learning trajectories tended to be non-linear and often non-cumulative. Overall, these findings point to a developmental process that varies considerably from one learner to another. It also demonstrates the serious drawbacks of relying on group means to characterize the process of L2 segmental learning.</p>","PeriodicalId":51255,"journal":{"name":"Language and Speech","volume":" ","pages":"238309241264296"},"PeriodicalIF":1.1,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141857093","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-31DOI: 10.1177/00238309241259748
Madeleine Oakley
This study examines how L1 English-L2 French learners use L1 articulatory and acoustic categories to produce L2 vowels that are both similar to and different from their L1 vowels. Previous studies examining the relationship between L1 and L2 sound inventories have found that learners reuse L1 phone categories to produce L2 phones that are perceived as similar, but importantly, there is a lack of articulatory data included in these types of studies, which has reinforced the assumption that vowel categories can be solely represented by their acoustic properties. The present study uses ultrasound tongue imaging data and videos of lip rounding in addition to acoustic data to examine how L1 English-L2 French learners produce the French vowels /i y u e ø o/ compared with their English vowels /i u e o/. The results focus on individual paths to category formation to show how learners articulate L2 vowels, and reveal that they tend to reuse L1 tongue body gestures to produce the French vowels /i u e o/, and lip rounding gestures to produce the round vowels /y u o/. This study demonstrates that transfer of articulatory gestures depends on vowel quality and emphasizes the importance of using articulatory data to inform theories of L2 category formation.
本研究探讨了 L1 英语-L2 法语学习者如何使用 L1 的发音和声学类别来产生与其 L1元音既相似又不同的 L2元音。以前对 L1 和 L2 声音清单之间关系的研究发现,学习者会重复使用 L1 的元音类别来发出被认为相似的 L2 元音,但重要的是,这些类型的研究中缺乏发音数据,这强化了元音类别只能由其声学特性来表示的假设。本研究除声学数据外,还使用了超声波舌头成像数据和嘴唇圆润的视频,来研究第一英语学习者和第二法语学习者如何发出法语元音 /i y u e ø o/,与他们的英语元音 /i u e o/相比。研究结果集中于类别形成的个体路径,以显示学习者如何发音 L2元音,并揭示他们倾向于重复使用 L1 舌体手势来发出法语元音 /i u e o/,以及唇部圆润手势来发出圆元音 /y u o/。这项研究表明,发音手势的迁移取决于元音的质量,并强调了使用发音数据为 L2 类别形成理论提供信息的重要性。
{"title":"The Articulatory and Acoustic Representation of Second-Language French Vowels.","authors":"Madeleine Oakley","doi":"10.1177/00238309241259748","DOIUrl":"https://doi.org/10.1177/00238309241259748","url":null,"abstract":"<p><p>This study examines how L1 English-L2 French learners use L1 articulatory and acoustic categories to produce L2 vowels that are both similar to and different from their L1 vowels. Previous studies examining the relationship between L1 and L2 sound inventories have found that learners reuse L1 phone categories to produce L2 phones that are perceived as similar, but importantly, there is a lack of articulatory data included in these types of studies, which has reinforced the assumption that vowel categories can be solely represented by their acoustic properties. The present study uses ultrasound tongue imaging data and videos of lip rounding in addition to acoustic data to examine how L1 English-L2 French learners produce the French vowels /i y u e ø o/ compared with their English vowels /i u e o/. The results focus on individual paths to category formation to show how learners articulate L2 vowels, and reveal that they tend to reuse L1 tongue body gestures to produce the French vowels /i u e o/, and lip rounding gestures to produce the round vowels /y u o/. This study demonstrates that transfer of articulatory gestures depends on vowel quality and emphasizes the importance of using articulatory data to inform theories of L2 category formation.</p>","PeriodicalId":51255,"journal":{"name":"Language and Speech","volume":" ","pages":"238309241259748"},"PeriodicalIF":1.1,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141861624","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-29DOI: 10.1177/00238309241266864
Xin Yuan, Jing Tang
This study investigated how input modes (reading vs. listening) and learners' perceptual learning style (visual vs. auditory) affected the incidental learning of collocations. A total of 182 college students were first assigned to either a visual or auditory group based on their performance on a perceptual learning style questionnaire. Each style group was subsequently subdivided into three groups who were exposed to a series of texts containing unfamiliar collocation items under one of the input conditions: written input, aural input, or no input. Results of the study indicated that both written and aural input led to gains in collocational knowledge, and aural input was more effective than written input. Furthermore, the study provided empirical evidence that there was a moderating role of perceptual learning style on incidental collocation learning. The auditory learners under aural input showed the highest rate of collocation learning among all treatment subgroups.
{"title":"Incidental Learning of Collocations Under Different Input Modes and the Mediating Role of Perceptual Learning Style.","authors":"Xin Yuan, Jing Tang","doi":"10.1177/00238309241266864","DOIUrl":"https://doi.org/10.1177/00238309241266864","url":null,"abstract":"<p><p>This study investigated how input modes (reading vs. listening) and learners' perceptual learning style (visual vs. auditory) affected the incidental learning of collocations. A total of 182 college students were first assigned to either a visual or auditory group based on their performance on a perceptual learning style questionnaire. Each style group was subsequently subdivided into three groups who were exposed to a series of texts containing unfamiliar collocation items under one of the input conditions: written input, aural input, or no input. Results of the study indicated that both written and aural input led to gains in collocational knowledge, and aural input was more effective than written input. Furthermore, the study provided empirical evidence that there was a moderating role of perceptual learning style on incidental collocation learning. The auditory learners under aural input showed the highest rate of collocation learning among all treatment subgroups.</p>","PeriodicalId":51255,"journal":{"name":"Language and Speech","volume":" ","pages":"238309241266864"},"PeriodicalIF":1.1,"publicationDate":"2024-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141794058","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-27DOI: 10.1177/00238309241261702
Ricky K W Chan, Bruce Xiao Wang
Fundamental frequency (F0) has been widely studied and used in the context of speaker discrimination and forensic voice comparison casework, but most previous studies focused on long-term F0 statistics. Lexical tone, the linguistically structured and dynamic aspects of F0, has received much less research attention. A main methodological issue lies on how tonal F0 should be parameterized for the best speaker discrimination performance. This paper compares the speaker discriminatory performance of three approaches with lexical tone modeling: discrete cosine transform (DCT), polynomial curve fitting, and quantitative target approximation (qTA). Results show that using parameters based on DCT and polynomials led to similarly promising performance, whereas those based on qTA generally yielded relatively poor performance. Implications modeling surface tonal F0 and the underlying articulatory processes for speaker discrimination are discussed.
{"title":"Modeling Lexical Tones for Speaker Discrimination.","authors":"Ricky K W Chan, Bruce Xiao Wang","doi":"10.1177/00238309241261702","DOIUrl":"https://doi.org/10.1177/00238309241261702","url":null,"abstract":"<p><p>Fundamental frequency (F0) has been widely studied and used in the context of speaker discrimination and forensic voice comparison casework, but most previous studies focused on long-term F0 statistics. Lexical tone, the linguistically structured and dynamic aspects of F0, has received much less research attention. A main methodological issue lies on how tonal F0 should be parameterized for the best speaker discrimination performance. This paper compares the speaker discriminatory performance of three approaches with lexical tone modeling: discrete cosine transform (DCT), polynomial curve fitting, and quantitative target approximation (qTA). Results show that using parameters based on DCT and polynomials led to similarly promising performance, whereas those based on qTA generally yielded relatively poor performance. Implications modeling surface tonal F0 and the underlying articulatory processes for speaker discrimination are discussed.</p>","PeriodicalId":51255,"journal":{"name":"Language and Speech","volume":" ","pages":"238309241261702"},"PeriodicalIF":1.1,"publicationDate":"2024-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141768005","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-25DOI: 10.1177/00238309241260062
Feier Gao, Chien-Jer Charles Lin
Mandarin tone 3 sandhi refers to the phenomenon whereby a tone 3 syllable changes to a tone 2 when followed by another tone 3. This phonological process creates a deviation between the tonal forms realized at morphemic (/tone3-tone3/) and word ([tone2-tone3]) levels, posing questions in terms of how disyllabic tone 3 sandhi words are represented and accessed. The current study conducted three cross-modal lexical decision priming experiments to investigate this issue. Experiment 1 manipulated the frequencies of the initial morpheme and whole word, showing that the higher initial-character frequency against the whole word gives stronger activation to the underlying representation and the lower frequency of the initial character leads to stronger activation of the surface tone. Experiments 2 and 3 operationalized the relative frequency of the initial tone 3 morpheme's realization as a sandhi tone, finding that the competition between the two tonal realizations also influences how T3 sandhi words are accessed. Specifically, the more frequently the T3 morpheme surfaces as a T2 allomorph, the less activated the underlying representation becomes in the mental lexicon. Our results indicate a complex interplay between morpheme, word, and the associated tonal representations in the mental lexicon and that these factors co-determine the lexical access of tone 3 sandhi.
{"title":"Incorporating Frequency Effects in the Lexical Access of Mandarin Tone 3 Sandhi.","authors":"Feier Gao, Chien-Jer Charles Lin","doi":"10.1177/00238309241260062","DOIUrl":"https://doi.org/10.1177/00238309241260062","url":null,"abstract":"<p><p>Mandarin tone 3 sandhi refers to the phenomenon whereby a tone 3 syllable changes to a tone 2 when followed by another tone 3. This phonological process creates a deviation between the tonal forms realized at morphemic (/tone3-tone3/) and word ([tone2-tone3]) levels, posing questions in terms of how disyllabic tone 3 sandhi words are represented and accessed. The current study conducted three cross-modal lexical decision priming experiments to investigate this issue. Experiment 1 manipulated the frequencies of the initial morpheme and whole word, showing that the higher initial-character frequency against the whole word gives stronger activation to the underlying representation and the lower frequency of the initial character leads to stronger activation of the surface tone. Experiments 2 and 3 operationalized the relative frequency of the initial tone 3 morpheme's realization as a sandhi tone, finding that the competition between the two tonal realizations also influences how T3 sandhi words are accessed. Specifically, the more frequently the T3 morpheme surfaces as a T2 allomorph, the less activated the underlying representation becomes in the mental lexicon. Our results indicate a complex interplay between morpheme, word, and the associated tonal representations in the mental lexicon and that these factors co-determine the lexical access of tone 3 sandhi.</p>","PeriodicalId":51255,"journal":{"name":"Language and Speech","volume":" ","pages":"238309241260062"},"PeriodicalIF":1.1,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141762454","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-14DOI: 10.1177/00238309241258162
Ronny Bujok, Antje S Meyer, Hans Rutger Bosker
Human communication is inherently multimodal. Auditory speech, but also visual cues can be used to understand another talker. Most studies of audiovisual speech perception have focused on the perception of speech segments (i.e., speech sounds). However, less is known about the influence of visual information on the perception of suprasegmental aspects of speech like lexical stress. In two experiments, we investigated the influence of different visual cues (e.g., facial articulatory cues and beat gestures) on the audiovisual perception of lexical stress. We presented auditory lexical stress continua of disyllabic Dutch stress pairs together with videos of a speaker producing stress on the first or second syllable (e.g., articulating VOORnaam or voorNAAM). Moreover, we combined and fully crossed the face of the speaker producing lexical stress on either syllable with a gesturing body producing a beat gesture on either the first or second syllable. Results showed that people successfully used visual articulatory cues to stress in muted videos. However, in audiovisual conditions, we were not able to find an effect of visual articulatory cues. In contrast, we found that the temporal alignment of beat gestures with speech robustly influenced participants' perception of lexical stress. These results highlight the importance of considering suprasegmental aspects of language in multimodal contexts.
{"title":"Audiovisual Perception of Lexical Stress: Beat Gestures and Articulatory Cues.","authors":"Ronny Bujok, Antje S Meyer, Hans Rutger Bosker","doi":"10.1177/00238309241258162","DOIUrl":"https://doi.org/10.1177/00238309241258162","url":null,"abstract":"<p><p>Human communication is inherently multimodal. Auditory speech, but also visual cues can be used to understand another talker. Most studies of audiovisual speech perception have focused on the perception of speech segments (i.e., speech sounds). However, less is known about the influence of visual information on the perception of suprasegmental aspects of speech like lexical stress. In two experiments, we investigated the influence of different visual cues (e.g., facial articulatory cues and beat gestures) on the audiovisual perception of lexical stress. We presented auditory lexical stress continua of disyllabic Dutch stress pairs together with videos of a speaker producing stress on the first or second syllable (e.g., articulating <i>VOORnaam</i> or <i>voorNAAM</i>). Moreover, we combined and fully crossed the face of the speaker producing lexical stress on either syllable with a gesturing body producing a beat gesture on either the first or second syllable. Results showed that people successfully used visual articulatory cues to stress in muted videos. However, in audiovisual conditions, we were not able to find an effect of visual articulatory cues. In contrast, we found that the temporal alignment of beat gestures with speech robustly influenced participants' perception of lexical stress. These results highlight the importance of considering suprasegmental aspects of language in multimodal contexts.</p>","PeriodicalId":51255,"journal":{"name":"Language and Speech","volume":" ","pages":"238309241258162"},"PeriodicalIF":1.8,"publicationDate":"2024-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141321984","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-10DOI: 10.1177/00238309241254350
Stephanie Kaucke, Marcel Schlechtweg
Previous research has shown that it is difficult for English speakers to distinguish the front rounded vowels /y/ and /ø/ from the back rounded vowels /u/ and /o/. In this study, we examine the effect of noise on this perceptual difficulty. In an Oddity Discrimination Task, English speakers without any knowledge of German were asked to discriminate between German-sounding pseudowords varying in the vowel both in quiet and in white noise at two signal-to-noise ratios (8 and 0 dB). In test trials, vowels of the same height were contrasted with each other, whereas a contrast with /a/ served as a control trial. Results revealed that a contrast with /a/ remained stable in every listening condition for both high and mid vowels. When contrasting vowels of the same height, however, there was a perceptual shift along the F2 dimension as the noise level increased. Although the /ø/-/o/ and particularly /y/-/u/ contrasts were the most difficult in quiet, accuracy on /i/-/y/ and /e/-/ø/ trials decreased immensely when the speech signal was masked. The German control group showed the same pattern, albeit less severe than the non-native group, suggesting that even in low-level tasks with pseudowords, there is a native advantage in speech perception in noise.
{"title":"English Speakers' Perception of Non-native Vowel Contrasts in Adverse Listening Conditions: A Discrimination Study on the German Front Rounded Vowels /y/ and /ø/.","authors":"Stephanie Kaucke, Marcel Schlechtweg","doi":"10.1177/00238309241254350","DOIUrl":"https://doi.org/10.1177/00238309241254350","url":null,"abstract":"<p><p>Previous research has shown that it is difficult for English speakers to distinguish the front rounded vowels /y/ and /ø/ from the back rounded vowels /u/ and /o/. In this study, we examine the effect of noise on this perceptual difficulty. In an Oddity Discrimination Task, English speakers without any knowledge of German were asked to discriminate between German-sounding pseudowords varying in the vowel both in quiet and in white noise at two signal-to-noise ratios (8 and 0 dB). In test trials, vowels of the same height were contrasted with each other, whereas a contrast with /a/ served as a control trial. Results revealed that a contrast with /a/ remained stable in every listening condition for both high and mid vowels. When contrasting vowels of the same height, however, there was a perceptual shift along the F2 dimension as the noise level increased. Although the /ø/-/o/ and particularly /y/-/u/ contrasts were the most difficult in quiet, accuracy on /i/-/y/ and /e/-/ø/ trials decreased immensely when the speech signal was masked. The German control group showed the same pattern, albeit less severe than the non-native group, suggesting that even in low-level tasks with pseudowords, there is a native advantage in speech perception in noise.</p>","PeriodicalId":51255,"journal":{"name":"Language and Speech","volume":" ","pages":"238309241254350"},"PeriodicalIF":1.8,"publicationDate":"2024-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141297207","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-01Epub Date: 2021-10-20DOI: 10.1177/00238309211050094
Christiane Ulbrich
This paper presents the investigation and analysis of speech accommodation effects in data obtained from Spanish learners of German with varying proficiency levels. The production data were recorded during a collaborative map task of the Spanish learners of German among each other and with a native speaker of German. The map task was designed to target words and phrases with specific segmental and suprasegmental characteristics. These characteristics were derived from contrastive analyses of Spanish and German. The main objectives of the paper were to investigate whether segmental and suprasegmental characteristics of the target language German are affected by phonetic accommodation to varying degrees and whether these differences depend on the proficiency level of the speaker or the interlocutor. The statistical analysis, using regression analyses, revealed inconsistent accommodation effects across learners of different proficiency levels as well as different linguistic phenomena. In line with previous findings the results can best be accounted for by an adaptation of a dynamic system approach.
{"title":"Phonetic Accommodation on the Segmental and the Suprasegmental Level of Speech in Native-Non-Native Collaborative Tasks.","authors":"Christiane Ulbrich","doi":"10.1177/00238309211050094","DOIUrl":"10.1177/00238309211050094","url":null,"abstract":"<p><p>This paper presents the investigation and analysis of speech accommodation effects in data obtained from Spanish learners of German with varying proficiency levels. The production data were recorded during a collaborative map task of the Spanish learners of German among each other and with a native speaker of German. The map task was designed to target words and phrases with specific segmental and suprasegmental characteristics. These characteristics were derived from contrastive analyses of Spanish and German. The main objectives of the paper were to investigate whether segmental and suprasegmental characteristics of the target language German are affected by phonetic accommodation to varying degrees and whether these differences depend on the proficiency level of the speaker or the interlocutor. The statistical analysis, using regression analyses, revealed inconsistent accommodation effects across learners of different proficiency levels as well as different linguistic phenomena. In line with previous findings the results can best be accounted for by an adaptation of a dynamic system approach.</p>","PeriodicalId":51255,"journal":{"name":"Language and Speech","volume":" ","pages":"346-372"},"PeriodicalIF":1.8,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11141108/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9213903","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}