首页 > 最新文献

Studies in Second Language Acquisition最新文献

英文 中文
Comparing the effectiveness of Duolingo, Classroom instruction, and Classroom + Duolingo instruction conditions on beginner-level French language development 比较多邻国、课堂教学和课堂+多邻国教学条件对初级法语语言发展的效果
IF 4.1 1区 文学 Q1 LINGUISTICS Pub Date : 2026-03-10 DOI: 10.1017/s0272263126101521
YouJin Kim, Caroline Payant, Stephen Skalicky, Yoon Namkung
Over the past decade, there has been growing use of mobile-assisted language learning (MALL) applications such as Duolingo. The effectiveness of MALL applications is thus of great interest among language acquisition researchers and practitioners. This study compared French language development among beginners in three learning conditions: Classroom-Only ( n = 58), Duolingo-Only ( n = 65), and Classroom + Duolingo ( n = 60). The Classroom-Only group completed a standard first-semester curriculum, the Classroom + Duolingo group used Duolingo as supplemental material, and the Duolingo-Only group learned exclusively through the app. All participants completed pretests and posttests measuring overall proficiency, grammar, vocabulary, pragmatics ( tu vs. vous ), and communicative competence over a 16-week period. Linear mixed-effects models revealed that all three groups showed significant improvement across nearly all measures between pretest and posttest, with similar magnitudes of improvement. The only exception was in learning tu vs. vous (pragmatic competence), where the Classroom + Duolingo group showed larger gains than Classroom-Only and Duolingo-Only groups. Results suggest that both traditional classroom instruction and Duolingo are comparably effective for beginning French language learners. Results are discussed in light of mobile app-based language learning with the potential role of learner characteristics.
在过去的十年里,越来越多的人使用移动辅助语言学习(MALL)应用程序,比如Duolingo。因此,MALL应用的有效性是语言习得研究者和实践者非常感兴趣的问题。本研究比较了初学者在三种学习条件下的法语发展情况:只上课堂(n = 58)、只上多邻国(n = 65)和课堂+多邻国(n = 60)。课堂组完成了标准的第一学期课程,课堂+多ingo组使用多ingo作为补充材料,多ingo组只通过应用程序学习。所有参与者都完成了前测和后测,在16周的时间内测量了整体熟练程度、语法、词汇、语用学(tu / vous)和交际能力。线性混合效应模型显示,所有三组在测试前和测试后的几乎所有测量中都有显着改善,改善幅度相似。唯一的例外是在学习能力vs. vous(语用能力)方面,课堂+多邻国组比只使用课堂和只使用多邻国组表现出更大的收益。结果表明,对于初学法语的人来说,传统的课堂教学和多邻国都是相当有效的。从基于移动应用程序的语言学习的角度讨论了学习者特征的潜在作用。
{"title":"Comparing the effectiveness of Duolingo, Classroom instruction, and Classroom + Duolingo instruction conditions on beginner-level French language development","authors":"YouJin Kim, Caroline Payant, Stephen Skalicky, Yoon Namkung","doi":"10.1017/s0272263126101521","DOIUrl":"https://doi.org/10.1017/s0272263126101521","url":null,"abstract":"Over the past decade, there has been growing use of mobile-assisted language learning (MALL) applications such as Duolingo. The effectiveness of MALL applications is thus of great interest among language acquisition researchers and practitioners. This study compared French language development among beginners in three learning conditions: Classroom-Only ( <jats:italic>n</jats:italic> = 58), Duolingo-Only ( <jats:italic>n</jats:italic> = 65), and Classroom + Duolingo ( <jats:italic>n</jats:italic> = 60). The Classroom-Only group completed a standard first-semester curriculum, the Classroom + Duolingo group used Duolingo as supplemental material, and the Duolingo-Only group learned exclusively through the app. All participants completed pretests and posttests measuring overall proficiency, grammar, vocabulary, pragmatics ( <jats:italic>tu</jats:italic> vs. <jats:italic>vous</jats:italic> ), and communicative competence over a 16-week period. Linear mixed-effects models revealed that all three groups showed significant improvement across nearly all measures between pretest and posttest, with similar magnitudes of improvement. The only exception was in learning <jats:italic>tu</jats:italic> vs. <jats:italic>vous</jats:italic> (pragmatic competence), where the Classroom + Duolingo group showed larger gains than Classroom-Only and Duolingo-Only groups. Results suggest that both traditional classroom instruction and Duolingo are comparably effective for beginning French language learners. Results are discussed in light of mobile app-based language learning with the potential role of learner characteristics.","PeriodicalId":22008,"journal":{"name":"Studies in Second Language Acquisition","volume":"10 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2026-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147383466","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Measurement and structural invariance testing in L2 research: A methodological synthesis 第二语言研究中的测量和结构不变性检验:方法论综合
IF 4.1 1区 文学 Q1 LINGUISTICS Pub Date : 2026-03-10 DOI: 10.1017/s0272263126101624
Ekaterina Sudina, Yazhuo Quan, Hande Ozdemir
Measurement invariance (MI) ensures that a given measure holds the same conceptual meaning for individuals from different groups and across multiple measurement occasions. Structural invariance (SI) is a logical extension of MI that examines whether relationships between latent constructs (e.g., structural paths within the model) hold equally across groups. To examine the status quo of MI and SI in second-language (L2) research, we systematically investigated the extent to which primary studies adhered to best practices in invariance testing and reporting. A total of 4,272 full-text records were screened, and 113 articles (116 independent samples; 147,856 participants) were included. The sample was fully double-coded to ensure accuracy and reliability. The results indicated alarming inconsistencies in how key invariance steps were implemented and reported. We offer empirically grounded recommendations for (a) improving methodological rigor of invariance assessments in the field and (b) contributing to more equitable and interpretable comparisons in multilingual settings.
测量不变性(MI)确保给定的测量对于来自不同群体的个体和跨多个测量场合具有相同的概念含义。结构不变性(SI)是MI的逻辑扩展,它检查潜在构式之间的关系(例如,模型内的结构路径)是否在组间相等。为了研究第二语言(L2)研究中MI和SI的现状,我们系统地调查了初级研究在不变性测试和报告中坚持最佳实践的程度。共筛选了4272篇全文记录,纳入113篇文章(116个独立样本,147856名参与者)。样品是完全双重编码,以确保准确性和可靠性。结果表明,在如何实现和报告关键不变性步骤方面存在惊人的不一致性。我们提供了基于经验的建议:(a)提高该领域不变性评估方法的严谨性,(b)有助于在多语言环境中进行更公平和可解释的比较。
{"title":"Measurement and structural invariance testing in L2 research: A methodological synthesis","authors":"Ekaterina Sudina, Yazhuo Quan, Hande Ozdemir","doi":"10.1017/s0272263126101624","DOIUrl":"https://doi.org/10.1017/s0272263126101624","url":null,"abstract":"Measurement invariance (MI) ensures that a given measure holds the same conceptual meaning for individuals from different groups and across multiple measurement occasions. Structural invariance (SI) is a logical extension of MI that examines whether relationships between latent constructs (e.g., structural paths within the model) hold equally across groups. To examine the status quo of MI and SI in second-language (L2) research, we systematically investigated the extent to which primary studies adhered to best practices in invariance testing and reporting. A total of 4,272 full-text records were screened, and 113 articles (116 independent samples; 147,856 participants) were included. The sample was fully double-coded to ensure accuracy and reliability. The results indicated alarming inconsistencies in how key invariance steps were implemented and reported. We offer empirically grounded recommendations for (a) improving methodological rigor of invariance assessments in the field and (b) contributing to more equitable and interpretable comparisons in multilingual settings.","PeriodicalId":22008,"journal":{"name":"Studies in Second Language Acquisition","volume":"17 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2026-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147383407","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Univariate normality checking practices in L2 research: An AI-assisted systematic review 第二语言研究中的单变量正态性检查实践:人工智能辅助的系统回顾
IF 4.1 1区 文学 Q1 LINGUISTICS Pub Date : 2026-02-25 DOI: 10.1017/s0272263126101600
Vahid Aryadoust, Yichen Jia
Regardless of whether one is analyzing quantitative data from research involving generative artificial intelligence (GenAI) or more classical methods, testing for normality remains a necessary step in statistical analysis. Although over 60 methods have been proposed for assessing univariate normality, previous systematic reviews show that normality testing remains underreported in L2 research. This paper addresses this gap by first reviewing the concept of normality and its role in parametric statistical inference. We then examine 12 normality assessment methods including five graphical and seven analytical methods selected based on their prominence in statistical literature and availability in commonly used software. Each method is explained in terms of its underlying mechanism and sensitivity to specific forms of nonnormality, such as skewness, tail heaviness, and multimodality. In the second part of the study, we review 237 empirical articles published between 2020 and 2025 in ten selected L2-focused Q1 journals, using AI-assisted annotation. Our findings reveal inconsistencies in how graphical tests are reported, a tendency to rely on tests such as the Kolmogorov-Smirnov without explicit attention to sample size constraints, and limited justification provided for critical values of skewness and kurtosis. These results indicate some divergence between recommended statistical practices and the procedures for normality testing reported in the L2 publications examined. The paper concludes with actionable recommendations for selecting and interpreting normality tests in L2 research contexts.
无论分析的是生成式人工智能(GenAI)研究的定量数据还是更经典的方法,正态性检验仍然是统计分析的必要步骤。尽管已经提出了超过60种评估单变量正态性的方法,但之前的系统综述表明,正态性检验在第二语言研究中仍然被低估。本文通过首先回顾正态性的概念及其在参数统计推断中的作用来解决这一差距。然后,我们检查了12种正态性评估方法,包括5种图形和7种分析方法,选择基于它们在统计文献中的突出地位和常用软件的可用性。每种方法都根据其潜在的机制和对特定形式的异常(如偏度、尾重和多模态)的敏感性进行了解释。在研究的第二部分,我们使用人工智能辅助注释,回顾了2020年至2025年间发表在10种l2重点Q1期刊上的237篇实证文章。我们的研究结果揭示了图形检验报告方式的不一致性,依赖诸如Kolmogorov-Smirnov等检验而不明确关注样本量约束的倾向,以及为偏度和峰度临界值提供的有限理由。这些结果表明,在推荐的统计实践和L2出版物中报告的正态性测试程序之间存在一些分歧。本文最后提出了在第二语言研究背景下选择和解释正态性测试的可行建议。
{"title":"Univariate normality checking practices in L2 research: An AI-assisted systematic review","authors":"Vahid Aryadoust, Yichen Jia","doi":"10.1017/s0272263126101600","DOIUrl":"https://doi.org/10.1017/s0272263126101600","url":null,"abstract":"Regardless of whether one is analyzing quantitative data from research involving generative artificial intelligence (GenAI) or more classical methods, testing for normality remains a necessary step in statistical analysis. Although over 60 methods have been proposed for assessing univariate normality, previous systematic reviews show that normality testing remains underreported in L2 research. This paper addresses this gap by first reviewing the concept of normality and its role in parametric statistical inference. We then examine 12 normality assessment methods including five graphical and seven analytical methods selected based on their prominence in statistical literature and availability in commonly used software. Each method is explained in terms of its underlying mechanism and sensitivity to specific forms of nonnormality, such as skewness, tail heaviness, and multimodality. In the second part of the study, we review 237 empirical articles published between 2020 and 2025 in ten selected L2-focused Q1 journals, using AI-assisted annotation. Our findings reveal inconsistencies in how graphical tests are reported, a tendency to rely on tests such as the Kolmogorov-Smirnov without explicit attention to sample size constraints, and limited justification provided for critical values of skewness and kurtosis. These results indicate some divergence between recommended statistical practices and the procedures for normality testing reported in the L2 publications examined. The paper concludes with actionable recommendations for selecting and interpreting normality tests in L2 research contexts.","PeriodicalId":22008,"journal":{"name":"Studies in Second Language Acquisition","volume":"342 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2026-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147278874","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Disability and second language learning: Implications for interdisciplinary research in applied linguistics 残疾与第二语言学习:应用语言学跨学科研究的启示
IF 4.1 1区 文学 Q1 LINGUISTICS Pub Date : 2026-02-20 DOI: 10.1017/s0272263126101594
Caitlin Cornell, Robert Alan Randez
In this commentary contextualizing the complexities at the nexus of disability and applied linguistics (AL), the authors highlight the paucity of conscientious attention to disabled populations in AL research, explore the intricacies of choosing appropriate terminology to describe disability and disabled people, challenge scholars in the field to reflect on and make explicit their emic or etic positionality vis-à-vis disability in their research, and call researchers to consider researching with , rather than merely about , disabled second language learners. The authors (a) illustrate how a collection of emergent research studies illuminates critical considerations at this underresearched interdisciplinary intersection in the field, and (b) demonstrate, via example studies in other areas of AL, how scholars may choose to center the disabled second-language learning experience rather than relegate it to the far corners of the field.
在这篇评论中,作者将残疾和应用语言学(AL)关系的复杂性置于背景中,强调了在AL研究中对残疾人群的认真关注的缺乏,探索了选择适当术语来描述残疾和残疾人的复杂性,挑战该领域的学者在他们的研究中反思并明确他们对-à-vis残疾的主题或逻辑定位。并呼吁研究人员考虑研究残疾第二语言学习者,而不仅仅是研究他们。作者(a)说明了一系列新兴的研究如何阐明了该领域中这个研究不足的跨学科交叉领域的关键考虑因素,(b)通过人工智能其他领域的示例研究表明,学者如何选择将残疾的第二语言学习经验放在中心位置,而不是将其降级到该领域的偏远角落。
{"title":"Disability and second language learning: Implications for interdisciplinary research in applied linguistics","authors":"Caitlin Cornell, Robert Alan Randez","doi":"10.1017/s0272263126101594","DOIUrl":"https://doi.org/10.1017/s0272263126101594","url":null,"abstract":"In this commentary contextualizing the complexities at the nexus of disability and applied linguistics (AL), the authors highlight the paucity of conscientious attention to disabled populations in AL research, explore the intricacies of choosing appropriate terminology to describe disability and disabled people, challenge scholars in the field to reflect on and make explicit their emic or etic positionality vis-à-vis disability in their research, and call researchers to consider researching <jats:italic>with</jats:italic> , rather than merely <jats:italic>about</jats:italic> , disabled second language learners. The authors (a) illustrate how a collection of emergent research studies illuminates critical considerations at this underresearched interdisciplinary intersection in the field, and (b) demonstrate, via example studies in other areas of AL, how scholars may choose to center the disabled second-language learning experience rather than relegate it to the far corners of the field.","PeriodicalId":22008,"journal":{"name":"Studies in Second Language Acquisition","volume":"333 1","pages":"1-15"},"PeriodicalIF":4.1,"publicationDate":"2026-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146261162","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The role of initial proficiency in L2 English speaking development of adolescent learners 初级熟练程度在青少年二语学习者口语发展中的作用
IF 4.1 1区 文学 Q1 LINGUISTICS Pub Date : 2026-02-18 DOI: 10.1017/s0272263126101570
Vanessa De Wilde
The present study investigated how initial proficiency impacted development in adolescent learners’ L2 speaking. The study reports on a longitudinal study with dense measurements with twelve adolescent L2 English learners. The participants were tested every week throughout their first year of secondary school (30 datapoints). The participants’ learning trajectory was modeled using generalized additive mixed models (GAMMs). Results showed large differences in development between various lexical and syntactic measures and between individual learners. The learners’ initial proficiency had a significant impact on their development. Overall, more periods of growth were observed in lexical measures than in syntactic measures. In line with previous results, some stabilization was observed once learners reached a certain proficiency level, but this stabilization was dependent on task type. Closed tasks lead to a ceiling effect in some measures, whereas more open tasks give learners from various proficiency levels opportunities to demonstrate their L2 English speaking skills.
本研究旨在探讨初级熟练程度对青少年第二语言口语发展的影响。本研究报告了一项对12名青少年二语学习者进行密集测量的纵向研究。参与者在中学的第一年每周都接受测试(30个数据点)。采用广义加性混合模型(GAMMs)对被试的学习轨迹进行建模。结果显示,不同词汇和句法测量方法之间以及个体学习者之间的发展存在很大差异。学习者的初始熟练程度对其发展有显著影响。总的来说,在词汇测量中比在句法测量中观察到更多的生长时期。与之前的结果一致,当学习者达到一定的熟练程度时,观察到一些稳定,但这种稳定取决于任务类型。封闭任务在某些方面会导致天花板效应,而更开放的任务则为不同熟练程度的学习者提供了展示第二语言口语技能的机会。
{"title":"The role of initial proficiency in L2 English speaking development of adolescent learners","authors":"Vanessa De Wilde","doi":"10.1017/s0272263126101570","DOIUrl":"https://doi.org/10.1017/s0272263126101570","url":null,"abstract":"The present study investigated how initial proficiency impacted development in adolescent learners’ L2 speaking. The study reports on a longitudinal study with dense measurements with twelve adolescent L2 English learners. The participants were tested every week throughout their first year of secondary school (30 datapoints). The participants’ learning trajectory was modeled using generalized additive mixed models (GAMMs). Results showed large differences in development between various lexical and syntactic measures and between individual learners. The learners’ initial proficiency had a significant impact on their development. Overall, more periods of growth were observed in lexical measures than in syntactic measures. In line with previous results, some stabilization was observed once learners reached a certain proficiency level, but this stabilization was dependent on task type. Closed tasks lead to a ceiling effect in some measures, whereas more open tasks give learners from various proficiency levels opportunities to demonstrate their L2 English speaking skills.","PeriodicalId":22008,"journal":{"name":"Studies in Second Language Acquisition","volume":"107 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2026-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146215715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The effects of audiovisual input on second language learning A meta-analysis 视听输入对第二语言学习的影响:meta分析
IF 4.1 1区 文学 Q1 LINGUISTICS Pub Date : 2026-02-18 DOI: 10.1017/s0272263126101612
Dru Sutton, Stuart Webb
This meta-analysis investigates the contributions of viewing audiovisual input on second language (L2) learning. We calculated 75 effect sizes from 56 experiments ( n = 1954) and assessed the effects of audiovisual input on language learning using a within-group (pre-post) meta-analytic approach. Fifteen moderator variables were included in the analysis. Results showed that a) there was a small effect of audiovisual input on L2 learning ( g = .89); b) no differences were found between the effects of viewing audiovisual input on different areas of L2 learning (vocabulary, grammar, pronunciation, speaking, listening proficiency); and c) video category had a significant impact on L2 learning with entertainment-focused videos (e.g., TV series, movies, and mixed videos) yielding lower effects than educational videos (e.g., TED Talks, documentaries, and language-focused).
本荟萃分析调查了观看视听输入对第二语言学习的贡献。我们从56个实验(n = 1954)中计算了75个效应量,并使用组内(前后)荟萃分析方法评估了视听输入对语言学习的影响。分析中包括15个调节变量。结果表明:a)视听输入对二语学习的影响较小(g = 0.89);b)观看视听输入对第二语言学习不同领域(词汇、语法、发音、口语、听力熟练程度)的影响没有差异;c)视频类别对第二语言学习有显著影响,以娱乐为重点的视频(如电视剧、电影和混合视频)的效果低于教育视频(如TED演讲、纪录片和语言为重点)。
{"title":"The effects of audiovisual input on second language learning A meta-analysis","authors":"Dru Sutton, Stuart Webb","doi":"10.1017/s0272263126101612","DOIUrl":"https://doi.org/10.1017/s0272263126101612","url":null,"abstract":"This meta-analysis investigates the contributions of viewing audiovisual input on second language (L2) learning. We calculated 75 effect sizes from 56 experiments ( <jats:italic>n</jats:italic> = 1954) and assessed the effects of audiovisual input on language learning using a within-group (pre-post) meta-analytic approach. Fifteen moderator variables were included in the analysis. Results showed that a) there was a small effect of audiovisual input on L2 learning ( <jats:italic>g</jats:italic> = .89); b) no differences were found between the effects of viewing audiovisual input on different areas of L2 learning (vocabulary, grammar, pronunciation, speaking, listening proficiency); and c) video category had a significant impact on L2 learning with entertainment-focused videos (e.g., TV series, movies, and mixed videos) yielding lower effects than educational videos (e.g., TED Talks, documentaries, and language-focused).","PeriodicalId":22008,"journal":{"name":"Studies in Second Language Acquisition","volume":"73 1","pages":"1-25"},"PeriodicalIF":4.1,"publicationDate":"2026-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146261161","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fit-hacking: Questionable research practices in structural equation modeling 拟合黑客:结构方程建模中有问题的研究实践
IF 4.1 1区 文学 Q1 LINGUISTICS Pub Date : 2026-02-12 DOI: 10.1017/s0272263126101569
Ali H. Al-Hoorie, Yo In’nami, Phil Hiver
Structural equation modeling (SEM) is a powerful and flexible modeling framework for testing complex relationships among observed and latent variables. However, its methodological complexity and analytical flexibility also increase the risk of questionable research practices (QRPs), especially in fields like applied linguistics, where training in advanced statistics may be limited. This article synthesizes the literature on QRPs and applies it to SEM by identifying seven categories of problematic practices: not checking assumptions, not validating a measurement model, not testing competing models, not sufficiently justifying modeling decisions, relying on post hoc model modification, overemphasizing global fit indices, and incomplete or nontransparent reporting. Each practice is described with examples and linked to broader issues in research ethics and transparency. The paper concludes with concrete recommendations for improving the credibility and reproducibility of SEM research, emphasizing the integration of best practices with the principles of open science.
结构方程建模(SEM)是一种强大而灵活的建模框架,用于测试观察变量和潜在变量之间的复杂关系。然而,其方法的复杂性和分析的灵活性也增加了有问题的研究实践(qrp)的风险,特别是在应用语言学等领域,在这些领域,高级统计学的培训可能有限。本文综合了关于qrp的文献,并通过识别7类问题实践将其应用于SEM:不检查假设,不验证测量模型,不测试竞争模型,不充分证明建模决策,依赖于事后模型修改,过分强调全局拟合指数,以及不完整或不透明的报告。每一种实践都用例子来描述,并与研究伦理和透明度方面更广泛的问题联系起来。论文最后提出了提高SEM研究可信度和可重复性的具体建议,强调了将最佳实践与开放科学原则相结合。
{"title":"Fit-hacking: Questionable research practices in structural equation modeling","authors":"Ali H. Al-Hoorie, Yo In’nami, Phil Hiver","doi":"10.1017/s0272263126101569","DOIUrl":"https://doi.org/10.1017/s0272263126101569","url":null,"abstract":"Structural equation modeling (SEM) is a powerful and flexible modeling framework for testing complex relationships among observed and latent variables. However, its methodological complexity and analytical flexibility also increase the risk of questionable research practices (QRPs), especially in fields like applied linguistics, where training in advanced statistics may be limited. This article synthesizes the literature on QRPs and applies it to SEM by identifying seven categories of problematic practices: not checking assumptions, not validating a measurement model, not testing competing models, not sufficiently justifying modeling decisions, relying on post hoc model modification, overemphasizing global fit indices, and incomplete or nontransparent reporting. Each practice is described with examples and linked to broader issues in research ethics and transparency. The paper concludes with concrete recommendations for improving the credibility and reproducibility of SEM research, emphasizing the integration of best practices with the principles of open science.","PeriodicalId":22008,"journal":{"name":"Studies in Second Language Acquisition","volume":"abs/2301.03111 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2026-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146160328","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Post hoc score category collapsing for L2 pronunciation research 二语发音研究的分值分类崩溃
IF 4.1 1区 文学 Q1 LINGUISTICS Pub Date : 2026-02-11 DOI: 10.1017/s0272263126101582
Taichi Yamashita
Second language (L2) pronunciation research has measured speech comprehensibility by asking listeners to assess L2 learners’ speaking performance with rating scales. While some studies have provided validity evidence for these rating scales, few studies have examined the extent to which those scales effectively distinguish among L2 speakers. To fill this gap, the present study examines the 9-point scale used in Saito et al. (2020: Annual Review of Applied Linguistics, 40 , 9–25.) and the 100-point scale in Huensch and Nagle (2023: Studies in Second Language Acquisition, 45 (2), 571–585.) from a Rasch measurement perspective and showcases post hoc score category collapsing as a potential countermeasure against suboptimal rating scale functioning. Findings suggested that different score categories represented the same ability level and were therefore interchangeable. Collapsing these score categories yielded shorter but more functional scales without compromising the psychometric qualities of the original scales. These findings suggest that researchers need to empirically refine their scale lengths rather than uncritically following their conventional measurement practices.
第二语言(L2)发音研究通过要求听者用评分量表评估L2学习者的口语表现来衡量言语可理解性。虽然一些研究为这些评定量表提供了有效性证据,但很少有研究调查这些量表在多大程度上有效区分二语使用者。为了填补这一空白,本研究从Rasch测量的角度研究了Saito等人(2020:应用语言学年度评论,40,9 - 25.)使用的9分制和Huensch和Nagle(2023:第二语言习得研究,45(2),571-585 .)使用的100分制,并展示了即时得分类别崩溃作为一种潜在的对策,以防止次优评分量表功能。研究结果表明,不同的分数类别代表相同的能力水平,因此可以互换。将这些分数类别折叠后,产生了更短但更实用的量表,而不会影响原始量表的心理测量质量。这些发现表明,研究人员需要从经验上完善他们的尺度长度,而不是不加批判地遵循他们的传统测量方法。
{"title":"Post hoc score category collapsing for L2 pronunciation research","authors":"Taichi Yamashita","doi":"10.1017/s0272263126101582","DOIUrl":"https://doi.org/10.1017/s0272263126101582","url":null,"abstract":"Second language (L2) pronunciation research has measured speech comprehensibility by asking listeners to assess L2 learners’ speaking performance with rating scales. While some studies have provided validity evidence for these rating scales, few studies have examined the extent to which those scales effectively distinguish among L2 speakers. To fill this gap, the present study examines the 9-point scale used in Saito et al. (2020: <jats:italic>Annual Review of Applied Linguistics, 40</jats:italic> , 9–25.) and the 100-point scale in Huensch and Nagle (2023: <jats:italic>Studies in Second Language Acquisition, 45</jats:italic> (2), 571–585.) from a Rasch measurement perspective and showcases post hoc score category collapsing as a potential countermeasure against suboptimal rating scale functioning. Findings suggested that different score categories represented the same ability level and were therefore interchangeable. Collapsing these score categories yielded shorter but more functional scales without compromising the psychometric qualities of the original scales. These findings suggest that researchers need to empirically refine their scale lengths rather than uncritically following their conventional measurement practices.","PeriodicalId":22008,"journal":{"name":"Studies in Second Language Acquisition","volume":"97 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2026-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146153482","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Does phonetic training benefit word learning? 语音训练对单词学习有好处吗?
IF 4.1 1区 文学 Q1 LINGUISTICS Pub Date : 2026-01-26 DOI: 10.1017/s0272263125101459
Yuxin Ge, Susana Correia, João Dinis Fernandes, Kirsty Hanson, Anabela Rato, Patrick Rebuschat
Recent research has shown that adult learners can rapidly acquire novel words of a foreign language by tracking cross-situational statistics, but learning is substantially reduced when the target words are phonologically similar and contain non-native contrasts. We expand on this research by investigating whether perceptual discrimination training on non-native target contrasts facilitates cross-situational learning of new words (CSWL). Our design combines perceptual training and CSWL to test the transfer of perceptual gains to lexical learning—an approach that integrates methods from L2 speech and statistical learning. In two studies, we tested English-native and Portuguese-native speakers’ learning of 24 Portuguese pseudowords via a CSWL task. In Study 1, we examined baseline learning in both language groups without prior training. In Study 2, English-native speakers were assigned to one of three conditions: phonetic training with an AX discrimination task, phonetic training with an oddity discrimination task, or no phonetic training prior to the CSWL task. Results confirmed that adults can learn non-native words from cross-situational statistics, and that phonological overlap between words decreases learning. Perceptual training improved the discrimination of target contrasts, but this did not transfer to statistical learning of words that contain these contrasts. These findings suggest that phonetic training alone may not be sufficient for vocabulary acquisition, suggesting the need for instructional approaches that integrate phonetic training with more explicit teaching methods or meaning-based practice.
最近的研究表明,成人学习者可以通过跟踪跨情景统计快速习得外语新单词,但当目标单词语音相似且包含非母语对比时,学习速度会大大降低。在此基础上,我们进一步探讨了非母语目标对比的知觉辨别训练是否有助于跨情境新词学习。我们的设计结合了感知训练和CSWL来测试感知增益到词汇学习的转移,这是一种集成了二语语音和统计学习方法的方法。在两项研究中,我们通过CSWL任务测试了英语母语者和葡萄牙语母语者对24个葡萄牙语假词的学习。在研究1中,我们在没有事先培训的情况下检查了两个语言组的基线学习情况。在研究2中,以英语为母语的人被分配到三种条件中的一种:语音训练与AX识别任务,语音训练与奇音识别任务,或在CSWL任务之前没有语音训练。结果证实,成年人可以从跨情景统计中学习非母语词汇,单词之间的语音重叠会减少学习。知觉训练提高了对目标对比的辨别,但这并没有转移到包含这些对比的单词的统计学习上。这些发现表明,仅靠语音训练可能不足以促进词汇习得,这表明需要将语音训练与更明确的教学方法或基于意义的练习相结合的教学方法。
{"title":"Does phonetic training benefit word learning?","authors":"Yuxin Ge, Susana Correia, João Dinis Fernandes, Kirsty Hanson, Anabela Rato, Patrick Rebuschat","doi":"10.1017/s0272263125101459","DOIUrl":"https://doi.org/10.1017/s0272263125101459","url":null,"abstract":"Recent research has shown that adult learners can rapidly acquire novel words of a foreign language by tracking cross-situational statistics, but learning is substantially reduced when the target words are phonologically similar and contain non-native contrasts. We expand on this research by investigating whether perceptual discrimination training on non-native target contrasts facilitates cross-situational learning of new words (CSWL). Our design combines perceptual training and CSWL to test the transfer of perceptual gains to lexical learning—an approach that integrates methods from L2 speech and statistical learning. In two studies, we tested English-native and Portuguese-native speakers’ learning of 24 Portuguese pseudowords via a CSWL task. In Study 1, we examined baseline learning in both language groups without prior training. In Study 2, English-native speakers were assigned to one of three conditions: phonetic training with an AX discrimination task, phonetic training with an oddity discrimination task, or no phonetic training prior to the CSWL task. Results confirmed that adults can learn non-native words from cross-situational statistics, and that phonological overlap between words decreases learning. Perceptual training improved the discrimination of target contrasts, but this did not transfer to statistical learning of words that contain these contrasts. These findings suggest that phonetic training alone may not be sufficient for vocabulary acquisition, suggesting the need for instructional approaches that integrate phonetic training with more explicit teaching methods or meaning-based practice.","PeriodicalId":22008,"journal":{"name":"Studies in Second Language Acquisition","volume":"7 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146056129","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Written input and the encoding of L2 phonological contrasts: L1 Arabic learners of L2 English 书面输入与二语语音对比编码:母语阿拉伯语学习者的二语英语
IF 4.1 1区 文学 Q1 LINGUISTICS Pub Date : 2026-01-12 DOI: 10.1017/s0272263125101514
Louise Hannah Shepperd, Sam Hellmuth, Leah Roberts
Many adults learn languages with written forms that differ from their first language(s). Empirical research has demonstrated the influential role of written input on developing L2 phonology. However, existing studies are limited by (1) focusing on learning languages that share the same orthographic script, predominantly the Latin alphabet, (2) small sample sizes, and (3) limited consideration of L2 proficiency. This study investigated the influence of Arabic and English written input when lexically encoding the difficult /f-v/ phonological contrast for L1 Arabic-speaking learners of L2 English. A word learning study was completed by 114 L1 Arabic speakers, with varying English proficiency, and 117 L1 English-speaking controls. Mixed-effects modeling of L1 Arabic accuracy revealed an inhibitory effect of any written input when learning words differing by the difficult contrast. Performance improved with increasing L2 proficiency; however, the inhibitory effect of written input for words differing by /f-v/ persisted into high levels of L2 proficiency.
许多成年人学习与母语不同的书面语言。实证研究已经证明了书面输入对二语音韵学发展的影响作用。然而,现有的研究受到以下因素的限制:(1)专注于学习具有相同正字法的语言,主要是拉丁字母;(2)样本量小;(3)对第二语言熟练程度的考虑有限。本研究考察了阿拉伯语和英语书面输入对母语阿拉伯语的二语英语学习者进行难/f-v/音位对比词汇编码时的影响。114名英语水平不同的L1阿拉伯语使用者和117名L1英语对照者完成了单词学习研究。母语阿拉伯语准确度的混合效应模型揭示了任何书面输入在学习困难对比不同的单词时的抑制效应。表现随第二语言熟练程度的提高而改善;然而,书面输入不同于/f-v/的单词的抑制效应持续到高水平的二语熟练程度。
{"title":"Written input and the encoding of L2 phonological contrasts: L1 Arabic learners of L2 English","authors":"Louise Hannah Shepperd, Sam Hellmuth, Leah Roberts","doi":"10.1017/s0272263125101514","DOIUrl":"https://doi.org/10.1017/s0272263125101514","url":null,"abstract":"Many adults learn languages with written forms that differ from their first language(s). Empirical research has demonstrated the influential role of written input on developing L2 phonology. However, existing studies are limited by (1) focusing on learning languages that share the same orthographic script, predominantly the Latin alphabet, (2) small sample sizes, and (3) limited consideration of L2 proficiency. This study investigated the influence of Arabic and English written input when lexically encoding the difficult /f-v/ phonological contrast for L1 Arabic-speaking learners of L2 English. A word learning study was completed by 114 L1 Arabic speakers, with varying English proficiency, and 117 L1 English-speaking controls. Mixed-effects modeling of L1 Arabic accuracy revealed an inhibitory effect of any written input when learning words differing by the difficult contrast. Performance improved with increasing L2 proficiency; however, the inhibitory effect of written input for words differing by /f-v/ persisted into high levels of L2 proficiency.","PeriodicalId":22008,"journal":{"name":"Studies in Second Language Acquisition","volume":"15 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145950000","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Studies in Second Language Acquisition
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1