L2 English speaking syntactic complexity: Data preprocessing issues, reliability of automated analysis, and the effects of proficiency, L1 background, and topic

Minjin Kim, Xiaofei Lu
{"title":"L2 English speaking syntactic complexity: Data preprocessing issues, reliability of automated analysis, and the effects of proficiency, L1 background, and topic","authors":"Minjin Kim, Xiaofei Lu","doi":"10.1111/modl.12907","DOIUrl":null,"url":null,"abstract":"The effects of learner‐ and task‐related variables on second language (L2) writing syntactic complexity (SC) have been extensively investigated. However, previous research has rarely assessed the reliability of computational tools for analyzing the SC of L2 spoken production, and we know less about the effects of such variables on L2 speaking SC. Using data from the International Corpus Network of Asian Learners of English, this study explores data preprocessing issues for preparing L2 English speech samples for automated SC analysis, evaluates the reliability of L2 Syntactic Complexity Analyzer on preprocessed L2 English speech samples, and examines the effects of proficiency, first language (L1) background, and topic on L2 speaking SC. Our manual analysis of 30 random speech samples identified several issues that can be addressed through preprocessing to improve the accuracy of automated SC analysis. Results from multiple linear mixed‐effects models revealed significant effects of proficiency, L1 background, and topic on the mean length of clause, the number of complex AS‐units per AS‐unit, and the number of dependent clauses and complex nominals per clause in L2 learners’ spoken production. Our findings have useful implications for L2 speaking pedagogy and assessment as well as future L2 speaking SC research.","PeriodicalId":510718,"journal":{"name":"The Modern Language Journal","volume":"53 9","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Modern Language Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1111/modl.12907","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The effects of learner‐ and task‐related variables on second language (L2) writing syntactic complexity (SC) have been extensively investigated. However, previous research has rarely assessed the reliability of computational tools for analyzing the SC of L2 spoken production, and we know less about the effects of such variables on L2 speaking SC. Using data from the International Corpus Network of Asian Learners of English, this study explores data preprocessing issues for preparing L2 English speech samples for automated SC analysis, evaluates the reliability of L2 Syntactic Complexity Analyzer on preprocessed L2 English speech samples, and examines the effects of proficiency, first language (L1) background, and topic on L2 speaking SC. Our manual analysis of 30 random speech samples identified several issues that can be addressed through preprocessing to improve the accuracy of automated SC analysis. Results from multiple linear mixed‐effects models revealed significant effects of proficiency, L1 background, and topic on the mean length of clause, the number of complex AS‐units per AS‐unit, and the number of dependent clauses and complex nominals per clause in L2 learners’ spoken production. Our findings have useful implications for L2 speaking pedagogy and assessment as well as future L2 speaking SC research.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
L2 英语口语句法复杂性:数据预处理问题、自动分析的可靠性以及熟练程度、第一语言背景和话题的影响
与学习者和任务相关的变量对第二语言(L2)写作句法复杂性(SC)的影响已被广泛研究。然而,以往的研究很少评估用于分析第二语言口语句法复杂性的计算工具的可靠性,我们对这些变量对第二语言口语句法复杂性的影响也知之甚少。本研究利用亚洲英语学习者国际语料库网络的数据,探讨了为自动 SC 分析准备 L2 英语口语样本的数据预处理问题,评估了 L2 句法复杂性分析器在预处理 L2 英语口语样本上的可靠性,并研究了熟练程度、第一语言(L1)背景和话题对 L2 口语 SC 的影响。我们对 30 个随机语音样本进行了人工分析,发现了几个可以通过预处理来提高自动 SC 分析准确性的问题。多重线性混合效应模型的结果显示,在 L2 学习者的口语表达中,熟练程度、L1 背景和话题对句子的平均长度、每个 AS 单元的复杂 AS 数量以及每个句子的从句和复杂名词的数量都有显著影响。我们的研究结果对 L2 口语教学法和评估以及未来的 L2 口语 SC 研究都有有益的启示。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Give you some color: Chinese language teachers’ encounters of race and racialization in American K–12 schools Long‐term language use by US‐based study‐abroad alumni: Activity types and program effects L2 English speaking syntactic complexity: Data preprocessing issues, reliability of automated analysis, and the effects of proficiency, L1 background, and topic Long‐term language use by US‐based study‐abroad alumni: Activity types and program effects L2 English speaking syntactic complexity: Data preprocessing issues, reliability of automated analysis, and the effects of proficiency, L1 background, and topic
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1