A NLP-based stylometric approach for tracking the evolution of L1 written language competence

IF 1.7 Q2 EDUCATION & EDUCATIONAL RESEARCH Journal of Writing Research Pub Date : 2021-05-01 DOI:10.17239/JOWR-2021.13.01.03
Alessio Miaschi, D. Brunato, F. Dell’Orletta
{"title":"A NLP-based stylometric approach for tracking the evolution of L1 written language competence","authors":"Alessio Miaschi, D. Brunato, F. Dell’Orletta","doi":"10.17239/JOWR-2021.13.01.03","DOIUrl":null,"url":null,"abstract":": In this study we present a Natural Language Processing (NLP)-based stylometric approach for tracking the evolution of written language competence in Italian L1 learners. The approach relies on a wide set of linguistically motivated features capturing stylistic aspects of a text, which were extracted from students’ essays contained in CItA (Corpus Italiano di Apprendenti L1), the first longitudinal corpus of texts written by Italian L1 learners enrolled in the first and second year of lower secondary school. We address the problem of modeling written language development as a supervised classification task consisting in predicting the chronological order of essays written by the same student at different temporal spans. The promising results obtained in several classification scenarios allow us to conclude that it is possible to automatically model the highly relevant changes affecting written language evolution across time, as well as identifying which features are more predictive of this process. In the last part of the article, we focus the attention on the possible influence of background variables on language learning and we present preliminary results of a pilot study aiming at understanding how the observed developmental patterns are affected by information related to the school environment of the student","PeriodicalId":45632,"journal":{"name":"Journal of Writing Research","volume":" ","pages":""},"PeriodicalIF":1.7000,"publicationDate":"2021-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Writing Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17239/JOWR-2021.13.01.03","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"EDUCATION & EDUCATIONAL RESEARCH","Score":null,"Total":0}
引用次数: 10

Abstract

: In this study we present a Natural Language Processing (NLP)-based stylometric approach for tracking the evolution of written language competence in Italian L1 learners. The approach relies on a wide set of linguistically motivated features capturing stylistic aspects of a text, which were extracted from students’ essays contained in CItA (Corpus Italiano di Apprendenti L1), the first longitudinal corpus of texts written by Italian L1 learners enrolled in the first and second year of lower secondary school. We address the problem of modeling written language development as a supervised classification task consisting in predicting the chronological order of essays written by the same student at different temporal spans. The promising results obtained in several classification scenarios allow us to conclude that it is possible to automatically model the highly relevant changes affecting written language evolution across time, as well as identifying which features are more predictive of this process. In the last part of the article, we focus the attention on the possible influence of background variables on language learning and we present preliminary results of a pilot study aiming at understanding how the observed developmental patterns are affected by information related to the school environment of the student
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
一种基于NLP的风格计量法追踪一级书面语言能力的演变
:在这项研究中,我们提出了一种基于自然语言处理(NLP)的风格测量方法,用于跟踪意大利L1学习者书面语言能力的演变。该方法依赖于捕捉文本风格方面的一系列语言动机特征,这些特征是从CItA(Corpus Italiano di Apprendenti L1)中包含的学生论文中提取的,CItA是意大利一年级和二年级学生撰写的第一个纵向文本语料库。我们将书面语言发展建模问题作为一项有监督的分类任务来解决,该任务包括预测同一学生在不同时间跨度写的文章的时间顺序。在几个分类场景中获得的有希望的结果使我们能够得出结论,可以自动对影响书面语言随时间演变的高度相关的变化进行建模,并确定哪些特征更能预测这一过程。在文章的最后一部分,我们将注意力集中在背景变量对语言学习的可能影响上,并介绍了一项试点研究的初步结果,该研究旨在了解观察到的发展模式如何受到与学生学校环境相关的信息的影响
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Writing Research
Journal of Writing Research EDUCATION & EDUCATIONAL RESEARCH-
CiteScore
7.20
自引率
4.90%
发文量
16
审稿时长
40 weeks
期刊介绍: The Journal of Writing Research is an international peer reviewed journal that publishes high quality theoretical, empirical, and review papers covering the broad spectrum of writing research. The Journal primarily publishes papers that describe scientific studies of the processes by which writing is produced or the means by which writing can be effectively taught. The journal is inherently cross-disciplinary, publishing original research in the different domains of writing research. The Journal of Writing Research is an open access journal (no reader fee - no author fee).
期刊最新文献
Book review | Technology in second language writing: Advances in composing, translation, writing pedagogy and data-driven learning Fleshing out your text: How elaboration and contextualization moves differentially predict writing quality Thinking outside the box: Senior scientists’ metacognitive strategy knowledge and self-regulation of writing for science communication Synthesis Writing in Science Orientation Classes: An Instructional Design Studio Advancing Civics-specific Disciplinary Writing in the Elementary Grades issue
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1