首页 > 最新文献

Applied Corpus Linguistics最新文献

英文 中文
Book Reviews 书评
Pub Date : 2023-03-01 DOI: 10.1016/j.acorp.2023.100055
Rickey Lu
{"title":"Book Reviews","authors":"Rickey Lu","doi":"10.1016/j.acorp.2023.100055","DOIUrl":"https://doi.org/10.1016/j.acorp.2023.100055","url":null,"abstract":"","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46095037","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hack your corpus analysis: How AI can assist corpus linguists deal with messy social media data 破解语料库分析:人工智能如何帮助语料库语言学家处理混乱的社交媒体数据
Pub Date : 2023-01-01 DOI: 10.1016/j.acorp.2023.100067
Michele Zappavigna
{"title":"Hack your corpus analysis: How AI can assist corpus linguists deal with messy social media data","authors":"Michele Zappavigna","doi":"10.1016/j.acorp.2023.100067","DOIUrl":"https://doi.org/10.1016/j.acorp.2023.100067","url":null,"abstract":"","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":"3 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49774935","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Book review Vander Viana (2023) teaching English with corpora: A resource book 书评Vander Viana(2023)用语料库教授英语:一本资源书
Pub Date : 2023-01-01 DOI: 10.1016/j.acorp.2023.100061
Larissa Goulart (Assistant Professor of Linguistics)
{"title":"Book review Vander Viana (2023) teaching English with corpora: A resource book","authors":"Larissa Goulart (Assistant Professor of Linguistics)","doi":"10.1016/j.acorp.2023.100061","DOIUrl":"https://doi.org/10.1016/j.acorp.2023.100061","url":null,"abstract":"","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":"3 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49816437","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The gap between intentions and reality: Reasons for EAP writers’ non-use of corpora 意图与现实的差距:EAP作者不使用语料库的原因
Pub Date : 2022-12-01 DOI: 10.1016/j.acorp.2022.100032
Maggie Charles

Over the last three decades, extensive research has been devoted to EAP students’ use of corpora for academic writing. However, corpus use has usually been ascertained immediately post-course; data on long-term use is sparse and little attention has been paid to those who give up using corpora. This study investigates the extent of corpus non-use and students’ reasons for discontinuing the practice in the long term. It draws on data from two questionnaires: (1) immediate post-course (ImmPQ); (2) delayed post-course (DelPQ) completed a year later. Participants were 182 graduates who took a six-week course during which they built and consulted do-it-yourself corpora in their own field. Results from ImmPQ showed that most students (63%) used their corpus regularly (≥ 1/week), but one year later DelPQ revealed that regular use had decreased to 36%. Although 87% of respondents to ImmPQ stated their intention to use their corpus in the future, DelPQ reported a total of 37% of non-users. There were 86 mentions of reasons for non-use; the most prevalent were: not doing any academic writing (29%), the use of other tools (20%), time issues and corpus issues (10% each). It is argued that students’ scarcity of time is a possible underlying cause of much non-use and the study suggests some ways in which long-term corpus take-up could be increased.

在过去的三十年里,广泛的研究致力于EAP学生在学术写作中使用语料库。然而,语料库的使用通常在课程结束后立即确定;长期使用语料库的数据很少,很少有人关注那些放弃使用语料库的人。本研究调查了语料库不使用的程度以及学生长期停止使用语料库的原因。它从两个问卷中提取数据:(1)课程后立即(ImmPQ);(2)延迟一年后完成的后课程(DelPQ)。参与者是182名毕业生,他们参加了为期六周的课程,在此期间,他们在自己的领域建立并咨询了自己动手的语料库。ImmPQ的结果显示,大多数学生(63%)定期使用语料库(≥1个/周),但一年后DelPQ显示,定期使用语料库的学生减少到36%。尽管87%的ImmPQ受访者表示他们打算在未来使用他们的语料库,但DelPQ报告的非用户总数为37%。有86次提到不使用的原因;最常见的是:没有做任何学术写作(29%),使用其他工具(20%),时间问题和语料库问题(各占10%)。有人认为,学生缺乏时间可能是不使用语料库的潜在原因,该研究提出了一些可以增加长期语料库占用的方法。
{"title":"The gap between intentions and reality: Reasons for EAP writers’ non-use of corpora","authors":"Maggie Charles","doi":"10.1016/j.acorp.2022.100032","DOIUrl":"10.1016/j.acorp.2022.100032","url":null,"abstract":"<div><p>Over the last three decades, extensive research has been devoted to EAP students’ use of corpora for academic writing. However, corpus use has usually been ascertained immediately post-course; data on long-term use is sparse and little attention has been paid to those who give up using corpora. This study investigates the extent of corpus non-use and students’ reasons for discontinuing the practice in the long term. It draws on data from two questionnaires: (1) immediate post-course (ImmPQ); (2) delayed post-course (DelPQ) completed a year later. Participants were 182 graduates who took a six-week course during which they built and consulted do-it-yourself corpora in their own field. Results from ImmPQ showed that most students (63%) used their corpus regularly (≥ 1/week), but one year later DelPQ revealed that regular use had decreased to 36%. Although 87% of respondents to ImmPQ stated their intention to use their corpus in the future, DelPQ reported a total of 37% of non-users. There were 86 mentions of reasons for non-use; the most prevalent were: not doing any academic writing (29%), the use of other tools (20%), time issues and corpus issues (10% each). It is argued that students’ scarcity of time is a possible underlying cause of much non-use and the study suggests some ways in which long-term corpus take-up could be increased.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":"2 3","pages":"Article 100032"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666799122000156/pdfft?md5=f0528a6928b7b2511c7f7f2c8c8f18f7&pid=1-s2.0-S2666799122000156-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41858231","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Usable Amharic text corpus for natural language processing applications 可用的阿姆哈拉语文本语料库用于自然语言处理应用程序
Pub Date : 2022-12-01 DOI: 10.1016/j.acorp.2022.100033
Michael Melese Woldeyohannis, Million Meshesha

In this paper, we describe the preparation of a usable Amharic text corpus for different Natural Language Processing (NLP) applications. Natural language applications, such as document classification, topic modeling, machine translation, speech recognition, and others, suffer greatly from a lack of digital resources. This is especially true for Amharic, a resource-constrained, morphologically rich, and complex language. In response to this, a total of 67,739 Amharic news documents consisting of 8 different categories from online sources are collected. The collected corpus passes through a number of pre-processing steps including; data cleaning, text normalization and punctuation correction. To validate the usability of the collected corpora from different domains, a baseline document classification experiment was conducted. Experimental results show that, 84.53% accuracy is registered using deep learning in the absence of linguistic information. Finding indicated that it is possible to use the prepared corpora for different natural language applications in the absence of linguistic resources such as stemmer and dictionary despite the complexity of Amharic language. We are further working towards Amharic news document classification by incorporating a linguistic independent stop-word detection, stemming and unsupervised morphological segmentation of Amharic documents.

在本文中,我们描述了为不同的自然语言处理(NLP)应用程序准备一个可用的阿姆哈拉语文本语料库。自然语言应用程序,如文档分类、主题建模、机器翻译、语音识别等,由于缺乏数字资源而受到严重影响。阿姆哈拉语尤其如此,它是一种资源受限、形态丰富且复杂的语言。为此,我们从网上收集了8个不同类别的阿姆哈拉语新闻文档,共67,739份。所收集的语料库经过若干预处理步骤,包括;数据清理,文本规范化和标点纠正。为了验证从不同领域收集的语料库的可用性,进行了基线文档分类实验。实验结果表明,在缺乏语言信息的情况下,使用深度学习注册的准确率为84.53%。研究结果表明,尽管阿姆哈拉语本身很复杂,但在缺乏词干和词典等语言资源的情况下,将准备好的语料库用于不同的自然语言应用是可能的。我们正在进一步致力于阿姆哈拉语新闻文档分类,包括独立于语言的停止词检测、词干提取和阿姆哈拉语文档的无监督形态分割。
{"title":"Usable Amharic text corpus for natural language processing applications","authors":"Michael Melese Woldeyohannis,&nbsp;Million Meshesha","doi":"10.1016/j.acorp.2022.100033","DOIUrl":"10.1016/j.acorp.2022.100033","url":null,"abstract":"<div><p>In this paper, we describe the preparation of a usable Amharic text corpus for different Natural Language Processing (NLP) applications. Natural language applications, such as document classification, topic modeling, machine translation, speech recognition, and others, suffer greatly from a lack of digital resources. This is especially true for Amharic, a resource-constrained, morphologically rich, and complex language. In response to this, a total of 67,739 Amharic news documents consisting of 8 different categories from online sources are collected. The collected corpus passes through a number of pre-processing steps including; data cleaning, text normalization and punctuation correction. To validate the usability of the collected corpora from different domains, a baseline document classification experiment was conducted. Experimental results show that, 84.53% accuracy is registered using deep learning in the absence of linguistic information. Finding indicated that it is possible to use the prepared corpora for different natural language applications in the absence of linguistic resources such as stemmer and dictionary despite the complexity of Amharic language. We are further working towards Amharic news document classification by incorporating a linguistic independent stop-word detection, stemming and unsupervised morphological segmentation of Amharic documents.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":"2 3","pages":"Article 100033"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46475960","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Replication as a means of assessing corpus representativeness and the generalizability of specialized word lists 复制作为一种评估语料库代表性和专业词表可泛化性的方法
Pub Date : 2022-12-01 DOI: 10.1016/j.acorp.2022.100027
Don Miller

Considerable energy has gone into designing lists of words that are salient in discourse domains of varying breadth. Over the past two decades, most efforts in designing and validating corpus-based frequency lists have focused on three areas: corpus compilation, item selection criteria, and coverage-based demonstrations of list robustness. As a result, modern corpora are now often much larger and better balanced; the application of additional dispersion statistics allows for better targeting of items with desired distributions; and contemporary lexical frequency lists are proving increasingly efficient, providing ever higher coverage of target texts or achieving such coverage with fewer words. However, despite these important advances, relatively minimal attention has been paid to word list reliability—the extent to which lists can be generalized to the wider discourse domain that has been represented by the corpora upon which they are based. This study begins to address this gap, demonstrating via two word list development case studies (one for Environmental Science and one for Applied Linguistics) that adding iterative reliability analysis—via methodological replication with corpora of increasing size and comparison of items on resulting lists—can be used to: 1) inform corpus design beyond what Biber (1991) terms “situational” parameters, allowing us to see whether corpora are adequately representative of lexical distributions in target discourse domains; and 2) provide valuable insight into the degree of generalizability of word lists we have developed.

相当多的精力投入到设计在不同宽度的话语域中突出的单词列表上。在过去的二十年中,设计和验证基于语料库的频率列表的大部分工作集中在三个方面:语料库编译、项目选择标准和基于覆盖的列表鲁棒性演示。因此,现代语料库现在往往更大,更平衡;应用额外的分散统计数据可以更好地定位具有期望分布的项目;当代词汇频率表的效率越来越高,可以提供更高的目标文本覆盖范围,或者用更少的单词实现这样的覆盖范围。然而,尽管取得了这些重要的进展,人们对词表可靠性的关注相对较少,即词表在多大程度上可以被推广到更广泛的话语领域,即它们所基于的语料库所代表的话语领域。本研究开始解决这一差距,通过两个单词列表开发案例研究(一个用于环境科学,一个用于应用语言学)证明,通过增加语料库规模的方法学复制和结果列表上项目的比较,增加迭代可靠性分析可以用于:1)告知语料库设计超越Biber(1991)所说的“情境”参数,使我们能够看到语料库是否充分代表了目标话语域的词汇分布;2)对我们所开发的词表的泛化程度提供有价值的见解。
{"title":"Replication as a means of assessing corpus representativeness and the generalizability of specialized word lists","authors":"Don Miller","doi":"10.1016/j.acorp.2022.100027","DOIUrl":"10.1016/j.acorp.2022.100027","url":null,"abstract":"<div><p>Considerable energy has gone into designing lists of words that are salient in discourse domains of varying breadth. Over the past two decades, most efforts in designing and validating corpus-based frequency lists have focused on three areas: corpus compilation, item selection criteria, and coverage-based demonstrations of list robustness. As a result, modern corpora are now often much larger and better balanced; the application of additional dispersion statistics allows for better targeting of items with desired distributions; and contemporary lexical frequency lists are proving increasingly efficient, providing ever higher coverage of target texts or achieving such coverage with fewer words. However, despite these important advances, relatively minimal attention has been paid to word list reliability—the extent to which lists can be generalized to the wider discourse domain that has been represented by the corpora upon which they are based. This study begins to address this gap, demonstrating via two word list development case studies (one for Environmental Science and one for Applied Linguistics) that adding iterative reliability analysis—via methodological replication with corpora of increasing size and comparison of items on resulting lists—can be used to: 1) inform corpus design beyond what Biber (1991) terms “situational” parameters, allowing us to see whether corpora are adequately representative of lexical distributions in target discourse domains; and 2) provide valuable insight into the degree of generalizability of word lists we have developed.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":"2 3","pages":"Article 100027"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666799122000120/pdfft?md5=99bdd61e7345f961aa3e0dbbbda0d186&pid=1-s2.0-S2666799122000120-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49471849","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Corpus-aided EAP writing workshops to support international scholarly publication 语料库辅助EAP写作工作坊,支持国际学术出版
Pub Date : 2022-12-01 DOI: 10.1016/j.acorp.2022.100029
Ana Frankenberg-Garcia , Paula Tavares Pinto , Ana Eliza Pereira Bocorny , Simone Sarmento

Writing for international scholarly publication is hard, and arguably harder for researchers with English as an additional language. English teachers could help them, but most teachers have little or no experience of research writing or the specialized languages researchers use. This study trialled and evaluated workshops for Brazilian researchers and English teachers learning together to use corpora and corpus-based tools to develop autonomy in writing and teaching academic English writing for scholarly publication.

为国际学术出版物写作是困难的,对于英语作为额外语言的研究人员来说,可以说更难。英语教师可以帮助他们,但大多数教师很少或根本没有研究写作或研究人员使用的专业语言的经验。本研究对巴西研究人员和英语教师共同学习使用语料库和基于语料库的工具来培养写作自主权和教授学术英语写作的研讨会进行了试验和评估。
{"title":"Corpus-aided EAP writing workshops to support international scholarly publication","authors":"Ana Frankenberg-Garcia ,&nbsp;Paula Tavares Pinto ,&nbsp;Ana Eliza Pereira Bocorny ,&nbsp;Simone Sarmento","doi":"10.1016/j.acorp.2022.100029","DOIUrl":"10.1016/j.acorp.2022.100029","url":null,"abstract":"<div><p>Writing for international scholarly publication is hard, and arguably harder for researchers with English as an additional language. English teachers could help them, but most teachers have little or no experience of research writing or the specialized languages researchers use. This study trialled and evaluated workshops for Brazilian researchers and English teachers learning together to use corpora and corpus-based tools to develop autonomy in writing and teaching academic English writing for scholarly publication.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":"2 3","pages":"Article 100029"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666799122000144/pdfft?md5=fa1c82c2ee110a621abaa295dc402598&pid=1-s2.0-S2666799122000144-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47583185","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Review of Durrant, Brenchley, and McCallum (2021) Understanding development and proficiency in writing: Quantitative corpus linguistic approaches Durrant, Brenchley和McCallum(2021)理解写作的发展和熟练程度:定量语料库语言学方法
Pub Date : 2022-12-01 DOI: 10.1016/j.acorp.2022.100024
Ashleigh Cox
{"title":"Review of Durrant, Brenchley, and McCallum (2021) Understanding development and proficiency in writing: Quantitative corpus linguistic approaches","authors":"Ashleigh Cox","doi":"10.1016/j.acorp.2022.100024","DOIUrl":"10.1016/j.acorp.2022.100024","url":null,"abstract":"","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":"2 3","pages":"Article 100024"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47207770","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Teaching, learning, and researching with corpora 用语料库进行教学、学习和研究
Pub Date : 2022-12-01 DOI: 10.1016/j.acorp.2022.100025
Tove Larsson , Shelley Staples , Jesse Egbert
{"title":"Teaching, learning, and researching with corpora","authors":"Tove Larsson ,&nbsp;Shelley Staples ,&nbsp;Jesse Egbert","doi":"10.1016/j.acorp.2022.100025","DOIUrl":"10.1016/j.acorp.2022.100025","url":null,"abstract":"","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":"2 3","pages":"Article 100025"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666799122000107/pdfft?md5=f51d5341aae2c12e60f6219cf05a08ee&pid=1-s2.0-S2666799122000107-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46245949","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Principled pattern curation to guide data-driven learning design 原则模式策划,指导数据驱动的学习设计
Pub Date : 2022-12-01 DOI: 10.1016/j.acorp.2022.100028
Anne O'Keeffe , Geraldine Mark

Insights from corpus linguistics (CL) have informed language learning and materials design, among many other areas. An important nexus between CL and language learning is the use of Data-Driven Learning (DDL), which draws on the use of corpus data in the classroom and which brings opportunities for inductive language discovery.

Within the ethos of DDL, learners are encouraged to discover patterns of language and, in so doing, foster more complex cognitive processes such as making inferences. While many studies on DDL concur on the success of this approach, it is still perceived as a marginal practice. Its success so far has been largely limited to intermediate to advanced level learners in higher education settings (Boulton and Cobb 2017). This paper aims to offer guiding principles for how DDL might have wider application across all levels (not just at Intermediate and above) and to set out exemplars for their application at different levels of proficiency. Based on insights from second language acquisition (SLA) and learner corpus research (LCR), the focus of this paper will be on identifying principles for the curation of language patterns that are differentiated for stage of learning. In particular, we are keen to build on recent and important work which looks at SLA through the lens of the usage-based (UB) models (that is, models that view language as being acquired through the use of and exposure to language).

从语料库语言学(CL)的见解已经通知语言学习和材料设计,在许多其他领域。CL和语言学习之间的一个重要联系是数据驱动学习(DDL)的使用,它利用课堂上语料库数据的使用,为归纳语言发现带来了机会。在DDL的精神中,学习者被鼓励去发现语言的模式,这样做,培养更复杂的认知过程,比如推理。虽然许多关于DDL的研究一致认为这种方法是成功的,但它仍然被认为是一种边缘实践。到目前为止,它的成功在很大程度上仅限于高等教育环境中的中高级水平学习者(博尔顿和科布2017)。本文旨在为DDL如何在所有级别(不仅仅是中级和以上级别)中得到更广泛的应用提供指导原则,并为它们在不同熟练程度上的应用提供范例。基于第二语言习得(SLA)和学习者语料库研究(LCR)的见解,本文的重点是确定根据学习阶段区分的语言模式的管理原则。特别是,我们热衷于通过基于使用(UB)模型(即,将语言视为通过使用和接触语言而获得的模型)的视角来看待SLA的最新重要工作。
{"title":"Principled pattern curation to guide data-driven learning design","authors":"Anne O'Keeffe ,&nbsp;Geraldine Mark","doi":"10.1016/j.acorp.2022.100028","DOIUrl":"10.1016/j.acorp.2022.100028","url":null,"abstract":"<div><p>Insights from corpus linguistics (CL) have informed language learning and materials design, among many other areas. An important nexus between CL and language learning is the use of Data-Driven Learning (DDL), which draws on the use of corpus data in the classroom and which brings opportunities for inductive language discovery.</p><p>Within the ethos of DDL, learners are encouraged to discover patterns of language and, in so doing, foster more complex cognitive processes such as making inferences. While many studies on DDL concur on the success of this approach, it is still perceived as a marginal practice. Its success so far has been largely limited to intermediate to advanced level learners in higher education settings (Boulton and Cobb 2017). This paper aims to offer guiding principles for how DDL might have wider application across all levels (not just at Intermediate and above) and to set out exemplars for their application at different levels of proficiency. Based on insights from second language acquisition (SLA) and learner corpus research (LCR), the focus of this paper will be on identifying principles for the curation of language patterns that are differentiated for stage of learning. In particular, we are keen to build on recent and important work which looks at SLA through the lens of the usage-based (UB) models (that is, models that view language as being acquired through the use of and exposure to language).</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":"2 3","pages":"Article 100028"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666799122000132/pdfft?md5=f53afdebc49d6e7b54500fd05f50d11b&pid=1-s2.0-S2666799122000132-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49216980","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
Applied Corpus Linguistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1