首页 > 最新文献

Corpus Linguistics and Linguistic Theory最新文献

英文 中文
Learner corpus research: a critical appraisal and roadmap for contributing (more) to SLA research agendas 学习者语料库研究:批判性评估和路线图,为语言学习服务(SLA)研究议程做出(更多)贡献
IF 1.6 2区 文学 Q1 Arts and Humanities Pub Date : 2024-05-13 DOI: 10.1515/cllt-2024-0014
Magali Paquot
Over the past decade, learner corpora have gained recognition as valuable data sources in Second Language Acquisition (SLA) research. This development can be attributed to significant progress in Learner Corpus Research (LCR). However, there is still substantial work to be done. This article highlights key issues essential for sustaining the relevance of learner corpora in SLA. More particularly, I focus on the need for more diverse types of learner corpora, stress the importance of detailed metadata, and advocate for multifactorial study designs. I then revisit ongoing debates regarding the role of the native speaker in LCR and propose a practical solution to address this thorny issue. Finally, I also readdress the need for improvement in the quantitative methods and statistics, arguing that the importance of robust quantitative analysis cannot be overstated. In conclusion, I envision an ambitious learner corpus compilation project that adheres to the FAIR principles, with the goal of further elevating study quality in LCR.
在过去的十年中,学习者语料库作为第二语言习得(SLA)研究中的宝贵数据源已得到认可。这一发展可归功于学习者语料库研究(LCR)的重大进展。然而,仍有大量工作要做。本文强调了保持学习者语料库在 SLA 中的相关性所必须解决的关键问题。尤其是,我着重强调了对更多样化的学习者语料库的需求,强调了详细元数据的重要性,并提倡多因素研究设计。然后,我重温了目前关于母语使用者在 LCR 中的作用的争论,并提出了解决这一棘手问题的切实可行的方案。最后,我还重新讨论了改进定量方法和统计的必要性,并认为强有力的定量分析的重要性怎么强调都不为过。总之,我设想了一个雄心勃勃的学习者语料库编纂项目,该项目将遵循 FAIR 原则,目标是进一步提高 LCR 的研究质量。
{"title":"Learner corpus research: a critical appraisal and roadmap for contributing (more) to SLA research agendas","authors":"Magali Paquot","doi":"10.1515/cllt-2024-0014","DOIUrl":"https://doi.org/10.1515/cllt-2024-0014","url":null,"abstract":"Over the past decade, learner corpora have gained recognition as valuable data sources in Second Language Acquisition (SLA) research. This development can be attributed to significant progress in Learner Corpus Research (LCR). However, there is still substantial work to be done. This article highlights key issues essential for sustaining the relevance of learner corpora in SLA. More particularly, I focus on the need for more diverse types of learner corpora, stress the importance of detailed metadata, and advocate for multifactorial study designs. I then revisit ongoing debates regarding the role of the native speaker in LCR and propose a practical solution to address this thorny issue. Finally, I also readdress the need for improvement in the quantitative methods and statistics, arguing that the importance of robust quantitative analysis cannot be overstated. In conclusion, I envision an ambitious learner corpus compilation project that adheres to the FAIR principles, with the goal of further elevating study quality in LCR.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2024-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140931112","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Corpus linguistics and the social sciences 语料库语言学与社会科学
IF 1.6 2区 文学 Q1 Arts and Humanities Pub Date : 2024-04-25 DOI: 10.1515/cllt-2024-0036
Tony McEnery, Gavin Brookes
Corpus linguistics, with its methodological orientation towards the empirical analysis of language based on large text collections, has the potential to offer significant tools for addressing real-world problems across various social science domains, including climate change, criminology, healthcare and policy making. Despite this potential, the integration of corpus linguistics into social science disciplines (beyond linguistics) remains hampered by fundamental differences in epistemology, definitions and methodological approaches. This article explores the relationship between corpus linguistics and the social sciences. It is argued that epistemology, or the theory of knowledge, represents a primary barrier to integration, with much corpus linguistics research aligning with positivist and naturalist epistemologies. By contrast, many social science disciplines embrace more interpretive, conventionalist approaches that account for the dynamic nature of social phenomena. Considering the role of naturalism and conventionalism within both corpus linguistics and the social sciences, this article illustrates how these epistemological stances are likely to influence the acceptance and use of corpus methods in social science research. Despite the challenges, areas of convergence (e.g. shared use of data processing tools and the acknowledgement of the central role of language in social processes) provide opportunities for cross-disciplinary collaboration. As means to bridge the epistemological divide, this article advocates for a critical realist approach and concludes by calling on users of corpus linguistic methods to be reflexive and transparent about their epistemological stances when reporting their research.
语料库语言学在方法论上以基于大型文本集的语言实证分析为导向,有可能为解决气候变化、犯罪学、医疗保健和政策制定等各种社会科学领域的现实问题提供重要工具。尽管有这样的潜力,语料库语言学与社会科学学科(除语言学之外)的整合仍然受到认识论、定义和方法论上的根本差异的阻碍。本文探讨语料库语言学与社会科学之间的关系。文章认为,认识论或知识理论是融合的主要障碍,许多语料库语言学研究与实证主义和自然主义认识论相一致。与此相反,许多社会科学学科则采用解释性更强的传统主义方法来解释社会现象的动态性质。考虑到自然主义和传统主义在语料库语言学和社会科学中的作用,本文阐述了这些认识论立场可能如何影响社会科学研究对语料库方法的接受和使用。尽管存在挑战,但有一些共同点(如共同使用数据处理工具和承认语言在社会进程中的核心作用)为跨学科合作提供了机会。作为弥合认识论鸿沟的手段,本文提倡批判现实主义方法,并在最后呼吁语料库语言学方法的使用者在报告其研究时对其认识论立场保持自省和透明。
{"title":"Corpus linguistics and the social sciences","authors":"Tony McEnery, Gavin Brookes","doi":"10.1515/cllt-2024-0036","DOIUrl":"https://doi.org/10.1515/cllt-2024-0036","url":null,"abstract":"\u0000 Corpus linguistics, with its methodological orientation towards the empirical analysis of language based on large text collections, has the potential to offer significant tools for addressing real-world problems across various social science domains, including climate change, criminology, healthcare and policy making. Despite this potential, the integration of corpus linguistics into social science disciplines (beyond linguistics) remains hampered by fundamental differences in epistemology, definitions and methodological approaches. This article explores the relationship between corpus linguistics and the social sciences. It is argued that epistemology, or the theory of knowledge, represents a primary barrier to integration, with much corpus linguistics research aligning with positivist and naturalist epistemologies. By contrast, many social science disciplines embrace more interpretive, conventionalist approaches that account for the dynamic nature of social phenomena. Considering the role of naturalism and conventionalism within both corpus linguistics and the social sciences, this article illustrates how these epistemological stances are likely to influence the acceptance and use of corpus methods in social science research. Despite the challenges, areas of convergence (e.g. shared use of data processing tools and the acknowledgement of the central role of language in social processes) provide opportunities for cross-disciplinary collaboration. As means to bridge the epistemological divide, this article advocates for a critical realist approach and concludes by calling on users of corpus linguistic methods to be reflexive and transparent about their epistemological stances when reporting their research.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2024-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140656127","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The distributional properties of long nominal compounds in scientific articles: an investigation based on the uniform information density hypothesis 科学文章中长名词复合词的分布特性:基于均匀信息密度假说的研究
IF 1.6 2区 文学 Q1 Arts and Humanities Pub Date : 2024-04-16 DOI: 10.1515/cllt-2023-0028
John Gamboa, Kristina Braun, Juhani Järvikivi, Shanley E. M. Allen
Nominal compounds are a structure commonly used in scientific texts. Despite their commonality, very little is known about how they are distributed in scientific articles. Based on the Uniform Information Density hypothesis, which states that speakers communicate information at a constant rate, avoiding peaks and troughs of information transmission, we predict that nominal compounds should cluster toward the end of scientific texts, be preceded by supporting text that facilitates their understanding, and be repeated often after their first use. In this paper, we examine these predictions through a quantitative and a qualitative analysis of a corpus of scientific papers from the fields of Biology, Economics and Linguistics. While our investigation did not reveal definitive findings for the first and third predictions above, it did produce supporting evidence in favor of our second prediction, thus advancing our understanding of NC use and the choices speakers make when transmitting information.
名词性化合物是科学文章中常用的一种结构。尽管它们很常见,但人们对它们在科学文章中的分布却知之甚少。根据 "均匀信息密度假说"(Uniform Information Density hypothesis),即说话者以恒定的速度传递信息,避免信息传递的高峰和低谷,我们预测名词性复词应集中在科技文章的末尾,在其前面有有助于理解的辅助文字,并在首次使用后经常重复出现。在本文中,我们通过对生物学、经济学和语言学领域的科学论文语料库进行定量和定性分析,对上述预测进行了研究。虽然我们的调查没有为上述第一和第三项预测揭示明确的结论,但却为第二项预测提供了支持性证据,从而推进了我们对数控系统使用和说话者在传递信息时所作选择的理解。
{"title":"The distributional properties of long nominal compounds in scientific articles: an investigation based on the uniform information density hypothesis","authors":"John Gamboa, Kristina Braun, Juhani Järvikivi, Shanley E. M. Allen","doi":"10.1515/cllt-2023-0028","DOIUrl":"https://doi.org/10.1515/cllt-2023-0028","url":null,"abstract":"Nominal compounds are a structure commonly used in scientific texts. Despite their commonality, very little is known about how they are distributed in scientific articles. Based on the Uniform Information Density hypothesis, which states that speakers communicate information at a constant rate, avoiding peaks and troughs of information transmission, we predict that nominal compounds should cluster toward the end of scientific texts, be preceded by supporting text that facilitates their understanding, and be repeated often after their first use. In this paper, we examine these predictions through a quantitative and a qualitative analysis of a corpus of scientific papers from the fields of Biology, Economics and Linguistics. While our investigation did not reveal definitive findings for the first and third predictions above, it did produce supporting evidence in favor of our second prediction, thus advancing our understanding of NC use and the choices speakers make when transmitting information.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140609182","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Corpus-based discourse analysis: from meta-reflection to accountability 基于语料库的话语分析:从元反思到问责制
IF 1.6 2区 文学 Q1 Arts and Humanities Pub Date : 2024-04-16 DOI: 10.1515/cllt-2023-0104
Monika Bednarek, Martin Schweinberger, Kelvin K. H. Lee
Recent years have seen an increase in data and method reflection in corpus-based discourse analysis. In this article, we first take stock of some of the issues arising from such reflection (covering concepts such as triangulation, objectivity/subjectivity, replication, transparency, reflexivity, consistency). We then introduce a new ‘accountability’ framework for use in corpus-based discourse analysis (and perhaps beyond). We conceptualise such accountability as a multi-faceted phenomenon, covering various aspects of the research process. In the second part of this article, we then link this framework to a new cross-institutional initiative – the Australian Text Analytics Platform (ATAP) – which aims to address a small part of the framework, namely the transparency of analyses through Jupyter notebooks. We introduce the Quotation Tool as an example ATAP notebook of particular relevance to corpus-based discourse analysis. We reflect on how this notebook fosters accountability in relation to transparency of analysis and illustrate key applications using a set of different corpora.
近年来,在基于语料库的话语分析中,对数据和方法的反思日益增多。在本文中,我们首先总结了此类反思中出现的一些问题(包括三角测量、客观性/主体性、复制、透明度、反身性、一致性等概念)。然后,我们引入一个新的 "问责制 "框架,用于基于语料库的话语分析(或许还包括其他分析)。我们将这种问责制概念化为一种多层面现象,涵盖研究过程的各个方面。在本文的第二部分,我们将这一框架与一项新的跨机构倡议--澳大利亚文本分析平台(ATAP)--联系起来,该倡议旨在解决框架中的一小部分问题,即通过 Jupyter 笔记本实现分析的透明化。我们将引文工具作为 ATAP 笔记本的范例进行介绍,该笔记本与基于语料库的话语分析尤为相关。我们将思考该笔记本如何促进分析透明度方面的问责制,并使用一组不同的语料库说明其主要应用。
{"title":"Corpus-based discourse analysis: from meta-reflection to accountability","authors":"Monika Bednarek, Martin Schweinberger, Kelvin K. H. Lee","doi":"10.1515/cllt-2023-0104","DOIUrl":"https://doi.org/10.1515/cllt-2023-0104","url":null,"abstract":"Recent years have seen an increase in data and method reflection in corpus-based discourse analysis. In this article, we first take stock of some of the issues arising from such reflection (covering concepts such as triangulation, objectivity/subjectivity, replication, transparency, reflexivity, consistency). We then introduce a new ‘accountability’ framework for use in corpus-based discourse analysis (and perhaps beyond). We conceptualise such accountability as a multi-faceted phenomenon, covering various aspects of the research process. In the second part of this article, we then link this framework to a new cross-institutional initiative – the Australian Text Analytics Platform (ATAP) – which aims to address a small part of the framework, namely the transparency of analyses through Jupyter notebooks. We introduce the Quotation Tool as an example ATAP notebook of particular relevance to corpus-based discourse analysis. We reflect on how this notebook fosters accountability in relation to transparency of analysis and illustrate key applications using a set of different corpora.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140609022","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The counting principle makes number words unique 计数原理让数词独一无二
IF 1.6 2区 文学 Q1 Arts and Humanities Pub Date : 2024-03-29 DOI: 10.1515/cllt-2023-0105
Mira Ariel, Natalia Levshina
Following Ariel (2021. Why it’s hard to construct ad hoc number concepts. In Caterina Mauri, Ilaria Fiorentini, & Eugenio Goria (eds.), Building categories in interaction: Linguistic resources at work, 439–462. Amsterdam: John Benjamins), we argue that number words manifest distinct distributional patterns from open-class lexical items. When modified, open-class words typically take selectors (as in kinda table), which select a subset of their potential denotations (e.g., “nonprototypical table”). They are typically not modified by loosening operators (e.g., approximately), since even if bare, typical lexemes can broaden their interpretation (e.g., table referring to a rock used as a table). Number words, on the other hand, have a single, precise meaning and denotation and cannot take a selector, which would need to select a subset of their (single) denotation (??kinda seven). However, they are often overtly broadened (approximately seven), creating a range of values around N. First, we extend Ariel’s empirical examination to the larger COCA and to Hebrew (HeTenTen). Second, we propose that open-class and number words belong to sparse versus dense lexical domains, respectively, because the former exhibit prototypicality effects, but the latter do not. Third, we further support the contrast between sparse and dense domains by reference to: synchronic word2vec models of sparse and dense lexemes, which testify to their differential distributions, numeral use in noncounting communities, and different renewal rates for the two lexical types.
继阿里尔(2021.为什么难以构建特设数字概念?见 Caterina Mauri, Ilaria Fiorentini, & Eugenio Goria (eds.), Building categories in interaction:Linguistic resources at work, 439-462.阿姆斯特丹:John Benjamins),我们认为数词表现出与开放类词项不同的分布模式。当被修改时,开放类词汇通常会使用选择器(如 kinda table),选择其潜在指称的一个子集(如 "非原型表")。它们通常不会被松散运算符(如 "大约")修饰,因为即使是裸词,典型词素也可以扩大它们的释义范围(如 "桌子 "指的是用作桌子的石头)。首先,我们将 Ariel 的实证研究扩展到更大的 COCA 和希伯来语(HeTenTen)。其次,我们提出开放类词和数字词分别属于稀疏词域和密集词域,因为前者表现出原型效应,而后者则没有。第三,我们进一步支持稀疏词域和密集词域之间的对比,我们参考了稀疏词域和密集词域的同步 word2vec 模型,这些模型证明了稀疏词域和密集词域的不同分布,数字词在非计数社区中的使用,以及这两类词的不同更新率。
{"title":"The counting principle makes number words unique","authors":"Mira Ariel, Natalia Levshina","doi":"10.1515/cllt-2023-0105","DOIUrl":"https://doi.org/10.1515/cllt-2023-0105","url":null,"abstract":"\u0000 Following Ariel (2021. Why it’s hard to construct ad hoc number concepts. In Caterina Mauri, Ilaria Fiorentini, & Eugenio Goria (eds.), Building categories in interaction: Linguistic resources at work, 439–462. Amsterdam: John Benjamins), we argue that number words manifest distinct distributional patterns from open-class lexical items. When modified, open-class words typically take selectors (as in kinda table), which select a subset of their potential denotations (e.g., “nonprototypical table”). They are typically not modified by loosening operators (e.g., approximately), since even if bare, typical lexemes can broaden their interpretation (e.g., table referring to a rock used as a table). Number words, on the other hand, have a single, precise meaning and denotation and cannot take a selector, which would need to select a subset of their (single) denotation (??kinda seven). However, they are often overtly broadened (approximately seven), creating a range of values around N. First, we extend Ariel’s empirical examination to the larger COCA and to Hebrew (HeTenTen). Second, we propose that open-class and number words belong to sparse versus dense lexical domains, respectively, because the former exhibit prototypicality effects, but the latter do not. Third, we further support the contrast between sparse and dense domains by reference to: synchronic word2vec models of sparse and dense lexemes, which testify to their differential distributions, numeral use in noncounting communities, and different renewal rates for the two lexical types.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2024-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140365928","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A collostructional approach to Japanese noun-modifying clause construction use and acquisition: a learner corpus study 日语名词修饰从句结构使用和习得的共构方法:学习者语料库研究
IF 1.6 2区 文学 Q1 Arts and Humanities Pub Date : 2024-03-22 DOI: 10.1515/cllt-2024-0020
Nicole C. De Los Reyes, Ute Römer-Barron
Japanese features a general noun-modifying clause construction (NMCC) with a more versatile range of semantic and pragmatic interpretations than equivalent constructions in other languages. Motivated by the learning challenge NMCCs pose to Japanese as a foreign language (JFL) learners, this article examines speech data from the International Corpus of Japanese as a Second Language (I-JAS) to compare learner use of NMCCs against a large L1 Japanese corpus. Instances of the construction from both corpora were analyzed to identify high-frequency part-of-speech categories and subcategories in the modifying clause predicate and head noun slots. A simple collexeme analysis was then employed to identify strongly attracted and repelled lexical items among those identified in realizations of the construction. Taken together, findings from these analyses revealed an important connection between the semantic weight of head nouns in NMCCs and the idiomaticity of the construction, with learner productions demonstrating a tendency toward heavy head nouns. This study lays the groundwork for future research seeking to explore the NMCC at different levels of granularity and to improve its treatment in JFL pedagogical materials.
日语中的一般名词修饰从句结构(NMCC)与其他语言中的同等结构相比,具有更多的语义和语用解释。由于 NMCC 给日语作为外语(JFL)的学习者带来了学习上的挑战,本文研究了日语作为第二语言的国际语料库(I-JAS)中的语音数据,将学习者使用 NMCC 的情况与大型 L1 日语语料库进行了比较。文章分析了两个语料库中的非修饰性从句结构实例,以确定修饰性从句谓语和头名词位置中的高频语篇类别和子类别。然后,我们采用简单的同义词分析,在该结构的实现过程中识别出强烈吸引和排斥的词项。综合来看,这些分析结果表明,NMCCs 中头名词的语义量与结构的习语性之间存在重要联系,学习者的作品倾向于使用重头名词。本研究为今后的研究奠定了基础,以便在不同的粒度水平上探索 NMCC,并改进 JFL 教学材料中对 NMCC 的处理。
{"title":"A collostructional approach to Japanese noun-modifying clause construction use and acquisition: a learner corpus study","authors":"Nicole C. De Los Reyes, Ute Römer-Barron","doi":"10.1515/cllt-2024-0020","DOIUrl":"https://doi.org/10.1515/cllt-2024-0020","url":null,"abstract":"Japanese features a general noun-modifying clause construction (NMCC) with a more versatile range of semantic and pragmatic interpretations than equivalent constructions in other languages. Motivated by the learning challenge NMCCs pose to Japanese as a foreign language (JFL) learners, this article examines speech data from the International Corpus of Japanese as a Second Language (I-JAS) to compare learner use of NMCCs against a large L1 Japanese corpus. Instances of the construction from both corpora were analyzed to identify high-frequency part-of-speech categories and subcategories in the modifying clause predicate and head noun slots. A simple collexeme analysis was then employed to identify strongly attracted and repelled lexical items among those identified in realizations of the construction. Taken together, findings from these analyses revealed an important connection between the semantic weight of head nouns in NMCCs and the idiomaticity of the construction, with learner productions demonstrating a tendency toward heavy head nouns. This study lays the groundwork for future research seeking to explore the NMCC at different levels of granularity and to improve its treatment in JFL pedagogical materials.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2024-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140196917","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Corpus linguistics meets historical linguistics and construction grammar: how far have we come, and where do we go from here? 语料库语言学与历史语言学和结构语法的结合:我们走了多远,又将何去何从?
IF 1.6 2区 文学 Q1 Arts and Humanities Pub Date : 2024-03-22 DOI: 10.1515/cllt-2024-0009
Martin Hilpert
This paper aims to give an overview of corpus-based research that investigates processes of language change from the theoretical perspective of Construction Grammar. Starting in the early 2000s, a dynamic community of researchers has come together in order to contribute to this effort. Among the different lines of work that have characterized this enterprise, this paper discusses the respective roles of qualitative approaches, diachronic collostructional analysis, multivariate techniques, distributional semantic models, and analyses of network structure. The paper tries to contextualize these approaches and to offer pointers for future research.
本文旨在从构式语法的理论视角出发,概述基于语料库的语言变化过程研究。从 2000 年代初开始,一个充满活力的研究群体汇聚在一起,为这项工作做出了贡献。在这项事业的不同研究方向中,本文讨论了定性研究方法、对时构词法分析、多元技术、分布语义模型和网络结构分析各自的作用。本文试图对这些方法进行背景分析,并为今后的研究提供参考。
{"title":"Corpus linguistics meets historical linguistics and construction grammar: how far have we come, and where do we go from here?","authors":"Martin Hilpert","doi":"10.1515/cllt-2024-0009","DOIUrl":"https://doi.org/10.1515/cllt-2024-0009","url":null,"abstract":"This paper aims to give an overview of corpus-based research that investigates processes of language change from the theoretical perspective of Construction Grammar. Starting in the early 2000s, a dynamic community of researchers has come together in order to contribute to this effort. Among the different lines of work that have characterized this enterprise, this paper discusses the respective roles of qualitative approaches, diachronic collostructional analysis, multivariate techniques, distributional semantic models, and analyses of network structure. The paper tries to contextualize these approaches and to offer pointers for future research.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2024-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140196830","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Transfer of collostructions: the case of causative constructions 同位语结构的转移:因果关系结构的案例
IF 1.6 2区 文学 Q1 Arts and Humanities Pub Date : 2024-03-19 DOI: 10.1515/cllt-2024-0023
Gaëtanelle Gilquin
In an attempt to identify possible cases of collostructional transfer in the use of the causative construction [X make Y Vinf] by French-speaking learners of English, two types of analyses are combined in this study. First, a contrastive collostructional analysis compares the verbs occurring in the [Vinf] slot of the English construction and its French equivalent, [X faire Vinf Y]. Second, a contrastive interlanguage collostructional analysis compares the verbs used in the [Vinf] slot of [X make Y Vinf] by native speakers of English, French-speaking learners of English and learners of English from other mother tongue backgrounds. The aim is to identify verbs that are more distinctive of [X faire Vinf Y] than of [X make Y Vinf] and that are also more likely to be used by French-speaking learners of English than by other populations, as these verbs could be potential cases of collostructional preferences transferred by learners from French to English. The results suggest that learners might transfer verbs expressing a change of state or location and some individual verbs like discover from the French to the English causative construction. Their dispreference for copular verbs (other than be) could also be the result of transfer effects.
为了确定法语英语学习者在使用因果结构[X make Y Vinf]时可能出现的同位语结构转换情况,本研究结合了两种类型的分析。首先,对比性对位分析比较了出现在英语结构[Vinf]槽中的动词及其法语对等结构[X faire Vinf Y]。其次,对比性语际搭配分析比较了英语母语者、法语英语学习者和其他母语背景的英语学习者在[X make Y Vinf]的[Vinf]槽中使用的动词。目的是找出[X faire Vinf Y]比[X make Y Vinf]更独特的动词,而且法语英语学习者比其他人群更有可能使用这些动词,因为这些动词可能是学习者从法语转移到英语的同构偏好的潜在案例。研究结果表明,学习者可能会将表示状态或位置变化的动词以及一些单个动词(如 "发现")从法语因果结构转移到英语因果结构中。他们对共轭动词(be 除外)的偏爱也可能是迁移效应的结果。
{"title":"Transfer of collostructions: the case of causative constructions","authors":"Gaëtanelle Gilquin","doi":"10.1515/cllt-2024-0023","DOIUrl":"https://doi.org/10.1515/cllt-2024-0023","url":null,"abstract":"In an attempt to identify possible cases of collostructional transfer in the use of the causative construction [X <jats:sc> <jats:italic>make</jats:italic> </jats:sc> Y V<jats:sub>inf</jats:sub>] by French-speaking learners of English, two types of analyses are combined in this study. First, a contrastive collostructional analysis compares the verbs occurring in the [V<jats:sub>inf</jats:sub>] slot of the English construction and its French equivalent, [X <jats:sc> <jats:italic>faire</jats:italic> </jats:sc> V<jats:sub>inf</jats:sub> Y]. Second, a contrastive interlanguage collostructional analysis compares the verbs used in the [V<jats:sub>inf</jats:sub>] slot of [X <jats:sc> <jats:italic>make</jats:italic> </jats:sc> Y V<jats:sub>inf</jats:sub>] by native speakers of English, French-speaking learners of English and learners of English from other mother tongue backgrounds. The aim is to identify verbs that are more distinctive of [X <jats:sc> <jats:italic>faire</jats:italic> </jats:sc> V<jats:sub>inf</jats:sub> Y] than of [X <jats:sc> <jats:italic>make</jats:italic> </jats:sc> Y V<jats:sub>inf</jats:sub>] and that are also more likely to be used by French-speaking learners of English than by other populations, as these verbs could be potential cases of collostructional preferences transferred by learners from French to English. The results suggest that learners might transfer verbs expressing a change of state or location and some individual verbs like <jats:italic>discover</jats:italic> from the French to the English causative construction. Their dispreference for copular verbs (other than <jats:italic>be</jats:italic>) could also be the result of transfer effects.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2024-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140167239","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Revisiting N waiting to happen: word, construction, and corpus choices in a collostructional analysis 重新审视 "等待发生的 N":共结构分析中的词语、结构和语料选择
IF 1.6 2区 文学 Q1 Arts and Humanities Pub Date : 2024-03-11 DOI: 10.1515/cllt-2024-0019
John Newman
In undertaking any collostructional analysis, a researcher must make decisions concerning the properties of words, constructions, and corpora. Each of these crucial aspects of the analysis can be dealt with in alternative ways: words can be investigated as either lemmas or inflected forms; a construction can be characterized in alternative ways (reliance on semantics or syntax or some combination thereof, the span of the construction, etc.); the choice of corpus (or corpora) will be influenced by whether a researcher has an interest in different genres and varieties, whether the study is synchronic or diachronic, etc. I review various ways in which a researcher’s decisions about words, constructions, and corpora are relevant to a corpus-based study of N waiting to happen, referencing throughout the collostructional analysis of this construction by Stefanowitsch and Gries. The approach adopted here can be seen as supplementing Stefanowitsch and Gries’ original collostructional analysis. It illustrates how multifarious the results of a corpus-based study of constructions can be and serves as a reminder that no one corpus-based measure can possibly answer all the questions linguists might reasonably ask about a construction.
在进行任何共结构分析时,研究人员都必须就词语、结构和语料的属性做出决定。分析中的每一个关键方面都可以用不同的方式来处理:词可以作为词素或转折形式来研究;构式可以用不同的方式来描述(依赖语义或句法或它们的某种组合、构式的跨度等);语料库(或多个语料库)的选择会受到研究者是否对不同体裁和变体感兴趣、研究是同步的还是异步的等因素的影响。我回顾了研究者对词语、构式和语料库的决定与基于语料库的 "N waiting to happen "研究相关的各种方式,并在整个过程中参考了 Stefanowitsch 和 Gries 对这种构式进行的构式分析。这里采用的方法可以看作是对 Stefanowitsch 和 Gries 最初的拼合结构分析的补充。它说明了以语料库为基础的构式研究结果是多么的丰富多彩,同时也提醒我们,没有一种以语料库为基础的测量方法可以回答语言学家可能会对构式提出的所有合理问题。
{"title":"Revisiting N waiting to happen: word, construction, and corpus choices in a collostructional analysis","authors":"John Newman","doi":"10.1515/cllt-2024-0019","DOIUrl":"https://doi.org/10.1515/cllt-2024-0019","url":null,"abstract":"In undertaking any collostructional analysis, a researcher must make decisions concerning the properties of words, constructions, and corpora. Each of these crucial aspects of the analysis can be dealt with in alternative ways: words can be investigated as either lemmas or inflected forms; a construction can be characterized in alternative ways (reliance on semantics or syntax or some combination thereof, the span of the construction, etc.); the choice of corpus (or corpora) will be influenced by whether a researcher has an interest in different genres and varieties, whether the study is synchronic or diachronic, etc. I review various ways in which a researcher’s decisions about words, constructions, and corpora are relevant to a corpus-based study of N <jats:italic>waiting to happen</jats:italic>, referencing throughout the collostructional analysis of this construction by Stefanowitsch and Gries. The approach adopted here can be seen as supplementing Stefanowitsch and Gries’ original collostructional analysis. It illustrates how multifarious the results of a corpus-based study of constructions can be and serves as a reminder that no one corpus-based measure can possibly answer all the questions linguists might reasonably ask about a construction.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2024-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140116832","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Well, maybe you shouldn’t go around shaving poodles: collostructional semantic and discursive prosody in the go (a)round Ving and go (a)round and V constructions 好吧,也许你不应该到处给贵宾犬剃毛:go (a)round Ving 和 go (a)round and V 结构中的拼合语义和话语前体
IF 1.6 2区 文学 Q1 Arts and Humanities Pub Date : 2024-03-07 DOI: 10.1515/cllt-2024-0018
Kim Ebensgaard Jensen
This article presents a corpus-based study of the go (a)round Ving- and go (a)round and V-constructions in American English. More specifically, it addresses the possibility of the constructions serving as pragmatic markers of stance through the collocational phenomenon of semantic prosody. It is argued that the notions of internal and external constructional properties from the early days of construction grammar as well as the corpus-linguistic idea of association patterns would be beneficial to usage-based construction grammatical descriptions of phenomena such as semantic prosody. Drawing on a 248,145,425-word portion of the Corpus of Contemporary American English, both simple collexeme analysis and distinctive collexeme analysis are applied to generate output that feeds into semantic-prosodic analysis. Moreover, standard distinctive collexeme analysis and multiple distinctive collexeme analysis are applied at the level of semantic prosodies in the collexemic fields (i.e., distinctive semantic-prosodic analysis), at the level of verbal category colligations (i.e., distinctive colligational analysis), and at the level of speech act functions of usage-events of the two constructions (i.e., distinctive speech act analysis) as a type of trial balloon. The purpose is to expand semantic-prosodic analysis from focusing merely on lexemes to exploring how other linguistic and pragmatic phenomena may be at play.
本文以语料库为基础,对美国英语中的 go (a)round Ving- 和 go (a)round and V- 结构进行了研究。更具体地说,文章探讨了这些构式通过语义前置的搭配现象作为语用标记的可能性。该研究认为,早期构式语法中的内部和外部构式属性概念以及语料库语言学中的关联模式概念将有利于基于用法的构式语法对语义拟声等现象的描述。利用《当代美国英语语料库》(Corpus of Contemporary American English)中的 248,145,425 个单词,简单的词组分析和独特的词组分析都被应用到了语义拟声分析中。此外,作为一种试验气球,在语义前体分析(即独特的语义前体分析)、动词类别搭配分析(即独特的搭配分析)和两个结构的用法事件的言语行为功能分析(即独特的言语行为分析)层面上,还应用了标准的独特词组分析和多重独特词组分析。其目的是将语义-韵律分析从仅仅关注词素扩展到探索其他语言和语用现象如何发挥作用。
{"title":"Well, maybe you shouldn’t go around shaving poodles: collostructional semantic and discursive prosody in the go (a)round Ving and go (a)round and V constructions","authors":"Kim Ebensgaard Jensen","doi":"10.1515/cllt-2024-0018","DOIUrl":"https://doi.org/10.1515/cllt-2024-0018","url":null,"abstract":"\u0000 This article presents a corpus-based study of the go (a)round Ving- and go (a)round and V-constructions in American English. More specifically, it addresses the possibility of the constructions serving as pragmatic markers of stance through the collocational phenomenon of semantic prosody. It is argued that the notions of internal and external constructional properties from the early days of construction grammar as well as the corpus-linguistic idea of association patterns would be beneficial to usage-based construction grammatical descriptions of phenomena such as semantic prosody. Drawing on a 248,145,425-word portion of the Corpus of Contemporary American English, both simple collexeme analysis and distinctive collexeme analysis are applied to generate output that feeds into semantic-prosodic analysis. Moreover, standard distinctive collexeme analysis and multiple distinctive collexeme analysis are applied at the level of semantic prosodies in the collexemic fields (i.e., distinctive semantic-prosodic analysis), at the level of verbal category colligations (i.e., distinctive colligational analysis), and at the level of speech act functions of usage-events of the two constructions (i.e., distinctive speech act analysis) as a type of trial balloon. The purpose is to expand semantic-prosodic analysis from focusing merely on lexemes to exploring how other linguistic and pragmatic phenomena may be at play.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2024-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140077122","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Corpus Linguistics and Linguistic Theory
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1