首页 > 最新文献

Corpus Linguistics and Linguistic Theory最新文献

英文 中文
Expressing smells in (American) English 用(美式)英语表达气味
IF 1 2区 文学 N/A LANGUAGE & LINGUISTICS Pub Date : 2024-07-16 DOI: 10.1515/cllt-2024-0055
D. Schönefeld
The paper reports on a study of the usage of smell verbs over the last 200 years by speakers of American English. The focus is on how the expression of smell changes over time and what this reveals about the way speakers conceptualize and assess smells. The study is based on usage data from the COHA (Corpus of Historical American English). They were quantitatively analysed employing the methods of simple and (multiple) distinctive collexeme analysis. The results of our investigations indicate both a general increase over time in the usage of smell-verb constructions and a noticeable diversification of the smell vocabulary used by American English speakers. Moreover, the results of the collexeme analyses reveal more detailed aspects of the types of smell descriptors people use in smell talk. Reflecting what kinds of smell emitters are most typically and especially closely associated with the individual smell-verb constructions at particular times, they are informative about the sources of smells that are salient enough in our culture and (well-)known enough in the speech community to be used as functional smell descriptors and how these may change over time.
{"title":"Expressing smells in (American) English","authors":"D. Schönefeld","doi":"10.1515/cllt-2024-0055","DOIUrl":"https://doi.org/10.1515/cllt-2024-0055","url":null,"abstract":"\u0000 The paper reports on a study of the usage of smell verbs over the last 200 years by speakers of American English. The focus is on how the expression of smell changes over time and what this reveals about the way speakers conceptualize and assess smells. The study is based on usage data from the COHA (Corpus of Historical American English). They were quantitatively analysed employing the methods of simple and (multiple) distinctive collexeme analysis. The results of our investigations indicate both a general increase over time in the usage of smell-verb constructions and a noticeable diversification of the smell vocabulary used by American English speakers. Moreover, the results of the collexeme analyses reveal more detailed aspects of the types of smell descriptors people use in smell talk. Reflecting what kinds of smell emitters are most typically and especially closely associated with the individual smell-verb constructions at particular times, they are informative about the sources of smells that are salient enough in our culture and (well-)known enough in the speech community to be used as functional smell descriptors and how these may change over time.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141643396","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A radically usage-based, collostructional approach to assessing the differences between negative modal contractions and their parent forms 从根本上以用法为基础,采用同构方法来评估否定语气缩略词与其母语形式之间的差异
IF 1 2区 文学 N/A LANGUAGE & LINGUISTICS Pub Date : 2024-07-16 DOI: 10.1515/cllt-2024-0051
R. Daugs, David Lorenz
Starting from the premise that English negative modal contractions constitute partly variable patterns of associations that include both the preceding subject and the following verb infinitive, the study sets out to investigate distributional differences between can’t, shouldn’t, and won’t and their corresponding uncontracted parent forms. Given that some configurations are assumed to correlate with specific modal meanings (e.g. inanimate subjects and stative verbs > ‘epistemic prediction’; first person subjects > ‘(un)willingness’ or ‘commissive modality’), roughly 200,000 trigrams from COCA are submitted to distinctive covarying collexeme analysis in order to uncover if these contractions and their full forms are conventionalized and entrenched differentially enough to merit their separate treatment on both conceptual and methodological grounds. The results point to probabilistic tendencies, suggesting a cline where won’t and can’t appear to be more emancipated from their respective full-form analogue than shouldn’t. Furthermore, the study showcases how collostructional methods can be applied fruitfully to case studies embedded in Schmid’s (Schmid, Hans-Jörg. 2020. The dynamics of the linguistic system: Usage, conventionalization, and entrenchment. Oxford: Oxford University Press) Entrenchment and Conventionalization Model.
本研究以英语否定式情态缩约构成部分可变的关联模式(包括前面的主语和后面的动词不定式)为前提,着手研究 can't、should't 和 won't 与其相应的未缩约母形式之间的分布差异。鉴于某些构式被认为与特定的情态意义相关(例如,无生命主语和情态动词 > "认识论预测";第一人称主语 > "(不)愿意 "或 "委婉情态"),研究人员将 COCA 中的约 20 万个三元组提交给独特的共变词组分析,以揭示这些缩略词及其完整形式是否被传统化并根深蒂固,以至于在概念和方法论上都值得单独处理。研究结果表明,won't 和 can't 似乎比 shouldn't 更能从各自的全形类似词中解放出来。此外,本研究还展示了如何将同构方法卓有成效地应用于施密德(Schmid, Hans-Jörg.2020.语言系统的动态:使用、常规化和巩固。Oxford:牛津大学出版社)的 "巩固和常规化模式"。
{"title":"A radically usage-based, collostructional approach to assessing the differences between negative modal contractions and their parent forms","authors":"R. Daugs, David Lorenz","doi":"10.1515/cllt-2024-0051","DOIUrl":"https://doi.org/10.1515/cllt-2024-0051","url":null,"abstract":"\u0000 Starting from the premise that English negative modal contractions constitute partly variable patterns of associations that include both the preceding subject and the following verb infinitive, the study sets out to investigate distributional differences between can’t, shouldn’t, and won’t and their corresponding uncontracted parent forms. Given that some configurations are assumed to correlate with specific modal meanings (e.g. inanimate subjects and stative verbs > ‘epistemic prediction’; first person subjects > ‘(un)willingness’ or ‘commissive modality’), roughly 200,000 trigrams from COCA are submitted to distinctive covarying collexeme analysis in order to uncover if these contractions and their full forms are conventionalized and entrenched differentially enough to merit their separate treatment on both conceptual and methodological grounds. The results point to probabilistic tendencies, suggesting a cline where won’t and can’t appear to be more emancipated from their respective full-form analogue than shouldn’t. Furthermore, the study showcases how collostructional methods can be applied fruitfully to case studies embedded in Schmid’s (Schmid, Hans-Jörg. 2020. The dynamics of the linguistic system: Usage, conventionalization, and entrenchment. Oxford: Oxford University Press) Entrenchment and Conventionalization Model.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141643362","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cognitive and sociolectal constraints on the theme-recipient alternation: evidence from Mandarin 主题-受话人交替的认知和社会选择制约因素:来自普通话的证据
IF 1 2区 文学 N/A LANGUAGE & LINGUISTICS Pub Date : 2024-07-05 DOI: 10.1515/cllt-2023-0127
Yi Li
Abstract We explore the cognitive and sociolectal constraints that probabilistically regulate the theme-recipient (or “dative”) alternation in modern varieties of Mandarin and how these constraints interact with each other. Based on an extensively annotated corpus dataset and regression modeling, we found that the probabilistic grammar that shapes the theme-recipient alternation is fundamentally stable across regional and genre varieties of Mandarin. This general stability notwithstanding, significant variation regarding the importance of cognitive constraints across different sociolectal constraints is detected. Crucially, the analysis revealed that recipient syntactic complexity has a much greater effect in Taiwan Mandarin than in Mainland Mandarin. The effect of theme concreteness is also found to be significantly reduced in telephone conversations compared to broadcast news. Corpus-based findings were cross-validated using a psycholinguistic rating task experiment. While the results of the two approaches demonstrate substantial overlap, they also exhibit diverging patterns at the level of interaction between regional variety and recipient complexity, potentially indicating nuanced differences between the two approaches. The findings provide evidence that interactional patterns between cognitive and sociolectal constraints on probabilistic grammatical alternations may be shared across languages, despite their distinct socio-cultural factors that shape variation in human interaction.
摘要 我们探讨了在现代普通话变体中对主题-受话人(或 "助动词")交替进行概率调节的认知和社会选择制约因素,以及这些制约因素之间是如何相互作用的。基于广泛注释的语料库数据集和回归建模,我们发现形成主题-受话人交替的概率语法在不同地区和体裁的普通话变体中是基本稳定的。尽管具有这种普遍稳定性,但在不同的社会选择制约因素中,认知制约因素的重要性仍存在显著差异。最重要的是,分析表明,受话者句法复杂性在台湾普通话中的影响远远大于在大陆普通话中的影响。此外,还发现与广播新闻相比,电话交谈中主题具体性的影响明显减弱。基于语料库的研究结果通过心理语言学评级任务实验进行了交叉验证。虽然两种方法的结果有很大的重叠,但它们在区域多样性和收件人复杂性之间的交互层面上也表现出不同的模式,这可能表明两种方法之间存在细微差别。这些发现提供了证据,表明尽管不同的社会文化因素决定了人际交往中的差异,但认知和社会选择对概率语法交替的制约之间的互动模式可能是跨语言共享的。
{"title":"Cognitive and sociolectal constraints on the theme-recipient alternation: evidence from Mandarin","authors":"Yi Li","doi":"10.1515/cllt-2023-0127","DOIUrl":"https://doi.org/10.1515/cllt-2023-0127","url":null,"abstract":"Abstract We explore the cognitive and sociolectal constraints that probabilistically regulate the theme-recipient (or “dative”) alternation in modern varieties of Mandarin and how these constraints interact with each other. Based on an extensively annotated corpus dataset and regression modeling, we found that the probabilistic grammar that shapes the theme-recipient alternation is fundamentally stable across regional and genre varieties of Mandarin. This general stability notwithstanding, significant variation regarding the importance of cognitive constraints across different sociolectal constraints is detected. Crucially, the analysis revealed that recipient syntactic complexity has a much greater effect in Taiwan Mandarin than in Mainland Mandarin. The effect of theme concreteness is also found to be significantly reduced in telephone conversations compared to broadcast news. Corpus-based findings were cross-validated using a psycholinguistic rating task experiment. While the results of the two approaches demonstrate substantial overlap, they also exhibit diverging patterns at the level of interaction between regional variety and recipient complexity, potentially indicating nuanced differences between the two approaches. The findings provide evidence that interactional patterns between cognitive and sociolectal constraints on probabilistic grammatical alternations may be shared across languages, despite their distinct socio-cultural factors that shape variation in human interaction.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141673853","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CLLT ‘versus’ Corp ora and IJCL: a (half serious) keyness analysis CLLT "与 "Corp ora 和 IJCL:(半认真的)关键性分析
IF 1.6 2区 文学 Q1 Arts and Humanities Pub Date : 2024-05-27 DOI: 10.1515/cllt-2024-0050
Stefanie Wulff, Stefan Th. Gries
In this introduction to the special issue celebrating CLLT’s 20th anniversary, we look back and forward in time. To look back, we present the results of a (tongue-in-cheek) corpus-linguistic analysis of about 10 years worth of data of research published in CLLT, IJCL, and Corpora in order to distill the “essence” of CLLT for the reader. As an added bonus, we use the opportunity to discuss ways to improve established ways of performing keyness analyses. To look forward, we asked six (teams of) researchers who all have shaped corpus linguistics and thus the journal to give us their take on what the most significant developments in the field have been, and where they see the most impactful opportunities and challenges arise. This introduction briefly summarizes their contributions.
在这篇庆祝 CLLT 20 周年特刊的导言中,我们回顾过去,展望未来。在回顾过去时,我们介绍了对十年来发表在《CLLT》、《IJCL》和《Corpora》上的研究数据进行语料库语言学分析的结果,以便为读者提炼出 CLLT 的 "精髓"。作为额外的收获,我们还借此机会讨论了如何改进现有的关键性分析方法。为了展望未来,我们邀请了六位(团队)研究人员,他们都曾塑造过语料库语言学,因此也塑造了该期刊,让我们听听他们对该领域最重要发展的看法,以及他们认为最具影响力的机遇和挑战在哪里出现。本引言简要概述了他们的贡献。
{"title":"CLLT ‘versus’ Corp\u0000 ora and IJCL: a (half serious) keyness analysis","authors":"Stefanie Wulff, Stefan Th. Gries","doi":"10.1515/cllt-2024-0050","DOIUrl":"https://doi.org/10.1515/cllt-2024-0050","url":null,"abstract":"\u0000 In this introduction to the special issue celebrating CLLT’s 20th anniversary, we look back and forward in time. To look back, we present the results of a (tongue-in-cheek) corpus-linguistic analysis of about 10 years worth of data of research published in CLLT, IJCL, and Corpora in order to distill the “essence” of CLLT for the reader. As an added bonus, we use the opportunity to discuss ways to improve established ways of performing keyness analyses. To look forward, we asked six (teams of) researchers who all have shaped corpus linguistics and thus the journal to give us their take on what the most significant developments in the field have been, and where they see the most impactful opportunities and challenges arise. This introduction briefly summarizes their contributions.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141098301","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Transfer five ways: applications of multiple distinctive collexeme analysis to the dative alternation in Mandarin Chinese 五种转移方式:在普通话的助动词交替中应用多重独特词素分析法
IF 1.6 2区 文学 Q1 Arts and Humanities Pub Date : 2024-05-25 DOI: 10.1515/cllt-2024-0033
Shengyu Liao, Stefan Th. Gries, Stefanie Wulff
The dative alternation has been extensively studied in the world’s languages, and the meanings of the verbs participating in the alternation have been shown to play a key role in determining its argument realization options. The present paper presents a multiple distinctive collexeme analysis approach to the dative alternation in Mandarin Chinese, which involves a choice of one of five functionally similar alternants, and it does so by also discussing several ways to improve how this has been done statistically in most previous analyses. Linguistically, we identify the core semantic differences of the five constructions based on which verbs statistically prefer to occur in which pattern, focusing on semantic potential and direction of transfer. Methodologically, this study contributes to the slowly growing body of studies that use collexeme strengths that are not only less related to frequency than the traditional methods (i.e., association is measured in a less diluted way) and that are directional (i.e., we can focus on one direction of association from the verb to the construction).
对世界语言中的助动词交替进行了广泛的研究,参与交替的动词的意义被证明在决定其论据实现选择方面起着关键作用。普通话中的助词交替涉及从五个功能相似的交替词中选择一个的问题,本文针对普通话中的助词交替提出了一种多重独特的词库分析方法,同时还讨论了改进以往大多数分析中统计方法的几种途径。在语言学上,我们根据哪些动词在统计上更倾向于出现在哪种模式中来确定这五种结构的核心语义差异,重点关注语义潜能和转移方向。在方法论上,本研究为逐渐增多的研究机构做出了贡献,这些研究使用的词组强度不仅与频率的关系不如传统方法(即关联的测量方法不那么稀释),而且具有方向性(即我们可以关注从动词到结构的一个关联方向)。
{"title":"Transfer five ways: applications of multiple distinctive collexeme analysis to the dative alternation in Mandarin Chinese","authors":"Shengyu Liao, Stefan Th. Gries, Stefanie Wulff","doi":"10.1515/cllt-2024-0033","DOIUrl":"https://doi.org/10.1515/cllt-2024-0033","url":null,"abstract":"The dative alternation has been extensively studied in the world’s languages, and the meanings of the verbs participating in the alternation have been shown to play a key role in determining its argument realization options. The present paper presents a multiple distinctive collexeme analysis approach to the dative alternation in Mandarin Chinese, which involves a choice of one of five functionally similar alternants, and it does so by also discussing several ways to improve how this has been done statistically in most previous analyses. Linguistically, we identify the core semantic differences of the five constructions based on which verbs statistically prefer to occur in which pattern, focusing on semantic potential and direction of transfer. Methodologically, this study contributes to the slowly growing body of studies that use collexeme strengths that are not only less related to frequency than the traditional methods (i.e., association is measured in a less diluted way) and that are directional (i.e., we can focus on one direction of association from the verb to the construction).","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2024-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141153761","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Register and the dual nature of functional correspondence: accounting for text-linguistic variation between registers, within registers, and without registers 语域与功能对应的双重性质:解释语域之间、语域之内和语域之外的文本语言差异
IF 1.6 2区 文学 Q1 Arts and Humanities Pub Date : 2024-05-18 DOI: 10.1515/cllt-2024-0011
Jesse Egbert, Douglas Biber, Daniel Keller, Marianna Gracheva
During the past 20 years, corpus linguistic research on register variation has yielded important theoretical advances. The first part of this paper discusses these advances and the cumulative body of research that has produced them. In the second part of the paper, we focus on the goals of research on register variation. The traditional goal of the text-linguistic (TxtLx) approach to linguistic variation has been to describe registers and patterns of register variation: describing the linguistic and situational characteristics of registers. In this paper, we explore a related, but distinct, text-linguistic goal: to account for all linguistic variation among texts. Because the TxtLx framework assumes the importance of functional correspondence between linguistic characteristics and situational characteristics, it is reasonable to assume that in addition to register, we can use situational parameters coded continuously at the level of individual texts as additional predictors of text-linguistic variation. We describe the results of an empirical study to show that using both register categories and text-level situational parameters as predictors results in a more comprehensive and explanatory model of text-linguistic variation. In the conclusion we discuss the future of corpus-based register studies, focusing on unanswered questions related to theoretical claims about register.
过去 20 年间,语料库语言学对语域变异的研究取得了重要的理论进展。本文第一部分讨论了这些进展以及产生这些进展的累积研究成果。在本文的第二部分,我们将重点讨论语域变异研究的目标。文本语言学(TxtLx)方法研究语言变异的传统目标是描述语域和语域变异模式:描述语域的语言和情景特征。在本文中,我们将探讨一个相关但不同的文本语言学目标:解释文本之间的所有语言变异。由于 TxtLx 框架假定语言特征和情景特征之间的功能对应关系非常重要,因此我们有理由认为,除了语域之外,我们还可以使用在单篇文本层面上连续编码的情景参数作为文本语言变异的额外预测因素。我们描述了一项实证研究的结果,以说明同时使用语域类别和文本层面的情景参数作为预测因子,可以得到一个更全面、更能解释文本语言变异的模型。在结论部分,我们讨论了基于语料库的语域研究的未来,重点是与语域理论主张相关的未决问题。
{"title":"Register and the dual nature of functional correspondence: accounting for text-linguistic variation between registers, within registers, and without registers","authors":"Jesse Egbert, Douglas Biber, Daniel Keller, Marianna Gracheva","doi":"10.1515/cllt-2024-0011","DOIUrl":"https://doi.org/10.1515/cllt-2024-0011","url":null,"abstract":"During the past 20 years, corpus linguistic research on register variation has yielded important theoretical advances. The first part of this paper discusses these advances and the cumulative body of research that has produced them. In the second part of the paper, we focus on the goals of research on register variation. The traditional goal of the text-linguistic (TxtLx) approach to linguistic variation has been to describe registers and patterns of register variation: describing the linguistic and situational characteristics of registers. In this paper, we explore a related, but distinct, text-linguistic goal: to account for all linguistic variation among texts. Because the TxtLx framework assumes the importance of <jats:italic>functional correspondence</jats:italic> between linguistic characteristics and situational characteristics, it is reasonable to assume that in addition to register, we can use situational parameters coded continuously at the level of individual texts as additional predictors of text-linguistic variation. We describe the results of an empirical study to show that using both register categories and text-level situational parameters as predictors results in a more comprehensive and explanatory model of text-linguistic variation. In the conclusion we discuss the future of corpus-based register studies, focusing on unanswered questions related to theoretical claims about register.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2024-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141062280","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learner corpus research: a critical appraisal and roadmap for contributing (more) to SLA research agendas 学习者语料库研究:批判性评估和路线图,为语言学习服务(SLA)研究议程做出(更多)贡献
IF 1.6 2区 文学 Q1 Arts and Humanities Pub Date : 2024-05-13 DOI: 10.1515/cllt-2024-0014
Magali Paquot
Over the past decade, learner corpora have gained recognition as valuable data sources in Second Language Acquisition (SLA) research. This development can be attributed to significant progress in Learner Corpus Research (LCR). However, there is still substantial work to be done. This article highlights key issues essential for sustaining the relevance of learner corpora in SLA. More particularly, I focus on the need for more diverse types of learner corpora, stress the importance of detailed metadata, and advocate for multifactorial study designs. I then revisit ongoing debates regarding the role of the native speaker in LCR and propose a practical solution to address this thorny issue. Finally, I also readdress the need for improvement in the quantitative methods and statistics, arguing that the importance of robust quantitative analysis cannot be overstated. In conclusion, I envision an ambitious learner corpus compilation project that adheres to the FAIR principles, with the goal of further elevating study quality in LCR.
在过去的十年中,学习者语料库作为第二语言习得(SLA)研究中的宝贵数据源已得到认可。这一发展可归功于学习者语料库研究(LCR)的重大进展。然而,仍有大量工作要做。本文强调了保持学习者语料库在 SLA 中的相关性所必须解决的关键问题。尤其是,我着重强调了对更多样化的学习者语料库的需求,强调了详细元数据的重要性,并提倡多因素研究设计。然后,我重温了目前关于母语使用者在 LCR 中的作用的争论,并提出了解决这一棘手问题的切实可行的方案。最后,我还重新讨论了改进定量方法和统计的必要性,并认为强有力的定量分析的重要性怎么强调都不为过。总之,我设想了一个雄心勃勃的学习者语料库编纂项目,该项目将遵循 FAIR 原则,目标是进一步提高 LCR 的研究质量。
{"title":"Learner corpus research: a critical appraisal and roadmap for contributing (more) to SLA research agendas","authors":"Magali Paquot","doi":"10.1515/cllt-2024-0014","DOIUrl":"https://doi.org/10.1515/cllt-2024-0014","url":null,"abstract":"Over the past decade, learner corpora have gained recognition as valuable data sources in Second Language Acquisition (SLA) research. This development can be attributed to significant progress in Learner Corpus Research (LCR). However, there is still substantial work to be done. This article highlights key issues essential for sustaining the relevance of learner corpora in SLA. More particularly, I focus on the need for more diverse types of learner corpora, stress the importance of detailed metadata, and advocate for multifactorial study designs. I then revisit ongoing debates regarding the role of the native speaker in LCR and propose a practical solution to address this thorny issue. Finally, I also readdress the need for improvement in the quantitative methods and statistics, arguing that the importance of robust quantitative analysis cannot be overstated. In conclusion, I envision an ambitious learner corpus compilation project that adheres to the FAIR principles, with the goal of further elevating study quality in LCR.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2024-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140931112","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Corpus linguistics and the social sciences 语料库语言学与社会科学
IF 1.6 2区 文学 Q1 Arts and Humanities Pub Date : 2024-04-25 DOI: 10.1515/cllt-2024-0036
Tony McEnery, Gavin Brookes
Corpus linguistics, with its methodological orientation towards the empirical analysis of language based on large text collections, has the potential to offer significant tools for addressing real-world problems across various social science domains, including climate change, criminology, healthcare and policy making. Despite this potential, the integration of corpus linguistics into social science disciplines (beyond linguistics) remains hampered by fundamental differences in epistemology, definitions and methodological approaches. This article explores the relationship between corpus linguistics and the social sciences. It is argued that epistemology, or the theory of knowledge, represents a primary barrier to integration, with much corpus linguistics research aligning with positivist and naturalist epistemologies. By contrast, many social science disciplines embrace more interpretive, conventionalist approaches that account for the dynamic nature of social phenomena. Considering the role of naturalism and conventionalism within both corpus linguistics and the social sciences, this article illustrates how these epistemological stances are likely to influence the acceptance and use of corpus methods in social science research. Despite the challenges, areas of convergence (e.g. shared use of data processing tools and the acknowledgement of the central role of language in social processes) provide opportunities for cross-disciplinary collaboration. As means to bridge the epistemological divide, this article advocates for a critical realist approach and concludes by calling on users of corpus linguistic methods to be reflexive and transparent about their epistemological stances when reporting their research.
语料库语言学在方法论上以基于大型文本集的语言实证分析为导向,有可能为解决气候变化、犯罪学、医疗保健和政策制定等各种社会科学领域的现实问题提供重要工具。尽管有这样的潜力,语料库语言学与社会科学学科(除语言学之外)的整合仍然受到认识论、定义和方法论上的根本差异的阻碍。本文探讨语料库语言学与社会科学之间的关系。文章认为,认识论或知识理论是融合的主要障碍,许多语料库语言学研究与实证主义和自然主义认识论相一致。与此相反,许多社会科学学科则采用解释性更强的传统主义方法来解释社会现象的动态性质。考虑到自然主义和传统主义在语料库语言学和社会科学中的作用,本文阐述了这些认识论立场可能如何影响社会科学研究对语料库方法的接受和使用。尽管存在挑战,但有一些共同点(如共同使用数据处理工具和承认语言在社会进程中的核心作用)为跨学科合作提供了机会。作为弥合认识论鸿沟的手段,本文提倡批判现实主义方法,并在最后呼吁语料库语言学方法的使用者在报告其研究时对其认识论立场保持自省和透明。
{"title":"Corpus linguistics and the social sciences","authors":"Tony McEnery, Gavin Brookes","doi":"10.1515/cllt-2024-0036","DOIUrl":"https://doi.org/10.1515/cllt-2024-0036","url":null,"abstract":"\u0000 Corpus linguistics, with its methodological orientation towards the empirical analysis of language based on large text collections, has the potential to offer significant tools for addressing real-world problems across various social science domains, including climate change, criminology, healthcare and policy making. Despite this potential, the integration of corpus linguistics into social science disciplines (beyond linguistics) remains hampered by fundamental differences in epistemology, definitions and methodological approaches. This article explores the relationship between corpus linguistics and the social sciences. It is argued that epistemology, or the theory of knowledge, represents a primary barrier to integration, with much corpus linguistics research aligning with positivist and naturalist epistemologies. By contrast, many social science disciplines embrace more interpretive, conventionalist approaches that account for the dynamic nature of social phenomena. Considering the role of naturalism and conventionalism within both corpus linguistics and the social sciences, this article illustrates how these epistemological stances are likely to influence the acceptance and use of corpus methods in social science research. Despite the challenges, areas of convergence (e.g. shared use of data processing tools and the acknowledgement of the central role of language in social processes) provide opportunities for cross-disciplinary collaboration. As means to bridge the epistemological divide, this article advocates for a critical realist approach and concludes by calling on users of corpus linguistic methods to be reflexive and transparent about their epistemological stances when reporting their research.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2024-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140656127","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The distributional properties of long nominal compounds in scientific articles: an investigation based on the uniform information density hypothesis 科学文章中长名词复合词的分布特性:基于均匀信息密度假说的研究
IF 1.6 2区 文学 Q1 Arts and Humanities Pub Date : 2024-04-16 DOI: 10.1515/cllt-2023-0028
John Gamboa, Kristina Braun, Juhani Järvikivi, Shanley E. M. Allen
Nominal compounds are a structure commonly used in scientific texts. Despite their commonality, very little is known about how they are distributed in scientific articles. Based on the Uniform Information Density hypothesis, which states that speakers communicate information at a constant rate, avoiding peaks and troughs of information transmission, we predict that nominal compounds should cluster toward the end of scientific texts, be preceded by supporting text that facilitates their understanding, and be repeated often after their first use. In this paper, we examine these predictions through a quantitative and a qualitative analysis of a corpus of scientific papers from the fields of Biology, Economics and Linguistics. While our investigation did not reveal definitive findings for the first and third predictions above, it did produce supporting evidence in favor of our second prediction, thus advancing our understanding of NC use and the choices speakers make when transmitting information.
名词性化合物是科学文章中常用的一种结构。尽管它们很常见,但人们对它们在科学文章中的分布却知之甚少。根据 "均匀信息密度假说"(Uniform Information Density hypothesis),即说话者以恒定的速度传递信息,避免信息传递的高峰和低谷,我们预测名词性复词应集中在科技文章的末尾,在其前面有有助于理解的辅助文字,并在首次使用后经常重复出现。在本文中,我们通过对生物学、经济学和语言学领域的科学论文语料库进行定量和定性分析,对上述预测进行了研究。虽然我们的调查没有为上述第一和第三项预测揭示明确的结论,但却为第二项预测提供了支持性证据,从而推进了我们对数控系统使用和说话者在传递信息时所作选择的理解。
{"title":"The distributional properties of long nominal compounds in scientific articles: an investigation based on the uniform information density hypothesis","authors":"John Gamboa, Kristina Braun, Juhani Järvikivi, Shanley E. M. Allen","doi":"10.1515/cllt-2023-0028","DOIUrl":"https://doi.org/10.1515/cllt-2023-0028","url":null,"abstract":"Nominal compounds are a structure commonly used in scientific texts. Despite their commonality, very little is known about how they are distributed in scientific articles. Based on the Uniform Information Density hypothesis, which states that speakers communicate information at a constant rate, avoiding peaks and troughs of information transmission, we predict that nominal compounds should cluster toward the end of scientific texts, be preceded by supporting text that facilitates their understanding, and be repeated often after their first use. In this paper, we examine these predictions through a quantitative and a qualitative analysis of a corpus of scientific papers from the fields of Biology, Economics and Linguistics. While our investigation did not reveal definitive findings for the first and third predictions above, it did produce supporting evidence in favor of our second prediction, thus advancing our understanding of NC use and the choices speakers make when transmitting information.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140609182","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Corpus-based discourse analysis: from meta-reflection to accountability 基于语料库的话语分析:从元反思到问责制
IF 1.6 2区 文学 Q1 Arts and Humanities Pub Date : 2024-04-16 DOI: 10.1515/cllt-2023-0104
Monika Bednarek, Martin Schweinberger, Kelvin K. H. Lee
Recent years have seen an increase in data and method reflection in corpus-based discourse analysis. In this article, we first take stock of some of the issues arising from such reflection (covering concepts such as triangulation, objectivity/subjectivity, replication, transparency, reflexivity, consistency). We then introduce a new ‘accountability’ framework for use in corpus-based discourse analysis (and perhaps beyond). We conceptualise such accountability as a multi-faceted phenomenon, covering various aspects of the research process. In the second part of this article, we then link this framework to a new cross-institutional initiative – the Australian Text Analytics Platform (ATAP) – which aims to address a small part of the framework, namely the transparency of analyses through Jupyter notebooks. We introduce the Quotation Tool as an example ATAP notebook of particular relevance to corpus-based discourse analysis. We reflect on how this notebook fosters accountability in relation to transparency of analysis and illustrate key applications using a set of different corpora.
近年来,在基于语料库的话语分析中,对数据和方法的反思日益增多。在本文中,我们首先总结了此类反思中出现的一些问题(包括三角测量、客观性/主体性、复制、透明度、反身性、一致性等概念)。然后,我们引入一个新的 "问责制 "框架,用于基于语料库的话语分析(或许还包括其他分析)。我们将这种问责制概念化为一种多层面现象,涵盖研究过程的各个方面。在本文的第二部分,我们将这一框架与一项新的跨机构倡议--澳大利亚文本分析平台(ATAP)--联系起来,该倡议旨在解决框架中的一小部分问题,即通过 Jupyter 笔记本实现分析的透明化。我们将引文工具作为 ATAP 笔记本的范例进行介绍,该笔记本与基于语料库的话语分析尤为相关。我们将思考该笔记本如何促进分析透明度方面的问责制,并使用一组不同的语料库说明其主要应用。
{"title":"Corpus-based discourse analysis: from meta-reflection to accountability","authors":"Monika Bednarek, Martin Schweinberger, Kelvin K. H. Lee","doi":"10.1515/cllt-2023-0104","DOIUrl":"https://doi.org/10.1515/cllt-2023-0104","url":null,"abstract":"Recent years have seen an increase in data and method reflection in corpus-based discourse analysis. In this article, we first take stock of some of the issues arising from such reflection (covering concepts such as triangulation, objectivity/subjectivity, replication, transparency, reflexivity, consistency). We then introduce a new ‘accountability’ framework for use in corpus-based discourse analysis (and perhaps beyond). We conceptualise such accountability as a multi-faceted phenomenon, covering various aspects of the research process. In the second part of this article, we then link this framework to a new cross-institutional initiative – the Australian Text Analytics Platform (ATAP) – which aims to address a small part of the framework, namely the transparency of analyses through Jupyter notebooks. We introduce the Quotation Tool as an example ATAP notebook of particular relevance to corpus-based discourse analysis. We reflect on how this notebook fosters accountability in relation to transparency of analysis and illustrate key applications using a set of different corpora.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140609022","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Corpus Linguistics and Linguistic Theory
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1