首页 > 最新文献

Corpus Linguistics and Linguistic Theory最新文献

英文 中文
The red dress is cute: why subjective adjectives are more often predicative 红裙子很可爱:为什么主观形容词更常用作谓语?
IF 1.6 2区 文学 0 LANGUAGE & LINGUISTICS Pub Date : 2024-09-05 DOI: 10.1515/cllt-2024-0044
Lelia Glass
Which adjectives tend to occur as attributive (the cute/red dress) versus predicative (the dress is cute/red) and why? Building on findings from Wiegand et al. (2013. Predicative adjectives: An unsupervised criterion to extract subjective adjectives. In Lucy Vanderwende, Hal DauméIII & Katrin Kirchhoff (eds.), Proceedings of the 2013 conference of the North American chapter of the Association for Computational Linguistics : Human language technologies (NAACL-HLT), 534–539. Atlanta, GA: Association for Computational Linguistics) and Vartiainen (2013. Subjectivity, indefiniteness and semantic change. English Language and Linguistics 17(1). 157–179), this paper argues that subjective adjectives such as cute tend to be placed in predicative position not just because they often describe discourse-new information, but because this position serves to foreground information that the hearer may disagree with. This claim is supported using data from the Corpus of Contemporary American English (Davies, Mark. 2008. The corpus of contemporary American English: One billion words, 1990-present. Available at: https://www.english-corpora.org/coca/) combined with human annotations for subjectivity from Scontras et al. (2017. Subjectivity predicts adjective ordering preferences. Open Mind 1(1). 53–66) et seq.; and data from image captions versus descriptions (for seeing versus low-vision people) from the National Gallery of Art. A production experiment manipulates the discourse context to further show that adjectives tend to be placed in predicative position when they express controversial information. Overall, this paper explores how the lexical semantics of adjectives shapes the pragmatic contexts in which they tend to be used, which in turn shapes the syntax of the sentences using them.
哪些形容词倾向于作为属性词(可爱/红色连衣裙)出现,哪些形容词倾向于作为谓词(连衣裙很可爱/红色)出现,为什么?以 Wiegand 等人(2013.谓语形容词:提取主观形容词的无监督标准。见 Lucy Vanderwende、Hal DauméIII & Katrin Kirchhoff(编辑),《计算语言学协会北美分会 2013 年会议论文集:人类语言技术》(NAACL-HLT),534-539 页。亚特兰大,佐治亚州:计算语言学协会)和 Vartiainen(2013 年。主观性、不确定性和语义变化。英语语言和语言学 17(1).157-179),本文认为,诸如可爱之类的主观形容词往往被置于谓语位置,这不仅是因为它们经常描述话语新信息,还因为这一位置有助于突出听者可能不同意的信息。本文使用《当代美国英语语料库》(Corpus of Contemporary American English)中的数据(Davies, Mark.2008.The corpus of contemporary American English:The corpus of contemporary American English: One billion words, 1990-present.Available at: https://www.english-corpora.org/coca/)结合 Scontras 等人(2017.主观性预测形容词排序偏好。Open Mind 1(1).53-66) et seq.;以及来自美国国家美术馆的图片说明与描述数据(针对视力好的人与视力差的人)。一个制作实验操纵了话语语境,进一步表明形容词在表达有争议的信息时倾向于被置于谓语位置。总之,本文探讨了形容词的词汇语义是如何塑造形容词倾向于使用的语用语境的,而语用语境又是如何塑造使用形容词的句子的语法的。
{"title":"The red dress is cute: why subjective adjectives are more often predicative","authors":"Lelia Glass","doi":"10.1515/cllt-2024-0044","DOIUrl":"https://doi.org/10.1515/cllt-2024-0044","url":null,"abstract":"Which adjectives tend to occur as attributive (<jats:italic>the cute/red dress</jats:italic>) versus predicative (<jats:italic>the dress is cute/red</jats:italic>) and why? Building on findings from Wiegand et al. (2013. Predicative adjectives: An unsupervised criterion to extract subjective adjectives. In Lucy Vanderwende, Hal DauméIII &amp; Katrin Kirchhoff (eds.), <jats:italic>Proceedings of the 2013 conference of the North American chapter of the </jats:italic> <jats:italic>Association for Computational Linguistics</jats:italic> <jats:italic>: Human language technologies (NAACL-HLT)</jats:italic>, 534–539. Atlanta, GA: Association for Computational Linguistics) and Vartiainen (2013. Subjectivity, indefiniteness and semantic change. <jats:italic>English Language and Linguistics</jats:italic> 17(1). 157–179), this paper argues that subjective adjectives such as <jats:italic>cute</jats:italic> tend to be placed in predicative position not just because they often describe discourse-new information, but because this position serves to foreground information that the hearer may disagree with. This claim is supported using data from the Corpus of Contemporary American English (Davies, Mark. 2008. <jats:italic>The corpus of contemporary American English: One billion words, 1990-present</jats:italic>. Available at: <jats:ext-link xmlns:xlink=\"http://www.w3.org/1999/xlink\" ext-link-type=\"uri\" xlink:href=\"https://www.english-corpora.org/coca/\">https://www.english-corpora.org/coca/</jats:ext-link>) combined with human annotations for subjectivity from Scontras et al. (2017. Subjectivity predicts adjective ordering preferences. <jats:italic>Open Mind</jats:italic> 1(1). 53–66) <jats:italic>et seq.</jats:italic>; and data from image captions versus descriptions (for seeing versus low-vision people) from the National Gallery of Art. A production experiment manipulates the discourse context to further show that adjectives tend to be placed in predicative position when they express controversial information. Overall, this paper explores how the lexical semantics of adjectives shapes the pragmatic contexts in which they tend to be used, which in turn shapes the syntax of the sentences using them.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":"20 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142189695","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A corpus-based study on semantic and cognitive features of bei sentences in Mandarin Chinese 基于语料库的普通话bei 句语义和认知特征研究
IF 1.6 2区 文学 0 LANGUAGE & LINGUISTICS Pub Date : 2024-09-05 DOI: 10.1515/cllt-2024-0031
Yonghui Xie, Ruochen Niu, Haitao Liu
Bei sentences in Mandarin Chinese with SOV word order have attracted extensive interest. However, their semantic features lacked quantitative evidence and their cognitive features received insufficient attention. Therefore, the current study aims to quantitatively investigate the semantic and cognitive features through the analysis of nine annotated factors in a corpus. The results regarding bei sentences show that (i) subjects exhibit a tendency to be definite and animate; non-adversative verbs have gained popularity over time, and intransitive verbs are capable of taking objects; (ii) subject relations tend to be long, implying heavy cognitive load, whereas the dependencies governed by subjects are often short, suggesting light cognitive load; and (iii) certain semantic factors significantly impact cognitive factors; for instance, animate subjects tend to govern shorter dependencies. Overall, our study provides empirical support for the semantic features of bei sentences and reveals their cognitive features using dependency distance.
汉语普通话中带有 SOV 词序的 Bei 句子引起了广泛关注。然而,其语义特征缺乏量化证据,认知特征也未得到足够关注。因此,本研究旨在通过分析语料库中的九个注释因素,对其语义和认知特征进行定量研究。有关 bei 句子的研究结果表明:(i) 主语表现出确定和有生命的倾向;随着时间的推移,非谓语动词越来越受欢迎,而不及物动词则可以带宾语;(ii) 主语关系往往较长,这意味着认知负荷较重,而主语所支配的从属关系往往较短,这意味着认知负荷较轻;(iii) 某些语义因素对认知因素有显著影响,例如,有生命的主语往往支配较短的从属关系。总之,我们的研究为 bei 句子的语义特征提供了实证支持,并利用依存距离揭示了其认知特征。
{"title":"A corpus-based study on semantic and cognitive features of bei sentences in Mandarin Chinese","authors":"Yonghui Xie, Ruochen Niu, Haitao Liu","doi":"10.1515/cllt-2024-0031","DOIUrl":"https://doi.org/10.1515/cllt-2024-0031","url":null,"abstract":"<jats:italic>Bei</jats:italic> sentences in Mandarin Chinese with SOV word order have attracted extensive interest. However, their semantic features lacked quantitative evidence and their cognitive features received insufficient attention. Therefore, the current study aims to quantitatively investigate the semantic and cognitive features through the analysis of nine annotated factors in a corpus. The results regarding <jats:italic>bei</jats:italic> sentences show that (i) subjects exhibit a tendency to be definite and animate; non-adversative verbs have gained popularity over time, and intransitive verbs are capable of taking objects; (ii) subject relations tend to be long, implying heavy cognitive load, whereas the dependencies governed by subjects are often short, suggesting light cognitive load; and (iii) certain semantic factors significantly impact cognitive factors; for instance, animate subjects tend to govern shorter dependencies. Overall, our study provides empirical support for the semantic features of <jats:italic>bei</jats:italic> sentences and reveals their cognitive features using dependency distance.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":"10 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142189764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Verb influence on French wh-placement: a parallel corpus study 动词对法语wh-placement的影响:平行语料库研究
IF 1.6 2区 文学 0 LANGUAGE & LINGUISTICS Pub Date : 2024-09-03 DOI: 10.1515/cllt-2024-0001
Jan Fliessbach, Johanna Rockstroh
Our study investigates the effect of French verb lemmata on the preverbal (QV) or postverbal (VQ) positioning of interrogative forms equivalent to English ‘what’ (que, quoi, and related forms) within a French–Spanish parallel corpus of subtitles. We highlight and illustrate the corpus’s utility for studying less frequent verbs in combination with specific wh-forms. Our findings suggest that less frequent French verbs exhibit weaker associations with QV compared to their more frequent counterparts. A post-hoc study using Spanish translations reveals that French verbs correlated with QV often denote observable actions involving directly accessible Q-referents. We hypothesise that queries concerning ‘situationally accessible’ referents are predominantly utilised for non-standard, evaluative, or challenging questions, which are typically QV in French.
我们的研究调查了法语动词词性对法语-西班牙语平行字幕语料库中相当于英语 "什么"(que、quoi 和相关形式)的疑问句前置(QV)或后置(VQ)定位的影响。我们强调并说明了该语料库在研究频率较低的动词与特定wh-forms组合时的实用性。我们的研究结果表明,与频率较高的动词相比,频率较低的法语动词与 QV 的关联较弱。使用西班牙语翻译进行的事后研究显示,与 QV 相关的法语动词通常表示涉及可直接访问的 Q 指涉的可观察行为。我们假设,涉及 "情景可及 "参照物的询问主要用于非标准、评价性或挑战性问题,而这些问题在法语中通常是 QV。
{"title":"Verb influence on French wh-placement: a parallel corpus study","authors":"Jan Fliessbach, Johanna Rockstroh","doi":"10.1515/cllt-2024-0001","DOIUrl":"https://doi.org/10.1515/cllt-2024-0001","url":null,"abstract":"Our study investigates the effect of French verb lemmata on the preverbal (QV) or postverbal (VQ) positioning of interrogative forms equivalent to English ‘what’ (<jats:italic>que</jats:italic>, <jats:italic>quoi</jats:italic>, and related forms) within a French–Spanish parallel corpus of subtitles. We highlight and illustrate the corpus’s utility for studying less frequent verbs in combination with specific <jats:italic>wh</jats:italic>-forms. Our findings suggest that less frequent French verbs exhibit weaker associations with QV compared to their more frequent counterparts. A post-hoc study using Spanish translations reveals that French verbs correlated with QV often denote observable actions involving directly accessible Q-referents. We hypothesise that queries concerning ‘situationally accessible’ referents are predominantly utilised for non-standard, evaluative, or challenging questions, which are typically QV in French.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":"23 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142189771","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Idiosyncratic entrenchment: tracing change in constructional schematicity with nested random effects 痴人说梦:利用嵌套随机效应追踪建构图式的变化
IF 1.6 2区 文学 0 LANGUAGE & LINGUISTICS Pub Date : 2024-08-22 DOI: 10.1515/cllt-2023-0092
Svetlana Vetchinnikova
Usage-based constructionist approaches see language as an inventory of constructions at different levels of schematicity learned from the input. If so, personal constructicons should vary as a function of usage. Repeated use and chunking/entrenchment of concrete instances should lead to reanalysis of their internal structure and change in the level of schematicity. This paper exploits the reduction probability of is in it is as a diagnostic of reanalysis in a 1.75-million-word diachronic corpus of a single blogger over 8 years. All instances of it is/it’s (n = 10,929) were annotated at the constructional and lexical levels. A multilevel logistic regression model showed significant fixed effects of constructional entropy and construction-to-word association on reduction probability. Importantly, there remained substantial variation across lexical types of constructions in the extent to which they associated or became associated with reduction over time, suggesting idiosyncratic entrenchment and potential reanalysis as a function of usage.
以使用为基础的建构主义方法认为,语言是从输入中学习到的不同层次结构的建构库。如果是这样的话,个人构词法应该随着使用而变化。具体实例的重复使用和分块/堑壕化应导致对其内部结构的重新分析和图式化水平的变化。本文利用 "is "在 "it is "中的还原概率,对一位博主 8 年来 175 万字的双时态语料库进行重新分析。所有 it is/it's 实例(n = 10,929)都在构词和词汇层面进行了注释。多层次逻辑回归模型显示,构词熵和构词与词关联对还原概率有显著的固定效应。重要的是,随着时间的推移,不同词性类型的构词与缩减的关联或关联程度仍存在很大差异,这表明随着使用情况的变化,构词的特异性固着和潜在的重新分析也会发生变化。
{"title":"Idiosyncratic entrenchment: tracing change in constructional schematicity with nested random effects","authors":"Svetlana Vetchinnikova","doi":"10.1515/cllt-2023-0092","DOIUrl":"https://doi.org/10.1515/cllt-2023-0092","url":null,"abstract":"Usage-based constructionist approaches see language as an inventory of constructions at different levels of schematicity learned from the input. If so, personal constructicons should vary as a function of usage. Repeated use and chunking/entrenchment of concrete instances should lead to reanalysis of their internal structure and change in the level of schematicity. This paper exploits the reduction probability of <jats:italic>is</jats:italic> in <jats:italic>it is</jats:italic> as a diagnostic of reanalysis in a 1.75-million-word diachronic corpus of a single blogger over 8 years. All instances of <jats:italic>it is/it’s</jats:italic> (n = 10,929) were annotated at the constructional and lexical levels. A multilevel logistic regression model showed significant fixed effects of constructional entropy and construction-to-word association on reduction probability. Importantly, there remained substantial variation across lexical types of constructions in the extent to which they associated or became associated with reduction over time, suggesting idiosyncratic entrenchment and potential reanalysis as a function of usage.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":"44 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142189765","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Transfer five ways: applications of multiple distinctive collexeme analysis to the dative alternation in Mandarin Chinese 五种转移方式:在普通话的助动词交替中应用多重独特词素分析法
IF 1.6 2区 文学 0 LANGUAGE & LINGUISTICS Pub Date : 2024-05-25 DOI: 10.1515/cllt-2024-0033
Shengyu Liao, Stefan Th. Gries, Stefanie Wulff
The dative alternation has been extensively studied in the world’s languages, and the meanings of the verbs participating in the alternation have been shown to play a key role in determining its argument realization options. The present paper presents a multiple distinctive collexeme analysis approach to the dative alternation in Mandarin Chinese, which involves a choice of one of five functionally similar alternants, and it does so by also discussing several ways to improve how this has been done statistically in most previous analyses. Linguistically, we identify the core semantic differences of the five constructions based on which verbs statistically prefer to occur in which pattern, focusing on semantic potential and direction of transfer. Methodologically, this study contributes to the slowly growing body of studies that use collexeme strengths that are not only less related to frequency than the traditional methods (i.e., association is measured in a less diluted way) and that are directional (i.e., we can focus on one direction of association from the verb to the construction).
对世界语言中的助动词交替进行了广泛的研究,参与交替的动词的意义被证明在决定其论据实现选择方面起着关键作用。普通话中的助词交替涉及从五个功能相似的交替词中选择一个的问题,本文针对普通话中的助词交替提出了一种多重独特的词库分析方法,同时还讨论了改进以往大多数分析中统计方法的几种途径。在语言学上,我们根据哪些动词在统计上更倾向于出现在哪种模式中来确定这五种结构的核心语义差异,重点关注语义潜能和转移方向。在方法论上,本研究为逐渐增多的研究机构做出了贡献,这些研究使用的词组强度不仅与频率的关系不如传统方法(即关联的测量方法不那么稀释),而且具有方向性(即我们可以关注从动词到结构的一个关联方向)。
{"title":"Transfer five ways: applications of multiple distinctive collexeme analysis to the dative alternation in Mandarin Chinese","authors":"Shengyu Liao, Stefan Th. Gries, Stefanie Wulff","doi":"10.1515/cllt-2024-0033","DOIUrl":"https://doi.org/10.1515/cllt-2024-0033","url":null,"abstract":"The dative alternation has been extensively studied in the world’s languages, and the meanings of the verbs participating in the alternation have been shown to play a key role in determining its argument realization options. The present paper presents a multiple distinctive collexeme analysis approach to the dative alternation in Mandarin Chinese, which involves a choice of one of five functionally similar alternants, and it does so by also discussing several ways to improve how this has been done statistically in most previous analyses. Linguistically, we identify the core semantic differences of the five constructions based on which verbs statistically prefer to occur in which pattern, focusing on semantic potential and direction of transfer. Methodologically, this study contributes to the slowly growing body of studies that use collexeme strengths that are not only less related to frequency than the traditional methods (i.e., association is measured in a less diluted way) and that are directional (i.e., we can focus on one direction of association from the verb to the construction).","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":"94 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141153761","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Register and the dual nature of functional correspondence: accounting for text-linguistic variation between registers, within registers, and without registers 语域与功能对应的双重性质:解释语域之间、语域之内和语域之外的文本语言差异
IF 1.6 2区 文学 0 LANGUAGE & LINGUISTICS Pub Date : 2024-05-18 DOI: 10.1515/cllt-2024-0011
Jesse Egbert, Douglas Biber, Daniel Keller, Marianna Gracheva
During the past 20 years, corpus linguistic research on register variation has yielded important theoretical advances. The first part of this paper discusses these advances and the cumulative body of research that has produced them. In the second part of the paper, we focus on the goals of research on register variation. The traditional goal of the text-linguistic (TxtLx) approach to linguistic variation has been to describe registers and patterns of register variation: describing the linguistic and situational characteristics of registers. In this paper, we explore a related, but distinct, text-linguistic goal: to account for all linguistic variation among texts. Because the TxtLx framework assumes the importance of functional correspondence between linguistic characteristics and situational characteristics, it is reasonable to assume that in addition to register, we can use situational parameters coded continuously at the level of individual texts as additional predictors of text-linguistic variation. We describe the results of an empirical study to show that using both register categories and text-level situational parameters as predictors results in a more comprehensive and explanatory model of text-linguistic variation. In the conclusion we discuss the future of corpus-based register studies, focusing on unanswered questions related to theoretical claims about register.
过去 20 年间,语料库语言学对语域变异的研究取得了重要的理论进展。本文第一部分讨论了这些进展以及产生这些进展的累积研究成果。在本文的第二部分,我们将重点讨论语域变异研究的目标。文本语言学(TxtLx)方法研究语言变异的传统目标是描述语域和语域变异模式:描述语域的语言和情景特征。在本文中,我们将探讨一个相关但不同的文本语言学目标:解释文本之间的所有语言变异。由于 TxtLx 框架假定语言特征和情景特征之间的功能对应关系非常重要,因此我们有理由认为,除了语域之外,我们还可以使用在单篇文本层面上连续编码的情景参数作为文本语言变异的额外预测因素。我们描述了一项实证研究的结果,以说明同时使用语域类别和文本层面的情景参数作为预测因子,可以得到一个更全面、更能解释文本语言变异的模型。在结论部分,我们讨论了基于语料库的语域研究的未来,重点是与语域理论主张相关的未决问题。
{"title":"Register and the dual nature of functional correspondence: accounting for text-linguistic variation between registers, within registers, and without registers","authors":"Jesse Egbert, Douglas Biber, Daniel Keller, Marianna Gracheva","doi":"10.1515/cllt-2024-0011","DOIUrl":"https://doi.org/10.1515/cllt-2024-0011","url":null,"abstract":"During the past 20 years, corpus linguistic research on register variation has yielded important theoretical advances. The first part of this paper discusses these advances and the cumulative body of research that has produced them. In the second part of the paper, we focus on the goals of research on register variation. The traditional goal of the text-linguistic (TxtLx) approach to linguistic variation has been to describe registers and patterns of register variation: describing the linguistic and situational characteristics of registers. In this paper, we explore a related, but distinct, text-linguistic goal: to account for all linguistic variation among texts. Because the TxtLx framework assumes the importance of <jats:italic>functional correspondence</jats:italic> between linguistic characteristics and situational characteristics, it is reasonable to assume that in addition to register, we can use situational parameters coded continuously at the level of individual texts as additional predictors of text-linguistic variation. We describe the results of an empirical study to show that using both register categories and text-level situational parameters as predictors results in a more comprehensive and explanatory model of text-linguistic variation. In the conclusion we discuss the future of corpus-based register studies, focusing on unanswered questions related to theoretical claims about register.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":"38 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141062280","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learner corpus research: a critical appraisal and roadmap for contributing (more) to SLA research agendas 学习者语料库研究:批判性评估和路线图,为语言学习服务(SLA)研究议程做出(更多)贡献
IF 1.6 2区 文学 0 LANGUAGE & LINGUISTICS Pub Date : 2024-05-13 DOI: 10.1515/cllt-2024-0014
Magali Paquot
Over the past decade, learner corpora have gained recognition as valuable data sources in Second Language Acquisition (SLA) research. This development can be attributed to significant progress in Learner Corpus Research (LCR). However, there is still substantial work to be done. This article highlights key issues essential for sustaining the relevance of learner corpora in SLA. More particularly, I focus on the need for more diverse types of learner corpora, stress the importance of detailed metadata, and advocate for multifactorial study designs. I then revisit ongoing debates regarding the role of the native speaker in LCR and propose a practical solution to address this thorny issue. Finally, I also readdress the need for improvement in the quantitative methods and statistics, arguing that the importance of robust quantitative analysis cannot be overstated. In conclusion, I envision an ambitious learner corpus compilation project that adheres to the FAIR principles, with the goal of further elevating study quality in LCR.
在过去的十年中,学习者语料库作为第二语言习得(SLA)研究中的宝贵数据源已得到认可。这一发展可归功于学习者语料库研究(LCR)的重大进展。然而,仍有大量工作要做。本文强调了保持学习者语料库在 SLA 中的相关性所必须解决的关键问题。尤其是,我着重强调了对更多样化的学习者语料库的需求,强调了详细元数据的重要性,并提倡多因素研究设计。然后,我重温了目前关于母语使用者在 LCR 中的作用的争论,并提出了解决这一棘手问题的切实可行的方案。最后,我还重新讨论了改进定量方法和统计的必要性,并认为强有力的定量分析的重要性怎么强调都不为过。总之,我设想了一个雄心勃勃的学习者语料库编纂项目,该项目将遵循 FAIR 原则,目标是进一步提高 LCR 的研究质量。
{"title":"Learner corpus research: a critical appraisal and roadmap for contributing (more) to SLA research agendas","authors":"Magali Paquot","doi":"10.1515/cllt-2024-0014","DOIUrl":"https://doi.org/10.1515/cllt-2024-0014","url":null,"abstract":"Over the past decade, learner corpora have gained recognition as valuable data sources in Second Language Acquisition (SLA) research. This development can be attributed to significant progress in Learner Corpus Research (LCR). However, there is still substantial work to be done. This article highlights key issues essential for sustaining the relevance of learner corpora in SLA. More particularly, I focus on the need for more diverse types of learner corpora, stress the importance of detailed metadata, and advocate for multifactorial study designs. I then revisit ongoing debates regarding the role of the native speaker in LCR and propose a practical solution to address this thorny issue. Finally, I also readdress the need for improvement in the quantitative methods and statistics, arguing that the importance of robust quantitative analysis cannot be overstated. In conclusion, I envision an ambitious learner corpus compilation project that adheres to the FAIR principles, with the goal of further elevating study quality in LCR.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":"129 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140931112","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The distributional properties of long nominal compounds in scientific articles: an investigation based on the uniform information density hypothesis 科学文章中长名词复合词的分布特性:基于均匀信息密度假说的研究
IF 1.6 2区 文学 0 LANGUAGE & LINGUISTICS Pub Date : 2024-04-16 DOI: 10.1515/cllt-2023-0028
John Gamboa, Kristina Braun, Juhani Järvikivi, Shanley E. M. Allen
Nominal compounds are a structure commonly used in scientific texts. Despite their commonality, very little is known about how they are distributed in scientific articles. Based on the Uniform Information Density hypothesis, which states that speakers communicate information at a constant rate, avoiding peaks and troughs of information transmission, we predict that nominal compounds should cluster toward the end of scientific texts, be preceded by supporting text that facilitates their understanding, and be repeated often after their first use. In this paper, we examine these predictions through a quantitative and a qualitative analysis of a corpus of scientific papers from the fields of Biology, Economics and Linguistics. While our investigation did not reveal definitive findings for the first and third predictions above, it did produce supporting evidence in favor of our second prediction, thus advancing our understanding of NC use and the choices speakers make when transmitting information.
名词性化合物是科学文章中常用的一种结构。尽管它们很常见,但人们对它们在科学文章中的分布却知之甚少。根据 "均匀信息密度假说"(Uniform Information Density hypothesis),即说话者以恒定的速度传递信息,避免信息传递的高峰和低谷,我们预测名词性复词应集中在科技文章的末尾,在其前面有有助于理解的辅助文字,并在首次使用后经常重复出现。在本文中,我们通过对生物学、经济学和语言学领域的科学论文语料库进行定量和定性分析,对上述预测进行了研究。虽然我们的调查没有为上述第一和第三项预测揭示明确的结论,但却为第二项预测提供了支持性证据,从而推进了我们对数控系统使用和说话者在传递信息时所作选择的理解。
{"title":"The distributional properties of long nominal compounds in scientific articles: an investigation based on the uniform information density hypothesis","authors":"John Gamboa, Kristina Braun, Juhani Järvikivi, Shanley E. M. Allen","doi":"10.1515/cllt-2023-0028","DOIUrl":"https://doi.org/10.1515/cllt-2023-0028","url":null,"abstract":"Nominal compounds are a structure commonly used in scientific texts. Despite their commonality, very little is known about how they are distributed in scientific articles. Based on the Uniform Information Density hypothesis, which states that speakers communicate information at a constant rate, avoiding peaks and troughs of information transmission, we predict that nominal compounds should cluster toward the end of scientific texts, be preceded by supporting text that facilitates their understanding, and be repeated often after their first use. In this paper, we examine these predictions through a quantitative and a qualitative analysis of a corpus of scientific papers from the fields of Biology, Economics and Linguistics. While our investigation did not reveal definitive findings for the first and third predictions above, it did produce supporting evidence in favor of our second prediction, thus advancing our understanding of NC use and the choices speakers make when transmitting information.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":"56 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140609182","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Corpus-based discourse analysis: from meta-reflection to accountability 基于语料库的话语分析:从元反思到问责制
IF 1.6 2区 文学 0 LANGUAGE & LINGUISTICS Pub Date : 2024-04-16 DOI: 10.1515/cllt-2023-0104
Monika Bednarek, Martin Schweinberger, Kelvin K. H. Lee
Recent years have seen an increase in data and method reflection in corpus-based discourse analysis. In this article, we first take stock of some of the issues arising from such reflection (covering concepts such as triangulation, objectivity/subjectivity, replication, transparency, reflexivity, consistency). We then introduce a new ‘accountability’ framework for use in corpus-based discourse analysis (and perhaps beyond). We conceptualise such accountability as a multi-faceted phenomenon, covering various aspects of the research process. In the second part of this article, we then link this framework to a new cross-institutional initiative – the Australian Text Analytics Platform (ATAP) – which aims to address a small part of the framework, namely the transparency of analyses through Jupyter notebooks. We introduce the Quotation Tool as an example ATAP notebook of particular relevance to corpus-based discourse analysis. We reflect on how this notebook fosters accountability in relation to transparency of analysis and illustrate key applications using a set of different corpora.
近年来,在基于语料库的话语分析中,对数据和方法的反思日益增多。在本文中,我们首先总结了此类反思中出现的一些问题(包括三角测量、客观性/主体性、复制、透明度、反身性、一致性等概念)。然后,我们引入一个新的 "问责制 "框架,用于基于语料库的话语分析(或许还包括其他分析)。我们将这种问责制概念化为一种多层面现象,涵盖研究过程的各个方面。在本文的第二部分,我们将这一框架与一项新的跨机构倡议--澳大利亚文本分析平台(ATAP)--联系起来,该倡议旨在解决框架中的一小部分问题,即通过 Jupyter 笔记本实现分析的透明化。我们将引文工具作为 ATAP 笔记本的范例进行介绍,该笔记本与基于语料库的话语分析尤为相关。我们将思考该笔记本如何促进分析透明度方面的问责制,并使用一组不同的语料库说明其主要应用。
{"title":"Corpus-based discourse analysis: from meta-reflection to accountability","authors":"Monika Bednarek, Martin Schweinberger, Kelvin K. H. Lee","doi":"10.1515/cllt-2023-0104","DOIUrl":"https://doi.org/10.1515/cllt-2023-0104","url":null,"abstract":"Recent years have seen an increase in data and method reflection in corpus-based discourse analysis. In this article, we first take stock of some of the issues arising from such reflection (covering concepts such as triangulation, objectivity/subjectivity, replication, transparency, reflexivity, consistency). We then introduce a new ‘accountability’ framework for use in corpus-based discourse analysis (and perhaps beyond). We conceptualise such accountability as a multi-faceted phenomenon, covering various aspects of the research process. In the second part of this article, we then link this framework to a new cross-institutional initiative – the Australian Text Analytics Platform (ATAP) – which aims to address a small part of the framework, namely the transparency of analyses through Jupyter notebooks. We introduce the Quotation Tool as an example ATAP notebook of particular relevance to corpus-based discourse analysis. We reflect on how this notebook fosters accountability in relation to transparency of analysis and illustrate key applications using a set of different corpora.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":"25 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140609022","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A collostructional approach to Japanese noun-modifying clause construction use and acquisition: a learner corpus study 日语名词修饰从句结构使用和习得的共构方法:学习者语料库研究
IF 1.6 2区 文学 0 LANGUAGE & LINGUISTICS Pub Date : 2024-03-22 DOI: 10.1515/cllt-2024-0020
Nicole C. De Los Reyes, Ute Römer-Barron
Japanese features a general noun-modifying clause construction (NMCC) with a more versatile range of semantic and pragmatic interpretations than equivalent constructions in other languages. Motivated by the learning challenge NMCCs pose to Japanese as a foreign language (JFL) learners, this article examines speech data from the International Corpus of Japanese as a Second Language (I-JAS) to compare learner use of NMCCs against a large L1 Japanese corpus. Instances of the construction from both corpora were analyzed to identify high-frequency part-of-speech categories and subcategories in the modifying clause predicate and head noun slots. A simple collexeme analysis was then employed to identify strongly attracted and repelled lexical items among those identified in realizations of the construction. Taken together, findings from these analyses revealed an important connection between the semantic weight of head nouns in NMCCs and the idiomaticity of the construction, with learner productions demonstrating a tendency toward heavy head nouns. This study lays the groundwork for future research seeking to explore the NMCC at different levels of granularity and to improve its treatment in JFL pedagogical materials.
日语中的一般名词修饰从句结构(NMCC)与其他语言中的同等结构相比,具有更多的语义和语用解释。由于 NMCC 给日语作为外语(JFL)的学习者带来了学习上的挑战,本文研究了日语作为第二语言的国际语料库(I-JAS)中的语音数据,将学习者使用 NMCC 的情况与大型 L1 日语语料库进行了比较。文章分析了两个语料库中的非修饰性从句结构实例,以确定修饰性从句谓语和头名词位置中的高频语篇类别和子类别。然后,我们采用简单的同义词分析,在该结构的实现过程中识别出强烈吸引和排斥的词项。综合来看,这些分析结果表明,NMCCs 中头名词的语义量与结构的习语性之间存在重要联系,学习者的作品倾向于使用重头名词。本研究为今后的研究奠定了基础,以便在不同的粒度水平上探索 NMCC,并改进 JFL 教学材料中对 NMCC 的处理。
{"title":"A collostructional approach to Japanese noun-modifying clause construction use and acquisition: a learner corpus study","authors":"Nicole C. De Los Reyes, Ute Römer-Barron","doi":"10.1515/cllt-2024-0020","DOIUrl":"https://doi.org/10.1515/cllt-2024-0020","url":null,"abstract":"Japanese features a general noun-modifying clause construction (NMCC) with a more versatile range of semantic and pragmatic interpretations than equivalent constructions in other languages. Motivated by the learning challenge NMCCs pose to Japanese as a foreign language (JFL) learners, this article examines speech data from the International Corpus of Japanese as a Second Language (I-JAS) to compare learner use of NMCCs against a large L1 Japanese corpus. Instances of the construction from both corpora were analyzed to identify high-frequency part-of-speech categories and subcategories in the modifying clause predicate and head noun slots. A simple collexeme analysis was then employed to identify strongly attracted and repelled lexical items among those identified in realizations of the construction. Taken together, findings from these analyses revealed an important connection between the semantic weight of head nouns in NMCCs and the idiomaticity of the construction, with learner productions demonstrating a tendency toward heavy head nouns. This study lays the groundwork for future research seeking to explore the NMCC at different levels of granularity and to improve its treatment in JFL pedagogical materials.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":"30 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140196917","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Corpus Linguistics and Linguistic Theory
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1