Corpus Linguistics and Linguistic Theory最新文献

英文中文

Linguistic variation within registers: granularity in textual units and situational parameters 语域内的语言变异:文本单位和情景参数的粒度

IF 1.6 2区文学 0 LANGUAGE & LINGUISTICS

Corpus Linguistics and Linguistic Theory

Pub Date : 2022-10-18 DOI: 10.1515/cllt-2022-0034

Jesse Egbert, M. Gracheva

Abstract Register studies have focused on accounting for linguistic variation between culturally recognized register categories. This comparative approach to register has consistently demonstrated that culturally recognized register categories can predict language variation at all linguistic levels. Nevertheless, it has also been shown by previous research that even the most well-established register categories have substantial internal linguistic variation. We propose that at least some of this unexplained variance could be the result of how a text is defined, as well as whether and how researchers account for situational variables within registers. We present four case studies that explore the extent to which linguistic variation within registers is influenced by the definition of the textual unit and the situational parameters. We show that the functional correspondence between situation and language use exists even within register categories and discuss the theoretical and methodological implications of these findings for register research.

摘要语域研究的重点是解释文化认可的语域类别之间的语言差异。这种语域的比较方法一贯表明，文化认可的语域类别可以预测所有语言层面的语言变化。尽管如此，先前的研究也表明，即使是最成熟的语域类别也有很大的内部语言差异。我们认为，这种无法解释的差异至少有一部分可能是文本如何定义的结果，以及研究人员是否以及如何解释寄存器中的情境变量。我们提出了四个案例研究，探讨语域内的语言变异在多大程度上受到语篇单位定义和情景参数的影响。我们表明，情况和语言使用之间的功能对应关系甚至存在于语域类别中，并讨论了这些发现对语域研究的理论和方法意义。

引用次数: 2

Parts of speech and the placement of Targets in the corpus of languages in northwestern Iran 伊朗西北部语言语料库中的词性和目标位置

IF 1.6 2区文学 0 LANGUAGE & LINGUISTICS

Corpus Linguistics and Linguistic Theory

Pub Date : 2022-10-12 DOI: 10.1515/cllt-2022-0001

H. Asadpour

Abstract This study applies a corpus-based quantitative approach to the word order typology and linguistic theories about word order in several genetically unrelated language varieties in northwestern Iran, such as Mukri Kurdish, Northeastern Kurdish and Armenian (Indo-European), Jewish Neo-Aramaic (Semitic), and Azeri Turkic (Turkish). Despite the difference in the default position of the direct object, the existing corpora of published and personal field data of narrative free speech demonstrate that these languages share the clause-final position of Targets predominantly (e.g., physical and metaphorical goals, recipients, addressees, and resultant-states) in their word order. Yet, Targets are more flexible in Mukri Kurdish, Northeastern Neo-Aramaic, and Azeri Turkic, whereas they are less flexible in Armenian and Northeastern Kurdish. Among various factors relevant to the placement of Targets, morphosyntactic features such as parts of speech exhibit constraints and clear preferences in the pre- and postverbal placement of Targets.

摘要本研究将基于语料库的定量方法应用于伊朗西北部几种基因无关的语言变体的语序类型学和语序语言学理论，如穆克里库尔德语、东北库尔德语和亚美尼亚语（印欧语）、犹太新阿拉米语（闪米特语）和阿塞拜疆突厥语（土耳其语）。尽管直接宾语的默认位置存在差异，但现有的叙事自由言论的已发表和个人领域数据语料库表明，这些语言在语序中主要共享目标的从句最终位置（例如，物理和隐喻目标、接受者、收件人和结果状态）。然而，目标在穆克里库尔德语、东北新阿拉姆语和阿塞拜疆突厥语中更灵活，而在亚美尼亚语和东北库尔德语中则不那么灵活。在与目标位置相关的各种因素中，词性等形态句法特征在目标的语前和语后位置上表现出约束和明显的偏好。

引用次数: 2

Alternation phenomena and language proficiency: the genitive alternation in the spoken language of EFL learners 交替现象与语言能力：英语学习者口语中的属格交替

IF 1.6 2区文学 0 LANGUAGE & LINGUISTICS

Corpus Linguistics and Linguistic Theory

Pub Date : 2022-10-04 DOI: 10.1515/cllt-2021-0078

Tanguy Dubois, M. Paquot, Benedikt Szmrecsanyi

Abstract This study investigates how foreign language proficiency, which previous corpus-based research on alternation phenomena has largely ignored, influences the choice of genitive variant (the tail of the dog/the dog’s tail) in learners of English as a Foreign Language. The data stems from the Trinity Lancaster Corpus, a three-million-word corpus featuring spoken language from low-intermediate to advanced learners of English from several L1 backgrounds. The collected genitive observations were annotated for various constraints such as the length, animacy, definiteness and discourse status of the constituents and then analyzed via mixed-effects logistic regression. The results show that although native speakers and learners are remarkably similar, low-proficiency learners are less sensitive to possessor definiteness and possessor animacy, the latter of which is otherwise the strongest constraint of the genitive alternation.

摘要本研究调查了外语水平如何影响作为外语的英语学习者的属格变体（狗的尾巴/狗的尾巴）的选择，而以前基于语料库的关于交替现象的研究在很大程度上忽略了这一点。这些数据来源于三一兰开斯特语料库，这是一个300万字的语料库，以来自不同L1背景的低中级到高级英语学习者的口语为特色。对收集到的属格观察结果进行注释，以考虑成分的长度、动物性、确定性和话语状态等各种约束，然后通过混合效应逻辑回归进行分析。结果表明，尽管母语使用者和学习者非常相似，但低水平的学习者对拥有者明确性和拥有者动物性的敏感性较低，后者在其他方面是属格交替的最强约束。

引用次数: 1

IF 1.6 2区文学 0 LANGUAGE & LINGUISTICS

Corpus Linguistics and Linguistic Theory

Pub Date : 2022-09-20 DOI: 10.1515/cllt-2021-0090

Haipeng Li, Jonathan Dunn, A. Nini

Abstract This paper measures the stability of cross-linguistic register variation. A register is a variety of a language that is associated with extra-linguistic context. The relationship between a register and its context is functional: the linguistic features that make up a register are motivated by the needs and constraints of the communicative situation. This view hypothesizes that register should be universal, so that we expect a stable relationship between the extra-linguistic context that defines a register and the sets of linguistic features which the register contains. In this paper, the universality and robustness of register variation is tested by comparing variation within versus between register-specific corpora in 60 languages using corpora produced in comparable communicative situations: tweets and Wikipedia articles. Our findings confirm the prediction that register variation is, in fact, universal.

摘要本文对跨语言语域变异的稳定性进行了测量。语域是一种语言的变体，它与语言外语境有关。语域与其语境之间的关系是功能性的:构成语域的语言特征是由交际情境的需要和约束所激发的。这一观点假设语域应该是普遍的，因此我们期望在定义语域的语言外语境和语域包含的语言特征集之间存在稳定的关系。在本文中，通过比较60种语言中特定于语域的语料库内部和之间的变化，使用在可比较的交际情境中产生的语料库:tweet和Wikipedia文章，来测试语域变化的普遍性和鲁棒性。我们的研究结果证实了注册变异实际上是普遍存在的预测。

引用次数: 1

Comparing the functional range of English to be to German sein: a test of the boundary permeability hypothesis 比较英语to be和德语sein的功能范围:边界渗透性假设的检验

IF 1.6 2区文学 0 LANGUAGE & LINGUISTICS

Corpus Linguistics and Linguistic Theory

Pub Date : 2022-09-19 DOI: 10.1515/cllt-2022-0015

T. Berg

Abstract The boundary permeability hypothesis views language as a system of categories which are more or less rigidly separated from one another. In previous work on grammatical categories, English was characterized as a soft boundary and German as a strict boundary language. This project presents a test of the prediction that English to be has a wider functional range than its German counterpart sein. The range of these verbs is determined on the basis of a bidirectional translation study. English to be is translated by a larger variety of lexical verbs in German while sein is translated by a smaller number of lexical verbs in English. In fact, the translation equivalents of sein form a subset of those of to be. Moreover, to be plays a much larger role in constructions such as there is and it is than sein does. The boundary permeability hypothesis views this structural difference and the wider semantic range of to be as two sides of the same coin. It is suggested that English is a more speaker-friendly language than German.

摘要边界渗透性假说将语言视为一个或多或少严格分离的范畴系统。在以前的语法范畴研究中，英语被定性为软边界语言，德语被定性为严格边界语言。该项目测试了英语的功能范围比德语的sein更广的预测。这些动词的范围是在双向翻译研究的基础上确定的。待翻译的英语由大量德语词汇动词翻译，而sein则由少量英语词汇动词翻译。事实上，sein的翻译等价物形成了to be的子集。此外，to be在诸如“存在”和“现在”之类的结构中扮演着比sein更大的角色。边界渗透性假说将这种结构差异和更广泛的语义范围视为一枚硬币的两面。有人认为英语比德语更适合讲英语。

引用次数: 0

In search of lost space 寻找失去的空间

IF 1.6 2区文学 0 LANGUAGE & LINGUISTICS

Corpus Linguistics and Linguistic Theory

Pub Date : 2022-08-10 DOI: 10.1515/cllt-2022-0007

Daniel Rojas Plata, Noé Alejandro Castro Sánchez

Abstract This paper analyzes the semantics of the French and Spanish prepositions en from the point of view of diachrony. We focus on the spatial variation of their usage in both languages. Our methodological approach is based on a corpus analysis. We draw on comparable ancient texts and analyze some semantic categories that are fundamental for the spatial description. Our results reveal a fairly active evolution of prepositions with increases and decreases of usage where we did not expect them.

摘要本文从历时的角度分析了法语和西班牙语介词的语义。我们关注的是它们在两种语言中使用的空间差异。我们的方法是基于语料库分析。我们借鉴了可比较的古代文本，并分析了空间描述的基本语义范畴。我们的研究结果显示，介词在我们没有预料到的地方，随着使用的增加和减少，出现了相当活跃的演变。

引用次数: 0

An improved test of the constant rate hypothesis: late Modern American English possessive have 恒率假设的改进检验:晚期现代美式英语所有格have

IF 1.6 2区文学 0 LANGUAGE & LINGUISTICS

Corpus Linguistics and Linguistic Theory

Pub Date : 2022-06-20 DOI: 10.1515/cllt-2021-0038

Richard Zimmermann

Abstract The Constant Rate Hypothesis (CRH) predicts that a linguistic innovation should spread at identical rates of change in all grammatical contexts in which it is used (Kroch 1989. Reflexes of grammar in patterns of language change. Language Variation and Change 1(3). 199–244). Weaknesses in previous tests of the CRH are identified. A new study is conducted that improves upon them. It utilizes a syntactic change in late Modern American English possessive have, which altered its realization in the grammar-theoretically related contexts negation, inversion, VP-adjunction and VP-ellipsis. Data sets are collected from the Corpus of Historical American English (Davies 2010. The corpus of historical American English: 400 million words, 1810-2009. http://corpus.byu.edu/coha/ (accessed 10 September 2016)) and analyzed with mixed-effects logistic regression models. The statistical analysis reveals that it is indeed plausible to assume that each of the contexts innovates the use of possessive have at the same speed. Implications of the findings for the CRH are discussed.

恒定速率假设(CRH)预测一种语言创新在使用它的所有语法语境中应该以相同的变化速率传播(Kroch 1989)。语言模式中的语法反射变化。语言变异与变化1(3)。199 - 244)。指出了先前CRH测试中的弱点。一项新的研究在此基础上进行了改进。它利用了现代晚期美国英语所有格have的句法变化，改变了它在语法理论相关语境中的实现方式，即否定、倒装、副格修饰和副格省略。数据集收集自美国历史英语语料库(Davies 2010)。美国历史英语语料库:4亿字，1810-2009。http://corpus.byu.edu/coha/(2016年9月10日访问))，并使用混合效应逻辑回归模型进行分析。统计分析表明，假设每一种语境都以同样的速度革新所有格have的用法确实是合理的。本文还讨论了研究结果对CRH的影响。

引用次数: 1

Towards a dynamic behavioral profile of the Mandarin Chinese temperature term re: a diachronic semasiological approach 汉语温度词的动态行为特征研究——历时语义学方法

IF 1.6 2区文学 0 LANGUAGE & LINGUISTICS

Corpus Linguistics and Linguistic Theory

Pub Date : 2022-03-04 DOI: 10.1515/cllt-2021-0046

Meili Liu

Abstract This study adopts a corpus-based behavioral profile approach, combining multifactorial usage-feature analysis with frequency-based quantitative analysis, to investigate the diachronic semasiological variation of the Mandarin Chinese temperature term 热 re ‘hot’. The result shows a dynamic behavioral profile, i.e., both the usage patterns and the semasiological structural weight of senses have been constantly shifting. The semasiology of re has been becoming more and more diversified over time. Methodologically, this study extends the traditional behavioral profile approach—hierarchical agglomerative cluster analysis by applying multiple correspondence analysis and corroborating its validity in accounting for and visualizing the multifactorial nature of semasiological change of lexical items. Theoretically, the present study not only corroborates basic assumptions of usage-based cognitive semantics (e.g., non-discreteness, non-equality of senses, and bodily experience) but also complements it by demonstrating that sociocultural factors also play an important role in semasiological boundary variation of a lexical item.

摘要本研究采用基于语料库的行为谱方法，将多因素使用特征分析与基于频率的定量分析相结合，研究汉语温度词的历时语义变化热你很热。结果显示了一个动态的行为特征，即感官的使用模式和语义结构权重都在不断变化。随着时间的推移，re的语义越来越多样化。在方法论上，本研究扩展了传统的行为图谱方法——层次凝聚聚类分析，应用了多重对应分析，并证实了其在解释和可视化词汇语义变化的多因素性质方面的有效性。从理论上讲，本研究不仅证实了基于使用的认知语义的基本假设（如非离散性、感官和身体体验的不平等），而且通过证明社会文化因素在词汇的语义边界变化中也起着重要作用来补充这一假设。

{"title":"Towards a dynamic behavioral profile of the Mandarin Chinese temperature term re: a diachronic semasiological approach","authors":"Meili Liu","doi":"10.1515/cllt-2021-0046","DOIUrl":"https://doi.org/10.1515/cllt-2021-0046","url":null,"abstract":"Abstract This study adopts a corpus-based behavioral profile approach, combining multifactorial usage-feature analysis with frequency-based quantitative analysis, to investigate the diachronic semasiological variation of the Mandarin Chinese temperature term 热 re ‘hot’. The result shows a dynamic behavioral profile, i.e., both the usage patterns and the semasiological structural weight of senses have been constantly shifting. The semasiology of re has been becoming more and more diversified over time. Methodologically, this study extends the traditional behavioral profile approach—hierarchical agglomerative cluster analysis by applying multiple correspondence analysis and corroborating its validity in accounting for and visualizing the multifactorial nature of semasiological change of lexical items. Theoretically, the present study not only corroborates basic assumptions of usage-based cognitive semantics (e.g., non-discreteness, non-equality of senses, and bodily experience) but also complements it by demonstrating that sociocultural factors also play an important role in semasiological boundary variation of a lexical item.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":"19 1","pages":"289 - 321"},"PeriodicalIF":1.6,"publicationDate":"2022-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45941183","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

The distribution of /w/ and /ʍ/ in Scottish Standard English 苏格兰标准英语中/w/和/音/的分布

IF 1.6 2区文学 0 LANGUAGE & LINGUISTICS

Corpus Linguistics and Linguistic Theory

Pub Date : 2022-02-08 DOI: 10.1515/cllt-2021-0052

Zeyu Li, Ulrike Gut

Abstract The Scottish English phoneme inventory is generally claimed to have a /ʍ/-/w/ contrast, although several studies have suggested that this historical contrast is weakening for Scottish English speakers in the urban areas of Glasgow, Edinburgh and Aberdeen. Little is known about whether the /ʍ/-/w/ contrast is maintained in supraregional Scottish Standard English (SSE). This study sets out to explore, based on the phonemically transcribed ICE-Scotland corpus, the distribution of [ʍ] and [w] in SSE, their acoustic properties and potentially influencing social and language-internal factors. A total of 1,241 tokens were extracted from the corpus, together with a matching number of tokens, and the median of harmonicity was measured. The results show that [ʍ] and [w] produced for words beginning with are acoustically distinct from [w] produced for words beginning with . [ʍ] is relatively frequent in SSE, but most speakers use both [ʍ] and [w] interchangeably for and some never use [ʍ]. The realisation of as [ʍ] is determined by preceding phonetic context and speaker gender.

苏格兰英语的音素库通常被认为具有/音/-/音/音的对比，尽管一些研究表明，在格拉斯哥、爱丁堡和阿伯丁等城市地区的苏格兰英语使用者中，这种历史对比正在减弱。在跨区域的苏格兰标准英语(SSE)中，/音节/-/w/的对比是否保持，我们知之甚少。基于ICE-Scotland语料库的语音转录，本研究旨在探索SSE中[j]和[j]的分布、声学特性和潜在的影响社会和语言内部因素。从语料库中提取了1241个标记，并给出了匹配的标记数，测量了和谐度的中位数。结果表明，以…开头的单词产生的[j]和[w]在声学上与以…开头的单词产生的[w]不同。[j]在SSE中出现的频率相对较高，但大多数说话者将[j]和[j]互换使用，有些人从不使用[j]。as[音]的实现是由前面的语音语境和说话人的性别决定的。

引用次数: 2

Inferring case paradigms in Koalib with computational classifiers 用计算分类器推断Koalib中的事例范式

IF 1.6 2区文学 0 LANGUAGE & LINGUISTICS

Corpus Linguistics and Linguistic Theory

Pub Date : 2022-01-20 DOI: 10.1515/cllt-2021-0028

Nicolas Quint, Marc Allassonnière-Tang

Abstract The object case inflection in Koalib (Niger-Congo) represents complex patterns that involve phoneme position, syllable structure, and tonal pattern. Few attempts have been made with qualitative and quantitative approaches to identify the rules of the object case paradigms in Koalib. In the current study, information on phonemes, tones, and syllables are automatically extracted from a Koalib sample of 2,677 lexemes. The data is then fed to decision-tree-based classifiers to predict the object case paradigms and extract the interactive patterns between the variables. The results improve the predicting accuracy of existing studies and identify the case paradigms predicted by linguistic hypotheses. New case paradigms are also found by the computational classifiers and explained from a linguistic perspective. Our work demonstrates that the combination of linguistic theoretical knowledge with machine learning techniques can become one of the methodological approaches for linguistic analyses.

摘要Koalib（尼日尔-刚果）的宾语变格表示复杂的模式，包括音位、音节结构和音调模式。很少有人尝试用定性和定量的方法来确定Koalib中的客体案例范式的规则。在目前的研究中，音位、音调和音节的信息是从2677个单词的Koalib样本中自动提取的。然后将数据馈送到基于决策树的分类器，以预测对象-案例范式并提取变量之间的交互模式。这些结果提高了现有研究的预测准确性，并确定了语言学假设预测的案例范式。计算分类器也发现了新的事例范式，并从语言学的角度进行了解释。我们的工作表明，将语言学理论知识与机器学习技术相结合可以成为语言学分析的方法论方法之一。

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Corpus Linguistics and Linguistic Theory

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀