Corpora最新文献

英文中文

Review: Tao. 2018. Russian–Chinese Parallel Corpus-based Research on Translational Texts about Humanities and Social Sciences. Beijing: Science Press 回顾:Tao. 2018。基于中俄平行语料库的人文社会科学翻译文本研究。北京:科学出版社

IF 0.5 Q3 LINGUISTICS

Corpora

Pub Date : 2021-04-01 DOI: 10.3366/COR.2021.0213

Zhanhao Jiang

引用次数: 0

Operation Heron: latent topic changes in an abusive letter series 苍鹭行动:在辱骂信件系列中潜在的话题变化

IF 0.5 Q3 LINGUISTICS

Corpora

Pub Date : 2021-03-26 DOI: 10.3366/cor.2022.0255

Lucia Busso, Márton Petykó, S. Atkins, Tim D. Grant

The paper presents a two-part forensic linguistic analysis of an historic collection of abuse letters, sent to individuals in the public eye and individuals’ private homes between 2007 and 2009. We employ the technique of structural topic modelling (stm) to identify distinctions in the core topics of the letters, gauging the value of this relatively under-used methodology in forensic linguistics. Four key topics were identified in the letters, ‘Politics A’ and ‘B’, ‘Healthcare’ and ‘Immigration’, and their coherence, correlation and shifts in topic were evaluated. Following the stm, a qualitative corpus linguistic analysis was undertaken, coding concordance lines according to topic, with the reliability between coders tested. This coding demonstrated that various connected statements within the same topic tend to gain or lose prevalence over time, and ultimately confirmed the consistency of content within the four topics identified through stm throughout the letter series. The discussion and conclusions to the paper reflect on the findings and also consider the utility of these methodologies for linguistics and forensic linguistics in particular. The study demonstrates real value in revisiting a forensic linguistic dataset such as this to test and develop methodologies for the field.

这篇论文对一组历史上的虐待信件进行了分两部分的法医语言学分析，这些信件在2007年至2009年间被发送给公众和私人住宅。我们采用结构主题建模(stm)技术来识别信件核心主题的区别，衡量这种相对较少使用的方法在法律语言学中的价值。在字母中确定了四个关键主题，“政治A”和“B”，“医疗保健”和“移民”，并评估了它们的一致性，相关性和主题的变化。在此基础上，对语料库进行了定性的语言分析，按主题编码一致性线，并对编码器之间的信度进行了测试。这种编码表明，随着时间的推移，同一主题中的各种关联语句往往会增加或减少流行度，并最终确认了通过stm在整个信件系列中确定的四个主题中的内容的一致性。论文的讨论和结论反映了研究结果，并考虑了这些方法对语言学和法律语言学的效用。这项研究展示了重新访问像这样的法医语言数据集来测试和开发该领域的方法的真正价值。

{"title":"Operation Heron: latent topic changes in an abusive letter series","authors":"Lucia Busso, Márton Petykó, S. Atkins, Tim D. Grant","doi":"10.3366/cor.2022.0255","DOIUrl":"https://doi.org/10.3366/cor.2022.0255","url":null,"abstract":"The paper presents a two-part forensic linguistic analysis of an historic collection of abuse letters, sent to individuals in the public eye and individuals’ private homes between 2007 and 2009. We employ the technique of structural topic modelling (stm) to identify distinctions in the core topics of the letters, gauging the value of this relatively under-used methodology in forensic linguistics. Four key topics were identified in the letters, ‘Politics A’ and ‘B’, ‘Healthcare’ and ‘Immigration’, and their coherence, correlation and shifts in topic were evaluated. Following the stm, a qualitative corpus linguistic analysis was undertaken, coding concordance lines according to topic, with the reliability between coders tested. This coding demonstrated that various connected statements within the same topic tend to gain or lose prevalence over time, and ultimately confirmed the consistency of content within the four topics identified through stm throughout the letter series. The discussion and conclusions to the paper reflect on the findings and also consider the utility of these methodologies for linguistics and forensic linguistics in particular. The study demonstrates real value in revisiting a forensic linguistic dataset such as this to test and develop methodologies for the field.","PeriodicalId":44933,"journal":{"name":"Corpora","volume":"1 1","pages":""},"PeriodicalIF":0.5,"publicationDate":"2021-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42546142","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Late Latin Charter Treebank: contents and annotation 晚期拉丁语宪章树库:内容和注释

IF 0.5 Q3 LINGUISTICS

Corpora

Pub Date : 2021-01-01 DOI: 10.3366/cor.2021.0217

Timo Korkiakangas

This paper describes the construction and annotation of the Late Latin Charter Treebank, a set of three dependency treebanks (llct1, llct2 and llct3) which together contain 1,261 Early Medieval Latin documentary texts (i.e., original charters) written in Italy between ad 714 and 1000 (about 594,000 tokens). The paper focusses on matters which a linguistically or philologically inclined user of llct needs to know: the criteria on which the charters were selected, the special characteristics of the annotation types utilised, and the geographical and chronological distribution of the data. In addition to normal queries on forms, lemmas, morphology and syntax, complex philological research settings are enabled by the textual annotation layer of llct, which indicates abbreviated and damaged words, as well as the formulaic and non-formulaic passages of each charter.

本文描述了晚期拉丁宪章树库的构建和注释，这是一个由三个依赖树库(llct1, llct2和llct3)组成的集合，共包含1261个早期中世纪拉丁文献文本(即原始宪章)，写于公元714年至公元1000年之间的意大利(约594,000个标记)。本文的重点是语言学或语言学倾向的用户需要知道的事项:选择宪章的标准，所使用的注释类型的特殊特征，以及数据的地理和时间分布。除了对形式、引理、词法和句法的常规查询外，llct的文本注释层还支持复杂的文字学研究设置，它可以显示缩写和损坏的单词，以及每个宪章的公式化和非公式化段落。

引用次数: 5

Exploring recurrent frames in written Chinese 汉语书面语中的重复框架探析

IF 0.5 Q3 LINGUISTICS

Corpora

Pub Date : 2020-12-18 DOI: 10.3366/cor.2020.0201

Chan-Chia Hsu

Patterning in frames (i.e., discontinuous word sequences with at least one variable slot) involves both syntagmatic co-occurrences and paradigmatic variations, and this has received considerable at...

框架中的模式(即，至少有一个可变槽的不连续词序列)涉及句法共现和范式变化，这在…

引用次数: 0

Measuring grammatical status in Chinese through quantitative corpus analysis 定量语料库分析在汉语语法状态测量中的应用

IF 0.5 Q3 LINGUISTICS

Corpora

Pub Date : 2020-12-18 DOI: 10.3366/cor.2020.0202

Linlin Sun, D. Saavedra

This paper applies a quantitative model developed for measuring grammatical status, using data from the Lancaster Corpus of Mandarin Chinese (lcmc). The model takes into account four quantitative f...

本文利用兰开斯特汉语语料库（lcmc）的数据，应用了一个用于测量语法状态的定量模型。该模型考虑了四个定量因子。。。

引用次数: 2

Review: Doval and Sánchez Nieto. 2019. Parallel Corpora for Contrastive and Translation Studies: New Resources and Applications 评论：多瓦尔和桑切斯·涅托。2019.对比与翻译研究的平行语料库：新资源与应用

IF 0.5 Q3 LINGUISTICS

Corpora

Pub Date : 2020-12-18 DOI: 10.3366/cor.2020.0204

Yi Li

引用次数: 0

The interaction of various temporal devices in the use of past followed by temporal nouns 时态名词后接过去时使用中各种时态手段的相互作用

IF 0.5 Q3 LINGUISTICS

Corpora

Pub Date : 2020-12-18 DOI: 10.3366/cor.2020.0199

I. Yoo

Followed by a temporal noun, past can be synonymous with last, but not with non-deictically anchored previous (e.g., ‘I've not been feeling very well for the past/last/*previous few days’). Most di...

后面跟一个时态名词，过去可以是最后一个的同义词，但不能是非指示锚定的前面的同义词（例如，“我过去/最后/*前几天感觉不太好”）。大多数。。。

引用次数: 0

Stock market tweets annotated with emotions 股市推文带着情感

IF 0.5 Q3 LINGUISTICS

Corpora

Pub Date : 2020-12-18 DOI: 10.3366/cor.2020.0203

F. J. V. D. Silva, N. T. Roman, Ariadne Carvalho

As stock trading became a popular topic on Twitter, many researchers have proposed different approaches to make predictions on it, relying on the emotions found in messages. However, detailed studi...

随着股票交易在推特上成为一个热门话题，许多研究人员根据信息中的情绪，提出了不同的方法来进行预测。然而，详细的研究。。。

引用次数: 6

Adverbs on the move: investigating publisher application of corpus research on recent language change to ELT coursebook development 移动中的副词:调查出版商对新语言变化语料库研究在英语教材开发中的应用

IF 0.5 Q3 LINGUISTICS

Corpora

Pub Date : 2020-11-10 DOI: 10.3366/cor.2022.0233

Niall Curry, Robbie Love, Olivia Goodman

While the role of corpus linguistics (cl) in language teaching and learning continues to evolve, its use in the language teaching industry remains somewhat unclear. The specific ways in which elt publishers use cl research to inform materials development are under-studied, meaning that it is not known whether cl is being used by publishers to its full potential. This study investigates the use of cl research by a major international elt publisher by conducting research into recent change in adverbs in casual spoken British English and sharing the findings with editors from the publisher. Through our analysis, we find evidence of major recent changes in the use of frequent adverbs. Following the corpus analysis, we conducted in-depth interviews with the editors and a review of the materials they subsequently produced using the corpus findings. In so doing, we find some evidence of effective use of corpora in materials development but reveal limitations in current corpus research which prevent editors from employing cl research more effectively.

虽然语料库语言学(cl)在语言教学中的作用不断发展，但它在语言教学行业中的应用仍不明朗。英语教学出版商利用语言教学研究为教材开发提供信息的具体方式尚未得到充分研究，这意味着尚不清楚出版商是否充分利用了语言教学。本研究调查了一个主要的国际英语出版商使用cl研究，通过进行研究，在日常英语口语副词的最新变化，并与出版商的编辑分享研究结果。通过我们的分析，我们发现了最近频繁副词使用发生重大变化的证据。在语料库分析之后，我们对编辑进行了深入的采访，并对他们随后使用语料库发现制作的材料进行了审查。在此过程中，我们发现了一些在材料开发中有效利用语料库的证据，但也揭示了当前语料库研究的局限性，这些局限性阻碍了编辑更有效地利用语料库研究。

引用次数: 1

Back matter 背景材料

IF 0.5 Q3 LINGUISTICS

Corpora

Pub Date : 2020-11-01 DOI: 10.3366/cor.2020.0205

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Corpora

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀