Corpora最新文献

英文中文

Review: Le Bruyn and Paquot (eds). 2021. Learner Corpus Research Meets Second Language Acquisition 评论：Le Bruyn和Paquot（编辑）。2021.学习者语料库研究与第二语言习得

IF 0.5 Q3 LINGUISTICS

Corpora

Pub Date : 2022-08-01 DOI: 10.3366/cor.2022.0259

W. Crawford

引用次数: 0

The representation of migrants in Spanish judicial decisions: using corpus data to refute hate speech 移民在西班牙司法判决中的代表性：利用语料库数据驳斥仇恨言论

IF 0.5 Q3 LINGUISTICS

Corpora

Pub Date : 2022-08-01 DOI: 10.3366/cor.2022.0253

María José Marín Pérez, Á. Almela

The phenomenon of immigration and its depiction in media texts have been examined profusely within the field of corpus-based discourse analysis ( Gabrielatos and Baker, 2008 ; Baker et al., 2013 ; and Blinder and Allen, 2016 ). This research seeks to present it as reflected in a corpus of 600 judicial decisions issued by Spanish courts in the years 2016 and 2017. This analysis was motivated by the rise of extreme right-wing parties in Europe in recent years. Such parties dehumanise immigrants and portray them as a threat to the welfare state. On first examination, the results appear to dissociate immigration and crime since a considerable percentage of the keywords obtained (about 20 percent) revolves around three major topoi (namely, ‘family’, ‘territory/access’ and ‘legal punishment’) and there is no evidence of any major offences or crimes amongst the top-ranking lexicon. The study of the collocate networks of the keywords within the category ‘legal punishment’ confirms our initial perception; in fact, out of twenty-one collocates, only the word delito (‘crime’) itself collocates with terms referring to typified crimes such as violencia (‘violence’). In parallel, the data were triangulated using the text-classification software UMUTextStats ( García-Díaz et al., 2018 ). The results of this second analysis also confirm our initial observations.

移民现象及其在媒体文本中的描述在基于语料库的话语分析领域得到了广泛的研究(Gabrielatos和Baker, 2008;Baker et al.， 2013;Blinder and Allen, 2016)。本研究试图从西班牙法院在2016年和2017年发布的600个司法判决语料库中反映出这一点。这种分析是受到近年来欧洲极右翼政党崛起的推动。这些政党贬低移民的人性，把他们描绘成福利国家的威胁。在第一次检查中，结果似乎将移民和犯罪分离开来，因为获得的关键词中有相当大比例(约20%)围绕三个主要主题(即“家庭”，“领土/访问”和“法律惩罚”)，并且在排名靠前的词汇中没有任何重大违法或犯罪的证据。对“法律处罚”范畴内关键词搭配网络的研究证实了我们的初步认知;事实上，在21种搭配中，只有delito(“犯罪”)这个词本身与典型犯罪(如violencia(“暴力”))的术语搭配。同时，使用文本分类软件UMUTextStats对数据进行三角测量(García-Díaz et al.， 2018)。第二次分析的结果也证实了我们最初的观察。

{"title":"The representation of migrants in Spanish judicial decisions: using corpus data to refute hate speech","authors":"María José Marín Pérez, Á. Almela","doi":"10.3366/cor.2022.0253","DOIUrl":"https://doi.org/10.3366/cor.2022.0253","url":null,"abstract":"The phenomenon of immigration and its depiction in media texts have been examined profusely within the field of corpus-based discourse analysis ( Gabrielatos and Baker, 2008 ; Baker et al., 2013 ; and Blinder and Allen, 2016 ). This research seeks to present it as reflected in a corpus of 600 judicial decisions issued by Spanish courts in the years 2016 and 2017. This analysis was motivated by the rise of extreme right-wing parties in Europe in recent years. Such parties dehumanise immigrants and portray them as a threat to the welfare state. On first examination, the results appear to dissociate immigration and crime since a considerable percentage of the keywords obtained (about 20 percent) revolves around three major topoi (namely, ‘family’, ‘territory/access’ and ‘legal punishment’) and there is no evidence of any major offences or crimes amongst the top-ranking lexicon. The study of the collocate networks of the keywords within the category ‘legal punishment’ confirms our initial perception; in fact, out of twenty-one collocates, only the word delito (‘crime’) itself collocates with terms referring to typified crimes such as violencia (‘violence’). In parallel, the data were triangulated using the text-classification software UMUTextStats ( García-Díaz et al., 2018 ). The results of this second analysis also confirm our initial observations.","PeriodicalId":44933,"journal":{"name":"Corpora","volume":" ","pages":""},"PeriodicalIF":0.5,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42757298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Reporting local cultures in the globalised world: how indigenised can English be in the free world? 报道全球化世界中的本土文化：英语在自由世界中的本土化程度如何？

IF 0.5 Q3 LINGUISTICS

Corpora

Pub Date : 2022-08-01 DOI: 10.3366/cor.2022.0254

S. Ali, P. Thompson

This paper presents a corpus-based analysis of English newspaper reportage in two South Asian countries, Pakistan (where English was introduced through colonisation) and Afghanistan (which has not been colonised), and their comparison with British newspaper reportage. The objective of this study is to analyse linguistic variation between the cultural press reportage (cpr) of the selected countries and to see which variety of English, Pakistani or Afghan, resembles British English the most. To achieve this objective, three English newspapers from each country were selected for the compilation of a specialised corpus which was analysed with reference to the five textual dimensions introduced by Biber (1988 , 2006 ). This research is significant as no previous study has attempted to find the differences and similarities between the Englishes used in a formerly colonised country and a country that was never part of the British Empire. The comparison indicates that Pakistani cpr is close to British cpr, while Afghan cpr is different. In terms of Biber’s five textual dimensions, Afghan cpr is more informational, narrative, explicit and abstract, and less non-argumentative in comparison with British and Pakistani cpr.

本文对南亚两个国家巴基斯坦（英语是通过殖民引入的）和阿富汗（尚未被殖民）的英语报纸报道进行了语料库分析，并将其与英国报纸报道进行比较。本研究的目的是分析选定国家的文化新闻报道（cpr）之间的语言差异，并了解巴基斯坦语或阿富汗语中哪种英语最像英国英语。为了实现这一目标，从每个国家挑选了三家英文报纸来编辑一个专门的语料库，并参考Biber（19882006）提出的五个文本维度对其进行了分析。这项研究意义重大，因为之前没有任何研究试图发现在一个以前被殖民的国家和一个从未属于大英帝国的国家使用的英语之间的差异和相似之处。比较表明，巴基斯坦的cpr与英国的cpr接近，而阿富汗的cpr则不同。就比伯的五个文本维度而言，与英国和巴基斯坦的cpr相比，阿富汗的cpr更具信息性、叙事性、明确性和抽象性，而非争论性更少。

{"title":"Reporting local cultures in the globalised world: how indigenised can English be in the free world?","authors":"S. Ali, P. Thompson","doi":"10.3366/cor.2022.0254","DOIUrl":"https://doi.org/10.3366/cor.2022.0254","url":null,"abstract":"This paper presents a corpus-based analysis of English newspaper reportage in two South Asian countries, Pakistan (where English was introduced through colonisation) and Afghanistan (which has not been colonised), and their comparison with British newspaper reportage. The objective of this study is to analyse linguistic variation between the cultural press reportage (cpr) of the selected countries and to see which variety of English, Pakistani or Afghan, resembles British English the most. To achieve this objective, three English newspapers from each country were selected for the compilation of a specialised corpus which was analysed with reference to the five textual dimensions introduced by Biber (1988 , 2006 ). This research is significant as no previous study has attempted to find the differences and similarities between the Englishes used in a formerly colonised country and a country that was never part of the British Empire. The comparison indicates that Pakistani cpr is close to British cpr, while Afghan cpr is different. In terms of Biber’s five textual dimensions, Afghan cpr is more informational, narrative, explicit and abstract, and less non-argumentative in comparison with British and Pakistani cpr.","PeriodicalId":44933,"journal":{"name":"Corpora","volume":" ","pages":""},"PeriodicalIF":0.5,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49301474","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Review: Jonsson and Larsson (eds). 2020. Voices Past and Present – Studies of Involved, Speech-related and Spoken Texts: In Honor of Merja Kytö 综述:Jonsson和Larsson主编。2020. 过去和现在的声音-涉及的，言语相关和口语文本的研究:在Merja的荣誉Kytö

IF 0.5 Q3 LINGUISTICS

Corpora

Pub Date : 2022-08-01 DOI: 10.3366/cor.2022.0258

Mariana Centanin Bertho

引用次数: 0

The representation of gendered social actors across five manosphere communities on Reddit Reddit上五个庄园社区中性别社会行为者的代表性

IF 0.5 Q3 LINGUISTICS

Corpora

Pub Date : 2022-08-01 DOI: 10.3366/cor.2022.0257

A. Krendel, M. McGlashan, Veronika Koller

This paper investigates the representation of gendered social actors in a specialised corpus of 10.9 million words, collected from five Reddit communities associated with the so-called ‘manosphere’: incels (involuntary celibates), Men Going Their Own Way (male separatists), pick-up artists, men’s rights activists, and a group dedicated to wider discussions of ‘red pill’ philosophy. Thirty-four gendered social actor terms were identified as key-key-words across the manosphere corpora. Both male and female social actors are referenced using relational terms, while the latter are also referenced using derogatory terms and the former are referenced using terms for kinship and in-group identification. We then analysed the consistent collocates ( Baker et al., 2008 ) of the four most frequent gendered social actor terms ( women, girls, men and guys), to establish the topics, descriptions and actions associated with the social actors across the five groups. Gendered social actors were constructed in essentialist dichotomies, with women and girls, although objectified and passivated in dating/sexual contexts, being represented as violent towards male social actors and as holding a privileged position over men in wider society. The anti-feminist ideology reflected in manosphere discourse can be seen as a more extreme version of mainstream discourse. To the extent that manosphere discourse spreads beyond dedicated forums and websites, its views will be re-imported into the mainstream, leading to a wider radicalisation.

这篇论文调查了性别社会行为者在一个1090万字的专业语料库中的表现，这些语料库收集自Reddit上五个与所谓的“管理圈”相关的社区:incels(非自愿的独身者)，Men Going Their Own Way(男性分离主义者)，pick-妹艺人，男性权利活动家，以及一个致力于更广泛讨论“红色药丸”哲学的团体。34个性别社会行为者术语被确定为跨管理圈语料库的关键字。男性和女性社会行动者都使用关系术语，而后者也使用贬义术语，而前者则使用亲属关系和群体内认同术语。然后，我们分析了四个最常见的性别社会行动者术语(妇女、女孩、男人和男孩)的一致搭配(Baker et al.， 2008)，以建立与五个群体的社会行动者相关的主题、描述和行动。性别社会行为者是在本质主义的二分法中构建的，尽管妇女和女孩在约会/性环境中被客观化和钝化，但她们被描述为对男性社会行为者有暴力行为，并且在更广泛的社会中比男性拥有特权地位。庄园话语所反映的反女权主义意识形态，可以看作是主流话语的一种更为极端的版本。在某种程度上，庄园话语传播到专门的论坛和网站之外，它的观点将被重新引入主流，导致更广泛的激进化。

{"title":"The representation of gendered social actors across five manosphere communities on Reddit","authors":"A. Krendel, M. McGlashan, Veronika Koller","doi":"10.3366/cor.2022.0257","DOIUrl":"https://doi.org/10.3366/cor.2022.0257","url":null,"abstract":"This paper investigates the representation of gendered social actors in a specialised corpus of 10.9 million words, collected from five Reddit communities associated with the so-called ‘manosphere’: incels (involuntary celibates), Men Going Their Own Way (male separatists), pick-up artists, men’s rights activists, and a group dedicated to wider discussions of ‘red pill’ philosophy. Thirty-four gendered social actor terms were identified as key-key-words across the manosphere corpora. Both male and female social actors are referenced using relational terms, while the latter are also referenced using derogatory terms and the former are referenced using terms for kinship and in-group identification. We then analysed the consistent collocates ( Baker et al., 2008 ) of the four most frequent gendered social actor terms ( women, girls, men and guys), to establish the topics, descriptions and actions associated with the social actors across the five groups. Gendered social actors were constructed in essentialist dichotomies, with women and girls, although objectified and passivated in dating/sexual contexts, being represented as violent towards male social actors and as holding a privileged position over men in wider society. The anti-feminist ideology reflected in manosphere discourse can be seen as a more extreme version of mainstream discourse. To the extent that manosphere discourse spreads beyond dedicated forums and websites, its views will be re-imported into the mainstream, leading to a wider radicalisation.","PeriodicalId":44933,"journal":{"name":"Corpora","volume":" ","pages":""},"PeriodicalIF":0.5,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41748683","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 14

Apparently identical verbs can be represented differently: comparing L1–L2 inflection with contingency-based measure ΔP 显然相同的动词可以用不同的方式表示：将L1–L2屈折与基于偶然性的测度ΔP进行比较

IF 0.5 Q3 LINGUISTICS

Corpora

Pub Date : 2022-04-01 DOI: 10.3366/cor.2022.0236

Stefano Rastelli, Akira Murakami

We propose a method that models unidirectional, contingency-based association scale ΔP in order to analyse the different degrees of morpheme productivity in apparently identical L1–L2 inflected pairs. The method has the potential to uncover differences in how in L1–L2 inflected items are represented by L2 learners and native speakers. Such differences are at risk of remaining invisible if one considers only frequency, distribution and rank of predicates.

我们提出了一种方法，建模单向的，基于权变的联想量表ΔP，以分析在表面上相同的L1-L2屈折对中不同程度的语素生产力。该方法有可能揭示L2学习者和母语人士在表达L1-L2屈折词方面的差异。如果只考虑谓词的频率、分布和等级，这种差异就有可能不可见。

引用次数: 1

Emotional engagement in Thai and Japanese insurance advertising: corpus-based keyword analysis 泰国和日本保险广告中的情感投入:基于语料库的关键词分析

IF 0.5 Q3 LINGUISTICS

Corpora

Pub Date : 2022-04-01 DOI: 10.3366/cor.2022.0235

Chavalin Svetanant, Brian Ballsun-Stanton, Attapol T. Rutherford

Advertisements demonstrate patterns of communication that are imagined to be acceptable to the communities at which they are aimed, serving as cultural artefacts that provide insights into shared cultural interpretations and social interactions. Drawing on techniques from corpus linguistics, text linguistics and the Appraisal framework, we conducted an analysis of salient keywords in a collection of Thai and Japanese tv commercials (tvcs) for insurance products in order to identify statistically significant keywords and examine emotional engagement. We reveal how specific keywords and communicative strategies used in the persuasive discourse of the insurance products reflect Thai and Japanese socio-cultural preferences and values. We found that while Thai tvcs demonstrate a higher audience engagement through the use of metadiscourse markers such as engagement markers and emphatics under the theme of moral and family values, Japanese tvcs highlight information-dense content-words, including the extensive use of statistics involving themes of security, health and value propositions.

广告展示了被想象为其所针对的社区所接受的交流模式，作为文化工艺品，提供了对共享文化解释和社会互动的见解。利用语料库语言学、文本语言学和评价框架的技术，我们对泰国和日本保险产品电视广告(tvcs)中的显著关键词进行了分析，以确定具有统计意义的关键词并检验情感投入。我们揭示了保险产品说服性话语中使用的特定关键词和交际策略如何反映泰国和日本的社会文化偏好和价值观。我们发现，虽然泰国电视节目通过使用元话语标记(如参与标记和道德和家庭价值观主题下的强调)显示出更高的观众参与度，但日本电视节目强调信息密集的内容词，包括广泛使用涉及安全、健康和价值主张主题的统计数据。

引用次数: 0

Front matter 前页

IF 0.5 Q3 LINGUISTICS

Corpora

Pub Date : 2022-04-01 DOI: 10.3366/cor.2022.0232

引用次数: 0

Making use of transcription data from qualitative research within a corpus-linguistic paradigm: issues, experiences and recommendations 在语料库-语言学范式中利用定性研究的转录数据:问题、经验和建议

IF 0.5 Q3 LINGUISTICS

Corpora

Pub Date : 2022-04-01 DOI: 10.3366/cor.2022.0237

Luke C. Collins, A. Hardie

In this paper, we reflect on the process of re-operationalising transcript data generated in an ethnographic study for the purposes of corpus analysis. We present a corpus of patient–provider interactions in the context of Emergency Departments in hospitals in Australia, to discuss the process through which ethnographic transcripts were manipulated to generate a searchable corpus. We refer to the types of corpus analysis that this conversion enables, facilitated by the rich metadata collected alongside the transcribed audio recordings, augmenting the findings of prior qualitative analyses. Subsequently, we offer guidance for spoken data transcription, intended to ‘future proof’ such data for subsequent reformatting for corpus linguistic analysis.

在本文中，我们反思了将人种学研究中生成的文字记录数据重新操作以进行语料库分析的过程。我们在澳大利亚医院急诊科的背景下展示了一个患者-提供者互动的语料库，以讨论民族志转录本被操纵以生成可搜索语料库的过程。我们提到了这种转换所能实现的语料库分析类型，这得益于与转录的录音一起收集的丰富元数据，增强了先前定性分析的结果。随后，我们为口语数据的转录提供了指导，旨在“未来证明”这些数据，以便随后重新格式化以进行语料库语言分析。

引用次数: 2

Review: Liu. 2020. Corpus-Assisted Translation Teaching: Issues and Challenges 评审：刘。2020.语料库辅助翻译教学：问题与挑战

IF 0.5 Q3 LINGUISTICS

Corpora

Pub Date : 2022-04-01 DOI: 10.3366/cor.2022.0240

Ting-Feng Liu, M. Afzaal

引用次数: 1

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Corpora

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀