This paper investigates the representation of gendered social actors in a specialised corpus of 10.9 million words, collected from five Reddit communities associated with the so-called ‘manosphere’: incels (involuntary celibates), Men Going Their Own Way (male separatists), pick-up artists, men’s rights activists, and a group dedicated to wider discussions of ‘red pill’ philosophy. Thirty-four gendered social actor terms were identified as key-key-words across the manosphere corpora. Both male and female social actors are referenced using relational terms, while the latter are also referenced using derogatory terms and the former are referenced using terms for kinship and in-group identification. We then analysed the consistent collocates ( Baker et al., 2008 ) of the four most frequent gendered social actor terms ( women, girls, men and guys), to establish the topics, descriptions and actions associated with the social actors across the five groups. Gendered social actors were constructed in essentialist dichotomies, with women and girls, although objectified and passivated in dating/sexual contexts, being represented as violent towards male social actors and as holding a privileged position over men in wider society. The anti-feminist ideology reflected in manosphere discourse can be seen as a more extreme version of mainstream discourse. To the extent that manosphere discourse spreads beyond dedicated forums and websites, its views will be re-imported into the mainstream, leading to a wider radicalisation.
这篇论文调查了性别社会行为者在一个1090万字的专业语料库中的表现,这些语料库收集自Reddit上五个与所谓的“管理圈”相关的社区:incels(非自愿的独身者),Men Going Their Own Way(男性分离主义者),pick-妹艺人,男性权利活动家,以及一个致力于更广泛讨论“红色药丸”哲学的团体。34个性别社会行为者术语被确定为跨管理圈语料库的关键字。男性和女性社会行动者都使用关系术语,而后者也使用贬义术语,而前者则使用亲属关系和群体内认同术语。然后,我们分析了四个最常见的性别社会行动者术语(妇女、女孩、男人和男孩)的一致搭配(Baker et al., 2008),以建立与五个群体的社会行动者相关的主题、描述和行动。性别社会行为者是在本质主义的二分法中构建的,尽管妇女和女孩在约会/性环境中被客观化和钝化,但她们被描述为对男性社会行为者有暴力行为,并且在更广泛的社会中比男性拥有特权地位。庄园话语所反映的反女权主义意识形态,可以看作是主流话语的一种更为极端的版本。在某种程度上,庄园话语传播到专门的论坛和网站之外,它的观点将被重新引入主流,导致更广泛的激进化。
{"title":"The representation of gendered social actors across five manosphere communities on Reddit","authors":"A. Krendel, M. McGlashan, Veronika Koller","doi":"10.3366/cor.2022.0257","DOIUrl":"https://doi.org/10.3366/cor.2022.0257","url":null,"abstract":"This paper investigates the representation of gendered social actors in a specialised corpus of 10.9 million words, collected from five Reddit communities associated with the so-called ‘manosphere’: incels (involuntary celibates), Men Going Their Own Way (male separatists), pick-up artists, men’s rights activists, and a group dedicated to wider discussions of ‘red pill’ philosophy. Thirty-four gendered social actor terms were identified as key-key-words across the manosphere corpora. Both male and female social actors are referenced using relational terms, while the latter are also referenced using derogatory terms and the former are referenced using terms for kinship and in-group identification. We then analysed the consistent collocates ( Baker et al., 2008 ) of the four most frequent gendered social actor terms ( women, girls, men and guys), to establish the topics, descriptions and actions associated with the social actors across the five groups. Gendered social actors were constructed in essentialist dichotomies, with women and girls, although objectified and passivated in dating/sexual contexts, being represented as violent towards male social actors and as holding a privileged position over men in wider society. The anti-feminist ideology reflected in manosphere discourse can be seen as a more extreme version of mainstream discourse. To the extent that manosphere discourse spreads beyond dedicated forums and websites, its views will be re-imported into the mainstream, leading to a wider radicalisation.","PeriodicalId":44933,"journal":{"name":"Corpora","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41748683","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Review: Jonsson and Larsson (eds). 2020. Voices Past and Present – Studies of Involved, Speech-related and Spoken Texts: In Honor of Merja Kytö","authors":"Mariana Centanin Bertho","doi":"10.3366/cor.2022.0258","DOIUrl":"https://doi.org/10.3366/cor.2022.0258","url":null,"abstract":"","PeriodicalId":44933,"journal":{"name":"Corpora","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44083644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
We propose a method that models unidirectional, contingency-based association scale ΔP in order to analyse the different degrees of morpheme productivity in apparently identical L1–L2 inflected pairs. The method has the potential to uncover differences in how in L1–L2 inflected items are represented by L2 learners and native speakers. Such differences are at risk of remaining invisible if one considers only frequency, distribution and rank of predicates.
{"title":"Apparently identical verbs can be represented differently: comparing L1–L2 inflection with contingency-based measure ΔP","authors":"Stefano Rastelli, Akira Murakami","doi":"10.3366/cor.2022.0236","DOIUrl":"https://doi.org/10.3366/cor.2022.0236","url":null,"abstract":"We propose a method that models unidirectional, contingency-based association scale ΔP in order to analyse the different degrees of morpheme productivity in apparently identical L1–L2 inflected pairs. The method has the potential to uncover differences in how in L1–L2 inflected items are represented by L2 learners and native speakers. Such differences are at risk of remaining invisible if one considers only frequency, distribution and rank of predicates.","PeriodicalId":44933,"journal":{"name":"Corpora","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2022-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45522688","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Chavalin Svetanant, Brian Ballsun-Stanton, Attapol T. Rutherford
Advertisements demonstrate patterns of communication that are imagined to be acceptable to the communities at which they are aimed, serving as cultural artefacts that provide insights into shared cultural interpretations and social interactions. Drawing on techniques from corpus linguistics, text linguistics and the Appraisal framework, we conducted an analysis of salient keywords in a collection of Thai and Japanese tv commercials (tvcs) for insurance products in order to identify statistically significant keywords and examine emotional engagement. We reveal how specific keywords and communicative strategies used in the persuasive discourse of the insurance products reflect Thai and Japanese socio-cultural preferences and values. We found that while Thai tvcs demonstrate a higher audience engagement through the use of metadiscourse markers such as engagement markers and emphatics under the theme of moral and family values, Japanese tvcs highlight information-dense content-words, including the extensive use of statistics involving themes of security, health and value propositions.
{"title":"Emotional engagement in Thai and Japanese insurance advertising: corpus-based keyword analysis","authors":"Chavalin Svetanant, Brian Ballsun-Stanton, Attapol T. Rutherford","doi":"10.3366/cor.2022.0235","DOIUrl":"https://doi.org/10.3366/cor.2022.0235","url":null,"abstract":"Advertisements demonstrate patterns of communication that are imagined to be acceptable to the communities at which they are aimed, serving as cultural artefacts that provide insights into shared cultural interpretations and social interactions. Drawing on techniques from corpus linguistics, text linguistics and the Appraisal framework, we conducted an analysis of salient keywords in a collection of Thai and Japanese tv commercials (tvcs) for insurance products in order to identify statistically significant keywords and examine emotional engagement. We reveal how specific keywords and communicative strategies used in the persuasive discourse of the insurance products reflect Thai and Japanese socio-cultural preferences and values. We found that while Thai tvcs demonstrate a higher audience engagement through the use of metadiscourse markers such as engagement markers and emphatics under the theme of moral and family values, Japanese tvcs highlight information-dense content-words, including the extensive use of statistics involving themes of security, health and value propositions.","PeriodicalId":44933,"journal":{"name":"Corpora","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2022-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48756969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In this paper, we reflect on the process of re-operationalising transcript data generated in an ethnographic study for the purposes of corpus analysis. We present a corpus of patient–provider interactions in the context of Emergency Departments in hospitals in Australia, to discuss the process through which ethnographic transcripts were manipulated to generate a searchable corpus. We refer to the types of corpus analysis that this conversion enables, facilitated by the rich metadata collected alongside the transcribed audio recordings, augmenting the findings of prior qualitative analyses. Subsequently, we offer guidance for spoken data transcription, intended to ‘future proof’ such data for subsequent reformatting for corpus linguistic analysis.
{"title":"Making use of transcription data from qualitative research within a corpus-linguistic paradigm: issues, experiences and recommendations","authors":"Luke C. Collins, A. Hardie","doi":"10.3366/cor.2022.0237","DOIUrl":"https://doi.org/10.3366/cor.2022.0237","url":null,"abstract":"In this paper, we reflect on the process of re-operationalising transcript data generated in an ethnographic study for the purposes of corpus analysis. We present a corpus of patient–provider interactions in the context of Emergency Departments in hospitals in Australia, to discuss the process through which ethnographic transcripts were manipulated to generate a searchable corpus. We refer to the types of corpus analysis that this conversion enables, facilitated by the rich metadata collected alongside the transcribed audio recordings, augmenting the findings of prior qualitative analyses. Subsequently, we offer guidance for spoken data transcription, intended to ‘future proof’ such data for subsequent reformatting for corpus linguistic analysis.","PeriodicalId":44933,"journal":{"name":"Corpora","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2022-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45330680","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
This study investigates ( i) whether there has been a shift towards increased statistical focus in corpus linguistic research articles, and, if so, ( ii) whether this has had any repercussions for the attention paid to linguistic description. We investigate this through an analysis of the relative focus on statistical reporting versus linguistic description in the way the results are reported and discussed in research articles published in four major corpus linguistics journals in 2009 and 2019. The results display a marked change: in 2009, a clear majority of the articles exhibit a preference for linguistic description over statistical reporting; in 2019, the exact opposite is true. The number of different statistical techniques employed has also gone up. Whilst the increased statistical focus may reflect increased methodological sophistication, our results show that it has come at a cost: a diminished focus on linguistic description, evident, for example, through fewer text excerpts and linguistic examples, which appears to be symptomatic of increasing distance from the language that is the object of study. We discuss these shifts and suggest some ways of employing sophisticated statistical techniques without sacrificing a focus on language.
{"title":"On the status of statistical reporting versus linguistic description in corpus linguistics: a ten-year perspective","authors":"Tove Larsson, Jesse Egbert, D. Biber","doi":"10.3366/cor.2022.0238","DOIUrl":"https://doi.org/10.3366/cor.2022.0238","url":null,"abstract":"This study investigates ( i) whether there has been a shift towards increased statistical focus in corpus linguistic research articles, and, if so, ( ii) whether this has had any repercussions for the attention paid to linguistic description. We investigate this through an analysis of the relative focus on statistical reporting versus linguistic description in the way the results are reported and discussed in research articles published in four major corpus linguistics journals in 2009 and 2019. The results display a marked change: in 2009, a clear majority of the articles exhibit a preference for linguistic description over statistical reporting; in 2019, the exact opposite is true. The number of different statistical techniques employed has also gone up. Whilst the increased statistical focus may reflect increased methodological sophistication, our results show that it has come at a cost: a diminished focus on linguistic description, evident, for example, through fewer text excerpts and linguistic examples, which appears to be symptomatic of increasing distance from the language that is the object of study. We discuss these shifts and suggest some ways of employing sophisticated statistical techniques without sacrificing a focus on language.","PeriodicalId":44933,"journal":{"name":"Corpora","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2022-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44253043","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
This paper presents a corpus-driven Sinclairian analysis of five high-frequency Slovene verbs covering the lexical paradigm ‘to express orally’ in combination with their premodifying adverbs of manner. One of the main goals of the paper is to establish how frequent the phenomenon of semantic prosody actually is among high-frequency lexical items (here, adv-v pairs). A methodology aiming to provide an answer to this question has been proposed featuring the top-down approach (i.e., in order of decreasing frequency of occurrence). It involves setting up the widest possible parameters of searching for so-called ‘extended units of meaning’ and their semantic prosody amongst the most frequent lexical patterns in a language. A total of twenty-six adv-v pairs have been examined. Results indicate a strong correlation between the frequency of multi-word lexical items and their tendency to develop semantic prosodies: high-frequency collocations are thus more likely to have semantic prosodies compared to their lower-frequency counterparts. Overall, results also corroborate the trend of semantic prosody to be found with mainly negative meanings and to a lesser extent in neutral meanings, while no positive semantic prosody has been determined in this study.
{"title":"Semantic prosody of Slovene adverb–verb collocations: introducing the top-down approach","authors":"P. Jurko","doi":"10.3366/cor.2022.0234","DOIUrl":"https://doi.org/10.3366/cor.2022.0234","url":null,"abstract":"This paper presents a corpus-driven Sinclairian analysis of five high-frequency Slovene verbs covering the lexical paradigm ‘to express orally’ in combination with their premodifying adverbs of manner. One of the main goals of the paper is to establish how frequent the phenomenon of semantic prosody actually is among high-frequency lexical items (here, adv-v pairs). A methodology aiming to provide an answer to this question has been proposed featuring the top-down approach (i.e., in order of decreasing frequency of occurrence). It involves setting up the widest possible parameters of searching for so-called ‘extended units of meaning’ and their semantic prosody amongst the most frequent lexical patterns in a language. A total of twenty-six adv-v pairs have been examined. Results indicate a strong correlation between the frequency of multi-word lexical items and their tendency to develop semantic prosodies: high-frequency collocations are thus more likely to have semantic prosodies compared to their lower-frequency counterparts. Overall, results also corroborate the trend of semantic prosody to be found with mainly negative meanings and to a lesser extent in neutral meanings, while no positive semantic prosody has been determined in this study.","PeriodicalId":44933,"journal":{"name":"Corpora","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2022-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42993874","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Review: Pastor and Colson (eds). 2020. Computational Phraseology","authors":"Joe Geluso","doi":"10.3366/cor.2022.0239","DOIUrl":"https://doi.org/10.3366/cor.2022.0239","url":null,"abstract":"","PeriodicalId":44933,"journal":{"name":"Corpora","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2022-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48768673","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}