Acquiring English collocations poses a major challenge for second language (L2) learners. It has been well noted that even advanced L2 English learners have difficulty using basic verb + noun collocations. Among the factors that make it difficult to acquire L2 collocations, the influence of learners’ first language (L1) has been repeatedly pointed out in the literature. As learners’ L1 and target language (L2) use data can help us to examine L1 influence effectively, in this study, we used a bilingual essay corpus, in which the same individuals ( n = 524) produced L1 and L2 essays on the same topic, to investigate the relationship between Japanese efl (English as a foreign language) learners’ L1 and their English collocation use in written essays. We also referred to essays written by native speakers of English on the same topic as a reference corpus ( n = 200). The proficiency level of 524 individuals in the learner corpus data was divided into three levels according to the Common European Framework of Reference (cefr): A2 (Waystage), B1 (Threshold) and B2 (Vantage). We focussed on the use of ‘ make + noun’ collocations as the target structure in this study and extracted them from the two corpora. Results suggest that, although the differences between the learner levels were not found to be statistically significant, the learners underused make + noun collocations with less variation, compared with native speakers of English. The pedagogical implications of this finding are discussed in terms of materials and syllabus development for efl learners.
{"title":"Exploring the use of make + noun collocations by Japanese EFL learners through a bilingual essay corpus","authors":"Ryo Sawaguchi, Atsushi Mizumoto","doi":"10.3366/cor.2022.0247","DOIUrl":"https://doi.org/10.3366/cor.2022.0247","url":null,"abstract":"Acquiring English collocations poses a major challenge for second language (L2) learners. It has been well noted that even advanced L2 English learners have difficulty using basic verb + noun collocations. Among the factors that make it difficult to acquire L2 collocations, the influence of learners’ first language (L1) has been repeatedly pointed out in the literature. As learners’ L1 and target language (L2) use data can help us to examine L1 influence effectively, in this study, we used a bilingual essay corpus, in which the same individuals ( n = 524) produced L1 and L2 essays on the same topic, to investigate the relationship between Japanese efl (English as a foreign language) learners’ L1 and their English collocation use in written essays. We also referred to essays written by native speakers of English on the same topic as a reference corpus ( n = 200). The proficiency level of 524 individuals in the learner corpus data was divided into three levels according to the Common European Framework of Reference (cefr): A2 (Waystage), B1 (Threshold) and B2 (Vantage). We focussed on the use of ‘ make + noun’ collocations as the target structure in this study and extracted them from the two corpora. Results suggest that, although the differences between the learner levels were not found to be statistically significant, the learners underused make + noun collocations with less variation, compared with native speakers of English. The pedagogical implications of this finding are discussed in terms of materials and syllabus development for efl learners.","PeriodicalId":44933,"journal":{"name":"Corpora","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47139315","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In English, a relative clause (rc) follows the head noun phrase (np). Conversely, the rc in Chinese precedes the head np directly before it or is separated by a determiner phrase (dp). This is uncommon in the subject-verb-object order language and results in two types of constructions. For Chinese as foreign language (cfl) learners, this construction alternation is complex to acquire and is dependent on many factors. This study aims to explore the underlying factors influencing cfl learners’ outer modifier nominal (omn)/inner modifier nominal (imn) alternation and whether they have multiple interactions. A multifactorial exploration of the significant predictors of the omn/imn alternation in Chinese interlanguage data, taken from the International Chinese Interlanguage Corpus, was conducted. Conditional inference trees and random forests were used to analyse the data. The predictors studied consist of head np animacy (HeadNPAnimacy), head np length (HeadNPLength), grammatical roles of head nps in matrix clauses (HeadNPMatRole), grammatical roles of head nps in rcs (HeadNPRelRole), length of rcs (RCLength), types of native languages (NLType), and learners’ Chinese proficiency (CHProficiency). Examinations of omn/imn alternation show predictors’ significant effects in descending order of their effect size: HeadNPRelRole, HeadNPMatRole, RCLength, NLType, and HeadNPAnimacy.
{"title":"The ordering of relative clauses and determiner phrases in Chinese interlanguage: a multifactorial study","authors":"Jiajin Xu, Zhao Liu","doi":"10.3366/cor.2022.0246","DOIUrl":"https://doi.org/10.3366/cor.2022.0246","url":null,"abstract":"In English, a relative clause (rc) follows the head noun phrase (np). Conversely, the rc in Chinese precedes the head np directly before it or is separated by a determiner phrase (dp). This is uncommon in the subject-verb-object order language and results in two types of constructions. For Chinese as foreign language (cfl) learners, this construction alternation is complex to acquire and is dependent on many factors. This study aims to explore the underlying factors influencing cfl learners’ outer modifier nominal (omn)/inner modifier nominal (imn) alternation and whether they have multiple interactions. A multifactorial exploration of the significant predictors of the omn/imn alternation in Chinese interlanguage data, taken from the International Chinese Interlanguage Corpus, was conducted. Conditional inference trees and random forests were used to analyse the data. The predictors studied consist of head np animacy (HeadNPAnimacy), head np length (HeadNPLength), grammatical roles of head nps in matrix clauses (HeadNPMatRole), grammatical roles of head nps in rcs (HeadNPRelRole), length of rcs (RCLength), types of native languages (NLType), and learners’ Chinese proficiency (CHProficiency). Examinations of omn/imn alternation show predictors’ significant effects in descending order of their effect size: HeadNPRelRole, HeadNPMatRole, RCLength, NLType, and HeadNPAnimacy.","PeriodicalId":44933,"journal":{"name":"Corpora","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43631286","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
As a field of research closely connected with second language acquisition, teaching and learning, learner corpus research (lcr) has garnered interest among language teachers and researchers in Hong Kong, where English is one of the two official languages (alongside Chinese) and also one of the chief mediums of instruction in education. In view of this unique situation, this paper provides a comprehensive overview of lcr within different teaching contexts in Hong Kong and identifies some major research trends and issues. Through this survey of the development of lcr in the region, we find that great advances have been made over the past three decades. Specifically, the object of analysis has shifted from cherry-picked, isolated textual features to operationalised parameters such as metadiscourse markers, lexical diversity, and syntactic complexity to study learners’ language output. Despite the progress that has been achieved so far, there remain a number of important questions for lcr in the context of Hong Kong. In particular, some researchers tend to broadly apply the term ‘learner corpus’ even to the language output of expert-level L2 speakers. Yet, whether this group of speakers can be treated as L2 learners, and their language output as a learner corpus, remains contested. In addition, existing learner corpora are also limited in their scope by genre, with the majority being compiled from letters and essay writings. This paper concludes with suggestions on how these limitations can be addressed in future research.
{"title":"Learner corpus research in Hong Kong: past, present and future","authors":"Kanglong Liu, Joyce Oiwun Cheung, Nan Zhao","doi":"10.3366/cor.2022.0248","DOIUrl":"https://doi.org/10.3366/cor.2022.0248","url":null,"abstract":"As a field of research closely connected with second language acquisition, teaching and learning, learner corpus research (lcr) has garnered interest among language teachers and researchers in Hong Kong, where English is one of the two official languages (alongside Chinese) and also one of the chief mediums of instruction in education. In view of this unique situation, this paper provides a comprehensive overview of lcr within different teaching contexts in Hong Kong and identifies some major research trends and issues. Through this survey of the development of lcr in the region, we find that great advances have been made over the past three decades. Specifically, the object of analysis has shifted from cherry-picked, isolated textual features to operationalised parameters such as metadiscourse markers, lexical diversity, and syntactic complexity to study learners’ language output. Despite the progress that has been achieved so far, there remain a number of important questions for lcr in the context of Hong Kong. In particular, some researchers tend to broadly apply the term ‘learner corpus’ even to the language output of expert-level L2 speakers. Yet, whether this group of speakers can be treated as L2 learners, and their language output as a learner corpus, remains contested. In addition, existing learner corpora are also limited in their scope by genre, with the majority being compiled from letters and essay writings. This paper concludes with suggestions on how these limitations can be addressed in future research.","PeriodicalId":44933,"journal":{"name":"Corpora","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42994625","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Introduction to the special issue on learner corpora research in the Asia Pacific Region","authors":"Chae Kwan Jung","doi":"10.3366/cor.2022.0243","DOIUrl":"https://doi.org/10.3366/cor.2022.0243","url":null,"abstract":"","PeriodicalId":44933,"journal":{"name":"Corpora","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43828610","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Corpus linguistics has firmly established itself as a major area of research within linguistics. Arguably, one of the most practical applications of corpora and corpus linguistics has been in the area of second language (L2) acquisition research. Emerging from the integration of the fields of corpus linguistics and second language acquisition, learner corpus research has greatly enhanced our understanding of how language learners acquire and use their L2 ( Granger, 2002 ). Since its inception in the 1980s ( Granger et al., 2015 ), learner corpus research has increasingly attracted scholarly attention from around the world. This paper highlights the state of learner corpus research in New Zealand, focussing, in particular, on lexical and syntactic aspects of learner language. In doing so, it reviews the learner corpus studies carried out to date by New Zealand-based researchers, describing the results and implications of such research in the context of L2 education, and discussing the current status and future prospects of learner corpus research in New Zealand.
语料库语言学已经成为语言学的一个主要研究领域。可以说,语料库和语料库语言学最实际的应用之一是在第二语言习得研究领域。语料库研究融合了语料库语言学和第二语言习得领域,极大地增强了我们对语言学习者如何习得和使用二语的理解(Granger,2002)。自20世纪80年代成立以来(Granger et al.,2015),学习者语料库研究越来越受到世界各地学术界的关注。本文重点介绍了新西兰学习者语料库的研究现状,特别关注学习者语言的词汇和句法方面。在此过程中,它回顾了新西兰研究人员迄今为止进行的学习者语料库研究,描述了此类研究在二语教育背景下的结果和意义,并讨论了新西兰学习者语料库的研究现状和未来前景。
{"title":"Learner corpus research in New Zealand","authors":"A. Siyanova-Chanturia, J. Parkinson, Taha Omidian","doi":"10.3366/cor.2022.0250","DOIUrl":"https://doi.org/10.3366/cor.2022.0250","url":null,"abstract":"Corpus linguistics has firmly established itself as a major area of research within linguistics. Arguably, one of the most practical applications of corpora and corpus linguistics has been in the area of second language (L2) acquisition research. Emerging from the integration of the fields of corpus linguistics and second language acquisition, learner corpus research has greatly enhanced our understanding of how language learners acquire and use their L2 ( Granger, 2002 ). Since its inception in the 1980s ( Granger et al., 2015 ), learner corpus research has increasingly attracted scholarly attention from around the world. This paper highlights the state of learner corpus research in New Zealand, focussing, in particular, on lexical and syntactic aspects of learner language. In doing so, it reviews the learner corpus studies carried out to date by New Zealand-based researchers, describing the results and implications of such research in the context of L2 education, and discussing the current status and future prospects of learner corpus research in New Zealand.","PeriodicalId":44933,"journal":{"name":"Corpora","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42939932","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The polysemous adverb just is frequently used by efl learners, but many learners are still unaware of how just should be used. The aim of this study is to examine how frequently different meanings of the adverb just are employed by native speakers and Taiwanese efl learners in their essays and to identify the differences in the lexico-grammatical patterns. Drawing data from one native-speaker corpus and two Taiwanese efl learner corpora, we investigated ( i) the overall frequencies of just, ( ii) the frequencies of just by meaning categories, and ( iii) the lexico-grammatical patterns of the different meanings of just, as well as their semantic and syntactic features. Results showed that the overall frequencies of just were similar in the native speaker and learner corpora, but there was a smaller variety of the use of adverbial just in the learner corpora. By examining the lexico-grammatical patterns, we found that the meanings of the adverbial just were induced in the following patterns: first, when it modified different syntactic structures; secondly, when it co-occurred with specific contextual clues; and, thirdly, when it interacted with particular tense/aspect of a verb. In addition, semantic features and lexical choices had a pivotal role in determining whether the use of a particular sense of just was acceptable in a sentence. By providing corpus-based teaching material for the uses of adverbial just, it is hoped that our study will shed light on the perplexing issue of adverb acquisition.
{"title":"A corpus-based study of native speakers’ and Taiwanese EFL learners’ use of the adverb just","authors":"Yifan Lin, Siaw-Fong Chung","doi":"10.3366/cor.2022.0249","DOIUrl":"https://doi.org/10.3366/cor.2022.0249","url":null,"abstract":"The polysemous adverb just is frequently used by efl learners, but many learners are still unaware of how just should be used. The aim of this study is to examine how frequently different meanings of the adverb just are employed by native speakers and Taiwanese efl learners in their essays and to identify the differences in the lexico-grammatical patterns. Drawing data from one native-speaker corpus and two Taiwanese efl learner corpora, we investigated ( i) the overall frequencies of just, ( ii) the frequencies of just by meaning categories, and ( iii) the lexico-grammatical patterns of the different meanings of just, as well as their semantic and syntactic features. Results showed that the overall frequencies of just were similar in the native speaker and learner corpora, but there was a smaller variety of the use of adverbial just in the learner corpora. By examining the lexico-grammatical patterns, we found that the meanings of the adverbial just were induced in the following patterns: first, when it modified different syntactic structures; secondly, when it co-occurred with specific contextual clues; and, thirdly, when it interacted with particular tense/aspect of a verb. In addition, semantic features and lexical choices had a pivotal role in determining whether the use of a particular sense of just was acceptable in a sentence. By providing corpus-based teaching material for the uses of adverbial just, it is hoped that our study will shed light on the perplexing issue of adverb acquisition.","PeriodicalId":44933,"journal":{"name":"Corpora","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42762114","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The interest in the exploitation of corpora in the study of Korean L2 learners’ use of English has risen dramatically over the past two decades, leading to the compilation of learner corpora and to numerous empirical investigations into Korean learners’ use of English. This paper will give an overview of the compilation and characteristics of English learner corpora in Korea and will also provide an analysis of the recent trends in learner corpus research. It was not until the mid-2000s that Korean academics started to compile English learner corpora, such as the snu Korean-speaking English Learner Corpus (skelc), the Yonsei English Learner Corpus (yelc), the Gachon Learner Corpus (glc), the Neungyule Interlanguage Corpus of Korean Learners of English (nickle), the efl Teacher Corpus (etc), the Korean English Learners’ Spoken Corpus (kelsc) and the ets Corpus of Non-native Written English (TOEFL11). There have also been a growing number of learner corpus-based studies that used the existing learner corpora as well as self-compiled corpus data. All the learner corpus-based research articles published in two Korean academic journals ( English Teaching and Korean Journal of Applied Linguistics) will be reviewed and analysed in terms of research topics and areas, data types, analysis methods and corpus compilation practices. Finally, this paper will suggest some future directions for learner corpus compilation and research in Korea.
{"title":"English learner corpora and research in Korea","authors":"Heokseung Kwon","doi":"10.3366/cor.2022.0244","DOIUrl":"https://doi.org/10.3366/cor.2022.0244","url":null,"abstract":"The interest in the exploitation of corpora in the study of Korean L2 learners’ use of English has risen dramatically over the past two decades, leading to the compilation of learner corpora and to numerous empirical investigations into Korean learners’ use of English. This paper will give an overview of the compilation and characteristics of English learner corpora in Korea and will also provide an analysis of the recent trends in learner corpus research. It was not until the mid-2000s that Korean academics started to compile English learner corpora, such as the snu Korean-speaking English Learner Corpus (skelc), the Yonsei English Learner Corpus (yelc), the Gachon Learner Corpus (glc), the Neungyule Interlanguage Corpus of Korean Learners of English (nickle), the efl Teacher Corpus (etc), the Korean English Learners’ Spoken Corpus (kelsc) and the ets Corpus of Non-native Written English (TOEFL11). There have also been a growing number of learner corpus-based studies that used the existing learner corpora as well as self-compiled corpus data. All the learner corpus-based research articles published in two Korean academic journals ( English Teaching and Korean Journal of Applied Linguistics) will be reviewed and analysed in terms of research topics and areas, data types, analysis methods and corpus compilation practices. Finally, this paper will suggest some future directions for learner corpus compilation and research in Korea.","PeriodicalId":44933,"journal":{"name":"Corpora","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42376331","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Review: Le Bruyn and Paquot (eds). 2021. Learner Corpus Research Meets Second Language Acquisition","authors":"W. Crawford","doi":"10.3366/cor.2022.0259","DOIUrl":"https://doi.org/10.3366/cor.2022.0259","url":null,"abstract":"","PeriodicalId":44933,"journal":{"name":"Corpora","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49557698","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The phenomenon of immigration and its depiction in media texts have been examined profusely within the field of corpus-based discourse analysis ( Gabrielatos and Baker, 2008 ; Baker et al., 2013 ; and Blinder and Allen, 2016 ). This research seeks to present it as reflected in a corpus of 600 judicial decisions issued by Spanish courts in the years 2016 and 2017. This analysis was motivated by the rise of extreme right-wing parties in Europe in recent years. Such parties dehumanise immigrants and portray them as a threat to the welfare state. On first examination, the results appear to dissociate immigration and crime since a considerable percentage of the keywords obtained (about 20 percent) revolves around three major topoi (namely, ‘family’, ‘territory/access’ and ‘legal punishment’) and there is no evidence of any major offences or crimes amongst the top-ranking lexicon. The study of the collocate networks of the keywords within the category ‘legal punishment’ confirms our initial perception; in fact, out of twenty-one collocates, only the word delito (‘crime’) itself collocates with terms referring to typified crimes such as violencia (‘violence’). In parallel, the data were triangulated using the text-classification software UMUTextStats ( García-Díaz et al., 2018 ). The results of this second analysis also confirm our initial observations.
移民现象及其在媒体文本中的描述在基于语料库的话语分析领域得到了广泛的研究(Gabrielatos和Baker, 2008;Baker et al., 2013;Blinder and Allen, 2016)。本研究试图从西班牙法院在2016年和2017年发布的600个司法判决语料库中反映出这一点。这种分析是受到近年来欧洲极右翼政党崛起的推动。这些政党贬低移民的人性,把他们描绘成福利国家的威胁。在第一次检查中,结果似乎将移民和犯罪分离开来,因为获得的关键词中有相当大比例(约20%)围绕三个主要主题(即“家庭”,“领土/访问”和“法律惩罚”),并且在排名靠前的词汇中没有任何重大违法或犯罪的证据。对“法律处罚”范畴内关键词搭配网络的研究证实了我们的初步认知;事实上,在21种搭配中,只有delito(“犯罪”)这个词本身与典型犯罪(如violencia(“暴力”))的术语搭配。同时,使用文本分类软件UMUTextStats对数据进行三角测量(García-Díaz et al., 2018)。第二次分析的结果也证实了我们最初的观察。
{"title":"The representation of migrants in Spanish judicial decisions: using corpus data to refute hate speech","authors":"María José Marín Pérez, Á. Almela","doi":"10.3366/cor.2022.0253","DOIUrl":"https://doi.org/10.3366/cor.2022.0253","url":null,"abstract":"The phenomenon of immigration and its depiction in media texts have been examined profusely within the field of corpus-based discourse analysis ( Gabrielatos and Baker, 2008 ; Baker et al., 2013 ; and Blinder and Allen, 2016 ). This research seeks to present it as reflected in a corpus of 600 judicial decisions issued by Spanish courts in the years 2016 and 2017. This analysis was motivated by the rise of extreme right-wing parties in Europe in recent years. Such parties dehumanise immigrants and portray them as a threat to the welfare state. On first examination, the results appear to dissociate immigration and crime since a considerable percentage of the keywords obtained (about 20 percent) revolves around three major topoi (namely, ‘family’, ‘territory/access’ and ‘legal punishment’) and there is no evidence of any major offences or crimes amongst the top-ranking lexicon. The study of the collocate networks of the keywords within the category ‘legal punishment’ confirms our initial perception; in fact, out of twenty-one collocates, only the word delito (‘crime’) itself collocates with terms referring to typified crimes such as violencia (‘violence’). In parallel, the data were triangulated using the text-classification software UMUTextStats ( García-Díaz et al., 2018 ). The results of this second analysis also confirm our initial observations.","PeriodicalId":44933,"journal":{"name":"Corpora","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42757298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
This paper presents a corpus-based analysis of English newspaper reportage in two South Asian countries, Pakistan (where English was introduced through colonisation) and Afghanistan (which has not been colonised), and their comparison with British newspaper reportage. The objective of this study is to analyse linguistic variation between the cultural press reportage (cpr) of the selected countries and to see which variety of English, Pakistani or Afghan, resembles British English the most. To achieve this objective, three English newspapers from each country were selected for the compilation of a specialised corpus which was analysed with reference to the five textual dimensions introduced by Biber (1988 , 2006 ). This research is significant as no previous study has attempted to find the differences and similarities between the Englishes used in a formerly colonised country and a country that was never part of the British Empire. The comparison indicates that Pakistani cpr is close to British cpr, while Afghan cpr is different. In terms of Biber’s five textual dimensions, Afghan cpr is more informational, narrative, explicit and abstract, and less non-argumentative in comparison with British and Pakistani cpr.
{"title":"Reporting local cultures in the globalised world: how indigenised can English be in the free world?","authors":"S. Ali, P. Thompson","doi":"10.3366/cor.2022.0254","DOIUrl":"https://doi.org/10.3366/cor.2022.0254","url":null,"abstract":"This paper presents a corpus-based analysis of English newspaper reportage in two South Asian countries, Pakistan (where English was introduced through colonisation) and Afghanistan (which has not been colonised), and their comparison with British newspaper reportage. The objective of this study is to analyse linguistic variation between the cultural press reportage (cpr) of the selected countries and to see which variety of English, Pakistani or Afghan, resembles British English the most. To achieve this objective, three English newspapers from each country were selected for the compilation of a specialised corpus which was analysed with reference to the five textual dimensions introduced by Biber (1988 , 2006 ). This research is significant as no previous study has attempted to find the differences and similarities between the Englishes used in a formerly colonised country and a country that was never part of the British Empire. The comparison indicates that Pakistani cpr is close to British cpr, while Afghan cpr is different. In terms of Biber’s five textual dimensions, Afghan cpr is more informational, narrative, explicit and abstract, and less non-argumentative in comparison with British and Pakistani cpr.","PeriodicalId":44933,"journal":{"name":"Corpora","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49301474","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}