{"title":"Capturing Herder: a three-step approach to the identification of language ideologies using corpus linguistics and critical discourse analysis","authors":"Adnan Ajšić","doi":"10.3366/COR.2021.0209","DOIUrl":null,"url":null,"abstract":"Recent lexical approaches to the identification of language ideologies focus on the application of quantitative corpus-linguistic techniques to large data sets as a way to minimise researcher inference and ensure more objective sampling methods, replicability of analytical procedures, and a higher degree of generalisability ( Fitzsimmons-Doolan, 2014 ; Subtirelu, 2015 ; Vessey, 2017 ; Wright and Brooks, 2019 ; and McEntee-Atalianis and Vessey, 2020 ). Based on two comprehensive, specialised research (11.6 million words) and comparator (22.4 million words) newspaper corpora, this study offers an examination of the effectiveness of the multivariate and univariate statistical techniques, and proposes a three-step approach whereby corpus linguistics and critical discourse analysis are combined to identify ( 1) thematic and ( 2) ideological discourses (cf. ‘d’/’D’ discourses; Gee, 2010 ), and ( 3) language ideologies. In contrast to recent contributions, it is argued that item frequency is not necessarily a reliable or effective indicator of language ideologies but, rather, of language-related discourses which can be examined for implicit and explicit language-ideological content. A combination of multivariate and univariate statistical techniques, and the three-step approach are shown to be a highly effective methodological solution for synchronic and diachronic language ideology and discourse research based on topically/discursively heterogeneous corpora.","PeriodicalId":44933,"journal":{"name":"Corpora","volume":null,"pages":null},"PeriodicalIF":0.8000,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Corpora","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3366/COR.2021.0209","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"LINGUISTICS","Score":null,"Total":0}
引用次数: 2
Abstract
Recent lexical approaches to the identification of language ideologies focus on the application of quantitative corpus-linguistic techniques to large data sets as a way to minimise researcher inference and ensure more objective sampling methods, replicability of analytical procedures, and a higher degree of generalisability ( Fitzsimmons-Doolan, 2014 ; Subtirelu, 2015 ; Vessey, 2017 ; Wright and Brooks, 2019 ; and McEntee-Atalianis and Vessey, 2020 ). Based on two comprehensive, specialised research (11.6 million words) and comparator (22.4 million words) newspaper corpora, this study offers an examination of the effectiveness of the multivariate and univariate statistical techniques, and proposes a three-step approach whereby corpus linguistics and critical discourse analysis are combined to identify ( 1) thematic and ( 2) ideological discourses (cf. ‘d’/’D’ discourses; Gee, 2010 ), and ( 3) language ideologies. In contrast to recent contributions, it is argued that item frequency is not necessarily a reliable or effective indicator of language ideologies but, rather, of language-related discourses which can be examined for implicit and explicit language-ideological content. A combination of multivariate and univariate statistical techniques, and the three-step approach are shown to be a highly effective methodological solution for synchronic and diachronic language ideology and discourse research based on topically/discursively heterogeneous corpora.