Teodora Vuković, Anastasia Escher, Barbara Sonnenhauser
{"title":"Degrees of non-standardness","authors":"Teodora Vuković, Anastasia Escher, Barbara Sonnenhauser","doi":"10.1075/ijcl.20014.vuk","DOIUrl":null,"url":null,"abstract":"\n A corpus-based method for assessing a range of dialect-standard variation is presented for identifying samples\n exhibiting the highest prevalence of dialect features. This method provides insight into areal and inter-speaker variation and\n allows the extraction of maximally non-standard manifestations of the dialect, which may then be sampled and used for the study of\n language change and variation. The focus is on a non-standard Torlak variety, which has undergone considerable change under the\n influence of standard Serbian. The degree of variation is assessed by measuring the frequencies of five distinguishing linguistic\n features: accent position, dative reflexive si, auxiliary omission in the compound perfect, the post-positive\n article, and analytic case marking in the indirect object and possessive. Locations subject to the greatest and least influence of\n the standard are revealed using hierarchical clustering. A positive correlation between the frequencies of occurrence reveals\n which non-standard feature is the best predictor of the others.","PeriodicalId":46843,"journal":{"name":"International Journal of Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":1.6000,"publicationDate":"2022-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Corpus Linguistics","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1075/ijcl.20014.vuk","RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 0
Abstract
A corpus-based method for assessing a range of dialect-standard variation is presented for identifying samples
exhibiting the highest prevalence of dialect features. This method provides insight into areal and inter-speaker variation and
allows the extraction of maximally non-standard manifestations of the dialect, which may then be sampled and used for the study of
language change and variation. The focus is on a non-standard Torlak variety, which has undergone considerable change under the
influence of standard Serbian. The degree of variation is assessed by measuring the frequencies of five distinguishing linguistic
features: accent position, dative reflexive si, auxiliary omission in the compound perfect, the post-positive
article, and analytic case marking in the indirect object and possessive. Locations subject to the greatest and least influence of
the standard are revealed using hierarchical clustering. A positive correlation between the frequencies of occurrence reveals
which non-standard feature is the best predictor of the others.
期刊介绍:
The International Journal of Corpus Linguistics (IJCL) publishes original research covering methodological, applied and theoretical work in any area of corpus linguistics. Through its focus on empirical language research, IJCL provides a forum for the presentation of new findings and innovative approaches in any area of linguistics (e.g. lexicology, grammar, discourse analysis, stylistics, sociolinguistics, morphology, contrastive linguistics), applied linguistics (e.g. language teaching, forensic linguistics), and translation studies. Based on its interest in corpus methodology, IJCL also invites contributions on the interface between corpus and computational linguistics.