整体优势:统一定量建模,深入洞察(社会)语言变异,减少偏见

IF 0.9 0 LANGUAGE & LINGUISTICS Languages Pub Date : 2024-05-16 DOI:10.3390/languages9050182
Wilkinson Daniel Wong Gonzales
{"title":"整体优势:统一定量建模,深入洞察(社会)语言变异,减少偏见","authors":"Wilkinson Daniel Wong Gonzales","doi":"10.3390/languages9050182","DOIUrl":null,"url":null,"abstract":"What happens when recognized and diverse conditioning factors of linguistic variation are omitted from analysis and/or are not analyzed under a single analytical procedure? This paper explores the consequences of such a choice on data interpretation and, consequently, (socio)linguistic theorization. Utilizing Twitter-style English in the Philippines (EngPH) as a case study, I employ the Twitter Corpus of Philippine Englishes (TCOPE) primarily to investigate and elucidate variations in three morphosyntactic variables that have been previously examined using a piecemeal approach. I propose a holistic quantitative approach that incorporates documented linguistic, social, diachronic, and stylistic factors in a unified analysis. The paper illustrates the impacts of adopting this holistic approach through two statistical procedures: Bayesian regression modeling and Boruta feature selection with random forest modeling. In contrast to earlier research findings, my overall results reveal biases in non-unified quantitative analyses, where the confidence in the effects of certain factors diminishes in light of others during analysis. The adoption of a unified analysis or modeling also enhances the resolution at which variations have been examined in EngPH. For instance, it highlights that presumed ‘universals’, such as the hierarchy of linguistic > stylistic > diachronic > social factors in explaining variation in some domains, is contingent on the specific variable under examination. Overall, I argue that unified analyses reduce data distortion and introduce more nuanced interpretations and insights that are critical for establishing a well-grounded empirical theory of EngPH variation and language variation as a whole.","PeriodicalId":52329,"journal":{"name":"Languages","volume":null,"pages":null},"PeriodicalIF":0.9000,"publicationDate":"2024-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The Holistic Advantage: Unified Quantitative Modeling for Less-Biased, In-Depth Insights into (Socio)Linguistic Variation\",\"authors\":\"Wilkinson Daniel Wong Gonzales\",\"doi\":\"10.3390/languages9050182\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"What happens when recognized and diverse conditioning factors of linguistic variation are omitted from analysis and/or are not analyzed under a single analytical procedure? This paper explores the consequences of such a choice on data interpretation and, consequently, (socio)linguistic theorization. Utilizing Twitter-style English in the Philippines (EngPH) as a case study, I employ the Twitter Corpus of Philippine Englishes (TCOPE) primarily to investigate and elucidate variations in three morphosyntactic variables that have been previously examined using a piecemeal approach. I propose a holistic quantitative approach that incorporates documented linguistic, social, diachronic, and stylistic factors in a unified analysis. The paper illustrates the impacts of adopting this holistic approach through two statistical procedures: Bayesian regression modeling and Boruta feature selection with random forest modeling. In contrast to earlier research findings, my overall results reveal biases in non-unified quantitative analyses, where the confidence in the effects of certain factors diminishes in light of others during analysis. The adoption of a unified analysis or modeling also enhances the resolution at which variations have been examined in EngPH. For instance, it highlights that presumed ‘universals’, such as the hierarchy of linguistic > stylistic > diachronic > social factors in explaining variation in some domains, is contingent on the specific variable under examination. Overall, I argue that unified analyses reduce data distortion and introduce more nuanced interpretations and insights that are critical for establishing a well-grounded empirical theory of EngPH variation and language variation as a whole.\",\"PeriodicalId\":52329,\"journal\":{\"name\":\"Languages\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.9000,\"publicationDate\":\"2024-05-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Languages\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3390/languages9050182\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"LANGUAGE & LINGUISTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Languages","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/languages9050182","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 0

摘要

如果在分析中忽略了公认的、多种多样的语言变异条件因素,并且/或者不按照单一的分析程序进行分析,会发生什么情况?本文探讨了这种选择对数据解释以及(社会)语言理论化的影响。以菲律宾的推特式英语(EngPH)作为案例研究,我主要利用菲律宾英语推特语料库(TCOPE)来调查和阐明三个形态句法变量的变化,这些变量以前曾用零散的方法进行过研究。我提出了一种整体定量方法,将记录的语言、社会、非同步和文体因素纳入统一分析中。本文通过两种统计程序说明了采用这种整体方法的影响:贝叶斯回归建模和博鲁塔特征选择与随机森林建模。与之前的研究结果不同,我的总体研究结果揭示了非统一定量分析中的偏差,即在分析过程中,某些因素的影响会随着其他因素的影响而降低可信度。采用统一的分析或建模方法还能提高 EngPH 中研究变化的分辨率。例如,它强调了假定的 "普遍性",如在解释某些领域的变异时,语言因素>文体因素>异时因素>社会因素的层次结构,取决于所研究的具体变量。总之,我认为统一分析减少了数据失真,并引入了更细致入微的解释和见解,这对于建立关于英语语言变异和整个语言变异的基础扎实的实证理论至关重要。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
The Holistic Advantage: Unified Quantitative Modeling for Less-Biased, In-Depth Insights into (Socio)Linguistic Variation
What happens when recognized and diverse conditioning factors of linguistic variation are omitted from analysis and/or are not analyzed under a single analytical procedure? This paper explores the consequences of such a choice on data interpretation and, consequently, (socio)linguistic theorization. Utilizing Twitter-style English in the Philippines (EngPH) as a case study, I employ the Twitter Corpus of Philippine Englishes (TCOPE) primarily to investigate and elucidate variations in three morphosyntactic variables that have been previously examined using a piecemeal approach. I propose a holistic quantitative approach that incorporates documented linguistic, social, diachronic, and stylistic factors in a unified analysis. The paper illustrates the impacts of adopting this holistic approach through two statistical procedures: Bayesian regression modeling and Boruta feature selection with random forest modeling. In contrast to earlier research findings, my overall results reveal biases in non-unified quantitative analyses, where the confidence in the effects of certain factors diminishes in light of others during analysis. The adoption of a unified analysis or modeling also enhances the resolution at which variations have been examined in EngPH. For instance, it highlights that presumed ‘universals’, such as the hierarchy of linguistic > stylistic > diachronic > social factors in explaining variation in some domains, is contingent on the specific variable under examination. Overall, I argue that unified analyses reduce data distortion and introduce more nuanced interpretations and insights that are critical for establishing a well-grounded empirical theory of EngPH variation and language variation as a whole.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Languages
Languages Arts and Humanities-Language and Linguistics
CiteScore
1.40
自引率
22.20%
发文量
282
审稿时长
11 weeks
期刊最新文献
A Systematic Review of Empirical Mobile-Assisted Pronunciation Studies through a Perception–Production Lens Amplifying Parental Views about Language Choice When Raising Multilingual Children: Towards a Family-Centered Approach in Professional Contexts Causal Relations and Cohesive Strategies in the Narratives of Heritage Speakers of Russian in Their Two Languages Mind the Gap! Null Modals (and Other Functional Verbs) in Finite Complementation in Italo-Greek Chilean Spanish Intonational Plateaus and Their Pragmatic Functions: A Case of Contact with Mapudungun
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1