更好,更快,更强:使用机器学习分析南非警方记录的抗议数据

IF 0.5 Q4 SOCIOLOGY South African Review of Sociology Pub Date : 2022-01-01 DOI:10.1080/21528586.2021.1982762
M. Bekker
{"title":"更好,更快,更强:使用机器学习分析南非警方记录的抗议数据","authors":"M. Bekker","doi":"10.1080/21528586.2021.1982762","DOIUrl":null,"url":null,"abstract":"ABSTRACT A long-important tool for quantitative analysis of protests, the potential power of Protest Event Analysis (PEA) has only increased with the rise of Machine Learning technologies and the ubiquity of big data. PEA coders also present an advantage over contemporary Natural Language Programming innovations by being customisable to incorporate locally appropriate terms and vernaculars, expressed as personalised ontologies. As such, there is a need to develop a standard process for deploying machine learning tools that can draw on the local. This paper introduces such a tool, innovating the numeration of abstract indicators. “Machine Learning Protest Event Analysis Keyword Enumerated Recoding” is a protocol that enables PEA coders to read and classify large “event databases”, incorporating local terms and abstract indicators into the analysis. Applying this protocol to 150,000 records in a police-recorded database of crowd events in South Africa, protest events could be individually rated by levels of “tumult”—a feat hitherto inhibited by conventional PEA methods. Innovations in estimating crowd sizes, as well as an updated view of post-apartheid protest, showing that protests tend to be more common but less prone to violence than previous theories concluded, speaks to the potential for this protocol to unearth novel insights on even bigger data sets.","PeriodicalId":44730,"journal":{"name":"South African Review of Sociology","volume":"67 1","pages":"4 - 23"},"PeriodicalIF":0.5000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Better, Faster, Stronger: Using Machine Learning to Analyse South African Police-recorded Protest Data\",\"authors\":\"M. Bekker\",\"doi\":\"10.1080/21528586.2021.1982762\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"ABSTRACT A long-important tool for quantitative analysis of protests, the potential power of Protest Event Analysis (PEA) has only increased with the rise of Machine Learning technologies and the ubiquity of big data. PEA coders also present an advantage over contemporary Natural Language Programming innovations by being customisable to incorporate locally appropriate terms and vernaculars, expressed as personalised ontologies. As such, there is a need to develop a standard process for deploying machine learning tools that can draw on the local. This paper introduces such a tool, innovating the numeration of abstract indicators. “Machine Learning Protest Event Analysis Keyword Enumerated Recoding” is a protocol that enables PEA coders to read and classify large “event databases”, incorporating local terms and abstract indicators into the analysis. Applying this protocol to 150,000 records in a police-recorded database of crowd events in South Africa, protest events could be individually rated by levels of “tumult”—a feat hitherto inhibited by conventional PEA methods. Innovations in estimating crowd sizes, as well as an updated view of post-apartheid protest, showing that protests tend to be more common but less prone to violence than previous theories concluded, speaks to the potential for this protocol to unearth novel insights on even bigger data sets.\",\"PeriodicalId\":44730,\"journal\":{\"name\":\"South African Review of Sociology\",\"volume\":\"67 1\",\"pages\":\"4 - 23\"},\"PeriodicalIF\":0.5000,\"publicationDate\":\"2022-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"South African Review of Sociology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1080/21528586.2021.1982762\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"SOCIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"South African Review of Sociology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/21528586.2021.1982762","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"SOCIOLOGY","Score":null,"Total":0}
引用次数: 2

摘要

作为长期以来重要的抗议定量分析工具,随着机器学习技术的兴起和大数据的无处不在,抗议事件分析(PEA)的潜在力量只会增加。与当代自然语言编程创新相比,PEA编码器还具有一个优势,即可定制,以个性化本体的形式表达本地适当的术语和方言。因此,有必要开发一个标准流程来部署可以利用本地资源的机器学习工具。本文介绍了这样一个工具,创新了抽象指标的计算方法。“机器学习抗议事件分析关键字枚举重编码”是一种协议,它使PEA编码器能够读取和分类大型“事件数据库”,并将局部术语和抽象指标纳入分析。将这一协议应用到南非警方记录的人群事件数据库中的15万份记录中,抗议事件可以按照“骚乱”的程度进行单独评级——这是迄今为止传统PEA方法所无法做到的。估计人群规模的创新,以及对后种族隔离抗议的更新看法,表明抗议活动往往比以前的理论结论更常见,但更不容易发生暴力,说明该协议有可能在更大的数据集上挖掘出新的见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Better, Faster, Stronger: Using Machine Learning to Analyse South African Police-recorded Protest Data
ABSTRACT A long-important tool for quantitative analysis of protests, the potential power of Protest Event Analysis (PEA) has only increased with the rise of Machine Learning technologies and the ubiquity of big data. PEA coders also present an advantage over contemporary Natural Language Programming innovations by being customisable to incorporate locally appropriate terms and vernaculars, expressed as personalised ontologies. As such, there is a need to develop a standard process for deploying machine learning tools that can draw on the local. This paper introduces such a tool, innovating the numeration of abstract indicators. “Machine Learning Protest Event Analysis Keyword Enumerated Recoding” is a protocol that enables PEA coders to read and classify large “event databases”, incorporating local terms and abstract indicators into the analysis. Applying this protocol to 150,000 records in a police-recorded database of crowd events in South Africa, protest events could be individually rated by levels of “tumult”—a feat hitherto inhibited by conventional PEA methods. Innovations in estimating crowd sizes, as well as an updated view of post-apartheid protest, showing that protests tend to be more common but less prone to violence than previous theories concluded, speaks to the potential for this protocol to unearth novel insights on even bigger data sets.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
0.90
自引率
25.00%
发文量
26
期刊最新文献
Guerrillas and combative mothers: women and the armed struggle in South Africa Menstruation and Society in South Africa: A Desktop Analysis Women from the South. Poetics of the Encounter with Asia and Africa “Fighting to Be a Real Man”: Constructions of Respectability and Contestations among African Migrant Men in Johannesburg Exploring Heteronormativity and the Illusion of the “Real Man”: A Case Study of Sivuyile (Siv) Ngesi
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1