更好，更快，更强:使用机器学习分析南非警方记录的抗议数据

IF 0.6 Q4 SOCIOLOGY South African Review of Sociology Pub Date : 2022-01-01 DOI:10.1080/21528586.2021.1982762

M. Bekker

{"title":"更好，更快，更强:使用机器学习分析南非警方记录的抗议数据","authors":"M. Bekker","doi":"10.1080/21528586.2021.1982762","DOIUrl":null,"url":null,"abstract":"ABSTRACT A long-important tool for quantitative analysis of protests, the potential power of Protest Event Analysis (PEA) has only increased with the rise of Machine Learning technologies and the ubiquity of big data. PEA coders also present an advantage over contemporary Natural Language Programming innovations by being customisable to incorporate locally appropriate terms and vernaculars, expressed as personalised ontologies. As such, there is a need to develop a standard process for deploying machine learning tools that can draw on the local. This paper introduces such a tool, innovating the numeration of abstract indicators. “Machine Learning Protest Event Analysis Keyword Enumerated Recoding” is a protocol that enables PEA coders to read and classify large “event databases”, incorporating local terms and abstract indicators into the analysis. Applying this protocol to 150,000 records in a police-recorded database of crowd events in South Africa, protest events could be individually rated by levels of “tumult”—a feat hitherto inhibited by conventional PEA methods. Innovations in estimating crowd sizes, as well as an updated view of post-apartheid protest, showing that protests tend to be more common but less prone to violence than previous theories concluded, speaks to the potential for this protocol to unearth novel insights on even bigger data sets.","PeriodicalId":44730,"journal":{"name":"South African Review of Sociology","volume":"67 1","pages":"4 - 23"},"PeriodicalIF":0.6000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Better, Faster, Stronger: Using Machine Learning to Analyse South African Police-recorded Protest Data\",\"authors\":\"M. Bekker\",\"doi\":\"10.1080/21528586.2021.1982762\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"ABSTRACT A long-important tool for quantitative analysis of protests, the potential power of Protest Event Analysis (PEA) has only increased with the rise of Machine Learning technologies and the ubiquity of big data. PEA coders also present an advantage over contemporary Natural Language Programming innovations by being customisable to incorporate locally appropriate terms and vernaculars, expressed as personalised ontologies. As such, there is a need to develop a standard process for deploying machine learning tools that can draw on the local. This paper introduces such a tool, innovating the numeration of abstract indicators. “Machine Learning Protest Event Analysis Keyword Enumerated Recoding” is a protocol that enables PEA coders to read and classify large “event databases”, incorporating local terms and abstract indicators into the analysis. Applying this protocol to 150,000 records in a police-recorded database of crowd events in South Africa, protest events could be individually rated by levels of “tumult”—a feat hitherto inhibited by conventional PEA methods. Innovations in estimating crowd sizes, as well as an updated view of post-apartheid protest, showing that protests tend to be more common but less prone to violence than previous theories concluded, speaks to the potential for this protocol to unearth novel insights on even bigger data sets.\",\"PeriodicalId\":44730,\"journal\":{\"name\":\"South African Review of Sociology\",\"volume\":\"67 1\",\"pages\":\"4 - 23\"},\"PeriodicalIF\":0.6000,\"publicationDate\":\"2022-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"South African Review of Sociology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1080/21528586.2021.1982762\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"SOCIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"South African Review of Sociology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/21528586.2021.1982762","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"SOCIOLOGY","Score":null,"Total":0}

引用次数: 2

摘要

作为长期以来重要的抗议定量分析工具，随着机器学习技术的兴起和大数据的无处不在，抗议事件分析(PEA)的潜在力量只会增加。与当代自然语言编程创新相比，PEA编码器还具有一个优势，即可定制，以个性化本体的形式表达本地适当的术语和方言。因此，有必要开发一个标准流程来部署可以利用本地资源的机器学习工具。本文介绍了这样一个工具，创新了抽象指标的计算方法。“机器学习抗议事件分析关键字枚举重编码”是一种协议，它使PEA编码器能够读取和分类大型“事件数据库”，并将局部术语和抽象指标纳入分析。将这一协议应用到南非警方记录的人群事件数据库中的15万份记录中，抗议事件可以按照“骚乱”的程度进行单独评级——这是迄今为止传统PEA方法所无法做到的。估计人群规模的创新，以及对后种族隔离抗议的更新看法，表明抗议活动往往比以前的理论结论更常见，但更不容易发生暴力，说明该协议有可能在更大的数据集上挖掘出新的见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Better, Faster, Stronger: Using Machine Learning to Analyse South African Police-recorded Protest Data

ABSTRACT A long-important tool for quantitative analysis of protests, the potential power of Protest Event Analysis (PEA) has only increased with the rise of Machine Learning technologies and the ubiquity of big data. PEA coders also present an advantage over contemporary Natural Language Programming innovations by being customisable to incorporate locally appropriate terms and vernaculars, expressed as personalised ontologies. As such, there is a need to develop a standard process for deploying machine learning tools that can draw on the local. This paper introduces such a tool, innovating the numeration of abstract indicators. “Machine Learning Protest Event Analysis Keyword Enumerated Recoding” is a protocol that enables PEA coders to read and classify large “event databases”, incorporating local terms and abstract indicators into the analysis. Applying this protocol to 150,000 records in a police-recorded database of crowd events in South Africa, protest events could be individually rated by levels of “tumult”—a feat hitherto inhibited by conventional PEA methods. Innovations in estimating crowd sizes, as well as an updated view of post-apartheid protest, showing that protests tend to be more common but less prone to violence than previous theories concluded, speaks to the potential for this protocol to unearth novel insights on even bigger data sets.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

South African Review of Sociology SOCIOLOGY-

CiteScore

0.90

自引率

25.00%

发文量