预测政客的不当行为：来自哥伦比亚的证据

IF 1.8 Q3 PUBLIC ADMINISTRATION Data & policy Pub Date : 2022-11-14 DOI:10.1017/dap.2022.35

Jorge Gallego, M. Prem, Juan F. Vargas

{"title":"预测政客的不当行为：来自哥伦比亚的证据","authors":"Jorge Gallego, M. Prem, Juan F. Vargas","doi":"10.1017/dap.2022.35","DOIUrl":null,"url":null,"abstract":"Abstract Corruption has pervasive effects on economic development and the well-being of the population. Despite being crucial and necessary, fighting corruption is not an easy task because it is a difficult phenomenon to measure and detect. However, recent advances in the field of artificial intelligence may help in this quest. In this article, we propose the use of machine-learning models to predict municipality-level corruption in a developing country. Using data from disciplinary prosecutions conducted by an anti-corruption agency in Colombia, we trained four canonical models (Random Forests, Gradient Boosting Machine, Lasso, and Neural Networks), and ensemble their predictions, to predict whether or not a mayor will commit acts of corruption. Our models achieve acceptable levels of performance, based on metrics such as the precision and the area under the receiver-operating characteristic curve, demonstrating that these tools are useful in predicting where misbehavior is most likely to occur. Moreover, our feature-importance analysis shows us which groups of variables are most important in predicting corruption.","PeriodicalId":93427,"journal":{"name":"Data & policy","volume":" ","pages":""},"PeriodicalIF":1.8000,"publicationDate":"2022-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predicting politicians’ misconduct: Evidence from Colombia\",\"authors\":\"Jorge Gallego, M. Prem, Juan F. Vargas\",\"doi\":\"10.1017/dap.2022.35\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract Corruption has pervasive effects on economic development and the well-being of the population. Despite being crucial and necessary, fighting corruption is not an easy task because it is a difficult phenomenon to measure and detect. However, recent advances in the field of artificial intelligence may help in this quest. In this article, we propose the use of machine-learning models to predict municipality-level corruption in a developing country. Using data from disciplinary prosecutions conducted by an anti-corruption agency in Colombia, we trained four canonical models (Random Forests, Gradient Boosting Machine, Lasso, and Neural Networks), and ensemble their predictions, to predict whether or not a mayor will commit acts of corruption. Our models achieve acceptable levels of performance, based on metrics such as the precision and the area under the receiver-operating characteristic curve, demonstrating that these tools are useful in predicting where misbehavior is most likely to occur. Moreover, our feature-importance analysis shows us which groups of variables are most important in predicting corruption.\",\"PeriodicalId\":93427,\"journal\":{\"name\":\"Data & policy\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":1.8000,\"publicationDate\":\"2022-11-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Data & policy\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1017/dap.2022.35\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"PUBLIC ADMINISTRATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data & policy","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1017/dap.2022.35","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"PUBLIC ADMINISTRATION","Score":null,"Total":0}

引用次数: 0

摘要

摘要腐败对经济发展和人民福祉具有普遍影响。尽管打击腐败是至关重要和必要的，但它并不是一项容易的任务，因为它是一种难以衡量和发现的现象。然而，人工智能领域的最新进展可能有助于这一探索。在本文中，我们建议使用机器学习模型来预测发展中国家的市级腐败。利用哥伦比亚一家反腐败机构进行的纪律起诉的数据，我们训练了四个典型模型（随机森林、梯度提升机、拉索和神经网络），并综合了它们的预测，以预测市长是否会犯下腐败行为。基于精度和接收器工作特性曲线下的面积等指标，我们的模型实现了可接受的性能水平，证明这些工具在预测最有可能发生不当行为的地方是有用的。此外，我们的特征重要性分析向我们展示了哪些变量组在预测腐败方面最重要。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Predicting politicians’ misconduct: Evidence from Colombia

Abstract Corruption has pervasive effects on economic development and the well-being of the population. Despite being crucial and necessary, fighting corruption is not an easy task because it is a difficult phenomenon to measure and detect. However, recent advances in the field of artificial intelligence may help in this quest. In this article, we propose the use of machine-learning models to predict municipality-level corruption in a developing country. Using data from disciplinary prosecutions conducted by an anti-corruption agency in Colombia, we trained four canonical models (Random Forests, Gradient Boosting Machine, Lasso, and Neural Networks), and ensemble their predictions, to predict whether or not a mayor will commit acts of corruption. Our models achieve acceptable levels of performance, based on metrics such as the precision and the area under the receiver-operating characteristic curve, demonstrating that these tools are useful in predicting where misbehavior is most likely to occur. Moreover, our feature-importance analysis shows us which groups of variables are most important in predicting corruption.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Data & policy

CiteScore

3.10

自引率

0.00%

发文量

审稿时长

12 weeks