预测中间城市的犯罪:哥伦比亚布卡拉曼加的“机器学习”模型

IF 0.3 Q4 POLITICAL SCIENCE URVIO-Revista Latinoamericana de Estudios de Seguridad Pub Date : 2022-09-30 DOI:10.17141/urvio.34.2022.5395

Juan David Gelvez Ferreira, María-Paula Nieto-Rodríguez, Carlos A. Rocha-Ruíz

{"title":"预测中间城市的犯罪:哥伦比亚布卡拉曼加的“机器学习”模型","authors":"Juan David Gelvez Ferreira, María-Paula Nieto-Rodríguez, Carlos A. Rocha-Ruíz","doi":"10.17141/urvio.34.2022.5395","DOIUrl":null,"url":null,"abstract":"El uso de tecnología para prevenir el crimen es una práctica cada vez más frecuente. Sin embargo, la evidencia se ha concentrado en ciudades principales, que cuentan con gran cantidad de datos y mejores capacidades locales. El objetivo de esta investigación es presentar los resultados de un modelo de “machine learning” para predecir el delito en Bucaramanga, una ciudad intermedia de Colombia. Se utilizó el procesamiento de señales para grafos y una adaptación al caso del modelo de vectorización de texto TF-IDF. Se identificó que los mejores resultados en la predicción del crimen se dieron con modelos espaciales de grafos por semanas. Además, encontramos evidencia de que existen diversas dificultades de predicción, en dependencia de la periodicidad del modelo. La mejor opción posible (con los datos disponibles) es una periodicidad semanal. El mejor modelo encontrado es un KNN de clasificación, que alcanza un 59 % de exhaustividad(recall) y más de 60 % de exactitud (accuracy.). Concluimosque los modelos de predicción del delito constituyen una herramienta útil para construir estrategias de prevención en ciudades principales; sin embargo, existen limitaciones para su aplicación en ciudades intermedias, que cuentan con poca información. \nAbstract \nThe use of technology to prevent and respond to citizen security challenges is increasingly frequent. However, empirical evidence has been concentrated in major cities with large amounts of data and local authorities' strong capacities. Therefore, this investigation aims to capture a series of policy recommendations based on a machine learning crime prediction model in an intermediate city in Colombia, Bucaramanga (department of Santander). The model used signal processing for graphs and an adaptation of the TF-IDF text vectorization model to the space-time case, for each of the cities’ neighborhoods. The results show that the best crime prediction outcomes were obtained when using the models with spatial relationships of graphs by weeks. Evidence of the difficulty in predictions based on the periodicity of the model is found. The best possible prediction (with available data) is weekly prediction. In addition, the best model found was a KNN classification model, reaching 59 % of recall and more than 60 % of accuracy. We concluded that crime prediction models are a helpful tool for constructing prevention strategies in major cities; however, there are limitations to its application in intermediate cities and rural areas in Colombia, which have little statistical information and few technical capabilities.","PeriodicalId":41674,"journal":{"name":"URVIO-Revista Latinoamericana de Estudios de Seguridad","volume":"159 1","pages":""},"PeriodicalIF":0.3000,"publicationDate":"2022-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Prediciendo el crimen en ciudades intermedias: un modelo de “machine learning” en Bucaramanga, Colombia\",\"authors\":\"Juan David Gelvez Ferreira, María-Paula Nieto-Rodríguez, Carlos A. Rocha-Ruíz\",\"doi\":\"10.17141/urvio.34.2022.5395\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"El uso de tecnología para prevenir el crimen es una práctica cada vez más frecuente. Sin embargo, la evidencia se ha concentrado en ciudades principales, que cuentan con gran cantidad de datos y mejores capacidades locales. El objetivo de esta investigación es presentar los resultados de un modelo de “machine learning” para predecir el delito en Bucaramanga, una ciudad intermedia de Colombia. Se utilizó el procesamiento de señales para grafos y una adaptación al caso del modelo de vectorización de texto TF-IDF. Se identificó que los mejores resultados en la predicción del crimen se dieron con modelos espaciales de grafos por semanas. Además, encontramos evidencia de que existen diversas dificultades de predicción, en dependencia de la periodicidad del modelo. La mejor opción posible (con los datos disponibles) es una periodicidad semanal. El mejor modelo encontrado es un KNN de clasificación, que alcanza un 59 % de exhaustividad(recall) y más de 60 % de exactitud (accuracy.). Concluimosque los modelos de predicción del delito constituyen una herramienta útil para construir estrategias de prevención en ciudades principales; sin embargo, existen limitaciones para su aplicación en ciudades intermedias, que cuentan con poca información. \\nAbstract \\nThe use of technology to prevent and respond to citizen security challenges is increasingly frequent. However, empirical evidence has been concentrated in major cities with large amounts of data and local authorities' strong capacities. Therefore, this investigation aims to capture a series of policy recommendations based on a machine learning crime prediction model in an intermediate city in Colombia, Bucaramanga (department of Santander). The model used signal processing for graphs and an adaptation of the TF-IDF text vectorization model to the space-time case, for each of the cities’ neighborhoods. The results show that the best crime prediction outcomes were obtained when using the models with spatial relationships of graphs by weeks. Evidence of the difficulty in predictions based on the periodicity of the model is found. The best possible prediction (with available data) is weekly prediction. In addition, the best model found was a KNN classification model, reaching 59 % of recall and more than 60 % of accuracy. We concluded that crime prediction models are a helpful tool for constructing prevention strategies in major cities; however, there are limitations to its application in intermediate cities and rural areas in Colombia, which have little statistical information and few technical capabilities.\",\"PeriodicalId\":41674,\"journal\":{\"name\":\"URVIO-Revista Latinoamericana de Estudios de Seguridad\",\"volume\":\"159 1\",\"pages\":\"\"},\"PeriodicalIF\":0.3000,\"publicationDate\":\"2022-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"URVIO-Revista Latinoamericana de Estudios de Seguridad\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.17141/urvio.34.2022.5395\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"POLITICAL SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"URVIO-Revista Latinoamericana de Estudios de Seguridad","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17141/urvio.34.2022.5395","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"POLITICAL SCIENCE","Score":null,"Total":0}

引用次数: 2

摘要

使用技术预防犯罪是一种越来越普遍的做法。然而，证据集中在主要城市，这些城市拥有丰富的数据和更好的地方能力。这项研究的目的是提出一个“机器学习”模型的结果，以预测哥伦比亚中部城市布卡拉曼加的犯罪。本研究的目的是评估在墨西哥和拉丁美洲进行的一项研究的结果。结果表明，空间图模型在预测犯罪方面的效果最好。此外，我们发现有证据表明，根据模型的周期性，存在一些预测困难。最好的选择(在数据可用的情况下)是每周。发现的最佳模型是KNN分类，达到59%的召回和超过60%的准确度。我们的结论是，犯罪预测模型是在主要城市建立预防策略的有用工具;然而，它在信息匮乏的中等城市的应用存在局限性。利用技术预防和应对公民安全挑战的情况越来越普遍。然而，经验证据集中在数据量大、地方当局能力强的主要城市。因此，这项调查的目的是在哥伦比亚布卡拉曼加(桑坦德省)的一个中间城市，根据机器学习犯罪预测模型，提出一系列政策建议。该模型用于图形的信号处理和TF-IDF文本矢量化模型对每个城市社区的时空情况的适应。结果表明，以周为单位使用具有图形空间关系的模型可以获得最佳的犯罪预测结果。《滑道的证据转换based on the periodicity of the model is found。The best可能prediction (with data)是《prediction提供。此外，发现的最佳模型是KNN分类模型，召回率达到59%，准确率超过60%。我们认为犯罪预测模型是在主要城市制定预防战略的有用工具;然而，它在哥伦比亚的中等城市和农村地区的应用存在局限性，这些地区统计资料少，技术能力少。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Prediciendo el crimen en ciudades intermedias: un modelo de “machine learning” en Bucaramanga, Colombia

El uso de tecnología para prevenir el crimen es una práctica cada vez más frecuente. Sin embargo, la evidencia se ha concentrado en ciudades principales, que cuentan con gran cantidad de datos y mejores capacidades locales. El objetivo de esta investigación es presentar los resultados de un modelo de “machine learning” para predecir el delito en Bucaramanga, una ciudad intermedia de Colombia. Se utilizó el procesamiento de señales para grafos y una adaptación al caso del modelo de vectorización de texto TF-IDF. Se identificó que los mejores resultados en la predicción del crimen se dieron con modelos espaciales de grafos por semanas. Además, encontramos evidencia de que existen diversas dificultades de predicción, en dependencia de la periodicidad del modelo. La mejor opción posible (con los datos disponibles) es una periodicidad semanal. El mejor modelo encontrado es un KNN de clasificación, que alcanza un 59 % de exhaustividad(recall) y más de 60 % de exactitud (accuracy.). Concluimosque los modelos de predicción del delito constituyen una herramienta útil para construir estrategias de prevención en ciudades principales; sin embargo, existen limitaciones para su aplicación en ciudades intermedias, que cuentan con poca información. Abstract The use of technology to prevent and respond to citizen security challenges is increasingly frequent. However, empirical evidence has been concentrated in major cities with large amounts of data and local authorities' strong capacities. Therefore, this investigation aims to capture a series of policy recommendations based on a machine learning crime prediction model in an intermediate city in Colombia, Bucaramanga (department of Santander). The model used signal processing for graphs and an adaptation of the TF-IDF text vectorization model to the space-time case, for each of the cities’ neighborhoods. The results show that the best crime prediction outcomes were obtained when using the models with spatial relationships of graphs by weeks. Evidence of the difficulty in predictions based on the periodicity of the model is found. The best possible prediction (with available data) is weekly prediction. In addition, the best model found was a KNN classification model, reaching 59 % of recall and more than 60 % of accuracy. We concluded that crime prediction models are a helpful tool for constructing prevention strategies in major cities; however, there are limitations to its application in intermediate cities and rural areas in Colombia, which have little statistical information and few technical capabilities.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

URVIO-Revista Latinoamericana de Estudios de Seguridad POLITICAL SCIENCE-

自引率

0.00%

发文量