Abrar A. Almuhanna, Marwa M. Alrehili, Samah H. Alsubhi, Liyakathunisa Syed
{"title":"利用空间数据分析预测纽约市社区犯罪","authors":"Abrar A. Almuhanna, Marwa M. Alrehili, Samah H. Alsubhi, Liyakathunisa Syed","doi":"10.1109/CAIDA51941.2021.9425120","DOIUrl":null,"url":null,"abstract":"Crimes prediction is one of the most important topics in recent years that aim to protect people’s lives. These analytical studies for criminal hotspots are frequently demanded by law enforcement agencies hence, there is a huge requirement and demand for enhanced geographic information systems and innovative spatial data mining techniques in order to enhance crime detections and better protect their communities. In this paper, we propose a methodology to predict Spatio-temporal criminal patterns within the New York City neighbourhoods using a dataset from 2006 until 2019 with 2.2M criminal records for 25 different crimes type. In order to achieve the study objectives, the methodology passes through several stages until the final results are reached, starting with the visualization analysis of Spatio-temporal New York crime data which is important in decision-making, followed by, applying three different classifiers namely; Support Vector Machine (SVM), Random Forest (RF), and XGboost classifiers. After analysis, it is illustrated that XGboost has predicted the highest number of correct classifications out of 25 different crime types it has predicted 22 types of crime accurately, whereas Random Forest has predicted 21 types of crime accurately and SVM predicted accurately 17 types of crimes with lowest accuracy. Hence XGBoost outperformed all other models and can be considered for detection of crimes in the neighborhood.","PeriodicalId":272573,"journal":{"name":"2021 1st International Conference on Artificial Intelligence and Data Analytics (CAIDA)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Prediction of Crime in Neighbourhoods of New York City using Spatial Data Analysis\",\"authors\":\"Abrar A. Almuhanna, Marwa M. Alrehili, Samah H. Alsubhi, Liyakathunisa Syed\",\"doi\":\"10.1109/CAIDA51941.2021.9425120\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Crimes prediction is one of the most important topics in recent years that aim to protect people’s lives. These analytical studies for criminal hotspots are frequently demanded by law enforcement agencies hence, there is a huge requirement and demand for enhanced geographic information systems and innovative spatial data mining techniques in order to enhance crime detections and better protect their communities. In this paper, we propose a methodology to predict Spatio-temporal criminal patterns within the New York City neighbourhoods using a dataset from 2006 until 2019 with 2.2M criminal records for 25 different crimes type. In order to achieve the study objectives, the methodology passes through several stages until the final results are reached, starting with the visualization analysis of Spatio-temporal New York crime data which is important in decision-making, followed by, applying three different classifiers namely; Support Vector Machine (SVM), Random Forest (RF), and XGboost classifiers. After analysis, it is illustrated that XGboost has predicted the highest number of correct classifications out of 25 different crime types it has predicted 22 types of crime accurately, whereas Random Forest has predicted 21 types of crime accurately and SVM predicted accurately 17 types of crimes with lowest accuracy. Hence XGBoost outperformed all other models and can be considered for detection of crimes in the neighborhood.\",\"PeriodicalId\":272573,\"journal\":{\"name\":\"2021 1st International Conference on Artificial Intelligence and Data Analytics (CAIDA)\",\"volume\":\"44 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-04-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 1st International Conference on Artificial Intelligence and Data Analytics (CAIDA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CAIDA51941.2021.9425120\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 1st International Conference on Artificial Intelligence and Data Analytics (CAIDA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CAIDA51941.2021.9425120","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Prediction of Crime in Neighbourhoods of New York City using Spatial Data Analysis
Crimes prediction is one of the most important topics in recent years that aim to protect people’s lives. These analytical studies for criminal hotspots are frequently demanded by law enforcement agencies hence, there is a huge requirement and demand for enhanced geographic information systems and innovative spatial data mining techniques in order to enhance crime detections and better protect their communities. In this paper, we propose a methodology to predict Spatio-temporal criminal patterns within the New York City neighbourhoods using a dataset from 2006 until 2019 with 2.2M criminal records for 25 different crimes type. In order to achieve the study objectives, the methodology passes through several stages until the final results are reached, starting with the visualization analysis of Spatio-temporal New York crime data which is important in decision-making, followed by, applying three different classifiers namely; Support Vector Machine (SVM), Random Forest (RF), and XGboost classifiers. After analysis, it is illustrated that XGboost has predicted the highest number of correct classifications out of 25 different crime types it has predicted 22 types of crime accurately, whereas Random Forest has predicted 21 types of crime accurately and SVM predicted accurately 17 types of crimes with lowest accuracy. Hence XGBoost outperformed all other models and can be considered for detection of crimes in the neighborhood.