D. Wallace, E. Helderop, Anthony Grubesic, Jason Walker, X. Liu, R. Wei, Yirong Zhou, Connor Stewart
{"title":"一个两步的过程,以增加成功的地理编码在公开可用的警察拦截数据","authors":"D. Wallace, E. Helderop, Anthony Grubesic, Jason Walker, X. Liu, R. Wei, Yirong Zhou, Connor Stewart","doi":"10.1080/15614263.2023.2181169","DOIUrl":null,"url":null,"abstract":"ABSTRACT Many police departments are meeting calls for transparency by releasing publicly accessible data. High-quality address locations are critical for successful and accurate geocoding, though the content and quality of that data can drastically vary across datasets. In this study, we showcase a two-step geocoding process that helps convert low-quality address locations into geo-locatable addresses using traditional geocoding and Jaro-Winkler edit distance methods with police stop data from the San Diego Police Department. For reference, only 83% of stops were geocoded when using traditional geocoding methods. By employing the Jaro-Winkler edit distance to clean the stop address strings, we were able to geocode 99% of stops. We further discuss data creation practices and solutions for data quality-related issues for police departments and researchers when using publicly available policing data.","PeriodicalId":47167,"journal":{"name":"Police Practice and Research","volume":null,"pages":null},"PeriodicalIF":1.4000,"publicationDate":"2023-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A two-step process to increase successful geocoding in publicly available police stop data\",\"authors\":\"D. Wallace, E. Helderop, Anthony Grubesic, Jason Walker, X. Liu, R. Wei, Yirong Zhou, Connor Stewart\",\"doi\":\"10.1080/15614263.2023.2181169\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"ABSTRACT Many police departments are meeting calls for transparency by releasing publicly accessible data. High-quality address locations are critical for successful and accurate geocoding, though the content and quality of that data can drastically vary across datasets. In this study, we showcase a two-step geocoding process that helps convert low-quality address locations into geo-locatable addresses using traditional geocoding and Jaro-Winkler edit distance methods with police stop data from the San Diego Police Department. For reference, only 83% of stops were geocoded when using traditional geocoding methods. By employing the Jaro-Winkler edit distance to clean the stop address strings, we were able to geocode 99% of stops. We further discuss data creation practices and solutions for data quality-related issues for police departments and researchers when using publicly available policing data.\",\"PeriodicalId\":47167,\"journal\":{\"name\":\"Police Practice and Research\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.4000,\"publicationDate\":\"2023-02-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Police Practice and Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1080/15614263.2023.2181169\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CRIMINOLOGY & PENOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Police Practice and Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/15614263.2023.2181169","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CRIMINOLOGY & PENOLOGY","Score":null,"Total":0}
A two-step process to increase successful geocoding in publicly available police stop data
ABSTRACT Many police departments are meeting calls for transparency by releasing publicly accessible data. High-quality address locations are critical for successful and accurate geocoding, though the content and quality of that data can drastically vary across datasets. In this study, we showcase a two-step geocoding process that helps convert low-quality address locations into geo-locatable addresses using traditional geocoding and Jaro-Winkler edit distance methods with police stop data from the San Diego Police Department. For reference, only 83% of stops were geocoded when using traditional geocoding methods. By employing the Jaro-Winkler edit distance to clean the stop address strings, we were able to geocode 99% of stops. We further discuss data creation practices and solutions for data quality-related issues for police departments and researchers when using publicly available policing data.
期刊介绍:
Police Practice and Research is a peer-reviewed journal that presents current and innovative police research as well as operational and administrative practices from around the world. Articles and reports are sought from practitioners, researchers and others interested in developments in policing, analysis of public order, and the state of safety as it affects the quality of life everywhere. Police Practice and Research seeks to bridge the gap in knowledge that exists regarding who the police are, what they do, and how they maintain order, administer laws, and serve their communities. Attention will also be focused on specific organizational information about the police in different countries or regions. There will be periodic special issues devoted to a particular country or continent.