Developing a data pipeline to improve accessibility and utilization of Charlottesville's Open Data Portal

L. Beane, Elena Gillis, Raf Alvarado, C. Wylie
{"title":"Developing a data pipeline to improve accessibility and utilization of Charlottesville's Open Data Portal","authors":"L. Beane, Elena Gillis, Raf Alvarado, C. Wylie","doi":"10.1109/SIEDS.2019.8735653","DOIUrl":null,"url":null,"abstract":"To improve democratic engagement between the people and the government, the city of Charlottesville put forward a proposition to construct an online portal that would contain data from the city departments that is considered public by nature. This move was intended to promote the ease of access to data pertinent to ongoing policy debates in the city and incentivize the public to contribute to the policy-making process with informed participation. Such efforts, while successful at their start, have gradually stagnated, and the end objective of the portal has not been reached. In this paper we identify possible reasons for this stagnation – inconsistent formatting of the datasets, variables that are not meant for human legibility, and limited data with disproportional representation from the city departments. We then propose a data pipeline that serves as a tool to extract utility from the data. It does so by converting the datasets into a consistent format, merges the datasets, and allows for creation of simple visualizations. The pipeline acts as a link between the raw data published by the government units and the city by increasing its interpretability and legibility and outputting results that are easily relatable to the policy issues at hand. We demonstrate this by analyzing datasets for crime and real estate and relating our findings to the affordable housing debate.","PeriodicalId":265421,"journal":{"name":"2019 Systems and Information Engineering Design Symposium (SIEDS)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2019-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 Systems and Information Engineering Design Symposium (SIEDS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SIEDS.2019.8735653","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

To improve democratic engagement between the people and the government, the city of Charlottesville put forward a proposition to construct an online portal that would contain data from the city departments that is considered public by nature. This move was intended to promote the ease of access to data pertinent to ongoing policy debates in the city and incentivize the public to contribute to the policy-making process with informed participation. Such efforts, while successful at their start, have gradually stagnated, and the end objective of the portal has not been reached. In this paper we identify possible reasons for this stagnation – inconsistent formatting of the datasets, variables that are not meant for human legibility, and limited data with disproportional representation from the city departments. We then propose a data pipeline that serves as a tool to extract utility from the data. It does so by converting the datasets into a consistent format, merges the datasets, and allows for creation of simple visualizations. The pipeline acts as a link between the raw data published by the government units and the city by increasing its interpretability and legibility and outputting results that are easily relatable to the policy issues at hand. We demonstrate this by analyzing datasets for crime and real estate and relating our findings to the affordable housing debate.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
开发数据管道,以提高夏洛茨维尔开放数据门户的可访问性和利用率
为了提高人民与政府之间的民主参与,夏洛茨维尔市提出了建立一个在线门户网站的建议,该门户网站将包含被认为是公共性质的城市部门的数据。此举旨在促进城市中正在进行的政策辩论相关数据的获取,并激励公众在知情参与的情况下为政策制定过程做出贡献。这种努力虽然在开始时取得了成功,但已逐渐停滞不前,门户的最终目标尚未实现。在本文中,我们确定了这种停滞的可能原因-数据集格式不一致,变量不适合人类易读性,以及来自城市部门的不成比例代表性的有限数据。然后,我们提出了一个数据管道,作为从数据中提取实用程序的工具。它通过将数据集转换为一致的格式、合并数据集并允许创建简单的可视化来实现这一点。该管道作为政府单位和城市发布的原始数据之间的联系,增加了其可解释性和可读性,并输出了与手头的政策问题容易相关的结果。我们通过分析犯罪和房地产的数据集,并将我们的发现与经济适用房的辩论联系起来,来证明这一点。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
The Impact of Artificial Intelligence and Internet of Things in the Transformation of E-Business Sector Gamification of eHealth Interventions to Increase User Engagement and Reduce Attrition Modeling User Context from Smartphone Data for Recognition of Health Status Developing a data pipeline to improve accessibility and utilization of Charlottesville's Open Data Portal Deep Learning for Detecting Diseases in Gastrointestinal Biopsy Images
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1