{"title":"Linked Data: A Framework for Publishing FiveStar Open Government Data","authors":"Bassel Al-khatib, Ali A. Ali","doi":"10.5815/ijitcs.2021.06.01","DOIUrl":null,"url":null,"abstract":"With the increased adoption of open government initiatives around the world, a huge amount of governmental raw datasets was released. However, the data was published in heterogeneous formats and vocabularies and in many cases in bad quality due to inconsistency, messy, and maybe incorrectness as it has been collected by practicalities within the source organization, which makes it inefficient for reusing and integrating it for serving citizens and third-party apps. This research introduces the LDOG (Linked Data for Open Government) experimental framework, which aims to provide a modular architecture that can be integrated into the open government hierarchy, allowing huge amounts of data to be gathered in a fine-grained manner from source and directly publishing them as linked data based on Tim Berners lee’s five-star deployment scheme with a validation layer using SHACL, which results in high quality data. The general idea is to model the hierarchy of government and classify government organizations into two types, the modeling organizations at higher levels and data source organizations at lower levels. Modeling organization’s experts in linked data have the responsibility to design data templates, ontologies, SHACL shapes, and linkage specifications. whereas non-experts can be incorporated in data source organizations to utilize their knowledge in data to do mapping, reconciliation, and correcting data. This approach lowers the needed experts that represent a problem of linked data adoption. To test the functionality of our framework in action, we developed the LDOG platform which utilizes the different modules of the framework to power a set of user interfaces that can be used to publish government datasets. we used this platform to convert some of UAE's government datasets into linked data. Finally, on top of the converted data, we built a proof-of-concept app to show the power of five-star linked data for integrating datasets from disparate organizations and to promote the governments' adoption. Our work has defined a clear path to integrate the linked data into open governments and solid steps to publishing and enhancing it in a fine-grained and practical manner with a lower number of experts in linked data, It extends SHACL to define data shapes and convert CSV to RDF.","PeriodicalId":130361,"journal":{"name":"International Journal of Information Technology and Computer Science","volume":"215 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Information Technology and Computer Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5815/ijitcs.2021.06.01","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
With the increased adoption of open government initiatives around the world, a huge amount of governmental raw datasets was released. However, the data was published in heterogeneous formats and vocabularies and in many cases in bad quality due to inconsistency, messy, and maybe incorrectness as it has been collected by practicalities within the source organization, which makes it inefficient for reusing and integrating it for serving citizens and third-party apps. This research introduces the LDOG (Linked Data for Open Government) experimental framework, which aims to provide a modular architecture that can be integrated into the open government hierarchy, allowing huge amounts of data to be gathered in a fine-grained manner from source and directly publishing them as linked data based on Tim Berners lee’s five-star deployment scheme with a validation layer using SHACL, which results in high quality data. The general idea is to model the hierarchy of government and classify government organizations into two types, the modeling organizations at higher levels and data source organizations at lower levels. Modeling organization’s experts in linked data have the responsibility to design data templates, ontologies, SHACL shapes, and linkage specifications. whereas non-experts can be incorporated in data source organizations to utilize their knowledge in data to do mapping, reconciliation, and correcting data. This approach lowers the needed experts that represent a problem of linked data adoption. To test the functionality of our framework in action, we developed the LDOG platform which utilizes the different modules of the framework to power a set of user interfaces that can be used to publish government datasets. we used this platform to convert some of UAE's government datasets into linked data. Finally, on top of the converted data, we built a proof-of-concept app to show the power of five-star linked data for integrating datasets from disparate organizations and to promote the governments' adoption. Our work has defined a clear path to integrate the linked data into open governments and solid steps to publishing and enhancing it in a fine-grained and practical manner with a lower number of experts in linked data, It extends SHACL to define data shapes and convert CSV to RDF.
随着世界各地越来越多地采用开放政府的举措,大量的政府原始数据集被发布。然而,数据以异构格式和词汇表发布,并且在许多情况下由于不一致、混乱和可能不正确而导致质量差,因为它是由源组织内部的实用性收集的,这使得为服务公民和第三方应用程序重用和集成它的效率低下。本研究引入了LDOG (Linked Data for Open Government)实验框架,该框架旨在提供一个可以集成到开放政府层级的模块化架构,允许从源头以细粒度的方式收集大量数据,并基于Tim Berners lee的五星部署方案,使用SHACL进行验证层,直接将其作为链接数据发布,从而获得高质量的数据。总体思路是对政府的层次结构进行建模,并将政府组织分为两类,即高层的建模组织和低层的数据源组织。关联数据建模组织的专家有责任设计数据模板、本体、SHACL形状和链接规范。而非专家可以加入数据源组织,利用他们在数据方面的知识进行映射、协调和纠正数据。这种方法降低了代表关联数据采用问题的所需专家。为了测试我们框架的实际功能,我们开发了LDOG平台,该平台利用框架的不同模块来支持一组可用于发布政府数据集的用户界面。我们使用这个平台将阿联酋的一些政府数据集转换为关联数据。最后,在转换数据的基础上,我们建立了一个概念验证应用程序,以展示五星关联数据在整合来自不同组织的数据集方面的力量,并促进政府的采用。我们的工作定义了一条清晰的路径,将关联数据集成到开放的政府中,并采取坚实的步骤,以细粒度和实用的方式与较少数量的关联数据专家一起发布和增强关联数据。它扩展了SHACL来定义数据形状并将CSV转换为RDF。