Introducing Context and Context-awareness in Data Integration: Identifying the Problem and a Preliminary Case Study on Informed Consent

C. Debruyne
{"title":"Introducing Context and Context-awareness in Data Integration: Identifying the Problem and a Preliminary Case Study on Informed Consent","authors":"C. Debruyne","doi":"10.1145/3428757.3429116","DOIUrl":null,"url":null,"abstract":"Data integration is the process of selecting, preprocessing, and transforming data from heterogeneous sources in data-driven projects. This process also requires the most time, effort, resources. Data integration is such an involved process due to the many informed decisions one has to make. These decisions are influenced by the complex context of a data-driven project. We argue that using said context could facilitate the decision-making processes and even automate some integration steps. However, the problem we identify in this paper is that the context of a data-driven project is tacit and, therefore, not easily accessible by humans and certainly not by software agents. From the SotA, however, we observe that current models represent the context in crude and simplistic terms. These context models are furthermore built for specific tasks or application domains such as query optimization or a smart home. The current state of affairs is thus is not fit for intelligent data integration. Next to identifying the problem, we postulate that solving this problem requires two steps: formalizing context and using that context for building context-aware agents. We illustrate this notion of \"context-aware data integration\" with preliminary results obtained with a use case in the domain of GDPR, more specifically the generation of datasets that takes into account informed consent.","PeriodicalId":212557,"journal":{"name":"Proceedings of the 22nd International Conference on Information Integration and Web-based Applications & Services","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 22nd International Conference on Information Integration and Web-based Applications & Services","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3428757.3429116","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Data integration is the process of selecting, preprocessing, and transforming data from heterogeneous sources in data-driven projects. This process also requires the most time, effort, resources. Data integration is such an involved process due to the many informed decisions one has to make. These decisions are influenced by the complex context of a data-driven project. We argue that using said context could facilitate the decision-making processes and even automate some integration steps. However, the problem we identify in this paper is that the context of a data-driven project is tacit and, therefore, not easily accessible by humans and certainly not by software agents. From the SotA, however, we observe that current models represent the context in crude and simplistic terms. These context models are furthermore built for specific tasks or application domains such as query optimization or a smart home. The current state of affairs is thus is not fit for intelligent data integration. Next to identifying the problem, we postulate that solving this problem requires two steps: formalizing context and using that context for building context-aware agents. We illustrate this notion of "context-aware data integration" with preliminary results obtained with a use case in the domain of GDPR, more specifically the generation of datasets that takes into account informed consent.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
在数据集成中引入上下文和上下文感知:识别问题和知情同意的初步案例研究
数据集成是在数据驱动的项目中从异构源选择、预处理和转换数据的过程。这个过程也需要最多的时间、精力和资源。由于必须做出许多明智的决策,数据集成是一个非常复杂的过程。这些决策受到数据驱动项目的复杂环境的影响。我们认为,使用上述上下文可以促进决策过程,甚至自动化一些集成步骤。然而,我们在本文中确定的问题是,数据驱动项目的上下文是隐性的,因此,不容易被人类访问,当然也不容易被软件代理访问。然而,从SotA中,我们观察到当前的模型以粗糙和简单的术语表示上下文。这些上下文模型是为特定任务或应用程序领域(如查询优化或智能家居)进一步构建的。因此,目前的现状并不适合智能数据集成。在确定问题之后,我们假设解决这个问题需要两个步骤:形式化上下文并使用该上下文构建上下文感知代理。我们通过GDPR领域的一个用例获得的初步结果来说明“上下文感知数据集成”的概念,更具体地说,是考虑到知情同意的数据集的生成。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Tailored Graph Embeddings for Entity Alignment on Historical Data CommunityCare A Comparison of Two Database Partitioning Approaches that Support Taxonomy-Based Query Answering Prediction of Cesarean Childbirth using Ensemble Machine Learning Methods Interoperability of Semantically-Enabled Web Services on the WoT: Challenges and Prospects
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1