Semantic Web最新文献_第6页

Understanding the structure of knowledge graphs with ABSTAT profiles 用ABSTAT概要文件理解知识图的结构

3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Semantic Web

Pub Date : 2023-03-09 DOI: 10.3233/sw-223181

Blerina Spahiu, Matteo Palmonari, Renzo Arturo Alva Principe, Anisa Rula

While there has been a trend in the last decades for publishing large-scale and highly-interconnected Knowledge Graphs (KGs), their users often get overwhelmed by the task of understanding their content as a result of their size and complexity. Data profiling approaches have been proposed to summarize large KGs into concise and meaningful representations, so that they can be better explored, processed, and managed. Profiles based on schema patterns represent each triple in a KG with its schema-level counterpart, thus covering the entire KG with profiles of considerable size. In this paper, we provide empirical evidence that profiles based on schema patterns, if explored with suitable mechanisms, can be useful to help users understand the content of big and complex KGs. ABSTAT provides concise pattern-based profiles and comes with faceted interfaces for profile exploration. Using this tool we present a user study based on query completion tasks. We demonstrate that users who look at ABSTAT profiles formulate their queries better and faster than users browsing the ontology of the KGs. The latter is a pretty strong baseline considering that many KGs do not even come with a specific ontology to be explored by the users. To the best of our knowledge, this is the first attempt to investigate the impact of profiling techniques on tasks related to knowledge graph understanding with a user study.

虽然在过去的几十年里出现了发布大规模和高度互联的知识图(KGs)的趋势，但由于它们的大小和复杂性，它们的用户经常被理解其内容的任务所淹没。已经提出了数据分析方法，将大型kg总结为简洁而有意义的表示，以便更好地探索、处理和管理它们。基于模式模式的概要文件表示一个KG中的每个三元组及其模式级对应的三元组，从而用相当大的概要文件覆盖整个KG。在本文中，我们提供的经验证据表明，基于模式模式的配置文件，如果使用合适的机制进行探索，可以帮助用户理解大型和复杂的kg的内容。ABSTAT提供了简洁的基于模式的配置文件，并提供了用于配置文件探索的分面接口。使用这个工具，我们提出了一个基于查询完成任务的用户研究。我们证明，查看ABSTAT配置文件的用户比浏览KGs本体的用户更好更快地制定他们的查询。考虑到许多KGs甚至没有特定的本体供用户探索，后者是一个相当强大的基线。据我们所知，这是第一次尝试调查分析技术对与用户研究相关的知识图谱理解任务的影响。

{"title":"Understanding the structure of knowledge graphs with ABSTAT profiles","authors":"Blerina Spahiu, Matteo Palmonari, Renzo Arturo Alva Principe, Anisa Rula","doi":"10.3233/sw-223181","DOIUrl":"https://doi.org/10.3233/sw-223181","url":null,"abstract":"While there has been a trend in the last decades for publishing large-scale and highly-interconnected Knowledge Graphs (KGs), their users often get overwhelmed by the task of understanding their content as a result of their size and complexity. Data profiling approaches have been proposed to summarize large KGs into concise and meaningful representations, so that they can be better explored, processed, and managed. Profiles based on schema patterns represent each triple in a KG with its schema-level counterpart, thus covering the entire KG with profiles of considerable size. In this paper, we provide empirical evidence that profiles based on schema patterns, if explored with suitable mechanisms, can be useful to help users understand the content of big and complex KGs. ABSTAT provides concise pattern-based profiles and comes with faceted interfaces for profile exploration. Using this tool we present a user study based on query completion tasks. We demonstrate that users who look at ABSTAT profiles formulate their queries better and faster than users browsing the ontology of the KGs. The latter is a pretty strong baseline considering that many KGs do not even come with a specific ontology to be explored by the users. To the best of our knowledge, this is the first attempt to investigate the impact of profiling techniques on tasks related to knowledge graph understanding with a user study.","PeriodicalId":48694,"journal":{"name":"Semantic Web","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136178874","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Helio: A framework for implementing the life cycle of knowledge graphs Helio:实现知识图谱生命周期的框架

IF 3 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Semantic Web

Pub Date : 2023-01-12 DOI: 10.3233/sw-233224

Andrea Cimmino, R. García-Castro

Building and publishing knowledge graphs (KG) as Linked Data, either on the Web or in private companies, has become a relevant and crucial process in many domains. This process requires that users perform a wide number of tasks conforming to the life cycle of a KG, and these tasks usually involve different unrelated research topics, such as RDF materialisation or link discovery. There is already a large corpus of tools and methods designed to perform these tasks; however, the lack of one tool that gathers them all leads practitioners to develop ad-hoc pipelines that are not generic and, thus, non-reusable. As a result, building and publishing a KG is becoming a complex and resource-consuming process. In this paper, a generic framework called Helio is presented. The framework aims to cover a set of requirements elicited from the KG life cycle and provide a tool capable of performing the different tasks required to build and publish KGs. As a result, Helio aims at providing users with the means for reducing the effort required to perform this process and, also, Helio aims to prevent the development of ad-hoc pipelines. Furthermore, the Helio framework has been applied in many different contexts, from European projects to research work.

构建和发布知识图(KG)作为关联数据，无论是在Web上还是在私人公司中，都已成为许多领域的相关和关键过程。这个过程要求用户执行大量符合KG生命周期的任务，这些任务通常涉及不同的不相关的研究主题，如RDF物化或链接发现。已经有大量的工具和方法被设计来执行这些任务;然而，缺乏一种工具来收集它们，导致从业者开发特别的管道，这些管道不是通用的，因此是不可重用的。因此，构建和发布KG正在成为一个复杂且消耗资源的过程。本文提出了一个通用的框架Helio。该框架旨在涵盖KG生命周期中产生的一系列需求，并提供一个能够执行构建和发布KG所需的不同任务的工具，因此，Helio旨在为用户提供减少执行此过程所需的工作量的方法，同时，Helio旨在防止开发ad-hoc管道。此外，Helio框架已应用于许多不同的环境，从欧洲项目到研究工作。

{"title":"Helio: A framework for implementing the life cycle of knowledge graphs","authors":"Andrea Cimmino, R. García-Castro","doi":"10.3233/sw-233224","DOIUrl":"https://doi.org/10.3233/sw-233224","url":null,"abstract":"Building and publishing knowledge graphs (KG) as Linked Data, either on the Web or in private companies, has become a relevant and crucial process in many domains. This process requires that users perform a wide number of tasks conforming to the life cycle of a KG, and these tasks usually involve different unrelated research topics, such as RDF materialisation or link discovery. There is already a large corpus of tools and methods designed to perform these tasks; however, the lack of one tool that gathers them all leads practitioners to develop ad-hoc pipelines that are not generic and, thus, non-reusable. As a result, building and publishing a KG is becoming a complex and resource-consuming process. In this paper, a generic framework called Helio is presented. The framework aims to cover a set of requirements elicited from the KG life cycle and provide a tool capable of performing the different tasks required to build and publish KGs. As a result, Helio aims at providing users with the means for reducing the effort required to perform this process and, also, Helio aims to prevent the development of ad-hoc pipelines. Furthermore, the Helio framework has been applied in many different contexts, from European projects to research work.","PeriodicalId":48694,"journal":{"name":"Semantic Web","volume":"40 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2023-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78447445","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

An ontological approach for representing declarative mapping languages 表示声明性映射语言的本体论方法

IF 3 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Semantic Web

Pub Date : 2022-12-29 DOI: 10.3233/sw-223224

Ana Iglesias-Molina, Andrea Cimmino, E. Ruckhaus, David Chaves-Fraga, R. García-Castro, Óscar Corcho

Knowledge Graphs are currently created using an assortment of techniques and tools: ad hoc code in a programming language, database export scripts, OpenRefine transformations, mapping languages, etc. Focusing on the latter, the wide variety of use cases, data peculiarities, and potential uses has had a substantial impact in how mappings have been created, extended, and applied. As a result, a large number of languages and their associated tools have been created. In this paper, we present the Conceptual Mapping ontology, that is designed to represent the features and characteristics of existing declarative mapping languages to construct Knowledge Graphs. This ontology is built upon the requirements extracted from experts experience, a thorough analysis of the features and capabilities of current mapping languages presented as a comparative framework; and the languages’ limitations discussed by the community and denoted as Mapping Challenges. The ontology is evaluated to ensure that it meets these requirements and has no inconsistencies, pitfalls or modelling errors, and is publicly available online along with its documentation and related resources.

目前，知识图谱的创建使用了各种各样的技术和工具:编程语言中的特殊代码、数据库导出脚本、OpenRefine转换、映射语言等。关注后者，各种各样的用例、数据特性和潜在用途对如何创建、扩展和应用映射产生了重大影响。因此，大量的语言及其相关工具被创建出来。在本文中，我们提出了概念映射本体，该本体旨在表示现有声明性映射语言的特征和特征，以构建知识图。该本体是建立在专家经验中提取需求的基础上，对当前映射语言的特征和能力进行了全面分析，作为一个比较框架提出的;以及社区讨论的语言局限性，并将其标记为映射挑战。对本体进行评估，以确保它满足这些需求，并且没有不一致、陷阱或建模错误，并且与其文档和相关资源一起在网上公开可用。

{"title":"An ontological approach for representing declarative mapping languages","authors":"Ana Iglesias-Molina, Andrea Cimmino, E. Ruckhaus, David Chaves-Fraga, R. García-Castro, Óscar Corcho","doi":"10.3233/sw-223224","DOIUrl":"https://doi.org/10.3233/sw-223224","url":null,"abstract":"Knowledge Graphs are currently created using an assortment of techniques and tools: ad hoc code in a programming language, database export scripts, OpenRefine transformations, mapping languages, etc. Focusing on the latter, the wide variety of use cases, data peculiarities, and potential uses has had a substantial impact in how mappings have been created, extended, and applied. As a result, a large number of languages and their associated tools have been created. In this paper, we present the Conceptual Mapping ontology, that is designed to represent the features and characteristics of existing declarative mapping languages to construct Knowledge Graphs. This ontology is built upon the requirements extracted from experts experience, a thorough analysis of the features and capabilities of current mapping languages presented as a comparative framework; and the languages’ limitations discussed by the community and denoted as Mapping Challenges. The ontology is evaluated to ensure that it meets these requirements and has no inconsistencies, pitfalls or modelling errors, and is publicly available online along with its documentation and related resources.","PeriodicalId":48694,"journal":{"name":"Semantic Web","volume":"60 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2022-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83444414","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

A systematic overview of data federation systems 数据联合系统的系统概述

IF 3 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Semantic Web

Pub Date : 2022-12-06 DOI: 10.3233/sw-223201

Zhenzhen Gu, F. Corcoglioniti, D. Lanti, A. Mosca, Guohui Xiao, Jingliu Xiong, D. Calvanese

Data federation addresses the problem of uniformly accessing multiple, possibly heterogeneous data sources, by mapping them into a unified schema, such as an RDF(S)/OWL ontology or a relational schema, and by supporting the execution of queries, like SPARQL or SQL queries, over that unified schema. Data explosion in volume and variety has made data federation increasingly popular in many application domains. Hence, many data federation systems have been developed in industry and academia, and it has become challenging for users to select suitable systems to achieve their objectives. In order to systematically analyze and compare these systems, we propose an evaluation framework comprising four dimensions: (i) federation capabilities, i.e., query language, data source, and federation techniques; (ii) data security, i.e., authentication, authorization, auditing, encryption, and data masking; (iii) interface, i.e., graphical interface, command line interface, and application programming interface; and (iv) development, i.e., main development language, deployment, commercial support, open source, and release. Using this framework, we thoroughly studied 51 data federation systems from the Semantic Web and Database communities. This paper shares the results of our investigation and aims to provide reference material and insights for users, developers and researchers selecting or further developing data federation systems.

数据联合通过将多个(可能是异构的)数据源映射到统一的模式(如RDF(S)/OWL本体或关系模式)，并通过支持在统一模式上执行查询(如SPARQL或SQL查询)，解决了统一访问多个数据源的问题。数据量和种类的爆炸式增长使得数据联合在许多应用领域日益流行。因此，工业界和学术界已经开发了许多数据联合系统，用户选择合适的系统来实现他们的目标已经成为一项挑战。为了系统地分析和比较这些系统，我们提出了一个评估框架，包括四个方面:(i)联合能力，即查询语言、数据源和联合技术;(ii)数据安全，即身份验证、授权、审计、加密和数据屏蔽;(iii)界面，即图形界面、命令行界面和应用程序编程界面;(iv)开发，即主要开发语言、部署、商业支持、开源和发布。使用这个框架，我们深入研究了来自语义网和数据库社区的51个数据联合系统。本文分享了我们的调查结果，旨在为用户、开发人员和研究人员选择或进一步开发数据联邦系统提供参考资料和见解。

{"title":"A systematic overview of data federation systems","authors":"Zhenzhen Gu, F. Corcoglioniti, D. Lanti, A. Mosca, Guohui Xiao, Jingliu Xiong, D. Calvanese","doi":"10.3233/sw-223201","DOIUrl":"https://doi.org/10.3233/sw-223201","url":null,"abstract":"Data federation addresses the problem of uniformly accessing multiple, possibly heterogeneous data sources, by mapping them into a unified schema, such as an RDF(S)/OWL ontology or a relational schema, and by supporting the execution of queries, like SPARQL or SQL queries, over that unified schema. Data explosion in volume and variety has made data federation increasingly popular in many application domains. Hence, many data federation systems have been developed in industry and academia, and it has become challenging for users to select suitable systems to achieve their objectives. In order to systematically analyze and compare these systems, we propose an evaluation framework comprising four dimensions: (i) federation capabilities, i.e., query language, data source, and federation techniques; (ii) data security, i.e., authentication, authorization, auditing, encryption, and data masking; (iii) interface, i.e., graphical interface, command line interface, and application programming interface; and (iv) development, i.e., main development language, deployment, commercial support, open source, and release. Using this framework, we thoroughly studied 51 data federation systems from the Semantic Web and Database communities. This paper shares the results of our investigation and aims to provide reference material and insights for users, developers and researchers selecting or further developing data federation systems.","PeriodicalId":48694,"journal":{"name":"Semantic Web","volume":"51 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80724133","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

The OneGraph vision: Challenges of breaking the graph model lock-in1 OneGraph愿景:打破图形模型锁定的挑战1

IF 3 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Semantic Web

Pub Date : 2022-11-30 DOI: 10.3233/sw-223273

O. Lassila, Michael Schmidt, O. Hartig, B. Bebee, Dave Bechberger, Willem Broekema, Ankesh Khandelwal, K. Lawrence, Carlos-Manuel López-Enríquez, Ronak Sharda, B. Thompson

Amazon Neptune is a graph database service that supports two graph models: W3C’s Resource Description Framework (RDF) and Labeled Property Graphs (LPG). Customers choose one or the other model. This choice determines which data modeling features can be used and – perhaps more importantly – which query languages are available. The choice between the two technology stacks is difficult and time consuming. It requires consideration of data modeling aspects, query language features, their adequacy for current and future use cases, as well as developer knowledge. Even in cases where customers evaluate the pros and cons and make a conscious choice that fits their use case, over time we often see requirements from new use cases emerge that could be addressed more easily with a different data model or query language. It is therefore highly desirable that the choice of the query language can be made without consideration of what graph model is chosen and can be easily revised or complemented at a later point. To this end, we advocate and explore the idea of OneGraph (“1G” for short), a single, unified graph data model that embraces both RDF and LPGs. The goal of 1G is to achieve interoperability at both data level, by supporting the co-existence of RDF and LPG in the same database, as well as query level, by enabling queries and updates over the unified data model with a query language of choice. In this paper, we sketch our vision and investigate technical challenges towards a unification of the two graph data models.

Amazon Neptune是一个图形数据库服务，支持两种图形模型:W3C的资源描述框架(RDF)和标记属性图(LPG)。客户可以选择其中一种模式。这种选择决定了可以使用哪些数据建模特性，以及(可能更重要的是)可以使用哪些查询语言。在这两种技术堆栈之间进行选择既困难又耗时。它需要考虑数据建模方面、查询语言特性、它们对当前和未来用例的充分性，以及开发人员的知识。即使在客户评估利弊并有意识地做出适合他们用例的选择的情况下，随着时间的推移，我们经常看到来自新用例的需求出现，这些需求可以用不同的数据模型或查询语言更容易地解决。因此，查询语言的选择可以不考虑所选择的图模型，并且可以在以后轻松地修改或补充。为此，我们提倡并探索OneGraph(简称“1G”)的理念，这是一个包含RDF和lpg的单一、统一的图形数据模型。1G的目标是在数据级(通过支持RDF和LPG在同一数据库中共存)和查询级(通过使用所选的查询语言支持对统一数据模型的查询和更新)实现互操作性。在本文中，我们概述了我们的愿景，并研究了两种图数据模型统一的技术挑战。

{"title":"The OneGraph vision: Challenges of breaking the graph model lock-in1","authors":"O. Lassila, Michael Schmidt, O. Hartig, B. Bebee, Dave Bechberger, Willem Broekema, Ankesh Khandelwal, K. Lawrence, Carlos-Manuel López-Enríquez, Ronak Sharda, B. Thompson","doi":"10.3233/sw-223273","DOIUrl":"https://doi.org/10.3233/sw-223273","url":null,"abstract":"Amazon Neptune is a graph database service that supports two graph models: W3C’s Resource Description Framework (RDF) and Labeled Property Graphs (LPG). Customers choose one or the other model. This choice determines which data modeling features can be used and – perhaps more importantly – which query languages are available. The choice between the two technology stacks is difficult and time consuming. It requires consideration of data modeling aspects, query language features, their adequacy for current and future use cases, as well as developer knowledge. Even in cases where customers evaluate the pros and cons and make a conscious choice that fits their use case, over time we often see requirements from new use cases emerge that could be addressed more easily with a different data model or query language. It is therefore highly desirable that the choice of the query language can be made without consideration of what graph model is chosen and can be easily revised or complemented at a later point. To this end, we advocate and explore the idea of OneGraph (“1G” for short), a single, unified graph data model that embraces both RDF and LPGs. The goal of 1G is to achieve interoperability at both data level, by supporting the co-existence of RDF and LPG in the same database, as well as query level, by enabling queries and updates over the unified data model with a query language of choice. In this paper, we sketch our vision and investigate technical challenges towards a unification of the two graph data models.","PeriodicalId":48694,"journal":{"name":"Semantic Web","volume":"16 1","pages":"125-134"},"PeriodicalIF":3.0,"publicationDate":"2022-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78352387","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

LSQ 2.0: A linked dataset of SPARQL query logs LSQ 2.0: SPARQL查询日志的链接数据集

IF 3 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Semantic Web

Pub Date : 2022-11-29 DOI: 10.3233/sw-223015

Claus Stadler, Muhammad Saleem, Qaiser Mehmood, C. Buil-Aranda, M. Dumontier, A. Hogan, Axel-Cyrille Ngonga Ngomo

We present the Linked SPARQL Queries (LSQ) dataset, which currently describes 43.95 million executions of 11.56 million unique SPARQL queries extracted from the logs of 27 different endpoints. The LSQ dataset provides RDF descriptions of each such query, which are indexed in a public LSQ endpoint, allowing interested parties to find queries with the characteristics they require. We begin by describing the use cases envisaged for the LSQ dataset, which include applications for research on common features of queries, for building custom benchmarks, and for designing user interfaces. We then discuss how LSQ has been used in practice since the release of four initial SPARQL logs in 2015. We discuss the model and vocabulary that we use to represent these queries in RDF. We then provide a brief overview of the 27 endpoints from which we extracted queries in terms of the domain to which they pertain and the data they contain. We provide statistics on the queries included from each log, including the number of query executions, unique queries, as well as distributions of queries for a variety of selected characteristics. We finally discuss how the LSQ dataset is hosted and how it can be accessed and leveraged by interested parties for their use cases.

我们提供了链接SPARQL查询(LSQ)数据集，该数据集目前描述了从27个不同端点的日志中提取的1156万个唯一SPARQL查询的4395万次执行。LSQ数据集提供了每个此类查询的RDF描述，这些描述在公共LSQ端点中建立了索引，从而允许感兴趣的各方查找具有所需特征的查询。我们首先描述为LSQ数据集设想的用例，其中包括用于研究查询的常见特征、构建自定义基准和设计用户界面的应用程序。然后，我们讨论了自2015年发布四个初始SPARQL日志以来，LSQ在实践中是如何使用的。我们将讨论用于在RDF中表示这些查询的模型和词汇表。然后，我们简要概述了27个端点，根据它们所属的域和它们包含的数据，我们从中提取了查询。我们提供了每个日志中包含的查询的统计信息，包括查询执行的次数、唯一查询以及针对各种选定特征的查询分布。我们最后讨论了LSQ数据集是如何托管的，以及感兴趣的各方如何为他们的用例访问和利用它。

{"title":"LSQ 2.0: A linked dataset of SPARQL query logs","authors":"Claus Stadler, Muhammad Saleem, Qaiser Mehmood, C. Buil-Aranda, M. Dumontier, A. Hogan, Axel-Cyrille Ngonga Ngomo","doi":"10.3233/sw-223015","DOIUrl":"https://doi.org/10.3233/sw-223015","url":null,"abstract":"We present the Linked SPARQL Queries (LSQ) dataset, which currently describes 43.95 million executions of 11.56 million unique SPARQL queries extracted from the logs of 27 different endpoints. The LSQ dataset provides RDF descriptions of each such query, which are indexed in a public LSQ endpoint, allowing interested parties to find queries with the characteristics they require. We begin by describing the use cases envisaged for the LSQ dataset, which include applications for research on common features of queries, for building custom benchmarks, and for designing user interfaces. We then discuss how LSQ has been used in practice since the release of four initial SPARQL logs in 2015. We discuss the model and vocabulary that we use to represent these queries in RDF. We then provide a brief overview of the 27 endpoints from which we extracted queries in terms of the domain to which they pertain and the data they contain. We provide statistics on the queries included from each log, including the number of query executions, unique queries, as well as distributions of queries for a variety of selected characteristics. We finally discuss how the LSQ dataset is hosted and how it can be accessed and leveraged by interested parties for their use cases.","PeriodicalId":48694,"journal":{"name":"Semantic Web","volume":"68 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2022-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76541638","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Creating occupant-centered digital twins using the Occupant Feedback Ontology implemented in a smartwatch app 使用智能手表应用中实现的乘员反馈本体创建以乘员为中心的数字孪生

IF 3 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Semantic Web

Pub Date : 2022-11-08 DOI: 10.3233/sw-223254

Alex Donkers, B. de Vries, Dujuan Yang

Occupant feedback enables building managers to improve occupants’ health, comfort, and satisfaction. However, acquiring continuous occupant feedback and integrating this feedback with other building information is challenging. This paper presents a scalable method to acquire continuous occupant feedback and directly integrate this with other building information. Semantic web technologies were applied to solve data interoperability issues. The Occupant Feedback Ontology was developed to describe feedback semantically. Next to this, a smartwatch app – Mintal – was developed to acquire continuous feedback on indoor environmental quality. The app gathers location, medical information, and answers on short micro surveys. Mintal applied the Occupant Feedback Ontology to directly integrate the feedback with linked building data. A case study was performed to evaluate this method. A semantic digital twin was created by integrating linked building data, sensor data, and occupant feedback. Results from SPARQL queries gave more insight into an occupant’s perceived comfort levels in the Open Flat. The case study shows how integrating feedback with building information allows for more occupant-centric decision support tools. The approach presented in this paper can be used in a wide range of use cases, both within and without the architecture, building, and construction domain.

居住者的反馈使建筑管理者能够改善居住者的健康、舒适和满意度。然而，获取持续的居住者反馈并将这些反馈与其他建筑信息整合是具有挑战性的。本文提出了一种可扩展的方法来获取持续的居住者反馈，并直接将其与其他建筑信息相结合。语义网技术被用于解决数据互操作性问题。开发了乘员反馈本体来对反馈进行语义描述。除此之外，他们还开发了一款名为mental的智能手表应用程序，用于获取室内环境质量的持续反馈。该应用程序收集位置、医疗信息，并回答简短的微调查。Mintal应用了居住者反馈本体，直接将反馈与相关的建筑数据相结合。通过案例研究对该方法进行了评价。通过整合相关的建筑数据、传感器数据和居住者反馈，创建了一个语义数字双胞胎。SPARQL查询的结果更深入地了解了开放式公寓中居住者的舒适度。案例研究展示了如何将反馈与建筑信息相结合，从而实现更多以乘员为中心的决策支持工具。本文中提出的方法可以在广泛的用例中使用，既可以在体系结构、构建和构造领域内，也可以不在体系结构领域内。

{"title":"Creating occupant-centered digital twins using the Occupant Feedback Ontology implemented in a smartwatch app","authors":"Alex Donkers, B. de Vries, Dujuan Yang","doi":"10.3233/sw-223254","DOIUrl":"https://doi.org/10.3233/sw-223254","url":null,"abstract":"Occupant feedback enables building managers to improve occupants’ health, comfort, and satisfaction. However, acquiring continuous occupant feedback and integrating this feedback with other building information is challenging. This paper presents a scalable method to acquire continuous occupant feedback and directly integrate this with other building information. Semantic web technologies were applied to solve data interoperability issues. The Occupant Feedback Ontology was developed to describe feedback semantically. Next to this, a smartwatch app – Mintal – was developed to acquire continuous feedback on indoor environmental quality. The app gathers location, medical information, and answers on short micro surveys. Mintal applied the Occupant Feedback Ontology to directly integrate the feedback with linked building data. A case study was performed to evaluate this method. A semantic digital twin was created by integrating linked building data, sensor data, and occupant feedback. Results from SPARQL queries gave more insight into an occupant’s perceived comfort levels in the Open Flat. The case study shows how integrating feedback with building information allows for more occupant-centric decision support tools. The approach presented in this paper can be used in a wide range of use cases, both within and without the architecture, building, and construction domain.","PeriodicalId":48694,"journal":{"name":"Semantic Web","volume":"21 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2022-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90882224","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Food process ontology requirements 食品加工本体要求

IF 3 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Semantic Web

Pub Date : 2022-11-04 DOI: 10.3233/sw-223096

Damion M. Dooley, Magalie Weber, Liliana Ibanescu, Matthew Lange, L. Chan, L. Soldatova, Chen Yang, Robert Warren, C. Shimizu, H. Mcginty, W. Hsiao

People often value the sensual, celebratory, and health aspects of food, but behind this experience exists many other value-laden agricultural production, distribution, manufacturing, and physiological processes that support or undermine a healthy population and a sustainable future. The complexity of such processes is evident in both every-day food preparation of recipes and in industrial food manufacturing, packaging and storage, each of which depends critically on human or machine agents, chemical or organismal ingredient references, and the explicit instructions and implicit procedures held in formulations or recipes. An integrated ontology landscape does not yet exist to cover all the entities at work in this farm to fork journey. It seems necessary to construct such a vision by reusing expert-curated fit-to-purpose ontology subdomains and their relationship, material, and more abstract organization and role entities. The challenge is to make this merger be, by analogy, one language, rather than nouns and verbs from a dozen or more dialects which cannot be used directly in statements about some aspect of the farm to fork journey without expensive translation or substantial dialect education in order to understand a particular text or domain of knowledge. This work focuses on the ontology components – object and data properties and annotations – needed to model food processes or more general process modelling within the context of the Open Biological and Biomedical Ontology Foundry and congruent ontologies. Ideally these components can be brought together in a general process ontology that can be specialized not only for the food domain but for carrying out other protocols as well. Many operations involved in food identification, preparation, transportation and storage – shaking, boiling, mixing, freezing, labeling, shipping – are actually common to activities from manufacturing and laboratory work to local or home food preparation.

人们通常看重食物的感官、庆祝和健康方面，但在这种体验背后，存在着许多其他价值丰富的农业生产、分销、制造和生理过程，这些过程支持或破坏了健康的人口和可持续的未来。这些过程的复杂性在日常食品配方制备和工业食品制造、包装和储存中都很明显，每一个过程都严重依赖于人类或机器代理，化学或有机成分参考，以及配方或配方中所包含的明确指示和隐含程序。目前还不存在一个集成的本体景观，以涵盖在这个农场到分叉的旅程中工作的所有实体。似乎有必要通过重用专家策划的符合目的的本体子域及其关系、材料和更抽象的组织和角色实体来构建这样的愿景。挑战在于，通过类比，使这种合并成为一种语言，而不是来自十几个或更多方言的名词和动词，这些方言不能直接用于关于农场到叉旅程的某些方面的陈述，如果没有昂贵的翻译或大量的方言教育，以理解特定的文本或知识领域。这项工作的重点是本体组件-对象和数据属性和注释-需要在开放生物和生物医学本体铸造和一致本体的背景下建模食品过程或更一般的过程建模。理想情况下，这些组件可以汇集在一个通用的过程本体中，该本体不仅可以专门用于食品领域，还可以用于执行其他协议。许多涉及食品鉴定、制备、运输和储存的操作——摇晃、煮沸、混合、冷冻、贴标签、运输——实际上在从制造和实验室工作到本地或家庭食品制备的活动中都很常见。

{"title":"Food process ontology requirements","authors":"Damion M. Dooley, Magalie Weber, Liliana Ibanescu, Matthew Lange, L. Chan, L. Soldatova, Chen Yang, Robert Warren, C. Shimizu, H. Mcginty, W. Hsiao","doi":"10.3233/sw-223096","DOIUrl":"https://doi.org/10.3233/sw-223096","url":null,"abstract":"People often value the sensual, celebratory, and health aspects of food, but behind this experience exists many other value-laden agricultural production, distribution, manufacturing, and physiological processes that support or undermine a healthy population and a sustainable future. The complexity of such processes is evident in both every-day food preparation of recipes and in industrial food manufacturing, packaging and storage, each of which depends critically on human or machine agents, chemical or organismal ingredient references, and the explicit instructions and implicit procedures held in formulations or recipes. An integrated ontology landscape does not yet exist to cover all the entities at work in this farm to fork journey. It seems necessary to construct such a vision by reusing expert-curated fit-to-purpose ontology subdomains and their relationship, material, and more abstract organization and role entities. The challenge is to make this merger be, by analogy, one language, rather than nouns and verbs from a dozen or more dialects which cannot be used directly in statements about some aspect of the farm to fork journey without expensive translation or substantial dialect education in order to understand a particular text or domain of knowledge. This work focuses on the ontology components – object and data properties and annotations – needed to model food processes or more general process modelling within the context of the Open Biological and Biomedical Ontology Foundry and congruent ontologies. Ideally these components can be brought together in a general process ontology that can be specialized not only for the food domain but for carrying out other protocols as well. Many operations involved in food identification, preparation, transportation and storage – shaking, boiling, mixing, freezing, labeling, shipping – are actually common to activities from manufacturing and laboratory work to local or home food preparation.","PeriodicalId":48694,"journal":{"name":"Semantic Web","volume":"2 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2022-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87321613","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

ImageSchemaNet: A framester graph for embodied commonsense knowledge ImageSchemaNet:包含常识性知识的框架图

IF 3 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Semantic Web

Pub Date : 2022-11-03 DOI: 10.3233/sw-223084

Stefano De Giorgis, Aldo Gangemi, Dagmar Gromann

Commonsense knowledge is a broad and challenging area of research which investigates our understanding of the world as well as human assumptions about reality. Deriving directly from the subjective perception of the external world, it is intrinsically intertwined with embodied cognition. Commonsense reasoning is linked to human sense-making, pattern recognition and knowledge framing abilities. This work presents a new resource that formalizes the cognitive theory of image schemas. Image schemas are dynamic conceptual building blocks originating from our sensorimotor interactions with the physical world, and enable our sense-making cognitive activity to assign coherence and structure to entities, events and situations we experience everyday. ImageSchemaNet is an ontology that aligns pre-existing resources, such as FrameNet, VerbNet, WordNet and MetaNet from the Framester hub, to image schema theory. This article describes an empirical application of ImageSchemaNet, combined with semantic parsers, on the task of annotating natural language sentences with image schemas.

常识知识是一个广泛而具有挑战性的研究领域，它调查了我们对世界的理解以及人类对现实的假设。它直接来源于对外部世界的主观感知，本质上与具身认知交织在一起。常识推理与人类的理解、模式识别和知识框架能力有关。这项工作提供了一个新的资源，形式化的认知理论的意象图式。意象图式是源于我们与物理世界的感觉运动互动的动态概念构建模块，它使我们的意义形成认知活动能够为我们每天经历的实体、事件和情境分配连贯性和结构。ImageSchemaNet是一个本体，它将已有的资源(如Framester中心的FrameNet、vernet、WordNet和MetaNet)与图像图式理论结合在一起。本文描述了ImageSchemaNet结合语义解析器在用图像模式注释自然语言句子方面的经验应用。

引用次数: 2

Evaluating the usability of a semantic environmental health data framework: Approach and study 评估语义环境健康数据框架的可用性:方法与研究

IF 3 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Semantic Web

Pub Date : 2022-11-03 DOI: 10.3233/sw-223212

Albert Navarro-Gallinad, F. Orlandi, Jennifer Scott, Mark Little, D. O’Sullivan

Environmental exposures transported across air, land and water can affect our health making us more susceptible to developing a disease. Therefore, researchers need to face the complex task of integrating environmental exposures and linking them to health events with the relevant spatiotemporal and health context for individuals or populations. We present a usability evaluation approach and study of a semantic framework (i.e. Knowledge Graph, Methodology and User Interface) to enable Health Data Researchers (HDR) to link particular health events with environmental data for rare disease research. The usability study includes 17 HDRs with expertise in health data related to Anti-Neutrophil Cytoplasmic Antibody (ANCA)-associated vasculitis (AAV) in Ireland and Kawasaki Disease in Japan, and with no previous practical experience in using Semantic Web (SW) technologies. The evaluation results are promising in that they indicate that the framework is useful in allowing researchers themselves to link health and environmental data whilst hiding the complexities of SW technologies. As a result of this work, we also discuss the limitations of the approach together with the applicability to other domains. Beyond the direct impact on environmental health studies, the description of the evaluation approach can guide researchers in making SW technologies more accessible to domain experts through usability studies.

通过空气、土地和水传播的环境暴露会影响我们的健康，使我们更容易患上疾病。因此，研究人员需要面对一个复杂的任务，即整合环境暴露，并将其与健康事件与相关的时空和健康背景联系起来。我们提出了一种可用性评估方法和语义框架(即知识图谱、方法论和用户界面)的研究，使健康数据研究人员(HDR)能够将特定的健康事件与罕见疾病研究的环境数据联系起来。可用性研究包括17名具有爱尔兰抗中性粒细胞细胞质抗体(ANCA)相关血管炎(AAV)和日本川崎病相关健康数据专业知识的hdr，并且以前没有使用语义网(SW)技术的实践经验。评估结果是有希望的，因为它们表明该框架在允许研究人员自己将健康和环境数据联系起来同时隐藏软件技术的复杂性方面是有用的。作为这项工作的结果，我们还讨论了该方法的局限性以及对其他领域的适用性。除了对环境健康研究的直接影响之外，对评估方法的描述可以指导研究人员通过可用性研究使领域专家更容易获得软件技术。

{"title":"Evaluating the usability of a semantic environmental health data framework: Approach and study","authors":"Albert Navarro-Gallinad, F. Orlandi, Jennifer Scott, Mark Little, D. O’Sullivan","doi":"10.3233/sw-223212","DOIUrl":"https://doi.org/10.3233/sw-223212","url":null,"abstract":"Environmental exposures transported across air, land and water can affect our health making us more susceptible to developing a disease. Therefore, researchers need to face the complex task of integrating environmental exposures and linking them to health events with the relevant spatiotemporal and health context for individuals or populations. We present a usability evaluation approach and study of a semantic framework (i.e. Knowledge Graph, Methodology and User Interface) to enable Health Data Researchers (HDR) to link particular health events with environmental data for rare disease research. The usability study includes 17 HDRs with expertise in health data related to Anti-Neutrophil Cytoplasmic Antibody (ANCA)-associated vasculitis (AAV) in Ireland and Kawasaki Disease in Japan, and with no previous practical experience in using Semantic Web (SW) technologies. The evaluation results are promising in that they indicate that the framework is useful in allowing researchers themselves to link health and environmental data whilst hiding the complexities of SW technologies. As a result of this work, we also discuss the limitations of the approach together with the applicability to other domains. Beyond the direct impact on environmental health studies, the description of the evaluation approach can guide researchers in making SW technologies more accessible to domain experts through usability studies.","PeriodicalId":48694,"journal":{"name":"Semantic Web","volume":"14 1","pages":"787-810"},"PeriodicalIF":3.0,"publicationDate":"2022-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88234808","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0