{"title":"EMPWR 平台:知识图谱生命周期的数据和知识驱动流程","authors":"Hong Yung Yip, Amit Sheth","doi":"10.1109/mic.2023.3339858","DOIUrl":null,"url":null,"abstract":"The unparalleled volume of data generated has heightened the need for approaches that can consume these data in a scalable and automated fashion. Although modern data-driven, deep-learning-based systems are cost-efficient and can learn complex patterns, they are black boxes in nature, and the underlying input data highly dictate their world model. Knowledge graphs (KGs), as one such technology, have surfaced as a compelling approach for using structured knowledge representation to support the integration of knowledge from diverse sources and formats. We present Empower (EMPWR), a comprehensive KG development and lifecycle support platform that uses a broad variety of techniques from symbolic and modern data-driven systems. We discuss the sets of system design guiding principles used to develop EMPWR, its system architectures, and workflow components. We illustrate some of EMPWR’s abilities by describing a process of creating and maintaining a KG for the pharmaceuticals domain.","PeriodicalId":13121,"journal":{"name":"IEEE Internet Computing","volume":"31 1","pages":""},"PeriodicalIF":3.7000,"publicationDate":"2024-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The EMPWR Platform: Data and Knowledge-Driven Processes for the Knowledge Graph Lifecycle\",\"authors\":\"Hong Yung Yip, Amit Sheth\",\"doi\":\"10.1109/mic.2023.3339858\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The unparalleled volume of data generated has heightened the need for approaches that can consume these data in a scalable and automated fashion. Although modern data-driven, deep-learning-based systems are cost-efficient and can learn complex patterns, they are black boxes in nature, and the underlying input data highly dictate their world model. Knowledge graphs (KGs), as one such technology, have surfaced as a compelling approach for using structured knowledge representation to support the integration of knowledge from diverse sources and formats. We present Empower (EMPWR), a comprehensive KG development and lifecycle support platform that uses a broad variety of techniques from symbolic and modern data-driven systems. We discuss the sets of system design guiding principles used to develop EMPWR, its system architectures, and workflow components. We illustrate some of EMPWR’s abilities by describing a process of creating and maintaining a KG for the pharmaceuticals domain.\",\"PeriodicalId\":13121,\"journal\":{\"name\":\"IEEE Internet Computing\",\"volume\":\"31 1\",\"pages\":\"\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2024-02-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Internet Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1109/mic.2023.3339858\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet Computing","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1109/mic.2023.3339858","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0
摘要
由于产生的数据量无与伦比,人们更加需要能够以可扩展和自动化的方式使用这些数据的方法。虽然基于深度学习的现代数据驱动系统具有成本效益,并能学习复杂的模式,但它们本质上是黑盒子,底层输入数据在很大程度上决定了它们的世界模型。知识图谱(KG)就是这样一种技术,它是使用结构化知识表示法来支持整合来自不同来源和格式的知识的一种引人注目的方法。我们介绍了 Empower (EMPWR),这是一个全面的知识图谱开发和生命周期支持平台,采用了符号系统和现代数据驱动系统的多种技术。我们讨论了用于开发 EMPWR 的系统设计指导原则、系统架构和工作流组件。我们通过描述创建和维护制药领域 KG 的过程来说明 EMPWR 的一些能力。
The EMPWR Platform: Data and Knowledge-Driven Processes for the Knowledge Graph Lifecycle
The unparalleled volume of data generated has heightened the need for approaches that can consume these data in a scalable and automated fashion. Although modern data-driven, deep-learning-based systems are cost-efficient and can learn complex patterns, they are black boxes in nature, and the underlying input data highly dictate their world model. Knowledge graphs (KGs), as one such technology, have surfaced as a compelling approach for using structured knowledge representation to support the integration of knowledge from diverse sources and formats. We present Empower (EMPWR), a comprehensive KG development and lifecycle support platform that uses a broad variety of techniques from symbolic and modern data-driven systems. We discuss the sets of system design guiding principles used to develop EMPWR, its system architectures, and workflow components. We illustrate some of EMPWR’s abilities by describing a process of creating and maintaining a KG for the pharmaceuticals domain.
期刊介绍:
This magazine provides a journal-quality evaluation and review of Internet-based computer applications and enabling technologies. It also provides a source of information as well as a forum for both users and developers. The focus of the magazine is on Internet services using WWW, agents, and similar technologies. This does not include traditional software concerns such as object-oriented or structured programming, or Common Object Request Broker Architecture (CORBA) or Object Linking and Embedding (OLE) standards. The magazine may, however, treat the intersection of these software technologies with the Web or agents. For instance, the linking of ORBs and Web servers or the conversion of KQML messages to object requests are relevant technologies for this magazine. An article strictly about CORBA would not be. This magazine is not focused on intelligent systems. Techniques for encoding knowledge or breakthroughs in neural net technologies are outside its scope, as would be an article on the efficacy of a particular expert system. Internet Computing focuses on technologies and applications that allow practitioners to leverage off services to be found on the Internet.