Semi-Automatic Ontology Construction from HTML Documents: A conversion of Text-formed Information into OWL 2

International Journal of Contents Pub Date : 2016-06-28 DOI:10.5392/IJOC.2016.12.2.024

Changjae Im, Do wan Kim

引用次数: 0

Abstract

Ontology is known to be one of the most important technologies in achieving semantic web. It is critical as it represents the knowledge in a machine readable state. World Wide Web Consortium (W3C) has been contributing to the development of ontology for the last several years. However, the recommendation of W3C left out HTML despite the massive amount of information it contains. Also, it is difficult and time consuming to keep up with all the technologies especially in the case of constructing ontology. Thus, we propose a module and methods that reuse HTML documents, extract necessary information from HTML tags and mapping it to OWL 2. We will be combining two kinds of approaches which will be the structural refinement for making an ontology skeleton and linguistic approach for adding detailed information onto the skeleton.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于HTML文档的半自动本体构建:文本形式信息到OWL的转换2

本体是实现语义网最重要的技术之一。它是至关重要的，因为它以机器可读的状态表示知识。在过去的几年里，万维网联盟(W3C)一直在为本体的发展做出贡献。然而，尽管HTML包含了大量的信息，W3C的推荐还是忽略了它。同时，在构建本体的过程中，要跟上各种技术的发展是非常困难和耗时的。因此，我们提出了一个重用HTML文档、从HTML标记中提取必要信息并将其映射到owl2的模块和方法。我们将结合两种方法，一种是构建本体骨架的结构改进方法，另一种是在骨架上添加详细信息的语言方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

International Journal of Contents

自引率

0.00%

发文量

审稿时长

8 weeks