Distributed service-oriented architecture for information extraction system "Semanta"

5th International Conference on Intelligent Systems Design and Applications (ISDA'05) Pub Date : 2005-09-08 DOI:10.1109/ISDA.2005.39

Lukasz Jastrzebski, Maciej Piasecki, Grzegorz Strzelecki

引用次数: 1

Abstract

Our objective is to provide a flexible, scalable, distributed architecture that assures a high performance for information extraction (IE) systems working in Internet. The architecture is based on both the general paradigm of the service-oriented architecture, client-server approach and strong separation of concerns between storage and processing components. An experimental IE system, named Semanta, utilising the proposed architecture is also presented. In the following document, we describe five main Semanta services, which are Web user interface (WebUI), Web crawler service (WCS), parsing service (PS), IE service and manager

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

面向服务的分布式信息抽取系统“Semanta”体系结构

我们的目标是提供一个灵活的、可扩展的、分布式的体系结构，以保证在Internet上工作的信息提取(IE)系统的高性能。该体系结构基于面向服务的体系结构的通用范例、客户机-服务器方法以及存储和处理组件之间的强烈关注点分离。本文还介绍了一个基于该架构的实验性IE系统Semanta。在下面的文档中，我们描述了五个主要的语义服务，它们是Web用户界面(Web ui)、Web爬虫服务(WCS)、解析服务(PS)、IE服务和管理器

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

5th International Conference on Intelligent Systems Design and Applications (ISDA'05)

自引率

0.00%

发文量