A Structured Text ADT for Object-Relational Databases

Theory Pract. Object Syst. Pub Date : 1998-10-12 DOI:10.1002/(SICI)1096-9942(1998)4:4<227::AID-TAPO3>3.3.CO;2-L

L. Brown, M. Consens, I. Davis, C. R. Palmer, Frank Wm. Tompa

{"title":"A Structured Text ADT for Object-Relational Databases","authors":"L. Brown, M. Consens, I. Davis, C. R. Palmer, Frank Wm. Tompa","doi":"10.1002/(SICI)1096-9942(1998)4:4<227::AID-TAPO3>3.3.CO;2-L","DOIUrl":null,"url":null,"abstract":"There is a growing need to develop tools that are able to retrieve relevant textual information rapidly, to present textual information in a meaningful way, and to integrate textual information with related data retrieved from other sources. These tools are critical to support applications within corporate intranets and across the rapidly evolving World Wide Web. This paper introduces a framework for modelling structured text and presents a small set of operations that may be applied against such models. Using these operations structured text may be selected, marked, fragmented, and transformed into relations for use in relational and object-oriented database systems. The extended functionality has been accepted for inclusion within the SQL/MM standard, and a prototype database engine has been implemented to support SQL with the proposed extensions. This prototype serves as a proof of concept intended to address industrial concerns, and it demonstrates the power of the proposed abstract data type for structured text. 1. The challenge Database technology is essential to the operation of conventional business enterprises, and it is becoming increasingly important in the development of distributed information systems. However, most database systems, and in particular relational database systems, provide few facilities for effectively managing the vast body of electronic information embedded within text. Many customers require that large texts be searched both vertically, with respect to their internal structure, and horizontally, with respect to their textual content [Wei85]. Texts often need to be fragmented at appropriate structural boundaries. Sometimes selected text needs to be extracted as separate units, but often the appropriate context surrounding selected text must be recovered, and thus the selected text needs to be marked in some manner, so that it can be subsequently located within a potentially much larger context.","PeriodicalId":293061,"journal":{"name":"Theory Pract. Object Syst.","volume":"201 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1998-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Theory Pract. Object Syst.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/(SICI)1096-9942(1998)4:4<227::AID-TAPO3>3.3.CO;2-L","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 18

Abstract

There is a growing need to develop tools that are able to retrieve relevant textual information rapidly, to present textual information in a meaningful way, and to integrate textual information with related data retrieved from other sources. These tools are critical to support applications within corporate intranets and across the rapidly evolving World Wide Web. This paper introduces a framework for modelling structured text and presents a small set of operations that may be applied against such models. Using these operations structured text may be selected, marked, fragmented, and transformed into relations for use in relational and object-oriented database systems. The extended functionality has been accepted for inclusion within the SQL/MM standard, and a prototype database engine has been implemented to support SQL with the proposed extensions. This prototype serves as a proof of concept intended to address industrial concerns, and it demonstrates the power of the proposed abstract data type for structured text. 1. The challenge Database technology is essential to the operation of conventional business enterprises, and it is becoming increasingly important in the development of distributed information systems. However, most database systems, and in particular relational database systems, provide few facilities for effectively managing the vast body of electronic information embedded within text. Many customers require that large texts be searched both vertically, with respect to their internal structure, and horizontally, with respect to their textual content [Wei85]. Texts often need to be fragmented at appropriate structural boundaries. Sometimes selected text needs to be extracted as separate units, but often the appropriate context surrounding selected text must be recovered, and thus the selected text needs to be marked in some manner, so that it can be subsequently located within a potentially much larger context.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

对象关系数据库的结构化文本ADT

越来越需要开发能够快速检索相关文本信息的工具，以有意义的方式呈现文本信息，并将文本信息与从其他来源检索的相关数据集成在一起。这些工具对于支持企业内部网和快速发展的万维网中的应用程序至关重要。本文介绍了一个结构化文本建模的框架，并提出了一组可以应用于此类模型的操作。使用这些操作，可以选择、标记、分割结构化文本，并将其转换为关系和面向对象数据库系统中使用的关系。扩展的功能已经被接受包含在SQL/MM标准中，并且已经实现了一个原型数据库引擎来支持SQL和提议的扩展。这个原型作为概念的证明，旨在解决工业问题，并演示了所建议的用于结构化文本的抽象数据类型的强大功能。1. 数据库技术对传统商业企业的运作至关重要，在分布式信息系统的发展中也变得越来越重要。然而，大多数数据库系统，特别是关系数据库系统，几乎没有提供有效管理嵌入文本中的大量电子信息的工具。许多客户要求对大型文本进行纵向搜索(针对其内部结构)和横向搜索(针对其文本内容)[Wei85]。文本通常需要在适当的结构边界上进行分割。有时需要将所选文本作为单独的单元提取出来，但是通常必须恢复所选文本周围的适当上下文，因此需要以某种方式标记所选文本，以便随后可以将其定位在可能更大的上下文中。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Theory Pract. Object Syst.

自引率

0.00%

发文量

期刊最新文献

The Electronic Library Project: SGML Document Management System Based on ODBMS A Performance Study of Object Database Management Systems Building CORBA Applications with an Object Database System Object Management for a Visual Data Analysis Tool In the Trenches with ObjectStore