面向对象数据库中的无模式XML文档管理

International Conference on Information Technology: Coding and Computing (ITCC'05) - Volume II Pub Date : 2005-04-04 DOI:10.1109/ITCC.2005.250

S. M. Chung, Sheeya B. Jesurajaiah

{"title":"面向对象数据库中的无模式XML文档管理","authors":"S. M. Chung, Sheeya B. Jesurajaiah","doi":"10.1109/ITCC.2005.250","DOIUrl":null,"url":null,"abstract":"XML has emerged as the universal format for data exchange and is being widely used in technical documentation of manufacturing companies, book publishing, and Web publishing. It is also being used in contexts that are mission-critical, such as banking and e-commerce. One of the main reasons for the popularity of XML is its semi-structured nature and the flexibility of its component parts. Schemaless XML documents, which do not have a schema or Document Type Definition (DTD), have the highest level of flexibility, as they are not bound to any kind of structure. However, support for schemaless XML documents in the major relational and object-oriented databases is limited to storing each of them as a CLOB (Character Large Object). This kind of storage requires the usage of associative search criteria to query the document, and does not support piecewise update and multiple document queries. In this research, we developed a middleware layer over an Oracle DBMS to support the storage, retrieval, query and update of schemaless XML documents. We also propose an algorithm by which we can store the logical structure of the XML document in addition to storing its content. The final part of this paper presents an algorithm that can rebuild XML documents by using the stored logical structures and the content of the XML documents. The rebuilt XML document is a well-formed document and is similar to the original document.","PeriodicalId":326887,"journal":{"name":"International Conference on Information Technology: Coding and Computing (ITCC'05) - Volume II","volume":"11 2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":"{\"title\":\"Schemaless XML document management in object-oriented databases\",\"authors\":\"S. M. Chung, Sheeya B. Jesurajaiah\",\"doi\":\"10.1109/ITCC.2005.250\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"XML has emerged as the universal format for data exchange and is being widely used in technical documentation of manufacturing companies, book publishing, and Web publishing. It is also being used in contexts that are mission-critical, such as banking and e-commerce. One of the main reasons for the popularity of XML is its semi-structured nature and the flexibility of its component parts. Schemaless XML documents, which do not have a schema or Document Type Definition (DTD), have the highest level of flexibility, as they are not bound to any kind of structure. However, support for schemaless XML documents in the major relational and object-oriented databases is limited to storing each of them as a CLOB (Character Large Object). This kind of storage requires the usage of associative search criteria to query the document, and does not support piecewise update and multiple document queries. In this research, we developed a middleware layer over an Oracle DBMS to support the storage, retrieval, query and update of schemaless XML documents. We also propose an algorithm by which we can store the logical structure of the XML document in addition to storing its content. The final part of this paper presents an algorithm that can rebuild XML documents by using the stored logical structures and the content of the XML documents. The rebuilt XML document is a well-formed document and is similar to the original document.\",\"PeriodicalId\":326887,\"journal\":{\"name\":\"International Conference on Information Technology: Coding and Computing (ITCC'05) - Volume II\",\"volume\":\"11 2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2005-04-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Information Technology: Coding and Computing (ITCC'05) - Volume II\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ITCC.2005.250\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Information Technology: Coding and Computing (ITCC'05) - Volume II","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITCC.2005.250","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 11

摘要

XML已经成为数据交换的通用格式，并被广泛用于制造公司的技术文档、图书出版和Web出版。它也被用于关键任务的环境中，例如银行和电子商务。XML流行的主要原因之一是它的半结构化特性和组成部分的灵活性。没有模式或文档类型定义(Document Type Definition, DTD)的无模式XML文档具有最高级别的灵活性，因为它们不绑定到任何类型的结构。但是，在主要的关系数据库和面向对象数据库中对无模式XML文档的支持仅限于将它们存储为CLOB(字符大对象)。这种存储需要使用关联搜索条件来查询文档，不支持分段更新和多个文档查询。在本研究中，我们在Oracle DBMS之上开发了一个中间件层来支持无模式XML文档的存储、检索、查询和更新。我们还提出了一种算法，通过该算法除了存储XML文档的内容外，还可以存储XML文档的逻辑结构。最后，本文提出了一种利用存储的逻辑结构和XML文档的内容来重建XML文档的算法。重建的XML文档是一个格式良好的文档，与原始文档类似。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Schemaless XML document management in object-oriented databases

XML has emerged as the universal format for data exchange and is being widely used in technical documentation of manufacturing companies, book publishing, and Web publishing. It is also being used in contexts that are mission-critical, such as banking and e-commerce. One of the main reasons for the popularity of XML is its semi-structured nature and the flexibility of its component parts. Schemaless XML documents, which do not have a schema or Document Type Definition (DTD), have the highest level of flexibility, as they are not bound to any kind of structure. However, support for schemaless XML documents in the major relational and object-oriented databases is limited to storing each of them as a CLOB (Character Large Object). This kind of storage requires the usage of associative search criteria to query the document, and does not support piecewise update and multiple document queries. In this research, we developed a middleware layer over an Oracle DBMS to support the storage, retrieval, query and update of schemaless XML documents. We also propose an algorithm by which we can store the logical structure of the XML document in addition to storing its content. The final part of this paper presents an algorithm that can rebuild XML documents by using the stored logical structures and the content of the XML documents. The rebuilt XML document is a well-formed document and is similar to the original document.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Conference on Information Technology: Coding and Computing (ITCC'05) - Volume II

自引率

0.00%

发文量

期刊最新文献

Component metadata management and publication for the grid A mathematical investigation on retrieval performance evaluation measures of information retrieval algorithm Single-parameter blackjack betting systems inspired by scatter search A time-series biclustering algorithm for revealing co-regulated genes A methodology for evaluating agent toolkits