{"title":"面向对象数据库中的无模式XML文档管理","authors":"S. M. Chung, Sheeya B. Jesurajaiah","doi":"10.1109/ITCC.2005.250","DOIUrl":null,"url":null,"abstract":"XML has emerged as the universal format for data exchange and is being widely used in technical documentation of manufacturing companies, book publishing, and Web publishing. It is also being used in contexts that are mission-critical, such as banking and e-commerce. One of the main reasons for the popularity of XML is its semi-structured nature and the flexibility of its component parts. Schemaless XML documents, which do not have a schema or Document Type Definition (DTD), have the highest level of flexibility, as they are not bound to any kind of structure. However, support for schemaless XML documents in the major relational and object-oriented databases is limited to storing each of them as a CLOB (Character Large Object). This kind of storage requires the usage of associative search criteria to query the document, and does not support piecewise update and multiple document queries. In this research, we developed a middleware layer over an Oracle DBMS to support the storage, retrieval, query and update of schemaless XML documents. We also propose an algorithm by which we can store the logical structure of the XML document in addition to storing its content. The final part of this paper presents an algorithm that can rebuild XML documents by using the stored logical structures and the content of the XML documents. The rebuilt XML document is a well-formed document and is similar to the original document.","PeriodicalId":326887,"journal":{"name":"International Conference on Information Technology: Coding and Computing (ITCC'05) - Volume II","volume":"11 2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":"{\"title\":\"Schemaless XML document management in object-oriented databases\",\"authors\":\"S. M. Chung, Sheeya B. Jesurajaiah\",\"doi\":\"10.1109/ITCC.2005.250\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"XML has emerged as the universal format for data exchange and is being widely used in technical documentation of manufacturing companies, book publishing, and Web publishing. It is also being used in contexts that are mission-critical, such as banking and e-commerce. One of the main reasons for the popularity of XML is its semi-structured nature and the flexibility of its component parts. Schemaless XML documents, which do not have a schema or Document Type Definition (DTD), have the highest level of flexibility, as they are not bound to any kind of structure. However, support for schemaless XML documents in the major relational and object-oriented databases is limited to storing each of them as a CLOB (Character Large Object). This kind of storage requires the usage of associative search criteria to query the document, and does not support piecewise update and multiple document queries. In this research, we developed a middleware layer over an Oracle DBMS to support the storage, retrieval, query and update of schemaless XML documents. We also propose an algorithm by which we can store the logical structure of the XML document in addition to storing its content. The final part of this paper presents an algorithm that can rebuild XML documents by using the stored logical structures and the content of the XML documents. The rebuilt XML document is a well-formed document and is similar to the original document.\",\"PeriodicalId\":326887,\"journal\":{\"name\":\"International Conference on Information Technology: Coding and Computing (ITCC'05) - Volume II\",\"volume\":\"11 2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2005-04-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Information Technology: Coding and Computing (ITCC'05) - Volume II\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ITCC.2005.250\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Information Technology: Coding and Computing (ITCC'05) - Volume II","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITCC.2005.250","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11
摘要
XML已经成为数据交换的通用格式,并被广泛用于制造公司的技术文档、图书出版和Web出版。它也被用于关键任务的环境中,例如银行和电子商务。XML流行的主要原因之一是它的半结构化特性和组成部分的灵活性。没有模式或文档类型定义(Document Type Definition, DTD)的无模式XML文档具有最高级别的灵活性,因为它们不绑定到任何类型的结构。但是,在主要的关系数据库和面向对象数据库中对无模式XML文档的支持仅限于将它们存储为CLOB(字符大对象)。这种存储需要使用关联搜索条件来查询文档,不支持分段更新和多个文档查询。在本研究中,我们在Oracle DBMS之上开发了一个中间件层来支持无模式XML文档的存储、检索、查询和更新。我们还提出了一种算法,通过该算法除了存储XML文档的内容外,还可以存储XML文档的逻辑结构。最后,本文提出了一种利用存储的逻辑结构和XML文档的内容来重建XML文档的算法。重建的XML文档是一个格式良好的文档,与原始文档类似。
Schemaless XML document management in object-oriented databases
XML has emerged as the universal format for data exchange and is being widely used in technical documentation of manufacturing companies, book publishing, and Web publishing. It is also being used in contexts that are mission-critical, such as banking and e-commerce. One of the main reasons for the popularity of XML is its semi-structured nature and the flexibility of its component parts. Schemaless XML documents, which do not have a schema or Document Type Definition (DTD), have the highest level of flexibility, as they are not bound to any kind of structure. However, support for schemaless XML documents in the major relational and object-oriented databases is limited to storing each of them as a CLOB (Character Large Object). This kind of storage requires the usage of associative search criteria to query the document, and does not support piecewise update and multiple document queries. In this research, we developed a middleware layer over an Oracle DBMS to support the storage, retrieval, query and update of schemaless XML documents. We also propose an algorithm by which we can store the logical structure of the XML document in addition to storing its content. The final part of this paper presents an algorithm that can rebuild XML documents by using the stored logical structures and the content of the XML documents. The rebuilt XML document is a well-formed document and is similar to the original document.