{"title":"OTS:有效处理XML小枝模式的优化方法","authors":"Yingfei Gao","doi":"10.1109/ISDEA.2012.483","DOIUrl":null,"url":null,"abstract":"As a core operation to twig query processing, finding all the occurrences of a twig pattern in an XML document has attracted much attention. Although existing methods are efficient for quires with ancestor-descendant edges, our analysis shows that they all suffer from redundant CPU cost. Moreover, large amount of redundant path solutions may be produced when parent-child edges appeared below branch nodes. We propose an optimized holistic twig join algorithm, namely OTS, towards efficient processing of a twig query. By pre-checking at three steps, OTS can not only reduce the CPU cost but also eliminate redundant path solutions. As a result, it broadens the class of queries with CPU complexity linear with the sum of sizes of the input lists and the output lists. Experimental results on various datasets indicate that OTS performs significantly better than the existing ones.","PeriodicalId":267532,"journal":{"name":"2012 Second International Conference on Intelligent System Design and Engineering Application","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"OTS: An Optimized Method for Efficient Processing of XML Twig Patterns\",\"authors\":\"Yingfei Gao\",\"doi\":\"10.1109/ISDEA.2012.483\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As a core operation to twig query processing, finding all the occurrences of a twig pattern in an XML document has attracted much attention. Although existing methods are efficient for quires with ancestor-descendant edges, our analysis shows that they all suffer from redundant CPU cost. Moreover, large amount of redundant path solutions may be produced when parent-child edges appeared below branch nodes. We propose an optimized holistic twig join algorithm, namely OTS, towards efficient processing of a twig query. By pre-checking at three steps, OTS can not only reduce the CPU cost but also eliminate redundant path solutions. As a result, it broadens the class of queries with CPU complexity linear with the sum of sizes of the input lists and the output lists. Experimental results on various datasets indicate that OTS performs significantly better than the existing ones.\",\"PeriodicalId\":267532,\"journal\":{\"name\":\"2012 Second International Conference on Intelligent System Design and Engineering Application\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-01-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 Second International Conference on Intelligent System Design and Engineering Application\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISDEA.2012.483\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 Second International Conference on Intelligent System Design and Engineering Application","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISDEA.2012.483","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
OTS: An Optimized Method for Efficient Processing of XML Twig Patterns
As a core operation to twig query processing, finding all the occurrences of a twig pattern in an XML document has attracted much attention. Although existing methods are efficient for quires with ancestor-descendant edges, our analysis shows that they all suffer from redundant CPU cost. Moreover, large amount of redundant path solutions may be produced when parent-child edges appeared below branch nodes. We propose an optimized holistic twig join algorithm, namely OTS, towards efficient processing of a twig query. By pre-checking at three steps, OTS can not only reduce the CPU cost but also eliminate redundant path solutions. As a result, it broadens the class of queries with CPU complexity linear with the sum of sizes of the input lists and the output lists. Experimental results on various datasets indicate that OTS performs significantly better than the existing ones.