可扩展标记语言关键字搜索的高效top-k算法

IET Softw. Pub Date : 2012-10-04 DOI:10.1049/iet-sen.2011.0082

Hang Yu, Zhihong Deng, Ning Gao

{"title":"可扩展标记语言关键字搜索的高效top-k算法","authors":"Hang Yu, Zhihong Deng, Ning Gao","doi":"10.1049/iet-sen.2011.0082","DOIUrl":null,"url":null,"abstract":"The ability to compute top- k matches to eXtensible Markup Language (XML) queries is gaining importance owing to the increasing of large XML repositories. Current work on top- k match to XML queries mainly focuses on employing XPath, XQuery or NEXI as the query language, whereas little work has concerned on top- k match to XML keyword search. In this study, the authors propose a novel two-layer-based index construction and associated algorithm for efficiently computing top- k results for XML keyword search. Our core contribution, the two-layer-based inverted Index and associated algorithm for XML keyword search take both score-sorted-sequence and Dewey ID-sorted-sequence into consideration, and thus gain performance benefits during querying process. The authors have conducted expensive experiments and our experimental results show efficiency advantages compared with existing approaches.","PeriodicalId":13395,"journal":{"name":"IET Softw.","volume":"220 1","pages":"342-349"},"PeriodicalIF":0.0000,"publicationDate":"2012-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Efficient top-k algorithm for eXtensible Markup Language keyword search\",\"authors\":\"Hang Yu, Zhihong Deng, Ning Gao\",\"doi\":\"10.1049/iet-sen.2011.0082\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The ability to compute top- k matches to eXtensible Markup Language (XML) queries is gaining importance owing to the increasing of large XML repositories. Current work on top- k match to XML queries mainly focuses on employing XPath, XQuery or NEXI as the query language, whereas little work has concerned on top- k match to XML keyword search. In this study, the authors propose a novel two-layer-based index construction and associated algorithm for efficiently computing top- k results for XML keyword search. Our core contribution, the two-layer-based inverted Index and associated algorithm for XML keyword search take both score-sorted-sequence and Dewey ID-sorted-sequence into consideration, and thus gain performance benefits during querying process. The authors have conducted expensive experiments and our experimental results show efficiency advantages compared with existing approaches.\",\"PeriodicalId\":13395,\"journal\":{\"name\":\"IET Softw.\",\"volume\":\"220 1\",\"pages\":\"342-349\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-10-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IET Softw.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1049/iet-sen.2011.0082\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Softw.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1049/iet-sen.2011.0082","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

由于大型XML存储库的增加，计算与可扩展标记语言(eXtensible Markup Language, XML)查询的top- k匹配的能力变得越来越重要。目前关于XML查询top- k匹配的工作主要集中在使用XPath、XQuery或NEXI作为查询语言，而关于XML关键字搜索top- k匹配的工作很少。在这项研究中，作者提出了一种新的基于两层的索引结构和相关算法，以有效地计算XML关键字搜索的top- k结果。我们的核心贡献是用于XML关键字搜索的基于两层的倒排索引和相关算法，它同时考虑了分数排序序列和Dewey id排序序列，从而在查询过程中获得性能优势。作者进行了昂贵的实验，与现有方法相比，我们的实验结果显示出效率优势。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Efficient top-k algorithm for eXtensible Markup Language keyword search

The ability to compute top- k matches to eXtensible Markup Language (XML) queries is gaining importance owing to the increasing of large XML repositories. Current work on top- k match to XML queries mainly focuses on employing XPath, XQuery or NEXI as the query language, whereas little work has concerned on top- k match to XML keyword search. In this study, the authors propose a novel two-layer-based index construction and associated algorithm for efficiently computing top- k results for XML keyword search. Our core contribution, the two-layer-based inverted Index and associated algorithm for XML keyword search take both score-sorted-sequence and Dewey ID-sorted-sequence into consideration, and thus gain performance benefits during querying process. The authors have conducted expensive experiments and our experimental results show efficiency advantages compared with existing approaches.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IET Softw.

自引率

0.00%

发文量