{"title":"Performance Evaluation of Inverted Index Traversal Techniques","authors":"Kun Jiang, Xingshen Song, Yuexiang Yang","doi":"10.1109/CSE.2014.315","DOIUrl":null,"url":null,"abstract":"Large search engines process thousands of queries per second over billions of documents, making query processing a major performance bottleneck. In this paper, we study an important class of index traversal techniques called dynamic pruning, which can efficiently reduce the query computational resources. We give brief overviews of index structure and skipping structure that used to quicken the term-centric and document centric index traversal strategies. Due to the fact that document centric is more efficient than term-centric for large indexes, we review several state-of-the-art dynamic pruning approaches based on document-centric scoring. We then present explicit experimental comparisons of the above index traversal techniques on TREC GOV2 datasets with detailed analytical conclusions. Finally, we provide the prospective future study of dynamic pruning with large scale of indexes.","PeriodicalId":258990,"journal":{"name":"2014 IEEE 17th International Conference on Computational Science and Engineering","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE 17th International Conference on Computational Science and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSE.2014.315","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Large search engines process thousands of queries per second over billions of documents, making query processing a major performance bottleneck. In this paper, we study an important class of index traversal techniques called dynamic pruning, which can efficiently reduce the query computational resources. We give brief overviews of index structure and skipping structure that used to quicken the term-centric and document centric index traversal strategies. Due to the fact that document centric is more efficient than term-centric for large indexes, we review several state-of-the-art dynamic pruning approaches based on document-centric scoring. We then present explicit experimental comparisons of the above index traversal techniques on TREC GOV2 datasets with detailed analytical conclusions. Finally, we provide the prospective future study of dynamic pruning with large scale of indexes.