Immediate-Access Indexing Using Space-Efficient Extensible Arrays

Alistair Moffat
{"title":"Immediate-Access Indexing Using Space-Efficient Extensible Arrays","authors":"Alistair Moffat","doi":"10.1145/3572960.3572984","DOIUrl":null,"url":null,"abstract":"The array is a fundamental data object in most programs. Its key functionality – storage of and access to a set of same-type elements in O(1) time per operation – is also widely employed in other more sophisticated data structures. In an extensible array the number of elements in the set is unknown at the time the program is initiated, and the array might continue to grow right through the program’s execution. In this paper we explore the use of extensible arrays in connection with the task of inverted index construction. We develop and test a space-efficient extensible array arrangement that has been previously described but not to our knowledge employed in practice, and show that it adds considerable flexibility to the index construction process while incurring only modest run-time overheads as a result of access indirections.","PeriodicalId":106265,"journal":{"name":"Proceedings of the 26th Australasian Document Computing Symposium","volume":"57 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 26th Australasian Document Computing Symposium","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3572960.3572984","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

The array is a fundamental data object in most programs. Its key functionality – storage of and access to a set of same-type elements in O(1) time per operation – is also widely employed in other more sophisticated data structures. In an extensible array the number of elements in the set is unknown at the time the program is initiated, and the array might continue to grow right through the program’s execution. In this paper we explore the use of extensible arrays in connection with the task of inverted index construction. We develop and test a space-efficient extensible array arrangement that has been previously described but not to our knowledge employed in practice, and show that it adds considerable flexibility to the index construction process while incurring only modest run-time overheads as a result of access indirections.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用空间高效的可扩展数组的即时访问索引
数组是大多数程序中的基本数据对象。它的关键功能——每次操作在O(1)时间内存储和访问一组相同类型的元素——也被广泛应用于其他更复杂的数据结构中。在可扩展数组中,在程序启动时,集合中的元素数量是未知的,并且该数组可能在程序执行期间继续增长。在本文中,我们探讨了可扩展数组在倒排索引构造任务中的应用。我们开发并测试了一种空间效率高的可扩展数组安排,这种安排以前已经描述过,但据我们所知尚未在实践中使用,并表明它为索引构建过程增加了相当大的灵活性,同时由于间接访问而只产生适度的运行时开销。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Immediate-Access Indexing Using Space-Efficient Extensible Arrays Investigating Language Use by Polarised Groups on Twitter: A Case Study of the Bushfires Robustness of Neural Rankers to Typos: A Comparative Study Neural Rankers for Effective Screening Prioritisation in Medical Systematic Review Literature Search The Task: Distinguishing Tasks and Sessions in Legal Information Retrieval
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1