Multidimensional Cluster Sampling View on Large Databases for Approximate Query Processing

Tomohiro Inoue, A. Krishna, R. Gopalan
{"title":"Multidimensional Cluster Sampling View on Large Databases for Approximate Query Processing","authors":"Tomohiro Inoue, A. Krishna, R. Gopalan","doi":"10.1109/EDOC.2015.24","DOIUrl":null,"url":null,"abstract":"Approximate query processing with relatively small random samples is an effective way to deal with many queries on large databases. However, small random samples might miss relevant records for highly selective queries due to insufficient coverage. A multidimensional index tree called the k-MDI was proposed as an effective sampling scheme for highly selective decision support queries. It has been shown to support a fast response time and high accuracy, whereas implementation of the k-MDI on database tables was not discussed. This paper proposes the Multidimensional Cluster Sampling View based on the k-MDI. The view can be implemented with ease using common database tables and can be manipulated by SQL statements. Furthermore, it is able to provide trustable approximate answers quickly for any query condition. The response time and accuracy of approximation are validated on a large dataset based on TPC-DS specifications.","PeriodicalId":112281,"journal":{"name":"2015 IEEE 19th International Enterprise Distributed Object Computing Conference","volume":"93 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE 19th International Enterprise Distributed Object Computing Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EDOC.2015.24","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Approximate query processing with relatively small random samples is an effective way to deal with many queries on large databases. However, small random samples might miss relevant records for highly selective queries due to insufficient coverage. A multidimensional index tree called the k-MDI was proposed as an effective sampling scheme for highly selective decision support queries. It has been shown to support a fast response time and high accuracy, whereas implementation of the k-MDI on database tables was not discussed. This paper proposes the Multidimensional Cluster Sampling View based on the k-MDI. The view can be implemented with ease using common database tables and can be manipulated by SQL statements. Furthermore, it is able to provide trustable approximate answers quickly for any query condition. The response time and accuracy of approximation are validated on a large dataset based on TPC-DS specifications.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
面向近似查询处理的大型数据库多维聚类抽样视图
使用相对较小的随机样本进行近似查询处理是处理大型数据库中大量查询的有效方法。然而,由于覆盖率不足,小的随机样本可能会错过高选择性查询的相关记录。提出了一种称为k-MDI的多维索引树作为高选择性决策支持查询的有效抽样方案。它已被证明支持快速响应时间和高准确性,而k-MDI在数据库表上的实现没有被讨论。本文提出了基于k-MDI的多维聚类采样视图。该视图可以使用公共数据库表轻松实现,并且可以通过SQL语句进行操作。此外,对于任何查询条件,该算法都能快速提供可靠的近似答案。在基于TPC-DS规范的大型数据集上验证了近似的响应时间和精度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A Workflow for Model Driven Game Development A Graph-Based Approach for Containment Checking of Behavior Models of Software Systems A Prediction Framework for Proactively Monitoring Aggregate Process-Performance Indicators A Domain Specific Language for Secure Outsourcing of Computation to the Cloud ERP Systems' Usage in the German IT Service Industry: An Exploratory Multi-case Study
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1