Information Guided Data Sampling and Recovery Using Bitmap Indexing

Tzu-Hsuan Wei, Soumya Dutta, Han-Wei Shen
{"title":"Information Guided Data Sampling and Recovery Using Bitmap Indexing","authors":"Tzu-Hsuan Wei, Soumya Dutta, Han-Wei Shen","doi":"10.1109/PacificVis.2018.00016","DOIUrl":null,"url":null,"abstract":"Creating a data representation is a common approach for efficient and effective data management and exploration. The compressed bitmap indexing is one of the emerging data representation used for large-scale data exploration. Performing sampling on the bitmapindexing based data representation allows further reduction of storage overhead and be more flexible to meet the requirements of different applications. In this paper, we propose two approaches to solve two potential limitations when exploring and visualizing the data using sampling-based bitmap indexing data representation. First, we propose an adaptive sampling approach called information guided stratified sampling (IGStS) for creating compact sampled datasets that preserves the important characteristics of the raw data. Furthermore, we propose a novel data recovery approach to reconstruct the irregular subsampled dataset into a volume dataset with regular grid structure for qualitative post-hoc data exploration and visualization. The quantitative and visual efficacy of our proposed data sampling and recovery approaches are demonstrated through multiple experiments and applications.","PeriodicalId":164616,"journal":{"name":"2018 IEEE Pacific Visualization Symposium (PacificVis)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE Pacific Visualization Symposium (PacificVis)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PacificVis.2018.00016","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 18

Abstract

Creating a data representation is a common approach for efficient and effective data management and exploration. The compressed bitmap indexing is one of the emerging data representation used for large-scale data exploration. Performing sampling on the bitmapindexing based data representation allows further reduction of storage overhead and be more flexible to meet the requirements of different applications. In this paper, we propose two approaches to solve two potential limitations when exploring and visualizing the data using sampling-based bitmap indexing data representation. First, we propose an adaptive sampling approach called information guided stratified sampling (IGStS) for creating compact sampled datasets that preserves the important characteristics of the raw data. Furthermore, we propose a novel data recovery approach to reconstruct the irregular subsampled dataset into a volume dataset with regular grid structure for qualitative post-hoc data exploration and visualization. The quantitative and visual efficacy of our proposed data sampling and recovery approaches are demonstrated through multiple experiments and applications.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用位图索引的信息引导数据采样和恢复
创建数据表示是实现高效数据管理和探索的常用方法。压缩位图索引是一种新兴的用于大规模数据探索的数据表示形式。在基于位图索引的数据表示上执行采样可以进一步减少存储开销,并且更灵活地满足不同应用程序的需求。在本文中,我们提出了两种方法来解决使用基于采样的位图索引数据表示来探索和可视化数据时的两个潜在限制。首先,我们提出了一种自适应采样方法,称为信息引导分层采样(IGStS),用于创建紧凑的采样数据集,保留原始数据的重要特征。此外,我们提出了一种新的数据恢复方法,将不规则的子采样数据重构为具有规则网格结构的体数据集,用于定性的事后数据探索和可视化。通过多个实验和应用证明了我们提出的数据采样和恢复方法的定量和视觉效果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Composite Visual Mapping for Time Series Visualization Optimal Algorithms for Compact Linear Layouts TagNet: Toward Tag-Based Sentiment Analysis of Large Social Media Data An Evaluation of Perceptually Complementary Views for Multivariate Data Know Your Enemy: Identifying Quality Problems of Time Series Data
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1