首页 > 最新文献

HILDA '16最新文献

英文 中文
Big data exploration requires collaboration between visualization and data infrastructures 大数据探索需要可视化和数据基础设施之间的协作
Pub Date : 2016-06-26 DOI: 10.1145/2939502.2939518
Danyel Fisher
As datasets grow to tera- and petabyte sizes, exploratory data visualization becomes very difficult: a screen is limited to a few million pixels, and main memory to a few tens of millions of data points. Yet these very large scale analyses are of tremendous interest to industry and academia. This paper discusses some of the major challenges involved in data analytics at scale, including issues of computation, communication, and rendering. It identifies techniques for handling large scale data, grouped into "look at less of it," and "look at it faster." Using these techniques involves a number of difficult design tradeoffs for both the ways that data can be represented, and the ways that users can interact with the visualizations.
当数据集增长到兆字节和拍字节大小时,探索性数据可视化变得非常困难:屏幕被限制为几百万像素,主存储器被限制为几千万个数据点。然而,这些非常大规模的分析引起了工业界和学术界的极大兴趣。本文讨论了涉及大规模数据分析的一些主要挑战,包括计算、通信和呈现问题。它确定了处理大规模数据的技术,分为“少看”和“快看”。使用这些技术需要在数据的表示方式和用户与可视化交互的方式方面进行许多困难的设计权衡。
{"title":"Big data exploration requires collaboration between visualization and data infrastructures","authors":"Danyel Fisher","doi":"10.1145/2939502.2939518","DOIUrl":"https://doi.org/10.1145/2939502.2939518","url":null,"abstract":"As datasets grow to tera- and petabyte sizes, exploratory data visualization becomes very difficult: a screen is limited to a few million pixels, and main memory to a few tens of millions of data points. Yet these very large scale analyses are of tremendous interest to industry and academia. This paper discusses some of the major challenges involved in data analytics at scale, including issues of computation, communication, and rendering. It identifies techniques for handling large scale data, grouped into \"look at less of it,\" and \"look at it faster.\" Using these techniques involves a number of difficult design tradeoffs for both the ways that data can be represented, and the ways that users can interact with the visualizations.","PeriodicalId":356971,"journal":{"name":"HILDA '16","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126872504","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
Bridging the gap between user intention and model parameters for human-in-the-loop data analytics 弥合用户意图和人在循环数据分析模型参数之间的差距
Pub Date : 2016-06-26 DOI: 10.1145/2939502.2939505
J. Self, R. K. Vinayagam, J. T. Fry, Chris North
Exploratory data analysis is challenging given the complexity of data. Models find structure in the data lessening the complexity for users. These models have parameters that can be adjusted to explore the data from many different angles providing more ways to learn about the data. "Human in the loop" means users can interact with the parameters to explore alternative structures. This exploration allows for discovery. This paper examines usability issues of Human-Model Interaction (HMI) for data analytics. In particular, we bridge the gaps between a user's intention and the parameters of a WMDS model during HMI communication.
鉴于数据的复杂性,探索性数据分析具有挑战性。模型在数据中发现结构,为用户减少复杂性。这些模型具有可以调整的参数,以便从许多不同的角度探索数据,从而提供更多了解数据的方法。“人在循环”意味着用户可以与参数交互以探索替代结构。这种探索会带来发现。本文探讨了用于数据分析的人机交互(HMI)的可用性问题。特别是,我们在HMI通信中弥合了用户意图与WMDS模型参数之间的差距。
{"title":"Bridging the gap between user intention and model parameters for human-in-the-loop data analytics","authors":"J. Self, R. K. Vinayagam, J. T. Fry, Chris North","doi":"10.1145/2939502.2939505","DOIUrl":"https://doi.org/10.1145/2939502.2939505","url":null,"abstract":"Exploratory data analysis is challenging given the complexity of data. Models find structure in the data lessening the complexity for users. These models have parameters that can be adjusted to explore the data from many different angles providing more ways to learn about the data. \"Human in the loop\" means users can interact with the parameters to explore alternative structures. This exploration allows for discovery. This paper examines usability issues of Human-Model Interaction (HMI) for data analytics. In particular, we bridge the gaps between a user's intention and the parameters of a WMDS model during HMI communication.","PeriodicalId":356971,"journal":{"name":"HILDA '16","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128152104","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 36
A DeVIL-ish approach to inconsistency in interactive visualizations 交互式可视化中不一致的魔鬼式方法
Pub Date : 2016-06-26 DOI: 10.1145/2939502.2939517
Yifan Wu, J. Hellerstein, Eugene Wu
Declarative languages have a long tradition in both the database systems and data visualization communities, separating specifications from implementations. In databases, declarative languages like SQL shield application programmers from changes to physical and logical properties like disk layouts, indexes and schema changes. In data visualization, declarative languages like Polaris, ggplot2 and Vega shield visualization programmers from variations in rendering, including screen layout, resolution, and color schemes. Declarative languages have been considered a foundational step forward in both communities.
声明性语言在数据库系统和数据可视化社区中有着悠久的传统,将规范与实现分离开来。在数据库中,像SQL这样的声明性语言保护应用程序程序员不受磁盘布局、索引和模式更改等物理和逻辑属性更改的影响。在数据可视化中,像Polaris、ggplot2和Vega这样的声明性语言使可视化程序员免受呈现变化的影响,包括屏幕布局、分辨率和配色方案。声明式语言被认为是两个社区向前迈出的基础一步。
{"title":"A DeVIL-ish approach to inconsistency in interactive visualizations","authors":"Yifan Wu, J. Hellerstein, Eugene Wu","doi":"10.1145/2939502.2939517","DOIUrl":"https://doi.org/10.1145/2939502.2939517","url":null,"abstract":"Declarative languages have a long tradition in both the database systems and data visualization communities, separating specifications from implementations. In databases, declarative languages like SQL shield application programmers from changes to physical and logical properties like disk layouts, indexes and schema changes. In data visualization, declarative languages like Polaris, ggplot2 and Vega shield visualization programmers from variations in rendering, including screen layout, resolution, and color schemes. Declarative languages have been considered a foundational step forward in both communities.","PeriodicalId":356971,"journal":{"name":"HILDA '16","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131193470","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Clustering provenance facilitating provenance exploration through data abstraction 聚类溯源通过数据抽象促进溯源探索
Pub Date : 2016-06-26 DOI: 10.1145/2939502.2939508
Linus Karsai, A. Fekete, J. Kay, P. Missier
As digital objects become increasingly important in people's lives, people may need to understand the provenance, or lineage and history, of an important digital object, to understand how it was produced. This is particularly important for objects created from large, multi-source collections of personal data. As the metadata describing provenance, Provenance Data, is commonly represented as a labelled directed acyclic graph, the challenge is to create effective interfaces onto such graphs so that people can understand the provenance of key digital objects. This unsolved problem is especially challenging for the case of novice and intermittent users and complex provenance graphs. We tackle this by creating an interface based on a clustering approach. This was designed to enable users to view provenance graphs, and to simplify complex graphs by combining several nodes. Our core contribution is the design of a prototype interface that supports clustering and its analytic evaluation in terms of desirable properties of visualisation interfaces.
随着数字物品在人们生活中变得越来越重要,人们可能需要了解一个重要数字物品的来源,或血统和历史,以了解它是如何产生的。这对于从大型、多来源的个人数据集合创建的对象尤其重要。由于描述起源的元数据,即起源数据,通常被表示为标记的有向无环图,因此挑战是在这些图上创建有效的接口,以便人们能够理解关键数字对象的起源。这个未解决的问题对于新手和间歇性用户以及复杂的来源图来说尤其具有挑战性。我们通过创建基于集群方法的接口来解决这个问题。其目的是使用户能够查看出处图,并通过组合多个节点来简化复杂的图。我们的核心贡献是设计一个支持聚类的原型界面,并根据可视化界面的理想属性对其进行分析评估。
{"title":"Clustering provenance facilitating provenance exploration through data abstraction","authors":"Linus Karsai, A. Fekete, J. Kay, P. Missier","doi":"10.1145/2939502.2939508","DOIUrl":"https://doi.org/10.1145/2939502.2939508","url":null,"abstract":"As digital objects become increasingly important in people's lives, people may need to understand the provenance, or lineage and history, of an important digital object, to understand how it was produced. This is particularly important for objects created from large, multi-source collections of personal data. As the metadata describing provenance, Provenance Data, is commonly represented as a labelled directed acyclic graph, the challenge is to create effective interfaces onto such graphs so that people can understand the provenance of key digital objects. This unsolved problem is especially challenging for the case of novice and intermittent users and complex provenance graphs. We tackle this by creating an interface based on a clustering approach. This was designed to enable users to view provenance graphs, and to simplify complex graphs by combining several nodes. Our core contribution is the design of a prototype interface that supports clustering and its analytic evaluation in terms of desirable properties of visualisation interfaces.","PeriodicalId":356971,"journal":{"name":"HILDA '16","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129127827","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
TrendQuery: a system for interactive exploration of trends TrendQuery:一个交互式趋势探索系统
Pub Date : 2016-06-26 DOI: 10.1145/2939502.2939514
N. Kamat, Eugene Wu, Arnab Nandi
The surfacing of trends from data collections such as user-generated content streams and news articles is a popular and important data analysis activity, used in applications such as business intelligence, quantitative stock trading and, social media exploration. Unlike traditional content analysis, trend analysis includes an additional vital time dimension: a trend can be defined as a temporal pattern over a group of semantically related items. The unsupervised discovery of trends is often not sufficient, either due to inadequacies in the trend analysis algorithm, or because the data collection itself does not possess all of the information to identify the trend. Thus, it is necessary for an expert human-in-the-loop to be involved in the process of trend analysis. To this end, we introduce TrendQuery, a system designed towards iterative and interactive surfacing of trends. Our system provides a set of trends to the expert, and enumerates iterative operations to curate the result. This process continues until the expert is satisfied with the surfaced trends. Since the space of possible tweaks to the result can be extremely large, the system continually provides feedback and guidance to the expert to prioritize possible operations. Our system allows interactive curation of trends providing better insights than a purely unsupervised approach.
从用户生成的内容流和新闻文章等数据收集中揭示趋势是一项流行且重要的数据分析活动,用于商业智能、定量股票交易和社交媒体探索等应用。与传统的内容分析不同,趋势分析包括一个额外的重要时间维度:趋势可以定义为一组语义相关项上的时间模式。无监督的趋势发现往往是不够的,要么是由于趋势分析算法的不足,要么是因为数据收集本身不具备识别趋势的所有信息。因此,在趋势分析的过程中,有必要有专家参与。为此,我们介绍了TrendQuery,这是一个设计用于趋势迭代和交互式呈现的系统。我们的系统为专家提供了一组趋势,并列举了迭代操作来管理结果。这个过程一直持续到专家对表面趋势感到满意为止。由于对结果的可能调整空间可能非常大,系统不断向专家提供反馈和指导,以确定可能操作的优先级。我们的系统允许对趋势进行交互式管理,提供比纯粹的无监督方法更好的见解。
{"title":"TrendQuery: a system for interactive exploration of trends","authors":"N. Kamat, Eugene Wu, Arnab Nandi","doi":"10.1145/2939502.2939514","DOIUrl":"https://doi.org/10.1145/2939502.2939514","url":null,"abstract":"The surfacing of trends from data collections such as user-generated content streams and news articles is a popular and important data analysis activity, used in applications such as business intelligence, quantitative stock trading and, social media exploration. Unlike traditional content analysis, trend analysis includes an additional vital time dimension: a trend can be defined as a temporal pattern over a group of semantically related items. The unsupervised discovery of trends is often not sufficient, either due to inadequacies in the trend analysis algorithm, or because the data collection itself does not possess all of the information to identify the trend. Thus, it is necessary for an expert human-in-the-loop to be involved in the process of trend analysis.\u0000 To this end, we introduce TrendQuery, a system designed towards iterative and interactive surfacing of trends. Our system provides a set of trends to the expert, and enumerates iterative operations to curate the result. This process continues until the expert is satisfied with the surfaced trends. Since the space of possible tweaks to the result can be extremely large, the system continually provides feedback and guidance to the expert to prioritize possible operations. Our system allows interactive curation of trends providing better insights than a purely unsupervised approach.","PeriodicalId":356971,"journal":{"name":"HILDA '16","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125393212","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
期刊
HILDA '16
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1