首页 > 最新文献

Information Visualization最新文献

英文 中文
Riverside: A design study on visualization for situation awareness in cybersecurity 河滨:网络安全态势感知可视化设计研究
IF 2.3 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2023-08-17 DOI: 10.1177/14738716231189220
Kaitlyn DeValk, N. Elmqvist
Real-time situation awareness is a key challenge of cybersecurity defense. Visual analytics has been utilized for this purpose, but existing tools tend to require detailed knowledge about the network, which can be challenging in large-scale, production networks. We conducted an interview study involving 24 security professionals to gather requirements for the design, development, and evaluation of visualization to aid situation awareness in cybersecurity. Using these findings, we designed a visualization tool – called RIVERSIDE – for providing a real-time view of the dynamically changing computer network to support situation awareness. We evaluated Riverside in a user study involving 10 participants. Participants were placed in an incident response scenario that tasked them to identify malicious activity on a network. 20% of the users identified all attack component, while an additional 40% only missed one component.
实时态势感知是网络安全防御的关键挑战。可视化分析已经被用于这个目的,但是现有的工具往往需要关于网络的详细知识,这在大规模的生产网络中是具有挑战性的。我们进行了一项涉及24名安全专业人员的访谈研究,以收集可视化设计、开发和评估的需求,以帮助网络安全中的态势感知。利用这些发现,我们设计了一个可视化工具,称为RIVERSIDE,用于提供动态变化的计算机网络的实时视图,以支持态势感知。我们在一项涉及10名参与者的用户研究中对Riverside进行了评估。参与者被置于一个事件响应场景中,该场景要求他们识别网络上的恶意活动。20%的用户识别了所有攻击组件,而另外40%的用户仅遗漏了一个组件。
{"title":"Riverside: A design study on visualization for situation awareness in cybersecurity","authors":"Kaitlyn DeValk, N. Elmqvist","doi":"10.1177/14738716231189220","DOIUrl":"https://doi.org/10.1177/14738716231189220","url":null,"abstract":"Real-time situation awareness is a key challenge of cybersecurity defense. Visual analytics has been utilized for this purpose, but existing tools tend to require detailed knowledge about the network, which can be challenging in large-scale, production networks. We conducted an interview study involving 24 security professionals to gather requirements for the design, development, and evaluation of visualization to aid situation awareness in cybersecurity. Using these findings, we designed a visualization tool – called RIVERSIDE – for providing a real-time view of the dynamically changing computer network to support situation awareness. We evaluated Riverside in a user study involving 10 participants. Participants were placed in an incident response scenario that tasked them to identify malicious activity on a network. 20% of the users identified all attack component, while an additional 40% only missed one component.","PeriodicalId":50360,"journal":{"name":"Information Visualization","volume":" ","pages":""},"PeriodicalIF":2.3,"publicationDate":"2023-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48581074","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Lodestar: Supporting rapid prototyping of data science workflows through data-driven analysis recommendations Lodestar:通过数据驱动的分析建议,支持数据科学工作流的快速原型
IF 2.3 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2023-08-14 DOI: 10.1177/14738716231190429
Deepthi Raghunandan, Zhe Cui, Kartik Krishnan, Segen Tirfe, Shenzhi Shi, Tejaswi Darshan Shrestha, L. Battle, N. Elmqvist
Keeping abreast of current trends, technologies, and best practices in visualization and data analysis is becoming increasingly difficult, especially for fledgling data scientists. In this paper, we propose lodestar, an interactive computational notebook that allows users to quickly explore and construct new data science workflows by selecting from a list of automated analysis recommendations. We derive our recommendations from directed graphs of known analysis states, with two input sources: one manually curated from online data science tutorials, and another extracted through semi-automatic analysis of a corpus of over 6000 Jupyter notebooks. We validated Lodestar through three separate user studies: first a formative evaluation involving novices learning data science using the tool. We used the feedback from this study to improve the tool. This was followed by a summative study involving both new and returning participants from the formative evaluation to test the efficacy of our improvements. We also engaged professional data scientists in an expert review assessing the utility of the different recommendations. Overall, our results suggest that both novice and professional users find Lodestar useful for rapidly creating data science workflows.
跟上当前的趋势、技术和最佳实践在可视化和数据分析变得越来越困难,特别是对于羽翼未丰的数据科学家。在本文中,我们提出了lodestar,这是一个交互式计算笔记本,允许用户通过从自动分析建议列表中进行选择来快速探索和构建新的数据科学工作流。我们从已知分析状态的有向图中得出我们的建议,有两个输入源:一个来自在线数据科学教程的手动策划,另一个通过对6000多个Jupyter笔记本的语料库的半自动分析提取。我们通过三个独立的用户研究验证了Lodestar:首先是一个涉及使用该工具学习数据科学的新手的形成性评估。我们利用这项研究的反馈来改进工具。接下来是一项总结性研究,涉及新参与者和从形成性评估中返回的参与者,以测试我们改进的有效性。我们还聘请了专业数据科学家进行专家评审,评估不同建议的效用。总的来说,我们的结果表明新手和专业用户都发现Lodestar对于快速创建数据科学工作流很有用。
{"title":"Lodestar: Supporting rapid prototyping of data science workflows through data-driven analysis recommendations","authors":"Deepthi Raghunandan, Zhe Cui, Kartik Krishnan, Segen Tirfe, Shenzhi Shi, Tejaswi Darshan Shrestha, L. Battle, N. Elmqvist","doi":"10.1177/14738716231190429","DOIUrl":"https://doi.org/10.1177/14738716231190429","url":null,"abstract":"Keeping abreast of current trends, technologies, and best practices in visualization and data analysis is becoming increasingly difficult, especially for fledgling data scientists. In this paper, we propose lodestar, an interactive computational notebook that allows users to quickly explore and construct new data science workflows by selecting from a list of automated analysis recommendations. We derive our recommendations from directed graphs of known analysis states, with two input sources: one manually curated from online data science tutorials, and another extracted through semi-automatic analysis of a corpus of over 6000 Jupyter notebooks. We validated Lodestar through three separate user studies: first a formative evaluation involving novices learning data science using the tool. We used the feedback from this study to improve the tool. This was followed by a summative study involving both new and returning participants from the formative evaluation to test the efficacy of our improvements. We also engaged professional data scientists in an expert review assessing the utility of the different recommendations. Overall, our results suggest that both novice and professional users find Lodestar useful for rapidly creating data science workflows.","PeriodicalId":50360,"journal":{"name":"Information Visualization","volume":"1 1","pages":""},"PeriodicalIF":2.3,"publicationDate":"2023-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42209321","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Incidental graphical perception: How marks and display time influence accuracy 附带图形感知:标记和显示时间如何影响准确性
IF 2.3 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2023-08-11 DOI: 10.1177/14738716231189218
João Moreira, Daniel Mendes, Daniel Gonçalves
Incidental visualizations are meant to be perceived at-a-glance, on-the-go, and during short exposure times, but are not seen on demand. Instead, they appear in people’s fields of view during an ongoing primary task. They differ from glanceable visualizations because the information is not received on demand, and they differ from ambient visualizations because the information is not continuously embedded in the environment. However, current graphical perception guidelines do not consider situations where information is presented at specific moments during brief exposure times without being the user’s primary focus. Therefore, we conducted a crowdsourced user study with 99 participants to understand how accurate people’s incidental graphical perception is. Each participant was tested on one of the three conditions: position of dots, length of lines, and angle of lines. We varied the number of elements for each combination and the display time. During the study, participants were asked to perform reproduction tasks, where they had to recreate a previously shown stimulus in each. Our results indicate that incidental graphical perception can be accurate when using position, length, and angles. Furthermore, we argue that incidental visualizations should be designed for low exposure times (between 300 and 1000 ms).
附带的可视化意味着一眼就能感知到,在移动中,在短曝光时间内,但不能在需要时看到。相反,它们出现在人们正在进行的主要任务的视野中。它们与可浏览的可视化不同,因为信息不是按需接收的;它们与环境可视化不同,因为信息不是连续嵌入到环境中。然而,目前的图形感知指南并没有考虑到信息在短暂曝光时间内的特定时刻呈现而不是用户的主要焦点的情况。因此,我们进行了一项有99名参与者的众包用户研究,以了解人们偶然的图形感知的准确性。每个参与者都被测试了三个条件中的一个:点的位置,线的长度和线的角度。我们改变了每个组合的元素数量和显示时间。在研究过程中,参与者被要求执行再现任务,他们必须在每个任务中重现之前显示的刺激。我们的结果表明,当使用位置、长度和角度时,附带的图形感知是准确的。此外,我们认为,附带的可视化应该设计为低曝光时间(300和1000毫秒之间)。
{"title":"Incidental graphical perception: How marks and display time influence accuracy","authors":"João Moreira, Daniel Mendes, Daniel Gonçalves","doi":"10.1177/14738716231189218","DOIUrl":"https://doi.org/10.1177/14738716231189218","url":null,"abstract":"Incidental visualizations are meant to be perceived at-a-glance, on-the-go, and during short exposure times, but are not seen on demand. Instead, they appear in people’s fields of view during an ongoing primary task. They differ from glanceable visualizations because the information is not received on demand, and they differ from ambient visualizations because the information is not continuously embedded in the environment. However, current graphical perception guidelines do not consider situations where information is presented at specific moments during brief exposure times without being the user’s primary focus. Therefore, we conducted a crowdsourced user study with 99 participants to understand how accurate people’s incidental graphical perception is. Each participant was tested on one of the three conditions: position of dots, length of lines, and angle of lines. We varied the number of elements for each combination and the display time. During the study, participants were asked to perform reproduction tasks, where they had to recreate a previously shown stimulus in each. Our results indicate that incidental graphical perception can be accurate when using position, length, and angles. Furthermore, we argue that incidental visualizations should be designed for low exposure times (between 300 and 1000 ms).","PeriodicalId":50360,"journal":{"name":"Information Visualization","volume":" ","pages":""},"PeriodicalIF":2.3,"publicationDate":"2023-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46344190","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Impact of incidental visualizations on primary tasks 附带的可视化对主要任务的影响
IF 2.3 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2023-06-20 DOI: 10.1177/14738716231180892
João Moreira, Daniel Mendes, Daniel Gonçalves
Incidental visualizations are meant to be seen at-a-glance, on-the-go, and during short exposure times. They will always appear side-by-side with an ongoing primary task while providing ancillary information relevant to those tasks. They differ from glanceable visualizations because looking at them is never their major focus, and they differ from ambient visualizations because they are not embedded in the environment, but appear when needed. However, unlike glanceable and ambient visualizations that have been studied in the past, incidental visualizations have yet to be explored in-depth. In particular, it is still not clear what is their impact on the users’ performance of primary tasks. Therefore, we conducted an empirical online between-subjects user study where participants had to play a maze game as their primary task. Their goal was to complete several mazes as quickly as possible to maximize their score. This game was chosen to be a cognitively demanding task, bound to be significantly affected if incidental visualizations have a meaningful impact. At the same time, they had to answer a question that appeared while playing, regarding the path followed so far. Then, for half the participants, an incidental visualization was shown for a short period while playing, containing information useful for answering the question. We analyzed various metrics to understand how the maze performance was impacted by the incidental visualization. Additionally, we aimed to understand if working memory would influence how the maze was played and how visualizations were perceived. We concluded that incidental visualizations of the type used in this study do not disrupt people while they played the maze as their primary task. Furthermore, our results strongly suggested that the information conveyed by the visualization improved their performance in answering the question. Finally, working memory had no impact on the participants’ results.
附带的可视化是指在旅途中和短曝光时间内一目了然。它们总是与正在进行的主要任务并排出现,同时提供与这些任务相关的辅助信息。它们不同于可观察的可视化,因为观察它们从来都不是它们的主要焦点,它们不同于环境可视化,因为它们没有嵌入环境中,而是在需要时出现。然而,与过去研究的龟头和环境可视化不同,偶然的可视化还有待深入探索。特别是,目前尚不清楚它们对用户执行主要任务的影响。因此,我们进行了一项受试者之间的实证在线用户研究,参与者必须玩迷宫游戏作为他们的主要任务。他们的目标是尽快完成几个迷宫,以最大限度地提高比分。这个游戏被选为一项认知要求很高的任务,如果偶然的可视化产生有意义的影响,势必会受到重大影响。与此同时,他们必须回答一个在比赛中出现的问题,关于迄今为止所走的道路。然后,对于一半的参与者,在玩游戏时会短暂显示一个附带的可视化,其中包含有助于回答问题的信息。我们分析了各种指标,以了解附带可视化对迷宫性能的影响。此外,我们旨在了解工作记忆是否会影响迷宫的游戏方式以及视觉效果的感知方式。我们得出的结论是,当人们把迷宫作为主要任务时,这项研究中使用的那种偶然的可视化不会干扰他们。此外,我们的研究结果强烈表明,可视化所传达的信息提高了他们回答问题的性能。最后,工作记忆对参与者的结果没有影响。
{"title":"Impact of incidental visualizations on primary tasks","authors":"João Moreira, Daniel Mendes, Daniel Gonçalves","doi":"10.1177/14738716231180892","DOIUrl":"https://doi.org/10.1177/14738716231180892","url":null,"abstract":"Incidental visualizations are meant to be seen at-a-glance, on-the-go, and during short exposure times. They will always appear side-by-side with an ongoing primary task while providing ancillary information relevant to those tasks. They differ from glanceable visualizations because looking at them is never their major focus, and they differ from ambient visualizations because they are not embedded in the environment, but appear when needed. However, unlike glanceable and ambient visualizations that have been studied in the past, incidental visualizations have yet to be explored in-depth. In particular, it is still not clear what is their impact on the users’ performance of primary tasks. Therefore, we conducted an empirical online between-subjects user study where participants had to play a maze game as their primary task. Their goal was to complete several mazes as quickly as possible to maximize their score. This game was chosen to be a cognitively demanding task, bound to be significantly affected if incidental visualizations have a meaningful impact. At the same time, they had to answer a question that appeared while playing, regarding the path followed so far. Then, for half the participants, an incidental visualization was shown for a short period while playing, containing information useful for answering the question. We analyzed various metrics to understand how the maze performance was impacted by the incidental visualization. Additionally, we aimed to understand if working memory would influence how the maze was played and how visualizations were perceived. We concluded that incidental visualizations of the type used in this study do not disrupt people while they played the maze as their primary task. Furthermore, our results strongly suggested that the information conveyed by the visualization improved their performance in answering the question. Finally, working memory had no impact on the participants’ results.","PeriodicalId":50360,"journal":{"name":"Information Visualization","volume":"22 1","pages":"307 - 322"},"PeriodicalIF":2.3,"publicationDate":"2023-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46974064","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An investigation into various visualization tools for complex biological networks 各种复杂生物网络可视化工具的研究
IF 2.3 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2023-06-15 DOI: 10.1177/14738716231181545
H. Alzahrani, S. Fernstad
Network biology has become crucial to understanding the complex structural characteristics of biological systems. Consequently, advanced visualization approaches are needed to support the investigation of such structures, and several network visualization tools have subsequently been developed to help researchers analyze intricate biological networks. While these tools support a range of analytical and interactive features, it is sometimes unclear to a data analyst or visualization designer which features are of most relevance to biologists. Thus, this study investigates and identifies essential factors for the visualization of complex biological networks using a mixed methodology approach. Based on the findings, essential factors were categorized as either generic and heuristic, where the former concern different analytical and interactive functionalities, such as an efficient layout, advanced search capabilities, plugin availability, graph analysis and user-friendliness, while the latter concern usability, such as information coding, flexibility, orientation and help.1 Furthermore, the findings indicate that 12 of the 15 generic factors identified were moderately important, while all 10 heuristic factors identified herein were moderately important.
网络生物学已经成为理解生物系统复杂结构特征的关键。因此,需要先进的可视化方法来支持对这些结构的研究,并且随后开发了一些网络可视化工具来帮助研究人员分析复杂的生物网络。虽然这些工具支持一系列分析和交互功能,但对于数据分析师或可视化设计人员来说,哪些功能与生物学家最相关,有时是不清楚的。因此,本研究使用混合方法研究和确定复杂生物网络可视化的基本因素。根据调查结果,基本因素被分为通用和启发式两类,前者关注不同的分析和交互功能,如高效的布局、高级搜索功能、插件可用性、图形分析和用户友好性,而后者关注可用性,如信息编码、灵活性、方向和帮助此外,研究结果表明,确定的15个一般因素中有12个是中等重要的,而本文确定的10个启发式因素都是中等重要的。
{"title":"An investigation into various visualization tools for complex biological networks","authors":"H. Alzahrani, S. Fernstad","doi":"10.1177/14738716231181545","DOIUrl":"https://doi.org/10.1177/14738716231181545","url":null,"abstract":"Network biology has become crucial to understanding the complex structural characteristics of biological systems. Consequently, advanced visualization approaches are needed to support the investigation of such structures, and several network visualization tools have subsequently been developed to help researchers analyze intricate biological networks. While these tools support a range of analytical and interactive features, it is sometimes unclear to a data analyst or visualization designer which features are of most relevance to biologists. Thus, this study investigates and identifies essential factors for the visualization of complex biological networks using a mixed methodology approach. Based on the findings, essential factors were categorized as either generic and heuristic, where the former concern different analytical and interactive functionalities, such as an efficient layout, advanced search capabilities, plugin availability, graph analysis and user-friendliness, while the latter concern usability, such as information coding, flexibility, orientation and help.1 Furthermore, the findings indicate that 12 of the 15 generic factors identified were moderately important, while all 10 heuristic factors identified herein were moderately important.","PeriodicalId":50360,"journal":{"name":"Information Visualization","volume":"22 1","pages":"323 - 339"},"PeriodicalIF":2.3,"publicationDate":"2023-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44908611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Visual analytics for monitoring credit scoring models 用于监控信用评分模型的可视化分析
4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2023-06-15 DOI: 10.1177/14738716231180803
Daiane Rodrigues Baldo, Murilo Santos Regio, Isabel Harb Manssour
Financial institutions use credit Scoring models to predict the default of their customers and assist in decision-making about the granting of credit. As a large volume of credit transactions is generated daily alongside a potential increase in this information with the advent of Open Finance, it is challenging to monitor this information quickly so we can act in case these models lose performance. Considering this context, our research aims to provide a Visual Analytics approach to assist in monitoring credit models. For this, initially, we carried out a systematic review of the literature on the subject and conducted semi-structured interviews with 13 domain experts. Considering the needs raised with this study, we created a prototype called Visual Analytics for monitoring Credit Scoring models (VACS). The main contributions of this work are twofold: The requirements gathered through interviews with specialists, which allowed the analysis of how the models are monitored within financial institutions, something that is not disclosed and that can help in the standardization of the monitoring process; and VACS, which was evaluated by four domain experts who considered it a very complete and easy-to-use tool.
金融机构使用信用评分模型来预测其客户的违约情况,并协助制定授信决策。随着开放金融的出现,每天都会产生大量的信贷交易,并且这些信息可能会增加,因此快速监控这些信息以使我们能够在这些模型失去性能时采取行动是一项挑战。考虑到这一背景,我们的研究旨在提供一种可视化分析方法来协助监控信贷模型。为此,最初,我们对该主题的文献进行了系统回顾,并与13位领域专家进行了半结构化访谈。考虑到这项研究提出的需求,我们创建了一个名为视觉分析的原型,用于监控信用评分模型(VACS)。这项工作的主要贡献有两个方面:通过与专家的访谈收集的需求,可以分析如何在金融机构内监测模型,这是未披露的,可以帮助标准化监测过程;和VACS,由四位领域专家评估,他们认为它是一个非常完整和易于使用的工具。
{"title":"Visual analytics for monitoring credit scoring models","authors":"Daiane Rodrigues Baldo, Murilo Santos Regio, Isabel Harb Manssour","doi":"10.1177/14738716231180803","DOIUrl":"https://doi.org/10.1177/14738716231180803","url":null,"abstract":"Financial institutions use credit Scoring models to predict the default of their customers and assist in decision-making about the granting of credit. As a large volume of credit transactions is generated daily alongside a potential increase in this information with the advent of Open Finance, it is challenging to monitor this information quickly so we can act in case these models lose performance. Considering this context, our research aims to provide a Visual Analytics approach to assist in monitoring credit models. For this, initially, we carried out a systematic review of the literature on the subject and conducted semi-structured interviews with 13 domain experts. Considering the needs raised with this study, we created a prototype called Visual Analytics for monitoring Credit Scoring models (VACS). The main contributions of this work are twofold: The requirements gathered through interviews with specialists, which allowed the analysis of how the models are monitored within financial institutions, something that is not disclosed and that can help in the standardization of the monitoring process; and VACS, which was evaluated by four domain experts who considered it a very complete and easy-to-use tool.","PeriodicalId":50360,"journal":{"name":"Information Visualization","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135672678","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Waffster: Hierarchical waffle charts for budget visualization 华夫饼:用于预算可视化的分层华夫饼图
4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2023-05-30 DOI: 10.1177/14738716231173730
Antoine Béland, Florent Daudens, Thomas Hurtut
Understanding and consuming public budget data is a key issue, helping citizens in gaining insight into their democratic and political systems. The goal of this work is to present Waffster, a user-friendly representation supporting the understanding of such data. The proposed representation enables the browsing, searching, comparing, and presenting of the hierarchically arranged components and quantities in budgets. In this paper, we first conduct a thorough survey of online public budget visualizations. Then, in collaboration with Le Devoir, a Canadian daily newspaper, we propose a novel unit-based hierarchical design based on waffle charts. We evaluate this design using a controlled user study to compare it to a tree-map based layout, and a case study conducted with Le Devoir during the provincial election campaign in Québec of 2018.
理解和使用公共预算数据是一个关键问题,可以帮助公民深入了解他们的民主和政治制度。这项工作的目标是提出Waffster,一个用户友好的表示支持这些数据的理解。所建议的表示支持浏览、搜索、比较和呈现预算中按层次排列的组件和数量。在本文中,我们首先对在线公共预算可视化进行了彻底的调查。然后,我们与加拿大日报Le Devoir合作,提出了一种基于华夫饼图的新颖的基于单位的分层设计。我们通过一项受控用户研究来评估这一设计,将其与基于树图的布局进行比较,并在2018年青海省选举期间与Le Devoir进行了案例研究。
{"title":"Waffster: Hierarchical waffle charts for budget visualization","authors":"Antoine Béland, Florent Daudens, Thomas Hurtut","doi":"10.1177/14738716231173730","DOIUrl":"https://doi.org/10.1177/14738716231173730","url":null,"abstract":"Understanding and consuming public budget data is a key issue, helping citizens in gaining insight into their democratic and political systems. The goal of this work is to present Waffster, a user-friendly representation supporting the understanding of such data. The proposed representation enables the browsing, searching, comparing, and presenting of the hierarchically arranged components and quantities in budgets. In this paper, we first conduct a thorough survey of online public budget visualizations. Then, in collaboration with Le Devoir, a Canadian daily newspaper, we propose a novel unit-based hierarchical design based on waffle charts. We evaluate this design using a controlled user study to compare it to a tree-map based layout, and a case study conducted with Le Devoir during the provincial election campaign in Québec of 2018.","PeriodicalId":50360,"journal":{"name":"Information Visualization","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135643711","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparing overlapping data distributions using visualization 使用可视化比较重叠数据分布
4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2023-05-23 DOI: 10.1177/14738716231173731
Eric Newburger, Niklas Elmqvist
We present results from a preregistered and crowdsourced user study where we asked members of the general population to determine whether two samples represented using different forms of data visualizations are drawn from the same or different populations. Such a task reduces to assessing whether the overlap between the two visualized samples is large enough to suggest similar or different origins. When using idealized normal curves fitted on the samples, it is essentially a graphical formulation of the classic Student’s t-test. However, we speculate that using more sophisticated visual representations, such as bar histograms, Wilkinson dot plots, strip plots, or Tukey boxplots will both allow people to be more accurate at this task as well as better understand its meaning. In other words, the purpose of our study is to explore which visualization best scaffolds novices in making graphical inferences about data. However, our results indicate that the more abstracted idealized bell curve representation of the task yields more accuracy.
我们展示了一项预先注册的众包用户研究的结果,我们要求普通人群的成员确定使用不同形式的数据可视化表示的两个样本是否来自相同或不同的人群。这样的任务简化为评估两个可视化样本之间的重叠是否大到足以表明相似或不同的起源。当使用拟合样本的理想正态曲线时,它本质上是经典学生t检验的图形化公式。然而,我们推测,使用更复杂的视觉表征,如条形直方图、威尔金森点图、条形图或Tukey箱形图,既能让人们更准确地完成这项任务,也能更好地理解其含义。换句话说,我们研究的目的是探索哪种可视化最适合初学者对数据进行图形推断。然而,我们的结果表明,任务的更抽象的理想化钟形曲线表示产生更高的准确性。
{"title":"Comparing overlapping data distributions using visualization","authors":"Eric Newburger, Niklas Elmqvist","doi":"10.1177/14738716231173731","DOIUrl":"https://doi.org/10.1177/14738716231173731","url":null,"abstract":"We present results from a preregistered and crowdsourced user study where we asked members of the general population to determine whether two samples represented using different forms of data visualizations are drawn from the same or different populations. Such a task reduces to assessing whether the overlap between the two visualized samples is large enough to suggest similar or different origins. When using idealized normal curves fitted on the samples, it is essentially a graphical formulation of the classic Student’s t-test. However, we speculate that using more sophisticated visual representations, such as bar histograms, Wilkinson dot plots, strip plots, or Tukey boxplots will both allow people to be more accurate at this task as well as better understand its meaning. In other words, the purpose of our study is to explore which visualization best scaffolds novices in making graphical inferences about data. However, our results indicate that the more abstracted idealized bell curve representation of the task yields more accuracy.","PeriodicalId":50360,"journal":{"name":"Information Visualization","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135184672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TopoBERT: Exploring the topology of fine-tuned word representations TopoBERT:探索微调单词表示的拓扑结构
IF 2.3 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2023-05-01 DOI: 10.1177/14738716231168671
Archit Rathore, Yichu Zhou, Vivek Srikumar, Bei Wang
Transformer-based language models such as BERT and its variants have found widespread use in natural language processing (NLP). A common way of using these models is to fine-tune them to improve their performance on a specific task. However, it is currently unclear how the fine-tuning process affects the underlying structure of the word embeddings from these models. We present TopoBERT, a visual analytics system for interactively exploring the fine-tuning process of various transformer-based models – across multiple fine-tuning batch updates, subsequent layers of the model, and different NLP tasks – from a topological perspective. The system uses the mapper algorithm from topological data analysis (TDA) to generate a graph that approximates the shape of a model’s embedding space for an input dataset. TopoBERT enables its users (e.g. experts in NLP and linguistics) to (1) interactively explore the fine-tuning process across different model-task pairs, (2) visualize the shape of embedding spaces at multiple scales and layers, and (3) connect linguistic and contextual information about the input dataset with the topology of the embedding space. Using TopoBERT, we provide various use cases to exemplify its applications in exploring fine-tuned word embeddings. We further demonstrate the utility of TopoBERT, which enables users to generate insights about the fine-tuning process and provides support for empirical validation of these insights.
基于转换器的语言模型,如BERT及其变体,已在自然语言处理(NLP)中得到广泛使用。使用这些模型的一种常见方法是对它们进行微调,以提高它们在特定任务中的性能。然而,目前尚不清楚微调过程如何影响这些模型中单词嵌入的基本结构。我们介绍了TopoBERT,这是一个视觉分析系统,用于从拓扑角度交互式探索各种基于变压器的模型的微调过程,包括多个微调批更新、模型的后续层和不同的NLP任务。该系统使用拓扑数据分析(TDA)中的映射器算法来生成一个图,该图近似于输入数据集的模型嵌入空间的形状。TopoBERT使其用户(例如NLP和语言学专家)能够(1)在不同的模型任务对之间交互式地探索微调过程,(2)在多个尺度和层上可视化嵌入空间的形状,以及(3)将关于输入数据集的语言和上下文信息与嵌入空间的拓扑连接起来。使用TopoBERT,我们提供了各种用例来举例说明它在探索微调单词嵌入中的应用。我们进一步展示了TopoBERT的实用性,它使用户能够生成关于微调过程的见解,并为这些见解的实证验证提供支持。
{"title":"TopoBERT: Exploring the topology of fine-tuned word representations","authors":"Archit Rathore, Yichu Zhou, Vivek Srikumar, Bei Wang","doi":"10.1177/14738716231168671","DOIUrl":"https://doi.org/10.1177/14738716231168671","url":null,"abstract":"Transformer-based language models such as BERT and its variants have found widespread use in natural language processing (NLP). A common way of using these models is to fine-tune them to improve their performance on a specific task. However, it is currently unclear how the fine-tuning process affects the underlying structure of the word embeddings from these models. We present TopoBERT, a visual analytics system for interactively exploring the fine-tuning process of various transformer-based models – across multiple fine-tuning batch updates, subsequent layers of the model, and different NLP tasks – from a topological perspective. The system uses the mapper algorithm from topological data analysis (TDA) to generate a graph that approximates the shape of a model’s embedding space for an input dataset. TopoBERT enables its users (e.g. experts in NLP and linguistics) to (1) interactively explore the fine-tuning process across different model-task pairs, (2) visualize the shape of embedding spaces at multiple scales and layers, and (3) connect linguistic and contextual information about the input dataset with the topology of the embedding space. Using TopoBERT, we provide various use cases to exemplify its applications in exploring fine-tuned word embeddings. We further demonstrate the utility of TopoBERT, which enables users to generate insights about the fine-tuning process and provides support for empirical validation of these insights.","PeriodicalId":50360,"journal":{"name":"Information Visualization","volume":"22 1","pages":"186 - 208"},"PeriodicalIF":2.3,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42372953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Voronoi treemap in Manhattan distance and Chebyshev distance Voronoi树图在曼哈顿距离和切比雪夫距离
IF 2.3 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2023-04-20 DOI: 10.1177/14738716231167181
Yan Chao Wang, Yi Xing, J. Zhang
The ordinary Voronoi treemap generated based on the Euclidean distance function has the flexibility to slightly adjust the layout when visualizing time-varying hierarchical data for better visual quality, preserving neighborhood relationships, and preserving a stable layout. However, its layout formed by segments with arbitrary orientations has poor shape stability between successive layouts at different time indexes, which is not conducive for the users to understand the plot and track the same node. In this paper, we propose novel Voronoi treemaps in Manhattan distance and Chebyshev distance respectively, such that the segments in the new layouts only have four orientations (horizontal, vertical, and ±45° to the x -axis). The new layouts can not only heritage the abilities of ordinary Voronoi treemap, but preserve good shape stability. To achieve this, we first focus on the weighted bisector between two sites in Manhattan distance and design a bisector generation method for different weight values of two sites, as the bisector is the foundation to form a diagram. Then a divide-and-conquer method is utilized to form the bisectors into a Voronoi diagram, and a Voronoi treemap layout can be finally obtained by using Lloyd’s method to iteratively adjust the diagram. Moreover, we prove that the treemap algorithm in Manhattan distance can be adjusted to also generate the Voronoi treemap in Chebyshev distance via linear transformation, avoiding designing additional algorithm. The computational properties of the proposed methods are first evaluated to check whether the layouts can be generated fast and accurately. Then the perceptual properties are evaluated quantitatively and qualitatively based on quality metrics and user studies, respectively. The results demonstrate that the proposed Voronoi treemaps preserve similar layout stability, but better visual quality and shape stability than the ordinary Voronoi treemap. Our algorithms are simple and resolution-independent, but also provide alternatives to the Voronoi treemaps.
基于欧几里得距离函数生成的普通Voronoi树图在可视化时变分层数据时具有略微调整布局的灵活性,以获得更好的视觉质量,保持邻域关系,并保持稳定的布局。然而,其由任意方向的片段形成的布局在不同时间索引的连续布局之间的形状稳定性较差,不利于用户理解情节和跟踪同一节点。在本文中,我们分别在曼哈顿距离和切比雪夫距离上提出了新的Voronoi树图,使得新布局中的线段只有四个方向(水平、垂直和与x轴±45°)。新的布局不仅可以继承普通Voronoi树图的功能,还可以保持良好的形状稳定性。为了实现这一点,我们首先关注曼哈顿距离内两个站点之间的加权平分线,并针对两个站点的不同权值设计了一种平分线生成方法,因为平分线是形成图表的基础。然后利用分治法将平分线形成Voronoi图,并利用Lloyd方法迭代调整图,最终得到Voronoi树图布局。此外,我们证明了曼哈顿距离的树图算法可以通过线性变换调整为切比雪夫距离的Voronoi树图,从而避免了设计额外的算法。首先评估所提出方法的计算特性,以检查布局是否能够快速准确地生成。然后分别基于质量度量和用户研究对感知特性进行定量和定性评估。结果表明,与普通的Voronoi树图相比,所提出的Voronai树图保持了相似的布局稳定性,但具有更好的视觉质量和形状稳定性。我们的算法简单且与分辨率无关,但也提供了Voronoi树图的替代方案。
{"title":"Voronoi treemap in Manhattan distance and Chebyshev distance","authors":"Yan Chao Wang, Yi Xing, J. Zhang","doi":"10.1177/14738716231167181","DOIUrl":"https://doi.org/10.1177/14738716231167181","url":null,"abstract":"The ordinary Voronoi treemap generated based on the Euclidean distance function has the flexibility to slightly adjust the layout when visualizing time-varying hierarchical data for better visual quality, preserving neighborhood relationships, and preserving a stable layout. However, its layout formed by segments with arbitrary orientations has poor shape stability between successive layouts at different time indexes, which is not conducive for the users to understand the plot and track the same node. In this paper, we propose novel Voronoi treemaps in Manhattan distance and Chebyshev distance respectively, such that the segments in the new layouts only have four orientations (horizontal, vertical, and ±45° to the x -axis). The new layouts can not only heritage the abilities of ordinary Voronoi treemap, but preserve good shape stability. To achieve this, we first focus on the weighted bisector between two sites in Manhattan distance and design a bisector generation method for different weight values of two sites, as the bisector is the foundation to form a diagram. Then a divide-and-conquer method is utilized to form the bisectors into a Voronoi diagram, and a Voronoi treemap layout can be finally obtained by using Lloyd’s method to iteratively adjust the diagram. Moreover, we prove that the treemap algorithm in Manhattan distance can be adjusted to also generate the Voronoi treemap in Chebyshev distance via linear transformation, avoiding designing additional algorithm. The computational properties of the proposed methods are first evaluated to check whether the layouts can be generated fast and accurately. Then the perceptual properties are evaluated quantitatively and qualitatively based on quality metrics and user studies, respectively. The results demonstrate that the proposed Voronoi treemaps preserve similar layout stability, but better visual quality and shape stability than the ordinary Voronoi treemap. Our algorithms are simple and resolution-independent, but also provide alternatives to the Voronoi treemaps.","PeriodicalId":50360,"journal":{"name":"Information Visualization","volume":"22 1","pages":"246 - 264"},"PeriodicalIF":2.3,"publicationDate":"2023-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45232514","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Information Visualization
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1