检查科学出版物中的数据可视化陷阱。

4区 计算机科学 Q1 Arts and Humanities Visual Computing for Industry, Biomedicine, and Art Pub Date : 2021-10-29 DOI:10.1186/s42492-021-00092-y
Vinh T Nguyen, Kwanghee Jung, Vibhuti Gupta
{"title":"检查科学出版物中的数据可视化陷阱。","authors":"Vinh T Nguyen,&nbsp;Kwanghee Jung,&nbsp;Vibhuti Gupta","doi":"10.1186/s42492-021-00092-y","DOIUrl":null,"url":null,"abstract":"<p><p>Data visualization blends art and science to convey stories from data via graphical representations. Considering different problems, applications, requirements, and design goals, it is challenging to combine these two components at their full force. While the art component involves creating visually appealing and easily interpreted graphics for users, the science component requires accurate representations of a large amount of input data. With a lack of the science component, visualization cannot serve its role of creating correct representations of the actual data, thus leading to wrong perception, interpretation, and decision. It might be even worse if incorrect visual representations were intentionally produced to deceive the viewers. To address common pitfalls in graphical representations, this paper focuses on identifying and understanding the root causes of misinformation in graphical representations. We reviewed the misleading data visualization examples in the scientific publications collected from indexing databases and then projected them onto the fundamental units of visual communication such as color, shape, size, and spatial orientation. Moreover, a text mining technique was applied to extract practical insights from common visualization pitfalls. Cochran's Q test and McNemar's test were conducted to examine if there is any difference in the proportions of common errors among color, shape, size, and spatial orientation. The findings showed that the pie chart is the most misused graphical representation, and size is the most critical issue. It was also observed that there were statistically significant differences in the proportion of errors among color, shape, size, and spatial orientation.</p>","PeriodicalId":52384,"journal":{"name":"Visual Computing for Industry, Biomedicine, and Art","volume":"4 1","pages":"27"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8556474/pdf/","citationCount":"5","resultStr":"{\"title\":\"Examining data visualization pitfalls in scientific publications.\",\"authors\":\"Vinh T Nguyen,&nbsp;Kwanghee Jung,&nbsp;Vibhuti Gupta\",\"doi\":\"10.1186/s42492-021-00092-y\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Data visualization blends art and science to convey stories from data via graphical representations. Considering different problems, applications, requirements, and design goals, it is challenging to combine these two components at their full force. While the art component involves creating visually appealing and easily interpreted graphics for users, the science component requires accurate representations of a large amount of input data. With a lack of the science component, visualization cannot serve its role of creating correct representations of the actual data, thus leading to wrong perception, interpretation, and decision. It might be even worse if incorrect visual representations were intentionally produced to deceive the viewers. To address common pitfalls in graphical representations, this paper focuses on identifying and understanding the root causes of misinformation in graphical representations. We reviewed the misleading data visualization examples in the scientific publications collected from indexing databases and then projected them onto the fundamental units of visual communication such as color, shape, size, and spatial orientation. Moreover, a text mining technique was applied to extract practical insights from common visualization pitfalls. Cochran's Q test and McNemar's test were conducted to examine if there is any difference in the proportions of common errors among color, shape, size, and spatial orientation. The findings showed that the pie chart is the most misused graphical representation, and size is the most critical issue. It was also observed that there were statistically significant differences in the proportion of errors among color, shape, size, and spatial orientation.</p>\",\"PeriodicalId\":52384,\"journal\":{\"name\":\"Visual Computing for Industry, Biomedicine, and Art\",\"volume\":\"4 1\",\"pages\":\"27\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8556474/pdf/\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Visual Computing for Industry, Biomedicine, and Art\",\"FirstCategoryId\":\"1093\",\"ListUrlMain\":\"https://doi.org/10.1186/s42492-021-00092-y\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Arts and Humanities\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Visual Computing for Industry, Biomedicine, and Art","FirstCategoryId":"1093","ListUrlMain":"https://doi.org/10.1186/s42492-021-00092-y","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Arts and Humanities","Score":null,"Total":0}
引用次数: 5

摘要

数据可视化融合了艺术和科学,通过图形表示从数据中传达故事。考虑到不同的问题、应用程序、需求和设计目标,将这两个组件充分结合起来是一项挑战。美术组件包括为用户创造具有视觉吸引力且易于解释的图形,而科学组件则需要准确呈现大量输入数据。由于缺乏科学成分,可视化无法为实际数据创建正确的表示,从而导致错误的感知、解释和决策。如果故意制造不正确的视觉表现来欺骗观众,情况可能会更糟。为了解决图形表示中的常见缺陷,本文着重于识别和理解图形表示中错误信息的根本原因。我们回顾了从索引数据库中收集的科学出版物中误导性的数据可视化示例,然后将它们投影到视觉传达的基本单位上,如颜色、形状、大小和空间方向。此外,应用文本挖掘技术从常见的可视化陷阱中提取实用的见解。Cochran’s Q测试和McNemar’s测试是为了检验在颜色、形状、大小和空间方向上常见错误的比例是否存在差异。调查结果表明,饼状图是最常被误用的图形表示形式,而大小是最关键的问题。我们还观察到,在颜色、形状、大小和空间方向上的错误比例有统计学上的显著差异。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

摘要图片

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Examining data visualization pitfalls in scientific publications.

Data visualization blends art and science to convey stories from data via graphical representations. Considering different problems, applications, requirements, and design goals, it is challenging to combine these two components at their full force. While the art component involves creating visually appealing and easily interpreted graphics for users, the science component requires accurate representations of a large amount of input data. With a lack of the science component, visualization cannot serve its role of creating correct representations of the actual data, thus leading to wrong perception, interpretation, and decision. It might be even worse if incorrect visual representations were intentionally produced to deceive the viewers. To address common pitfalls in graphical representations, this paper focuses on identifying and understanding the root causes of misinformation in graphical representations. We reviewed the misleading data visualization examples in the scientific publications collected from indexing databases and then projected them onto the fundamental units of visual communication such as color, shape, size, and spatial orientation. Moreover, a text mining technique was applied to extract practical insights from common visualization pitfalls. Cochran's Q test and McNemar's test were conducted to examine if there is any difference in the proportions of common errors among color, shape, size, and spatial orientation. The findings showed that the pie chart is the most misused graphical representation, and size is the most critical issue. It was also observed that there were statistically significant differences in the proportion of errors among color, shape, size, and spatial orientation.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Visual Computing for Industry, Biomedicine, and Art
Visual Computing for Industry, Biomedicine, and Art Arts and Humanities-Visual Arts and Performing Arts
CiteScore
5.60
自引率
0.00%
发文量
28
审稿时长
5 weeks
期刊最新文献
Discrimination between leucine-rich glioma-inactivated 1 antibody encephalitis and gamma-aminobutyric acid B receptor antibody encephalitis based on ResNet18. Hyperparameter optimization for cardiovascular disease data-driven prognostic system. Survey of methods and principles in three-dimensional reconstruction from two-dimensional medical images. Vision transformer architecture and applications in digital health: a tutorial and survey. DB-DCAFN: dual-branch deformable cross-attention fusion network for bacterial segmentation.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1