IEEE Transactions on Visualization and Computer Graphics最新文献_第6页

STTAR: Surgical Tool Tracking using off-the-shelf Augmented Reality Head-Mounted Displays STTAR:使用现成的增强现实头戴式显示器跟踪手术工具

IF 5.2 1区计算机科学 Q1 COMPUTER SCIENCE, SOFTWARE ENGINEERING

IEEE Transactions on Visualization and Computer Graphics

Pub Date : 2022-08-17 DOI: 10.48550/arXiv.2208.08880

A. Martin-Gomez, Haowei Li, T. Song, Sheng Yang, Guangzhi Wang, H. Ding, Nassir Navab, Zhe Zhao, M. Armand

The use of Augmented Reality (AR) for navigation purposes has shown beneficial in assisting physicians during the performance of surgical procedures. These applications commonly require knowing the pose of surgical tools and patients to provide visual information that surgeons can use during the performance of the task. Existing medical-grade tracking systems use infrared cameras placed inside the Operating Room (OR) to identify retro-reflective markers attached to objects of interest and compute their pose. Some commercially available AR Head-Mounted Displays (HMDs) use similar cameras for self-localization, hand tracking, and estimating the objects' depth. This work presents a framework that uses the built-in cameras of AR HMDs to enable accurate tracking of retro-reflective markers without the need to integrate any additional electronics into the HMD. The proposed framework can simultaneously track multiple tools without having previous knowledge of their geometry and only requires establishing a local network between the headset and a workstation. Our results show that the tracking and detection of the markers can be achieved with an accuracy of 0.09±0.06 mm on lateral translation, 0.42 ±0.32 mm on longitudinal translation and 0.80 ±0.39° for rotations around the vertical axis. Furthermore, to showcase the relevance of the proposed framework, we evaluate the system's performance in the context of surgical procedures. This use case was designed to replicate the scenarios of k-wire insertions in orthopedic procedures. For evaluation, seven surgeons were provided with visual navigation and asked to perform 24 injections using the proposed framework. A second study with ten participants served to investigate the capabilities of the framework in the context of more general scenarios. Results from these studies provided comparable accuracy to those reported in the literature for AR-based navigation procedures.

将增强现实（AR）用于导航目的已显示出在执行外科手术期间帮助医生是有益的。这些应用通常需要知道手术工具和患者的姿势，以提供外科医生在执行任务期间可以使用的视觉信息。现有的医疗级跟踪系统使用放置在手术室（OR）内的红外相机来识别附着在感兴趣物体上的反光标记，并计算其姿势。一些商用AR头戴式显示器（HMD）使用类似的相机进行自我定位、手部跟踪和估计物体深度。这项工作提出了一个框架，该框架使用AR HMD的内置摄像头来实现对反光标记的精确跟踪，而无需将任何额外的电子设备集成到HMD中。所提出的框架可以同时跟踪多个工具，而不需要事先了解它们的几何形状，并且只需要在头戴式耳机和工作站之间建立本地网络。我们的结果表明，标记物的跟踪和检测可以实现，横向平移的精度为0.09±0.06 mm，纵向平移为0.42±0.32 mm，绕垂直轴旋转的精度为0.80±0.39°。此外，为了展示所提出的框架的相关性，我们评估了该系统在外科手术中的表现。该用例旨在复制骨科手术中k线插入的场景。为了进行评估，向7名外科医生提供了视觉导航，并要求他们使用拟议的框架进行24次注射。第二项由十名参与者参与的研究旨在调查该框架在更一般场景中的能力。这些研究的结果提供了与文献中报道的基于AR的导航程序相当的准确性。

{"title":"STTAR: Surgical Tool Tracking using off-the-shelf Augmented Reality Head-Mounted Displays","authors":"A. Martin-Gomez, Haowei Li, T. Song, Sheng Yang, Guangzhi Wang, H. Ding, Nassir Navab, Zhe Zhao, M. Armand","doi":"10.48550/arXiv.2208.08880","DOIUrl":"https://doi.org/10.48550/arXiv.2208.08880","url":null,"abstract":"The use of Augmented Reality (AR) for navigation purposes has shown beneficial in assisting physicians during the performance of surgical procedures. These applications commonly require knowing the pose of surgical tools and patients to provide visual information that surgeons can use during the performance of the task. Existing medical-grade tracking systems use infrared cameras placed inside the Operating Room (OR) to identify retro-reflective markers attached to objects of interest and compute their pose. Some commercially available AR Head-Mounted Displays (HMDs) use similar cameras for self-localization, hand tracking, and estimating the objects' depth. This work presents a framework that uses the built-in cameras of AR HMDs to enable accurate tracking of retro-reflective markers without the need to integrate any additional electronics into the HMD. The proposed framework can simultaneously track multiple tools without having previous knowledge of their geometry and only requires establishing a local network between the headset and a workstation. Our results show that the tracking and detection of the markers can be achieved with an accuracy of 0.09±0.06 mm on lateral translation, 0.42 ±0.32 mm on longitudinal translation and 0.80 ±0.39° for rotations around the vertical axis. Furthermore, to showcase the relevance of the proposed framework, we evaluate the system's performance in the context of surgical procedures. This use case was designed to replicate the scenarios of k-wire insertions in orthopedic procedures. For evaluation, seven surgeons were provided with visual navigation and asked to perform 24 injections using the proposed framework. A second study with ten participants served to investigate the capabilities of the framework in the context of more general scenarios. Results from these studies provided comparable accuracy to those reported in the literature for AR-based navigation procedures.","PeriodicalId":13376,"journal":{"name":"IEEE Transactions on Visualization and Computer Graphics","volume":" ","pages":""},"PeriodicalIF":5.2,"publicationDate":"2022-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45200141","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

: A isualization pproah for Noie Awarenss in Quatum Computing ：四元计算中Noie Awarens的可视化方法

IF 5.2 1区计算机科学 Q1 COMPUTER SCIENCE, SOFTWARE ENGINEERING

IEEE Transactions on Visualization and Computer Graphics

Pub Date : 2022-07-28 DOI: 10.1109/TVCG.2022.3209455

Shaolun Ruan, Yong Wang, Weiwen Jiang, Y. Mao, Qian-Guo Guan

Quantum computing has attracted considerable public attention due to its exponential speedup over classical computing. Despite its advantages, today's quantum computers intrinsically suffer from noise and are error-prone. To guarantee the high fidelity of the execution result of a quantum algorithm, it is crucial to inform users of the noises of the used quantum computer and the compiled physical circuits. However, an intuitive and systematic way to make users aware of the quantum computing noise is still missing. In this paper, we fill the gap by proposing a novel visualization approach to achieve noise-aware quantum computing. It provides a holistic picture of the noise of quantum computing through multiple interactively coordinated views: a Computer Evolution View with a circuit-like design overviews the temporal evolution of the noises of different quantum computers, a Circuit Filtering View facilitates quick filtering of multiple compiled physical circuits for the same quantum algorithm, and a Circuit Comparison View with a coupled bar chart enables detailed comparison of the filtered compiled circuits. We extensively evaluate the performance of VACSEN through two case studies on quantum algorithms of different scales and in-depth interviews with 12 quantum computing users. The results demonstrate the effectiveness and usability of VACSEN in achieving noise-aware quantum computing.

量子计算由于其比经典计算具有指数级的速度而引起了公众的广泛关注。尽管有这些优势，但今天的量子计算机本质上受到噪音的影响，而且容易出错。为了保证量子算法执行结果的高保真度，告知用户所使用的量子计算机和编译的物理电路的噪声是至关重要的。然而，一种让用户意识到量子计算噪音的直观和系统的方法仍然缺失。在本文中，我们通过提出一种新的可视化方法来实现噪声感知量子计算来填补这一空白。它通过多个交互协调的视图提供了量子计算噪声的整体图像:具有类似电路设计的计算机进化视图概述了不同量子计算机噪声的时间演变，电路滤波视图便于对同一量子算法的多个编译物理电路进行快速滤波，而具有耦合条形图的电路比较视图可以对过滤后的编译电路进行详细比较。我们通过两个不同规模量子算法的案例研究和对12个量子计算用户的深入访谈，广泛评估了VACSEN的性能。结果证明了VACSEN在实现噪声感知量子计算方面的有效性和可用性。

{"title":": A isualization pproah for Noie Awarenss in Quatum Computing","authors":"Shaolun Ruan, Yong Wang, Weiwen Jiang, Y. Mao, Qian-Guo Guan","doi":"10.1109/TVCG.2022.3209455","DOIUrl":"https://doi.org/10.1109/TVCG.2022.3209455","url":null,"abstract":"Quantum computing has attracted considerable public attention due to its exponential speedup over classical computing. Despite its advantages, today's quantum computers intrinsically suffer from noise and are error-prone. To guarantee the high fidelity of the execution result of a quantum algorithm, it is crucial to inform users of the noises of the used quantum computer and the compiled physical circuits. However, an intuitive and systematic way to make users aware of the quantum computing noise is still missing. In this paper, we fill the gap by proposing a novel visualization approach to achieve noise-aware quantum computing. It provides a holistic picture of the noise of quantum computing through multiple interactively coordinated views: a Computer Evolution View with a circuit-like design overviews the temporal evolution of the noises of different quantum computers, a Circuit Filtering View facilitates quick filtering of multiple compiled physical circuits for the same quantum algorithm, and a Circuit Comparison View with a coupled bar chart enables detailed comparison of the filtered compiled circuits. We extensively evaluate the performance of VACSEN through two case studies on quantum algorithms of different scales and in-depth interviews with 12 quantum computing users. The results demonstrate the effectiveness and usability of VACSEN in achieving noise-aware quantum computing.","PeriodicalId":13376,"journal":{"name":"IEEE Transactions on Visualization and Computer Graphics","volume":"29 1","pages":"462-472"},"PeriodicalIF":5.2,"publicationDate":"2022-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41592083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Path Tracing in 2D, 3D, and Physicalized Networks 路径跟踪在2D, 3D和物理网络

IF 5.2 1区计算机科学 Q1 COMPUTER SCIENCE, SOFTWARE ENGINEERING

IEEE Transactions on Visualization and Computer Graphics

Pub Date : 2022-07-23 DOI: 10.48550/arXiv.2207.11586

Michael J. McGuffin, Ryan Servera, Marie Forest

It is common to advise against using 3D to visualize abstract data such as networks, however Ware and Mitchell's 2008 study showed that path tracing in a network is less error prone in 3D than in 2D. It is unclear, however, if 3D retains its advantage when the 2D presentation of a network is improved using edge-routing, and when simple interaction techniques for exploring the network are available. We address this with two studies of path tracing under new conditions. The first study was preregistered, involved 34 users, and compared 2D and 3D layouts that the user could rotate and move in virtual reality with a handheld controller. Error rates were lower in 3D than in 2D, despite the use of edge-routing in 2D and the use of mouse-driven interactive highlighting of edges. The second study involved 12 users and investigated data physicalization, comparing 3D layouts in virtual reality versus physical 3D printouts of networks augmented with a Microsoft HoloLens headset. No difference was found in error rate, but users performed a variety of actions with their fingers in the physical condition which can inform new interaction techniques.

通常建议不要使用3D来可视化网络等抽象数据，但Ware和Mitchell 2008年的研究表明，网络中的路径跟踪在3D中比在2D中更不容易出错。然而，尚不清楚当使用边缘路由改进网络的2D呈现时，以及当用于探索网络的简单交互技术可用时，3D是否保留其优势。我们通过对新条件下路径跟踪的两项研究来解决这一问题。第一项研究是预先注册的，涉及34名用户，并比较了用户可以使用手持控制器在虚拟现实中旋转和移动的2D和3D布局。尽管在2D中使用了边缘布线和使用鼠标驱动的交互式高亮显示边缘，但3D中的错误率低于2D。第二项研究涉及12名用户，调查了数据物理化，比较了虚拟现实中的3D布局与微软HoloLens耳机增强网络的物理3D打印输出。错误率没有发现差异，但用户在身体条件下用手指执行了各种动作，这可以为新的交互技术提供信息。

{"title":"Path Tracing in 2D, 3D, and Physicalized Networks","authors":"Michael J. McGuffin, Ryan Servera, Marie Forest","doi":"10.48550/arXiv.2207.11586","DOIUrl":"https://doi.org/10.48550/arXiv.2207.11586","url":null,"abstract":"It is common to advise against using 3D to visualize abstract data such as networks, however Ware and Mitchell's 2008 study showed that path tracing in a network is less error prone in 3D than in 2D. It is unclear, however, if 3D retains its advantage when the 2D presentation of a network is improved using edge-routing, and when simple interaction techniques for exploring the network are available. We address this with two studies of path tracing under new conditions. The first study was preregistered, involved 34 users, and compared 2D and 3D layouts that the user could rotate and move in virtual reality with a handheld controller. Error rates were lower in 3D than in 2D, despite the use of edge-routing in 2D and the use of mouse-driven interactive highlighting of edges. The second study involved 12 users and investigated data physicalization, comparing 3D layouts in virtual reality versus physical 3D printouts of networks augmented with a Microsoft HoloLens headset. No difference was found in error rate, but users performed a variety of actions with their fingers in the physical condition which can inform new interaction techniques.","PeriodicalId":13376,"journal":{"name":"IEEE Transactions on Visualization and Computer Graphics","volume":" ","pages":""},"PeriodicalIF":5.2,"publicationDate":"2022-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43237315","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

ScrollyVis: Interactive visual authoring of guided dynamic narratives for scientific scrollytelling ScrollyVis:交互式视觉创作引导动态叙事的科学叙事

IF 5.2 1区计算机科学 Q1 COMPUTER SCIENCE, SOFTWARE ENGINEERING

IEEE Transactions on Visualization and Computer Graphics

Pub Date : 2022-07-07 DOI: 10.48550/arXiv.2207.03616

Eric Mörth, S. Bruckner, N. Smit

Visual stories are an effective and powerful tool to convey specific information to a diverse public. Scrollytelling is a recent visual storytelling technique extensively used on the web, where content appears or changes as users scroll up or down a page. By employing the familiar gesture of scrolling as its primary interaction mechanism, it provides users with a sense of control, exploration and discoverability while still offering a simple and intuitive interface. In this paper, we present a novel approach for authoring, editing, and presenting data-driven scientific narratives using scrollytelling. Our method flexibly integrates common sources such as images, text, and video, but also supports more specialized visualization techniques such as interactive maps as well as scalar field and mesh data visualizations. We show that scrolling navigation can be used to traverse dynamic narratives and demonstrate how it can be combined with interactive parameter exploration. The resulting system consists of an extensible web-based authoring tool capable of exporting stand-alone stories that can be hosted on any web server. We demonstrate the power and utility of our approach with case studies from several diverse scientific fields and with a user study including 12 participants of diverse professional backgrounds. Furthermore, an expert in creating interactive articles assessed the usefulness of our approach and the quality of the created stories.

视觉故事是一种有效而有力的工具，可以向不同的公众传达特定的信息。滚动叙事是一种最近在网络上广泛使用的视觉叙事技术，当用户上下滚动页面时，内容就会出现或改变。通过使用熟悉的滚动手势作为主要交互机制，它为用户提供了一种控制感、探索感和可发现感，同时还提供了一个简单直观的界面。在本文中，我们提出了一种新颖的方法来创作、编辑和呈现数据驱动的科学叙事。我们的方法灵活地集成了常见的数据源，如图像、文本和视频，但也支持更专业的可视化技术，如交互式地图以及标量场和网格数据可视化。我们展示了滚动导航可以用来遍历动态叙述，并演示了它如何与交互式参数探索相结合。最终的系统由一个可扩展的基于web的创作工具组成，该工具能够导出可以托管在任何web服务器上的独立故事。我们通过几个不同科学领域的案例研究和包括12名不同专业背景的参与者的用户研究来展示我们方法的力量和实用性。此外，一位创建互动文章的专家评估了我们方法的有效性和所创建故事的质量。

{"title":"ScrollyVis: Interactive visual authoring of guided dynamic narratives for scientific scrollytelling","authors":"Eric Mörth, S. Bruckner, N. Smit","doi":"10.48550/arXiv.2207.03616","DOIUrl":"https://doi.org/10.48550/arXiv.2207.03616","url":null,"abstract":"Visual stories are an effective and powerful tool to convey specific information to a diverse public. Scrollytelling is a recent visual storytelling technique extensively used on the web, where content appears or changes as users scroll up or down a page. By employing the familiar gesture of scrolling as its primary interaction mechanism, it provides users with a sense of control, exploration and discoverability while still offering a simple and intuitive interface. In this paper, we present a novel approach for authoring, editing, and presenting data-driven scientific narratives using scrollytelling. Our method flexibly integrates common sources such as images, text, and video, but also supports more specialized visualization techniques such as interactive maps as well as scalar field and mesh data visualizations. We show that scrolling navigation can be used to traverse dynamic narratives and demonstrate how it can be combined with interactive parameter exploration. The resulting system consists of an extensible web-based authoring tool capable of exporting stand-alone stories that can be hosted on any web server. We demonstrate the power and utility of our approach with case studies from several diverse scientific fields and with a user study including 12 participants of diverse professional backgrounds. Furthermore, an expert in creating interactive articles assessed the usefulness of our approach and the quality of the created stories.","PeriodicalId":13376,"journal":{"name":"IEEE Transactions on Visualization and Computer Graphics","volume":" ","pages":""},"PeriodicalIF":5.2,"publicationDate":"2022-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44042439","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Discrete Morse Sandwich: Fast Computation of Persistence Diagrams for Scalar Data - An Algorithm and A Benchmark 离散Morse三明治：标量数据持久图的快速计算——一种算法和基准

IF 5.2 1区计算机科学 Q1 COMPUTER SCIENCE, SOFTWARE ENGINEERING

IEEE Transactions on Visualization and Computer Graphics

Pub Date : 2022-06-27 DOI: 10.48550/arXiv.2206.13932

P. Guillou, Jules Vidal, Julien Tierny

This paper introduces an efficient algorithm for persistence diagram computation, given an input piecewise linear scalar field f defined on a d-dimensional simplicial complex K, with d ≤ 3. Our work revisits the seminal algorithm "PairSimplices" [31], [103] with discrete Morse theory (DMT) [34], [80], which greatly reduces the number of input simplices to consider. Further, we also extend to DMT and accelerate the stratification strategy described in "PairSimplices" [31], [103] for the fast computation of the 0th and (d-1)th diagrams, noted D0(f) and Dd-1(f). Minima-saddle persistence pairs ( D0(f)) and saddle-maximum persistence pairs ( Dd-1(f)) are efficiently computed by processing , with a Union-Find , the unstable sets of 1-saddles and the stable sets of (d-1)-saddles. We provide a detailed description of the (optional) handling of the boundary component of K when processing (d-1)-saddles. This fast pre-computation for the dimensions 0 and (d-1) enables an aggressive specialization of [4] to the 3D case, which results in a drastic reduction of the number of input simplices for the computation of D1(f), the intermediate layer of the sandwich. Finally, we document several performance improvements via shared-memory parallelism. We provide an open-source implementation of our algorithm for reproducibility purposes. We also contribute a reproducible benchmark package, which exploits three-dimensional data from a public repository and compares our algorithm to a variety of publicly available implementations. Extensive experiments indicate that our algorithm improves by two orders of magnitude the time performance of the seminal "PairSimplices" algorithm it extends. Moreover, it also improves memory footprint and time performance over a selection of 14 competing approaches, with a substantial gain over the fastest available approaches, while producing a strictly identical output. We illustrate the utility of our contributions with an application to the fast and robust extraction of persistent 1-dimensional generators on surfaces, volume data and high-dimensional point clouds.

本文介绍了一种有效的持久图计算算法，给出了一个定义在d维单纯复形K上的输入分段线性标量场f，其中d≤3。我们的工作重新审视了具有开创性意义的算法“PairSimplices”[31]，[103]和离散莫尔斯理论（DMT）[34]，[80]，这大大减少了需要考虑的输入简化的数量。此外，我们还扩展到DMT，并加速了“PairSimplices”[31]、[103]中描述的分层策略，用于快速计算第0个和第（d-1）个图，标注为D0（f）和Dd-1（f）。通过用并集法处理1-鞍的不稳定集和（d-1）-鞍的稳定集，有效地计算了最小-鞍持久性对（D0（f））和鞍最大持久性对。我们提供了在处理（d-1）-鞍时对K的边界分量的（可选）处理的详细描述。尺寸0和（d-1）的这种快速预计算使得[4]能够积极地专门化到3D情况，这导致用于计算D1（f）（夹层的中间层）的输入简化的数量急剧减少。最后，我们通过共享内存并行性记录了一些性能改进。我们提供了我们算法的开源实现，以实现再现性。我们还提供了一个可复制的基准测试包，该包利用公共存储库中的三维数据，并将我们的算法与各种公开可用的实现进行比较。大量实验表明，我们的算法将其扩展的开创性“PairSimplices”算法的时间性能提高了两个数量级。此外，与14种竞争方法相比，它还提高了内存占用面积和时间性能，与最快的可用方法相比有了显著的增益，同时产生了完全相同的输出。我们通过应用于快速和稳健地提取表面、体积数据和高维点云上的持久一维生成器来说明我们的贡献的效用。

{"title":"Discrete Morse Sandwich: Fast Computation of Persistence Diagrams for Scalar Data - An Algorithm and A Benchmark","authors":"P. Guillou, Jules Vidal, Julien Tierny","doi":"10.48550/arXiv.2206.13932","DOIUrl":"https://doi.org/10.48550/arXiv.2206.13932","url":null,"abstract":"This paper introduces an efficient algorithm for persistence diagram computation, given an input piecewise linear scalar field f defined on a d-dimensional simplicial complex K, with d ≤ 3. Our work revisits the seminal algorithm \"PairSimplices\" [31], [103] with discrete Morse theory (DMT) [34], [80], which greatly reduces the number of input simplices to consider. Further, we also extend to DMT and accelerate the stratification strategy described in \"PairSimplices\" [31], [103] for the fast computation of the 0th and (d-1)th diagrams, noted D0(f) and Dd-1(f). Minima-saddle persistence pairs ( D0(f)) and saddle-maximum persistence pairs ( Dd-1(f)) are efficiently computed by processing , with a Union-Find , the unstable sets of 1-saddles and the stable sets of (d-1)-saddles. We provide a detailed description of the (optional) handling of the boundary component of K when processing (d-1)-saddles. This fast pre-computation for the dimensions 0 and (d-1) enables an aggressive specialization of [4] to the 3D case, which results in a drastic reduction of the number of input simplices for the computation of D1(f), the intermediate layer of the sandwich. Finally, we document several performance improvements via shared-memory parallelism. We provide an open-source implementation of our algorithm for reproducibility purposes. We also contribute a reproducible benchmark package, which exploits three-dimensional data from a public repository and compares our algorithm to a variety of publicly available implementations. Extensive experiments indicate that our algorithm improves by two orders of magnitude the time performance of the seminal \"PairSimplices\" algorithm it extends. Moreover, it also improves memory footprint and time performance over a selection of 14 competing approaches, with a substantial gain over the fastest available approaches, while producing a strictly identical output. We illustrate the utility of our contributions with an application to the fast and robust extraction of persistent 1-dimensional generators on surfaces, volume data and high-dimensional point clouds.","PeriodicalId":13376,"journal":{"name":"IEEE Transactions on Visualization and Computer Graphics","volume":" ","pages":""},"PeriodicalIF":5.2,"publicationDate":"2022-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41751984","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Adaptive color transfer from images to terrain visualizations 从图像到地形可视化的自适应色彩转移

IF 5.2 1区计算机科学 Q1 COMPUTER SCIENCE, SOFTWARE ENGINEERING

IEEE Transactions on Visualization and Computer Graphics

Pub Date : 2022-05-30 DOI: 10.48550/arXiv.2205.14908

Mingguang Wu, Yanjie Sun, Shangjing Jiang

Terrain mapping is not only dedicated to communicating how high or steep a landscape is but can also help to indicate how we feel about a place. However, crafting effective and expressive elevation colors is challenging for both nonexperts and experts. In this paper, we present a two-step image-to-terrain color transfer method that can transfer color from arbitrary images to diverse terrain models. First, we present a new image color organization method that organizes discrete, irregular image colors into a continuous, regular color grid that facilitates a series of color operations, such as local and global searching, categorical color selection and sequential color interpolation. Second, we quantify a series of subjective concerns about elevation color crafting, such as the "lower, higher" principle, color conventions, and aerial perspectives. We also define color similarity between images and terrain visualizations with aesthetic quality. We then mathematically formulate image-to-terrain color transfer as a dual-objective optimization problem and offer a heuristic searching method to solve the problem. Finally, we compare elevation colors from our method with a standard color scheme and a representative color scale generation tool based on four test terrains. The evaluations show that the elevation colors from the proposed method are most effective and that our results are visually favorable. We also showcase that our method can transfer emotion from images to terrain visualizations.

地形图不仅用于传达景观的高度或陡峭程度，还可以帮助指示我们对一个地方的感受。然而，对于非专家和专家来说，制作有效且富有表现力的立面颜色都是一项挑战。在本文中，我们提出了一种两步图像到地形的颜色转移方法，该方法可以将颜色从任意图像转移到不同的地形模型。首先，我们提出了一种新的图像颜色组织方法，该方法将离散的、不规则的图像颜色组成一个连续的、规则的颜色网格，便于进行一系列颜色操作，如局部和全局搜索、分类颜色选择和顺序颜色插值。其次，我们量化了关于立面颜色制作的一系列主观问题，如“低、高”原则、颜色惯例和空中视角。我们还定义了具有美学质量的图像和地形可视化之间的颜色相似性。然后，我们将图像到地形的颜色转移数学化为一个双目标优化问题，并提供了一种启发式搜索方法来解决该问题。最后，我们将我们的方法中的高程颜色与标准配色方案和基于四个测试地形的代表性色标生成工具进行了比较。评估表明，所提出的方法中的高程颜色是最有效的，并且我们的结果在视觉上是有利的。我们还展示了我们的方法可以将情感从图像转移到地形可视化中。

{"title":"Adaptive color transfer from images to terrain visualizations","authors":"Mingguang Wu, Yanjie Sun, Shangjing Jiang","doi":"10.48550/arXiv.2205.14908","DOIUrl":"https://doi.org/10.48550/arXiv.2205.14908","url":null,"abstract":"Terrain mapping is not only dedicated to communicating how high or steep a landscape is but can also help to indicate how we feel about a place. However, crafting effective and expressive elevation colors is challenging for both nonexperts and experts. In this paper, we present a two-step image-to-terrain color transfer method that can transfer color from arbitrary images to diverse terrain models. First, we present a new image color organization method that organizes discrete, irregular image colors into a continuous, regular color grid that facilitates a series of color operations, such as local and global searching, categorical color selection and sequential color interpolation. Second, we quantify a series of subjective concerns about elevation color crafting, such as the \"lower, higher\" principle, color conventions, and aerial perspectives. We also define color similarity between images and terrain visualizations with aesthetic quality. We then mathematically formulate image-to-terrain color transfer as a dual-objective optimization problem and offer a heuristic searching method to solve the problem. Finally, we compare elevation colors from our method with a standard color scheme and a representative color scale generation tool based on four test terrains. The evaluations show that the elevation colors from the proposed method are most effective and that our results are visually favorable. We also showcase that our method can transfer emotion from images to terrain visualizations.","PeriodicalId":13376,"journal":{"name":"IEEE Transactions on Visualization and Computer Graphics","volume":" ","pages":""},"PeriodicalIF":5.2,"publicationDate":"2022-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45662815","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

V4D: Voxel for 4D Novel View Synthesis V4D:用于4D新视图合成的体素

IF 5.2 1区计算机科学 Q1 COMPUTER SCIENCE, SOFTWARE ENGINEERING

IEEE Transactions on Visualization and Computer Graphics

Pub Date : 2022-05-28 DOI: 10.48550/arXiv.2205.14332

Wanshui Gan, Hongbin Xu, Yi Huang, Shifeng Chen, N. Yokoya

Neural radiance fields have made a remarkable breakthrough in the novel view synthesis task at the 3D static scene. However, for the 4D circumstance (e.g., dynamic scene), the performance of the existing method is still limited by the capacity of the neural network, typically in a multilayer perceptron network (MLP). In this paper, we utilize 3D Voxel to model the 4D neural radiance field, short as V4D, where the 3D voxel has two formats. The first one is to regularly model the 3D space and then use the sampled local 3D feature with the time index to model the density field and the texture field by a tiny MLP. The second one is in look-up tables (LUTs) format that is for the pixel-level refinement, where the pseudo-surface produced by the volume rendering is utilized as the guidance information to learn a 2D pixel-level refinement mapping. The proposed LUTs-based refinement module achieves the performance gain with little computational cost and could serve as the plug-and-play module in the novel view synthesis task. Moreover, we propose a more effective conditional positional encoding toward the 4D data that achieves performance gain with negligible computational burdens. Extensive experiments demonstrate that the proposed method achieves state-of-the-art performance at a low computational cost. The relevant code is available in https://github.com/GANWANSHUI/V4D.

神经辐射场在三维静态场景的新型视图合成任务中取得了显著的突破。然而，对于4D环境(如动态场景)，现有方法的性能仍然受到神经网络容量的限制，特别是在多层感知器网络(MLP)中。在本文中，我们利用3D体素来建模4D神经辐射场，简称V4D，其中3D体素有两种格式。第一种方法是对三维空间进行规则建模，然后利用采样的局部三维特征和时间索引，通过一个微小的MLP对密度场和纹理场进行建模。第二种是用于像素级细化的查找表(LUTs)格式，其中利用体渲染产生的伪表面作为指导信息来学习2D像素级细化映射。提出的基于lut的优化模块以较小的计算成本实现了性能提升，可以作为新型视图合成任务的即插即用模块。此外，我们提出了一种更有效的4D数据条件位置编码，在计算负担可以忽略不计的情况下实现性能提升。大量的实验表明，该方法以较低的计算成本达到了最先进的性能。相关代码可在https://github.com/GANWANSHUI/V4D中获得。

{"title":"V4D: Voxel for 4D Novel View Synthesis","authors":"Wanshui Gan, Hongbin Xu, Yi Huang, Shifeng Chen, N. Yokoya","doi":"10.48550/arXiv.2205.14332","DOIUrl":"https://doi.org/10.48550/arXiv.2205.14332","url":null,"abstract":"Neural radiance fields have made a remarkable breakthrough in the novel view synthesis task at the 3D static scene. However, for the 4D circumstance (e.g., dynamic scene), the performance of the existing method is still limited by the capacity of the neural network, typically in a multilayer perceptron network (MLP). In this paper, we utilize 3D Voxel to model the 4D neural radiance field, short as V4D, where the 3D voxel has two formats. The first one is to regularly model the 3D space and then use the sampled local 3D feature with the time index to model the density field and the texture field by a tiny MLP. The second one is in look-up tables (LUTs) format that is for the pixel-level refinement, where the pseudo-surface produced by the volume rendering is utilized as the guidance information to learn a 2D pixel-level refinement mapping. The proposed LUTs-based refinement module achieves the performance gain with little computational cost and could serve as the plug-and-play module in the novel view synthesis task. Moreover, we propose a more effective conditional positional encoding toward the 4D data that achieves performance gain with negligible computational burdens. Extensive experiments demonstrate that the proposed method achieves state-of-the-art performance at a low computational cost. The relevant code is available in https://github.com/GANWANSHUI/V4D.","PeriodicalId":13376,"journal":{"name":"IEEE Transactions on Visualization and Computer Graphics","volume":" ","pages":""},"PeriodicalIF":5.2,"publicationDate":"2022-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45880685","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 18

Strolling in Room-Scale VR: Hex-Core-MK1 Omnidirectional Treadmill 室内漫游VR：Hex-Core-MK1全方位跑步机

IF 5.2 1区计算机科学 Q1 COMPUTER SCIENCE, SOFTWARE ENGINEERING

IEEE Transactions on Visualization and Computer Graphics

Pub Date : 2022-04-18 DOI: 10.48550/arXiv.2204.08437

Ziyao Wang, Chiyi Liu, Jialiang Chen, Yao Yao, Dazheng Fang, Zhiyi Shi, Rui Yan, Yiye Wang, Kanjian Zhang, Hai Wang, Haikun Wei

The natural locomotion interface is critical to the development of many VR applications. For household VR applications, there are two basic requirements: natural immersive experience and minimized space occupation. The existing locomotion strategies generally do not simultaneously satisfy these two requirements well. This paper presents a novel omnidirectional treadmill (ODT) system named Hex-Core-MK1 (HCMK1). By implementing two kinds of mirror-symmetrical spiral rollers to generate the omnidirectional velocity field, this proposed system is capable of providing real walking experiences with a full-degree of freedom in an area as small as 1.76 m2, while delivering great advantages over several existing ODT systems in terms of weight, volume, latency and dynamic performance. Compared with the sizes of Infinadeck and HCP, the two best motor-driven ODTs so far, the 8 cm height of HCMK1 is only 20% of Infinadeck and 50% of HCP. In addition, HCMK1 is a lightweight device weighing only 110 kg, which provides possibilities for further expanding VR scenarios, such as terrain simulation. The system latency of HCMK1 is only 9ms. The experiments show that HCMK1 can deliver a starting acceleration of 16.00 m/s2 and a braking acceleration of 30.00 m/s2.

自然运动界面对许多VR应用程序的开发至关重要。对于家用VR应用，有两个基本要求：自然的沉浸式体验和最小化的空间占用。现有的运动策略通常不能很好地同时满足这两个要求。本文提出了一种新型的全向跑步机（ODT）系统，命名为Hex-Core-MK1（HCMK1）。通过实现两种镜像对称的螺旋滚轴来产生全向速度场，该系统能够在小到1.76平方米的区域内提供全自由度的真实步行体验，同时在重量、体积、延迟和动态性能方面比现有的几种ODT系统具有很大的优势。与迄今为止最好的两种电机驱动ODT Infinadeck和HCP的尺寸相比，HCMK1的8cm高度仅为Infinadeck的20%和HCP 50%。此外，HCMK1是一款重量仅为110公斤的轻型设备，为进一步扩展VR场景（如地形模拟）提供了可能性。HCMK1的系统延迟只有9ms。实验表明，HCMK1的起动加速度为16.00m/s2，制动加速度为30.00m/s2。

{"title":"Strolling in Room-Scale VR: Hex-Core-MK1 Omnidirectional Treadmill","authors":"Ziyao Wang, Chiyi Liu, Jialiang Chen, Yao Yao, Dazheng Fang, Zhiyi Shi, Rui Yan, Yiye Wang, Kanjian Zhang, Hai Wang, Haikun Wei","doi":"10.48550/arXiv.2204.08437","DOIUrl":"https://doi.org/10.48550/arXiv.2204.08437","url":null,"abstract":"The natural locomotion interface is critical to the development of many VR applications. For household VR applications, there are two basic requirements: natural immersive experience and minimized space occupation. The existing locomotion strategies generally do not simultaneously satisfy these two requirements well. This paper presents a novel omnidirectional treadmill (ODT) system named Hex-Core-MK1 (HCMK1). By implementing two kinds of mirror-symmetrical spiral rollers to generate the omnidirectional velocity field, this proposed system is capable of providing real walking experiences with a full-degree of freedom in an area as small as 1.76 m2, while delivering great advantages over several existing ODT systems in terms of weight, volume, latency and dynamic performance. Compared with the sizes of Infinadeck and HCP, the two best motor-driven ODTs so far, the 8 cm height of HCMK1 is only 20% of Infinadeck and 50% of HCP. In addition, HCMK1 is a lightweight device weighing only 110 kg, which provides possibilities for further expanding VR scenarios, such as terrain simulation. The system latency of HCMK1 is only 9ms. The experiments show that HCMK1 can deliver a starting acceleration of 16.00 m/s2 and a braking acceleration of 30.00 m/s2.","PeriodicalId":13376,"journal":{"name":"IEEE Transactions on Visualization and Computer Graphics","volume":" ","pages":""},"PeriodicalIF":5.2,"publicationDate":"2022-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48448721","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Efficient Reflectance Capture with a Deep Gated Mixture-of-Experts 使用深度门控专家混合的高效反射捕获

IF 5.2 1区计算机科学 Q1 COMPUTER SCIENCE, SOFTWARE ENGINEERING

IEEE Transactions on Visualization and Computer Graphics

Pub Date : 2022-03-29 DOI: 10.48550/arXiv.2203.15258

Xiaohe Ma, Ya-Qi Yu, Hongzhi Wu, Kun Zhou

We present a novel framework to efficiently acquire anisotropic reflectance in a pixel-independent fashion, using a deep gated mixture-of-experts. While existing work employs a unified network to handle all possible input, our network automatically learns to condition on the input for enhanced reconstruction. We train a gating module that takes photometric measurements as input and selects one out of a number of specialized decoders for reflectance reconstruction, essentially trading generality for quality. A common pre-trained latent-transform module is also appended to each decoder, to offset the burden of the increased number of decoders. In addition, the illumination conditions during acquisition can be jointly optimized. The effectiveness of our framework is validated on a wide variety of challenging near-planar samples with a lightstage. Compared with the state-of-the-art technique, our quality is improved with the same number of input images, and our input image number can be reduced to about 1/3 for equal-quality results. We further generalize the framework to enhance a state-of-the-art technique on non-planar reflectance scanning.

我们提出了一种新的框架，利用深度门控混合专家，以像素无关的方式有效地获取各向异性反射率。现有的工作采用统一的网络来处理所有可能的输入，而我们的网络会自动学习对输入进行条件调整以增强重建。我们训练了一个门控模块，它将光度测量作为输入，并从许多专门的解码器中选择一个用于反射率重建，本质上是为了质量而交换通用性。每个解码器还附加了一个通用的预训练的潜在变换模块，以抵消解码器数量增加带来的负担。此外，还可以对采集过程中的照明条件进行联合优化。我们的框架的有效性在各种具有挑战性的近平面样品和光舞台上得到了验证。与最先进的技术相比，在相同数量的输入图像的情况下，我们的质量得到了提高，并且我们的输入图像数量可以减少到1/3左右，从而获得相同质量的结果。我们进一步推广了该框架，以增强非平面反射扫描的最新技术。

{"title":"Efficient Reflectance Capture with a Deep Gated Mixture-of-Experts","authors":"Xiaohe Ma, Ya-Qi Yu, Hongzhi Wu, Kun Zhou","doi":"10.48550/arXiv.2203.15258","DOIUrl":"https://doi.org/10.48550/arXiv.2203.15258","url":null,"abstract":"We present a novel framework to efficiently acquire anisotropic reflectance in a pixel-independent fashion, using a deep gated mixture-of-experts. While existing work employs a unified network to handle all possible input, our network automatically learns to condition on the input for enhanced reconstruction. We train a gating module that takes photometric measurements as input and selects one out of a number of specialized decoders for reflectance reconstruction, essentially trading generality for quality. A common pre-trained latent-transform module is also appended to each decoder, to offset the burden of the increased number of decoders. In addition, the illumination conditions during acquisition can be jointly optimized. The effectiveness of our framework is validated on a wide variety of challenging near-planar samples with a lightstage. Compared with the state-of-the-art technique, our quality is improved with the same number of input images, and our input image number can be reduced to about 1/3 for equal-quality results. We further generalize the framework to enhance a state-of-the-art technique on non-planar reflectance scanning.","PeriodicalId":13376,"journal":{"name":"IEEE Transactions on Visualization and Computer Graphics","volume":" ","pages":""},"PeriodicalIF":5.2,"publicationDate":"2022-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48268495","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Revisiting the Design Patterns of Composite Visualizations 复合可视化设计模式再探

IF 5.2 1区计算机科学 Q1 COMPUTER SCIENCE, SOFTWARE ENGINEERING

IEEE Transactions on Visualization and Computer Graphics

Pub Date : 2022-03-20 DOI: 10.48550/arXiv.2203.10476

Dazhen Deng, Weiwei Cui, Xiyu Meng, Mengye Xu, Yu Liao, Haidong Zhang, Yingcai Wu

Composite visualization is a popular design strategy that represents complex datasets by integrating multiple visualizations in a meaningful and aesthetic layout, such as juxtaposition, overlay, and nesting. With this strategy, numerous novel designs have been proposed in visualization publications to accomplish various visual analytic tasks. However, there is a lack of understanding of design patterns of composite visualization, thus failing to provide holistic design space and concrete examples for practical use. In this paper, we opted to revisit the composite visualizations in IEEE VIS publications and answered what and how visualizations of different types are composed together. To achieve this, we first constructed a corpus of composite visualizations from the publications and analyzed common practices, such as the pattern distributions and co-occurrence of visualization types. From the analysis, we obtained insights into different design patterns on the utilities and their potential pros and cons. Furthermore, we discussed usage scenarios of our taxonomy and corpus and how future research on visualization composition can be conducted on the basis of this study.

复合可视化是一种流行的设计策略，通过在有意义和美观的布局中集成多个可视化来表示复杂的数据集，如并置、覆盖和嵌套。通过这种策略，可视化出版物中提出了许多新颖的设计，以完成各种可视化分析任务。然而，人们对复合可视化的设计模式缺乏了解，从而无法提供整体的设计空间和具体的实例供实际使用。在本文中，我们选择重新审视IEEE VIS出版物中的复合可视化，并回答了不同类型的可视化是如何组合在一起的。为了实现这一点，我们首先从出版物中构建了一个复合可视化语料库，并分析了常见的实践，如可视化类型的模式分布和共现。通过分析，我们深入了解了公用设施的不同设计模式及其潜在的利弊。此外，我们还讨论了我们的分类法和语料库的使用场景，以及如何在本研究的基础上进行未来的可视化合成研究。

{"title":"Revisiting the Design Patterns of Composite Visualizations","authors":"Dazhen Deng, Weiwei Cui, Xiyu Meng, Mengye Xu, Yu Liao, Haidong Zhang, Yingcai Wu","doi":"10.48550/arXiv.2203.10476","DOIUrl":"https://doi.org/10.48550/arXiv.2203.10476","url":null,"abstract":"Composite visualization is a popular design strategy that represents complex datasets by integrating multiple visualizations in a meaningful and aesthetic layout, such as juxtaposition, overlay, and nesting. With this strategy, numerous novel designs have been proposed in visualization publications to accomplish various visual analytic tasks. However, there is a lack of understanding of design patterns of composite visualization, thus failing to provide holistic design space and concrete examples for practical use. In this paper, we opted to revisit the composite visualizations in IEEE VIS publications and answered what and how visualizations of different types are composed together. To achieve this, we first constructed a corpus of composite visualizations from the publications and analyzed common practices, such as the pattern distributions and co-occurrence of visualization types. From the analysis, we obtained insights into different design patterns on the utilities and their potential pros and cons. Furthermore, we discussed usage scenarios of our taxonomy and corpus and how future research on visualization composition can be conducted on the basis of this study.","PeriodicalId":13376,"journal":{"name":"IEEE Transactions on Visualization and Computer Graphics","volume":" ","pages":""},"PeriodicalIF":5.2,"publicationDate":"2022-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43462263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6