Tasks Reflected in the Eyes: Egocentric Gaze-Aware Visual Task Type Recognition in Virtual Reality

IF 6.5 IEEE transactions on visualization and computer graphics Pub Date : 2024-09-10 DOI:10.1109/TVCG.2024.3456164

Zhimin Wang;Feng Lu

{"title":"Tasks Reflected in the Eyes: Egocentric Gaze-Aware Visual Task Type Recognition in Virtual Reality","authors":"Zhimin Wang;Feng Lu","doi":"10.1109/TVCG.2024.3456164","DOIUrl":null,"url":null,"abstract":"With eye tracking finding widespread utility in augmented reality and virtual reality headsets, eye gaze has the potential to recognize users' visual tasks and adaptively adjust virtual content displays, thereby enhancing the intelligence of these headsets. However, current studies on visual task recognition often focus on scene-specific tasks, like copying tasks for office environments, which lack applicability to new scenarios, e.g., museums. In this paper, we propose four scene-agnostic task types for facilitating task type recognition across a broader range of scenarios. We present a new dataset that includes eye and head movement data recorded from 20 participants while they engaged in four task types across 15 360-degree VR videos. Using this dataset, we propose an egocentric gaze-aware task type recognition method, TRCLP, which achieves promising results. Additionally, we illustrate the practical applications of task type recognition with three examples. Our work offers valuable insights for content developers in designing task-aware intelligent applications. Our dataset and source code are available at zhimin-wang.github.io/TaskTypeRecognition.html.","PeriodicalId":94035,"journal":{"name":"IEEE transactions on visualization and computer graphics","volume":"30 11","pages":"7277-7287"},"PeriodicalIF":6.5000,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on visualization and computer graphics","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10670454/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

With eye tracking finding widespread utility in augmented reality and virtual reality headsets, eye gaze has the potential to recognize users' visual tasks and adaptively adjust virtual content displays, thereby enhancing the intelligence of these headsets. However, current studies on visual task recognition often focus on scene-specific tasks, like copying tasks for office environments, which lack applicability to new scenarios, e.g., museums. In this paper, we propose four scene-agnostic task types for facilitating task type recognition across a broader range of scenarios. We present a new dataset that includes eye and head movement data recorded from 20 participants while they engaged in four task types across 15 360-degree VR videos. Using this dataset, we propose an egocentric gaze-aware task type recognition method, TRCLP, which achieves promising results. Additionally, we illustrate the practical applications of task type recognition with three examples. Our work offers valuable insights for content developers in designing task-aware intelligent applications. Our dataset and source code are available at zhimin-wang.github.io/TaskTypeRecognition.html.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

反映在眼睛中的任务：虚拟现实中以自我为中心的凝视感知视觉任务类型识别。

随着眼球跟踪技术在增强现实和虚拟现实头戴设备中的广泛应用，眼球凝视技术有望识别用户的视觉任务，并自适应地调整虚拟内容显示，从而提高这些头戴设备的智能化程度。然而，目前有关视觉任务识别的研究往往侧重于特定场景的任务，如办公室环境中的复印任务，缺乏对博物馆等新场景的适用性。在本文中，我们提出了四种与场景无关的任务类型，以便在更广泛的场景中促进任务类型的识别。我们展示了一个新的数据集，其中包括 20 名参与者在 15 个 360 度 VR 视频中参与四种任务类型时记录的眼球和头部运动数据。利用该数据集，我们提出了一种以自我为中心的凝视感知任务类型识别方法 TRCLP，该方法取得了可喜的成果。此外，我们还通过三个实例说明了任务类型识别的实际应用。我们的工作为内容开发人员设计任务感知智能应用程序提供了宝贵的见解。我们的数据集和源代码见 zhimin-wang.github.io/TaskTypeRecognition.html。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE transactions on visualization and computer graphics

自引率

0.00%

发文量

期刊最新文献

HYVE: Hybrid Vertex Encoder for Neural Distance Fields. Errata to "DiffCap: Diffusion-Based Real-Time Human Motion Capture Using Sparse IMUs and a Monocular Camera". A Systematic Evaluation of Dragging Interaction Using Raycasting in Virtual Reality. MARRS: Masked Autoregressive Unit-based Reaction Synthesis. Visualization Tasks for Unlabeled Graphs.