Commonsense Knowledge-Driven Joint Reasoning Approach for Object Retrieval in Virtual Reality

ACM Transactions on Graphics (TOG) Pub Date : 2023-12-04 DOI:10.1145/3618320

Haiyan Jiang, Dongdong Weng, Xiaonuo Dongye, Le Luo, Zhenliang Zhang

{"title":"Commonsense Knowledge-Driven Joint Reasoning Approach for Object Retrieval in Virtual Reality","authors":"Haiyan Jiang, Dongdong Weng, Xiaonuo Dongye, Le Luo, Zhenliang Zhang","doi":"10.1145/3618320","DOIUrl":null,"url":null,"abstract":"National Key Laboratory of General Artificial Intelligence, Beijing Institute for General Artificial Intelligence (BIGAI), China Retrieving out-of-reach objects is a crucial task in virtual reality (VR). One of the most commonly used approaches for this task is the gesture-based approach, which allows for bare-hand, eyes-free, and direct retrieval. However, previous work has primarily focused on assigned gesture design, neglecting the context. This can make it challenging to accurately retrieve an object from a large number of objects due to the one-to-one mapping metaphor, limitations of finger poses, and memory burdens. There is a general consensus that objects and contexts are related, which suggests that the object expected to be retrieved is related to the context, including the scene and the objects with which users interact. As such, we propose a commonsense knowledge-driven joint reasoning approach for object retrieval, where human grasping gestures and context are modeled using an And-Or graph (AOG). This approach enables users to accurately retrieve objects from a large number of candidate objects by using natural grasping gestures based on their experience of grasping physical objects. Experimental results demonstrate that our proposed approach improves retrieval accuracy. We also propose an object retrieval system based on the proposed approach. Two user studies show that our system enables efficient object retrieval in virtual environments (VEs).","PeriodicalId":7077,"journal":{"name":"ACM Transactions on Graphics (TOG)","volume":"73 22","pages":"1 - 18"},"PeriodicalIF":0.0000,"publicationDate":"2023-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Graphics (TOG)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3618320","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

National Key Laboratory of General Artificial Intelligence, Beijing Institute for General Artificial Intelligence (BIGAI), China Retrieving out-of-reach objects is a crucial task in virtual reality (VR). One of the most commonly used approaches for this task is the gesture-based approach, which allows for bare-hand, eyes-free, and direct retrieval. However, previous work has primarily focused on assigned gesture design, neglecting the context. This can make it challenging to accurately retrieve an object from a large number of objects due to the one-to-one mapping metaphor, limitations of finger poses, and memory burdens. There is a general consensus that objects and contexts are related, which suggests that the object expected to be retrieved is related to the context, including the scene and the objects with which users interact. As such, we propose a commonsense knowledge-driven joint reasoning approach for object retrieval, where human grasping gestures and context are modeled using an And-Or graph (AOG). This approach enables users to accurately retrieve objects from a large number of candidate objects by using natural grasping gestures based on their experience of grasping physical objects. Experimental results demonstrate that our proposed approach improves retrieval accuracy. We also propose an object retrieval system based on the proposed approach. Two user studies show that our system enables efficient object retrieval in virtual environments (VEs).

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

虚拟现实中物体检索的常识知识驱动联合推理方法

检索遥不可及物体是虚拟现实(VR)中的一项关键任务。这项任务最常用的方法之一是基于手势的方法，它允许徒手、无眼和直接检索。然而，之前的工作主要集中在指定的手势设计上，而忽略了上下文。由于一对一的映射比喻、手指姿势的限制和内存负担，这使得从大量对象中准确检索对象变得具有挑战性。人们普遍认为对象和上下文是相关的，这表明期望检索的对象与上下文相关，包括场景和用户与之交互的对象。因此，我们提出了一种常识知识驱动的对象检索联合推理方法，其中人类抓取手势和上下文使用and - or图(AOG)建模。该方法使用户能够根据抓取物理对象的经验，通过使用自然抓取手势，从大量候选对象中准确地检索对象。实验结果表明，该方法提高了检索精度。并在此基础上提出了一个对象检索系统。两项用户研究表明，我们的系统能够在虚拟环境(VEs)中实现有效的对象检索。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

ACM Transactions on Graphics (TOG)

自引率

0.00%

发文量

期刊最新文献

GeoLatent: A Geometric Approach to Latent Space Design for Deformable Shape Generators An Implicit Neural Representation for the Image Stack: Depth, All in Focus, and High Dynamic Range Rectifying Strip Patterns From Skin to Skeleton: Towards Biomechanically Accurate 3D Digital Humans Warped-Area Reparameterization of Differential Path Integrals