Prediction of Delay-Free Scene for Quadruped Robot Teleoperation: Integrating Delayed Data With User Commands

IF 5.3 2区计算机科学 Q2 ROBOTICS IEEE Robotics and Automation Letters Pub Date : 2025-01-29 DOI:10.1109/LRA.2025.3536222

Seunghyeon Ha;Seongyong Kim;Soo-Chul Lim

{"title":"Prediction of Delay-Free Scene for Quadruped Robot Teleoperation: Integrating Delayed Data With User Commands","authors":"Seunghyeon Ha;Seongyong Kim;Soo-Chul Lim","doi":"10.1109/LRA.2025.3536222","DOIUrl":null,"url":null,"abstract":"Teleoperation systems are utilized in various controllable systems, including vehicles, manipulators, and quadruped robots. However, during teleoperation, communication delays can cause users to receive delayed feedback, which reduces controllability and increases the risk faced by the remote robot. To address this issue, we propose a delay-free video generation model based on user commands that allows users to receive real-time feedback despite communication delays. Our model predicts delay-free video by integrating delayed data (video, point cloud, and robot status) from the robot with the user's real-time commands. The LiDAR point cloud data, which is part of the delayed data, is used to predict the contents of areas outside the camera frame during robot rotation. We constructed our proposed model by modifying the transformer-based video prediction model VPTR-NAR to effectively integrate these data. For our experiments, we acquired a navigation dataset from a quadruped robot, and this dataset was used to train and test our proposed model. We evaluated the model's performance by comparing it with existing video prediction models and conducting an ablation study to verify the effectiveness of its utilization of command and point cloud data.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"10 3","pages":"2846-2853"},"PeriodicalIF":5.3000,"publicationDate":"2025-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Robotics and Automation Letters","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10857415/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ROBOTICS","Score":null,"Total":0}

引用次数: 0

Abstract

Teleoperation systems are utilized in various controllable systems, including vehicles, manipulators, and quadruped robots. However, during teleoperation, communication delays can cause users to receive delayed feedback, which reduces controllability and increases the risk faced by the remote robot. To address this issue, we propose a delay-free video generation model based on user commands that allows users to receive real-time feedback despite communication delays. Our model predicts delay-free video by integrating delayed data (video, point cloud, and robot status) from the robot with the user's real-time commands. The LiDAR point cloud data, which is part of the delayed data, is used to predict the contents of areas outside the camera frame during robot rotation. We constructed our proposed model by modifying the transformer-based video prediction model VPTR-NAR to effectively integrate these data. For our experiments, we acquired a navigation dataset from a quadruped robot, and this dataset was used to train and test our proposed model. We evaluated the model's performance by comparing it with existing video prediction models and conducting an ablation study to verify the effectiveness of its utilization of command and point cloud data.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

四足机器人遥操作无延迟场景预测：将延迟数据与用户命令相结合

远程操作系统应用于各种可控系统，包括车辆、机械手和四足机器人。然而，在远程操作过程中，通信延迟会导致用户接收到延迟的反馈，从而降低了远程机器人的可控性，增加了远程机器人面临的风险。为了解决这个问题，我们提出了一种基于用户命令的无延迟视频生成模型，该模型允许用户在通信延迟的情况下接收实时反馈。我们的模型通过集成来自机器人的延迟数据（视频、点云和机器人状态）和用户的实时命令来预测无延迟视频。LiDAR点云数据是延迟数据的一部分，用于预测机器人旋转过程中相机帧外区域的内容。我们通过修改基于变压器的视频预测模型VPTR-NAR来构建我们的模型，以有效地整合这些数据。在我们的实验中，我们从一个四足机器人那里获得了一个导航数据集，并使用该数据集来训练和测试我们提出的模型。我们通过将该模型与现有的视频预测模型进行比较，并进行消融研究来验证其对命令和点云数据利用的有效性，从而评估了该模型的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Robotics and Automation Letters Computer Science-Computer Science Applications

CiteScore

9.60

自引率

15.40%

发文量

1428

期刊介绍： The scope of this journal is to publish peer-reviewed articles that provide a timely and concise account of innovative research ideas and application results, reporting significant theoretical findings and application case studies in areas of robotics and automation.