人机交互场景中的多模态手势识别系统

2009 IEEE International Workshop on Robotic and Sensors Environments Pub Date : 2009-12-18 DOI:10.1109/ROSE.2009.5355984

Zhi Li, R. Jarvis

{"title":"人机交互场景中的多模态手势识别系统","authors":"Zhi Li, R. Jarvis","doi":"10.1109/ROSE.2009.5355984","DOIUrl":null,"url":null,"abstract":"Recognition of non-verbal gestures is essential for robots to understand a user's state and intention in a Human-Robot Interaction (HRI) scenario. In this paper a multi-modal system is proposed to recognize a user's hand gestures and estimate body poses from the robot's viewpoint only. A range camera is employed to derive the depth data at a high frame rate. Depth data is useful for image segmentation, objects detection and localization in 3D spaces. A pair of stereo cameras is used to sense the user's head gestures and eye gaze direction, which provide useful information about the user's attention direction. Both hand shapes and hand trajectories are recognized. Full configurations of body poses are estimated using a model-based algorithm. Poses are tracked by a Particle Filter method, and refined by a gradient-based searching method in the neighborhood of the particles which have top largest weights.","PeriodicalId":107220,"journal":{"name":"2009 IEEE International Workshop on Robotic and Sensors Environments","volume":"81 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":"{\"title\":\"A multi-modal gesture recognition system in a Human-Robot Interaction scenario\",\"authors\":\"Zhi Li, R. Jarvis\",\"doi\":\"10.1109/ROSE.2009.5355984\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recognition of non-verbal gestures is essential for robots to understand a user's state and intention in a Human-Robot Interaction (HRI) scenario. In this paper a multi-modal system is proposed to recognize a user's hand gestures and estimate body poses from the robot's viewpoint only. A range camera is employed to derive the depth data at a high frame rate. Depth data is useful for image segmentation, objects detection and localization in 3D spaces. A pair of stereo cameras is used to sense the user's head gestures and eye gaze direction, which provide useful information about the user's attention direction. Both hand shapes and hand trajectories are recognized. Full configurations of body poses are estimated using a model-based algorithm. Poses are tracked by a Particle Filter method, and refined by a gradient-based searching method in the neighborhood of the particles which have top largest weights.\",\"PeriodicalId\":107220,\"journal\":{\"name\":\"2009 IEEE International Workshop on Robotic and Sensors Environments\",\"volume\":\"81 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-12-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"15\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 IEEE International Workshop on Robotic and Sensors Environments\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ROSE.2009.5355984\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 IEEE International Workshop on Robotic and Sensors Environments","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ROSE.2009.5355984","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 15

摘要

在人机交互(HRI)场景中，非语言手势的识别对于机器人理解用户的状态和意图至关重要。本文提出了一种多模态系统，仅从机器人的视点来识别用户的手势和估计身体姿势。采用距离相机在高帧率下获取深度数据。深度数据对于三维空间中的图像分割、目标检测和定位非常有用。一对立体摄像头用于感知用户的头部手势和眼睛注视方向，从而提供有关用户注意力方向的有用信息。手的形状和轨迹都被识别。使用基于模型的算法估计身体姿势的完整配置。采用粒子滤波方法对姿态进行跟踪，并在权值最大的粒子附近采用梯度搜索方法对姿态进行细化。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

A multi-modal gesture recognition system in a Human-Robot Interaction scenario

Recognition of non-verbal gestures is essential for robots to understand a user's state and intention in a Human-Robot Interaction (HRI) scenario. In this paper a multi-modal system is proposed to recognize a user's hand gestures and estimate body poses from the robot's viewpoint only. A range camera is employed to derive the depth data at a high frame rate. Depth data is useful for image segmentation, objects detection and localization in 3D spaces. A pair of stereo cameras is used to sense the user's head gestures and eye gaze direction, which provide useful information about the user's attention direction. Both hand shapes and hand trajectories are recognized. Full configurations of body poses are estimated using a model-based algorithm. Poses are tracked by a Particle Filter method, and refined by a gradient-based searching method in the neighborhood of the particles which have top largest weights.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2009 IEEE International Workshop on Robotic and Sensors Environments

自引率

0.00%

发文量

期刊最新文献

A modified bootstrap filter Real-time 3D reconstruction for mobile robot using catadioptric cameras Large area smart tactile sensor for rescue robot Mobile robot self-localization system using IR-UWB sensor in indoor environments A high precision sensor system for indoor object positioning and monitoring