Textual description of human activities by tracking head and hand motions

Object recognition supported by user interaction for service robots Pub Date : 2002-12-10 DOI:10.1109/ICPR.2002.1048491

A. Kojima, Takeshi Tamura, K. Fukunaga

引用次数: 20

Abstract

We propose a method for describing human activities from video images by tracking human skin regions: facial and hand regions. To detect skin regions robustly, three kinds of probabilistic information are extracted and integrated using Dempster-Shafer theory. The main difficulty in transforming video images into textual descriptions is bridging the semantic gap between them. By associating visual features of head and hand motion with natural language concepts, appropriate syntactic components such as verbs, objects, etc. are determined and translated into natural language.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

通过跟踪头部和手部的运动来对人类活动进行文字描述

我们提出了一种通过跟踪人体皮肤区域(面部和手部)来描述视频图像中人类活动的方法。为了对皮肤区域进行鲁棒检测，利用Dempster-Shafer理论提取和整合了三种概率信息。将视频图像转换为文本描述的主要困难是弥合它们之间的语义差距。通过将头部和手部运动的视觉特征与自然语言概念联系起来，确定适当的句法成分，如动词、宾语等，并将其翻译成自然语言。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Object recognition supported by user interaction for service robots

自引率

0.00%

发文量

期刊最新文献

Pattern recognition for humanitarian de-mining Data clustering using evidence accumulation Facial expression recognition using pseudo 3-D hidden Markov models Speeding up SVM decision based on mirror points Real-time tracking and estimation of plane pose