Real-Time Human Gaze Estimation

2019 Digital Image Computing: Techniques and Applications (DICTA) Pub Date : 2019-12-01 DOI:10.1109/DICTA47822.2019.8945919

T. Rowntree, C. Pontecorvo, I. Reid

引用次数: 1

Abstract

This paper describes a system for estimating the course gaze or 1D head pose of multiple people in a video stream from a moving camera in an indoor scene. The system runs at 30 Hz and can detect human heads with a F-Score of 87.2% and predict their gaze with an average error 20.9° including when they are facing directly away from the camera. The system uses two Convolutional Neural Networks (CNNs) for head detection and gaze estimation respectively and uses common tracking and filtering techniques for smoothing predictions over time. This paper is application-focused and so describes the individual components of the system as well as the techniques used for collecting data and training the CNNs.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

实时人眼注视估计

本文描述了一种用于估计室内场景中移动摄像机视频流中多人的航向凝视或1D头部姿势的系统。该系统以30赫兹的频率运行，可以检测到人类头部的f值为87.2%，并预测他们的凝视，平均误差为20.9°，包括他们直接远离相机的时候。该系统使用两个卷积神经网络(cnn)分别进行头部检测和凝视估计，并使用常见的跟踪和过滤技术随着时间的推移平滑预测。本文以应用为中心，因此描述了系统的各个组成部分以及用于收集数据和训练cnn的技术。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2019 Digital Image Computing: Techniques and Applications (DICTA)

自引率

0.00%

发文量