Sign Language Fingerspelling Classification from Depth and Color Images Using a Deep Belief Network

2014 Canadian Conference on Computer and Robot Vision Pub Date : 2014-05-06 DOI:10.1109/CRV.2014.20

Lucas Rioux-Maldague, P. Giguère

引用次数: 45

Abstract

Automatic sign language recognition is an open problem that has received a lot of attention recently, not only because of its usefulness to signers, but also due to the numerous applications a sign classifier can have. In this article, we present a new feature extraction technique for hand pose recognition using depth and intensity images captured from a Microsoft Kinect sensor. We applied our technique to American Sign Language finger spelling classification using a Deep Belief Network, for which our feature extraction technique is tailored. We evaluated our results on a multi-user data set with two scenarios: one with all known users and one with an unseen user. We achieved 99% recall and precision on the first, and 77% recall and 79% precision on the second. Our method is also capable of real-time sign classification and is adaptive to any environment or lightning intensity.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于深度信念网络的深度和颜色图像手语拼写分类

自动手语识别是最近受到广泛关注的一个开放问题，不仅是因为它对手语者的有用性，还因为符号分类器可以有许多应用。在这篇文章中，我们提出了一种新的特征提取技术，用于手部姿势识别，使用从微软Kinect传感器捕获的深度和强度图像。我们将该技术应用于使用深度信念网络的美国手语手指拼写分类，我们的特征提取技术是为此量身定制的。我们用两种场景在一个多用户数据集上评估我们的结果:一种是所有已知用户，另一种是不可见用户。我们在第一次上达到了99%的查全率和查准率，在第二次上达到了77%的查全率和79%的查准率。我们的方法还能够实时标记分类，并适应任何环境或闪电强度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2014 Canadian Conference on Computer and Robot Vision

自引率

0.00%

发文量