German Sign Language Translation using 3D Hand Pose Estimation and Deep Learning

2020 IEEE REGION 10 CONFERENCE (TENCON) Pub Date : 2020-11-16 DOI:10.1109/TENCON50793.2020.9293763

S. Mohanty, Supriya Prasad, Tanvi Sinha, B. N. Krupa

引用次数: 0

Abstract

Sign language is the primary medium of communication for the majority of the world’s population suffering from disabling hearing loss that creates a barrier between the hearing and the hearing-impaired people. In this paper, sign language translation is undertaken for German Sign Language (GSL) characters from a single image by leveraging the technique of 3D object detection. We make use of a three-network architecture that performs segmentation, keypoint localization, and elevation from a two-dimensional plane to the three-dimensional space, from a single RGB image containing the signed gesture. Thirty gestures have been used and the best results were obtained using a combination of pose representation coordinates, joint angles, and pool layer features of AlexNet for classification. The system gives a character error rate of 0.29, a reduction of error rate by 12.12% when compared to the state-of-the-art approach.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于三维手部姿态估计和深度学习的德语手语翻译

手语是世界上大多数听力丧失致残性人群的主要交流媒介，听力丧失致残性听力损失在听障人群和听障人群之间造成了障碍。本文利用三维物体检测技术对单幅图像中的德国手语(GSL)字符进行了手语翻译。我们利用一个三网络架构来执行分割、关键点定位和从二维平面到三维空间的提升，从一个包含签名手势的单一RGB图像。使用了30种手势，结合姿态表示坐标、关节角度和AlexNet的池层特征进行分类，得到了最好的结果。该系统的字符错误率为0.29，与最先进的方法相比，错误率降低了12.12%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2020 IEEE REGION 10 CONFERENCE (TENCON)

自引率

0.00%

发文量