基于混合ConvNet-LSTM网络的印度手语识别

IF 0.4 Q4 ENGINEERING, ELECTRICAL & ELECTRONIC EMITTER-International Journal of Engineering Technology Pub Date : 2021-06-16 DOI:10.24003/emitter.v9i1.613
Muthu Mariappan, V. Gomathi
{"title":"基于混合ConvNet-LSTM网络的印度手语识别","authors":"Muthu Mariappan, V. Gomathi","doi":"10.24003/emitter.v9i1.613","DOIUrl":null,"url":null,"abstract":"Dynamic hand gesture recognition is a challenging task of Human-Computer Interaction (HCI) and Computer Vision. The potential application areas of gesture recognition include sign language translation, video gaming, video surveillance, robotics, and gesture-controlled home appliances. In the proposed research, gesture recognition is applied to recognize sign language words from real-time videos. Classifying the actions from video sequences requires both spatial and temporal features. The proposed system handles the former by the Convolutional Neural Network (CNN), which is the core of several computer vision solutions and the latter by the Recurrent Neural Network (RNN), which is more efficient in handling the sequences of movements. Thus, the real-time Indian sign language (ISL) recognition system is developed using the hybrid CNN-RNN architecture. The system is trained with the proposed CasTalk-ISL dataset. The ultimate purpose of the presented research is to deploy a real-time sign language translator to break the hurdles present in the communication between hearing-impaired people and normal people. The developed system achieves 95.99% top-1 accuracy and 99.46% top-3 accuracy on the test dataset. The obtained results outperform the existing approaches using various deep models on different datasets.","PeriodicalId":40905,"journal":{"name":"EMITTER-International Journal of Engineering Technology","volume":"37 1","pages":""},"PeriodicalIF":0.4000,"publicationDate":"2021-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Indian Sign Language Recognition through Hybrid ConvNet-LSTM Networks\",\"authors\":\"Muthu Mariappan, V. Gomathi\",\"doi\":\"10.24003/emitter.v9i1.613\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Dynamic hand gesture recognition is a challenging task of Human-Computer Interaction (HCI) and Computer Vision. The potential application areas of gesture recognition include sign language translation, video gaming, video surveillance, robotics, and gesture-controlled home appliances. In the proposed research, gesture recognition is applied to recognize sign language words from real-time videos. Classifying the actions from video sequences requires both spatial and temporal features. The proposed system handles the former by the Convolutional Neural Network (CNN), which is the core of several computer vision solutions and the latter by the Recurrent Neural Network (RNN), which is more efficient in handling the sequences of movements. Thus, the real-time Indian sign language (ISL) recognition system is developed using the hybrid CNN-RNN architecture. The system is trained with the proposed CasTalk-ISL dataset. The ultimate purpose of the presented research is to deploy a real-time sign language translator to break the hurdles present in the communication between hearing-impaired people and normal people. The developed system achieves 95.99% top-1 accuracy and 99.46% top-3 accuracy on the test dataset. The obtained results outperform the existing approaches using various deep models on different datasets.\",\"PeriodicalId\":40905,\"journal\":{\"name\":\"EMITTER-International Journal of Engineering Technology\",\"volume\":\"37 1\",\"pages\":\"\"},\"PeriodicalIF\":0.4000,\"publicationDate\":\"2021-06-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"EMITTER-International Journal of Engineering Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.24003/emitter.v9i1.613\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"EMITTER-International Journal of Engineering Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.24003/emitter.v9i1.613","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 5

摘要

动态手势识别是人机交互(HCI)和计算机视觉领域的一个具有挑战性的课题。手势识别的潜在应用领域包括手语翻译、视频游戏、视频监控、机器人和手势控制的家用电器。本研究将手势识别应用于实时视频中的手语单词识别。从视频序列中对动作进行分类需要空间特征和时间特征。该系统使用卷积神经网络(CNN)处理前者,卷积神经网络是几种计算机视觉解决方案的核心;使用递归神经网络(RNN)处理后者,后者在处理动作序列时效率更高。因此,采用CNN-RNN混合架构开发了实时印度手语识别系统。系统使用提出的CasTalk-ISL数据集进行训练。本研究的最终目的是部署一种实时手语翻译器,以打破听障人士与正常人之间沟通的障碍。所开发的系统在测试数据集上达到95.99%的top-1准确率和99.46%的top-3准确率。在不同的数据集上使用不同的深度模型,得到的结果优于现有的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Indian Sign Language Recognition through Hybrid ConvNet-LSTM Networks
Dynamic hand gesture recognition is a challenging task of Human-Computer Interaction (HCI) and Computer Vision. The potential application areas of gesture recognition include sign language translation, video gaming, video surveillance, robotics, and gesture-controlled home appliances. In the proposed research, gesture recognition is applied to recognize sign language words from real-time videos. Classifying the actions from video sequences requires both spatial and temporal features. The proposed system handles the former by the Convolutional Neural Network (CNN), which is the core of several computer vision solutions and the latter by the Recurrent Neural Network (RNN), which is more efficient in handling the sequences of movements. Thus, the real-time Indian sign language (ISL) recognition system is developed using the hybrid CNN-RNN architecture. The system is trained with the proposed CasTalk-ISL dataset. The ultimate purpose of the presented research is to deploy a real-time sign language translator to break the hurdles present in the communication between hearing-impaired people and normal people. The developed system achieves 95.99% top-1 accuracy and 99.46% top-3 accuracy on the test dataset. The obtained results outperform the existing approaches using various deep models on different datasets.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
EMITTER-International Journal of Engineering Technology
EMITTER-International Journal of Engineering Technology ENGINEERING, ELECTRICAL & ELECTRONIC-
自引率
0.00%
发文量
7
审稿时长
12 weeks
期刊最新文献
Hardware Trojan Detection and Mitigation in NoC using Key authentication and Obfuscation Techniques Estimation of Confidence in the Dialogue based on Eye Gaze and Head Movement Information Experimental Study of Hydroformed Al6061T4 Elliptical Tube Samples under Different Internal Pressures Numerical Study of a Wind Turbine Blade Modification Using 30° Angle Winglet on Clark Y Foil 3D Visualization for Lung Surface Images of Covid-19 Patients based on U-Net CNN Segmentation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1