Automatic Hand Gesture Recognition with Semantic Segmentation and Deep Learning

Bristy Chanda, H. Nyeem
{"title":"Automatic Hand Gesture Recognition with Semantic Segmentation and Deep Learning","authors":"Bristy Chanda, H. Nyeem","doi":"10.1109/icaeee54957.2022.9836425","DOIUrl":null,"url":null,"abstract":"Automatic Hand Gesture Recognition is a key requirement for variety of applications, including translation of Sign Language, Human-Computer Interaction (HCI) and, ubiquitous vision-based systems. Due to the lighting variance and complicated background in the input image set of gestures, meeting this criterion remains a challenge. This paper introduces semantic segmentation to deep learning-based hand gesture recognition system for sign language translation. Building on the U - Net architecture, the proposed model obtains the semantically segmented mask of the input image, which is then fed to convolutional neural networks (CNNs) for multiclass classification. The proposed model is trained and tested for four different depths of the CNN architectures followed by the comparison with some pre-trained CNN architectures such as Inception V3, VGG16, VGG19, ResNet50. The proposed model is evaluated on National University of Singapore (NUS) hand posture dataset II (subset A), which contains 2000 images in 10 classes. A significant recognition rate of 97.15 % is achieved for the proposed model outperforming a set of prominent models and demonstrating its promises for sign language translation.","PeriodicalId":383872,"journal":{"name":"2022 International Conference on Advancement in Electrical and Electronic Engineering (ICAEEE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Advancement in Electrical and Electronic Engineering (ICAEEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icaeee54957.2022.9836425","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Automatic Hand Gesture Recognition is a key requirement for variety of applications, including translation of Sign Language, Human-Computer Interaction (HCI) and, ubiquitous vision-based systems. Due to the lighting variance and complicated background in the input image set of gestures, meeting this criterion remains a challenge. This paper introduces semantic segmentation to deep learning-based hand gesture recognition system for sign language translation. Building on the U - Net architecture, the proposed model obtains the semantically segmented mask of the input image, which is then fed to convolutional neural networks (CNNs) for multiclass classification. The proposed model is trained and tested for four different depths of the CNN architectures followed by the comparison with some pre-trained CNN architectures such as Inception V3, VGG16, VGG19, ResNet50. The proposed model is evaluated on National University of Singapore (NUS) hand posture dataset II (subset A), which contains 2000 images in 10 classes. A significant recognition rate of 97.15 % is achieved for the proposed model outperforming a set of prominent models and demonstrating its promises for sign language translation.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于语义分割和深度学习的自动手势识别
自动手势识别是各种应用的关键要求,包括手语翻译,人机交互(HCI)和无处不在的基于视觉的系统。由于手势输入图像集的光照变化和背景复杂,满足这一标准仍然是一个挑战。将语义分割引入到基于深度学习的手势识别系统中,用于手语翻译。该模型基于U - Net架构,获取输入图像的语义分割掩码,然后将其送入卷积神经网络进行多类分类。该模型对四种不同深度的CNN架构进行了训练和测试,然后与一些预训练的CNN架构(如Inception V3, VGG16, VGG19, ResNet50)进行了比较。该模型在新加坡国立大学(NUS)的手部姿势数据集II(子集A)上进行了评估,该数据集包含10个类别的2000张图像。该模型的识别率达到了97.15%,超过了一组著名的模型,证明了它在手语翻译方面的前景。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Design of a Multi-band Sierpinski Carpet Fractal Antenna With Modified Ground Plane Effect of Number of Modes of EMD in Respiratory Rate Estimation from PPG Signal An User Interest and Payment-aware Automated Car Parking System for the Bangladeshi People Using Android Application An Improved Load Frequency Control Strategy for Single & Multi-Area Power System Wall Shear Stress Assessment of Aorta with Varying Low-density Lipoprotein Concentration
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1