Tracked Robot Control with Hand Gesture Based on MediaPipe

Marthed Wameed, Ahmed M. ALKAMACHI, E. Erçelebi
{"title":"Tracked Robot Control with Hand Gesture Based on MediaPipe","authors":"Marthed Wameed, Ahmed M. ALKAMACHI, E. Erçelebi","doi":"10.22153/kej.2023.04.004","DOIUrl":null,"url":null,"abstract":"Hand gestures are currently considered one of the most accurate ways to communicate in many applications, such as sign language, controlling robots, the virtual world, smart homes, and the field of video games. Several techniques are used to detect and classify hand gestures, for instance using gloves that contain several sensors or depending on computer vision. In this work, computer vision is utilized instead of using gloves to control the robot's movement. That is because gloves need complicated electrical connections that limit user mobility, sensors may be costly to replace, and gloves can spread skin illnesses between users. Based on computer vision, the MediaPipe (MP) method is used. This method is a modern method that is discovered by Google. This method is described by detecting and classifying hand gestures by identifying 21 three-dimensional points on the hand, and by comparing the dimensions of those points. This is how the hand gestures are classified. After detecting and classifying the hand gestures, the system controls the tracked robot through hand gestures in real time, as each hand gesture has a specific movement that the tracked robot performs. In this work, some important paragraphs concluded that the MP method is more accurate and faster in response than the Deep Learning (DL) method, specifically the Convolution Neural Network (CNN). The experimental results shows the accuracy of this method in real time through the effect of environmental elements decreases in some cases when environmental factors change. Environmental elements are such light intensity, distance, and tilt angle (between the hand gesture and camera).The reason for this is that in some cases, the fingers are closed together, and some fingers are not fully closed or opened and the accuracy of the camera used is not good with the changing environmental factors. This leads to the inability of the algorithm used to classify hand gestures correctly (the classification accuracy decrease), and thus response time of the tracked robot's movement increases. That does not present possibility for the system to determine whether the finger is closed or opened.","PeriodicalId":7637,"journal":{"name":"Al-Khwarizmi Engineering Journal","volume":"43 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Al-Khwarizmi Engineering Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.22153/kej.2023.04.004","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Hand gestures are currently considered one of the most accurate ways to communicate in many applications, such as sign language, controlling robots, the virtual world, smart homes, and the field of video games. Several techniques are used to detect and classify hand gestures, for instance using gloves that contain several sensors or depending on computer vision. In this work, computer vision is utilized instead of using gloves to control the robot's movement. That is because gloves need complicated electrical connections that limit user mobility, sensors may be costly to replace, and gloves can spread skin illnesses between users. Based on computer vision, the MediaPipe (MP) method is used. This method is a modern method that is discovered by Google. This method is described by detecting and classifying hand gestures by identifying 21 three-dimensional points on the hand, and by comparing the dimensions of those points. This is how the hand gestures are classified. After detecting and classifying the hand gestures, the system controls the tracked robot through hand gestures in real time, as each hand gesture has a specific movement that the tracked robot performs. In this work, some important paragraphs concluded that the MP method is more accurate and faster in response than the Deep Learning (DL) method, specifically the Convolution Neural Network (CNN). The experimental results shows the accuracy of this method in real time through the effect of environmental elements decreases in some cases when environmental factors change. Environmental elements are such light intensity, distance, and tilt angle (between the hand gesture and camera).The reason for this is that in some cases, the fingers are closed together, and some fingers are not fully closed or opened and the accuracy of the camera used is not good with the changing environmental factors. This leads to the inability of the algorithm used to classify hand gestures correctly (the classification accuracy decrease), and thus response time of the tracked robot's movement increases. That does not present possibility for the system to determine whether the finger is closed or opened.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于MediaPipe的跟踪机器人手势控制
手势目前被认为是许多应用中最准确的交流方式之一,如手语、控制机器人、虚拟世界、智能家居和视频游戏领域。有几种技术用于检测和分类手势,例如使用包含多个传感器的手套或依赖计算机视觉。在这项工作中,利用计算机视觉代替手套来控制机器人的运动。这是因为手套需要复杂的电气连接,这限制了用户的移动性,更换传感器的成本可能很高,而且手套会在用户之间传播皮肤疾病。采用了基于计算机视觉的MediaPipe (MP)方法。这种方法是谷歌发现的一种现代方法。该方法通过识别手部的21个三维点,并通过比较这些点的尺寸来检测和分类手势。这就是手势的分类方式。在对手势进行检测和分类后,系统通过手势实时控制履带机器人,因为每个手势都有履带机器人执行的特定动作。在这项工作中,一些重要的段落得出结论,MP方法比深度学习(DL)方法更准确,响应速度更快,特别是卷积神经网络(CNN)。实验结果表明,在某些情况下,当环境因素发生变化时,该方法通过环境因素的实时影响而降低了精度。环境元素包括光线强度、距离和倾斜角度(手势和相机之间)。造成这种情况的原因是,在某些情况下,手指闭合在一起,有些手指没有完全闭合或打开,使用的相机精度随着环境因素的变化而不好。这导致所用算法无法正确对手势进行分类(分类精度降低),从而增加了履带机器人的运动响应时间。这并不表示系统有可能确定手指是闭合还是打开。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
GSM-enabled Wireless Patient Monitoring System Integrating Microcontroller for Managing Vital Signstal signs Performance Analysis of Different Machine Learning Algorithms for Predictive Maintenance Modeling and Simulation of Hydraulic Proportional Control Valves with Different Types of Controllers s Enhanced Gait Phases Recognition by EMG and Kinematics Information Fusion and a Minimal Recording Setuping Setup Assessment of Foot Deformities in Patient with Knee Osteoarthritis
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1