Jingqi Ma, Kai Huang, Zeyu Jiao, Chentong Li, Liangsheng Wu
{"title":"A Gesture Recognition Method Based on Yolov4 Network","authors":"Jingqi Ma, Kai Huang, Zeyu Jiao, Chentong Li, Liangsheng Wu","doi":"10.1109/ICESIT53460.2021.9696503","DOIUrl":null,"url":null,"abstract":"With the diverse development of human-computer interaction. The gesture recognition-based interaction has a large-scale application prospect in collaborative robotics and smart home control. However, due to the similarity of gestures and occlusion, the previous methods have problems with the poor accuracy and shift of detection box. Aiming at the above issues, a gesture recognition method based on the Yolov4 deep learning algorithm is proposed. Firstly, gesture images were collected and annotated, and the data was processed by the GridMask and scale adjustment of data augmentation in order to improve the generalization performance of the network. Then K-means clustering algorithm was used to cluster the annotation boxes in the annotation dataset, by this way, the anchor box of YOLOV4 was optimized to improve the IOU accuracy. Finally, during the training process, focal loss and Consine warmup were adopted to improve the unbalanced sample number of classes and overfitting of the network. The experimental results shows that the proposed algorithm outperforms the main target detection models which include Yolov4, Yolov3 and Faster RCNN, the average recognition accuracy of this method reaches 99.4% and the FPS is 33fps. The proposed algorithm has good real-time performance.","PeriodicalId":164745,"journal":{"name":"2021 IEEE International Conference on Emergency Science and Information Technology (ICESIT)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Emergency Science and Information Technology (ICESIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICESIT53460.2021.9696503","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
With the diverse development of human-computer interaction. The gesture recognition-based interaction has a large-scale application prospect in collaborative robotics and smart home control. However, due to the similarity of gestures and occlusion, the previous methods have problems with the poor accuracy and shift of detection box. Aiming at the above issues, a gesture recognition method based on the Yolov4 deep learning algorithm is proposed. Firstly, gesture images were collected and annotated, and the data was processed by the GridMask and scale adjustment of data augmentation in order to improve the generalization performance of the network. Then K-means clustering algorithm was used to cluster the annotation boxes in the annotation dataset, by this way, the anchor box of YOLOV4 was optimized to improve the IOU accuracy. Finally, during the training process, focal loss and Consine warmup were adopted to improve the unbalanced sample number of classes and overfitting of the network. The experimental results shows that the proposed algorithm outperforms the main target detection models which include Yolov4, Yolov3 and Faster RCNN, the average recognition accuracy of this method reaches 99.4% and the FPS is 33fps. The proposed algorithm has good real-time performance.