Attention-Based 2-D Hand Keypoints Localization

IF 2.2 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Sensors Letters Pub Date : 2024-08-13 DOI:10.1109/LSENS.2024.3443072

H Pallab Jyoti Dutta;M. K. Bhuyan

引用次数: 0

Abstract

Hand keypoint localization is used extensively in human–computer interaction, but accurate localization is challenging due to closeness between the fingers and the keypoints, occlusion, varied hand poses, complex backgrounds, and extreme lighting conditions. Despite much research, challenges persist. Therefore, we propose an encoder–decoder architecture aided by a novel attention module to precisely localize hand keypoints. The attention module captures keypoint-relevant features at two different scales that encompass local and global characteristics. Further, the loss function teaches the model to remove spurious detected keypoints in the initial learning phase. The proposed architecture outputs precise keypoint locations, as indicated by the qualitative and quantitative results. Evaluation of two benchmark RGB image datasets, comprising all the challenges encountered in keypoint localization, resulted in endpoint errors as low as 2.78 and 1.85 pixels and 98.50% and 99.77% correct keypoints, respectively. This shows the proposed model's effectiveness and ability to overcome challenges.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于注意力的二维手部关键点定位

手部关键点定位被广泛应用于人机交互中，但由于手指与关键点之间的距离、遮挡、不同的手部姿势、复杂的背景和极端的光照条件，精确定位具有挑战性。尽管进行了大量研究，但挑战依然存在。因此，我们提出了一种编码器-解码器架构，并辅以新颖的注意力模块来精确定位手部关键点。注意力模块在两个不同尺度上捕捉与关键点相关的特征，包括局部和全局特征。此外，在初始学习阶段，损失函数会教导模型去除检测到的虚假关键点。定性和定量结果表明，所提出的架构能输出精确的关键点位置。对两个基准 RGB 图像数据集（包括关键点定位中遇到的所有挑战）的评估结果显示，端点误差分别低至 2.78 和 1.85 像素，关键点正确率分别为 98.50% 和 99.77%。这表明了所提出模型的有效性和克服挑战的能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊