EVA: Key values eclosion with space anchor used in hand pose estimation and shape reconstruction

IF 6.8 1区 计算机科学 0 COMPUTER SCIENCE, INFORMATION SYSTEMS Information Sciences Pub Date : 2025-02-24 DOI:10.1016/j.ins.2025.122003
Xuefeng Li , Xiangbo Lin
{"title":"EVA: Key values eclosion with space anchor used in hand pose estimation and shape reconstruction","authors":"Xuefeng Li ,&nbsp;Xiangbo Lin","doi":"10.1016/j.ins.2025.122003","DOIUrl":null,"url":null,"abstract":"<div><div>3D hand pose estimation and shape reconstruction from single RGB image face challenges of self-occlusion, object occlusion, and depth ambiguity. Previous methods tried efforts to detect relevant information from images directly. Differently, this paper considers the task as a union of detection and generation. A novel framework called Key Value Eclosion is proposed. It utilizes powerful Diffusion generation strategies to gradually generate and refine occluded joints, vertices, and depth, using visible 2D joint locations as clues. To make the latent codes more comprehensive for hand shape reconstruction, 2D image features are transformed into 3D space using the proposed Space Anchor based feature inverse projection strategy. Integrating the Space Anchor based feature inverse projection into the Key Values Eclosion framework, a complete hand pose estimation and shape reconstruction model called EVA is constructed. The EVA model demonstrates excellent accuracy on both aligned and unaligned metrics using the HO-3D and DexYCB datasets. Especially, the improvement on Mean Error and Trans&amp;Scale metrics are about 30%~50%, compared to state-of-the-art methods.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"706 ","pages":"Article 122003"},"PeriodicalIF":6.8000,"publicationDate":"2025-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Sciences","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0020025525001355","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

3D hand pose estimation and shape reconstruction from single RGB image face challenges of self-occlusion, object occlusion, and depth ambiguity. Previous methods tried efforts to detect relevant information from images directly. Differently, this paper considers the task as a union of detection and generation. A novel framework called Key Value Eclosion is proposed. It utilizes powerful Diffusion generation strategies to gradually generate and refine occluded joints, vertices, and depth, using visible 2D joint locations as clues. To make the latent codes more comprehensive for hand shape reconstruction, 2D image features are transformed into 3D space using the proposed Space Anchor based feature inverse projection strategy. Integrating the Space Anchor based feature inverse projection into the Key Values Eclosion framework, a complete hand pose estimation and shape reconstruction model called EVA is constructed. The EVA model demonstrates excellent accuracy on both aligned and unaligned metrics using the HO-3D and DexYCB datasets. Especially, the improvement on Mean Error and Trans&Scale metrics are about 30%~50%, compared to state-of-the-art methods.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
EVA:键值融合与空间锚用于手部姿态估计和形状重建
单幅RGB图像的三维手姿估计和形状重建面临着自遮挡、目标遮挡和深度模糊的挑战。以往的方法都是直接从图像中检测相关信息。不同的是,本文将任务视为检测和生成的结合。提出了一种新的键值融合框架。它利用强大的扩散生成策略,以可见的2D关节位置为线索,逐步生成和细化闭塞的关节,顶点和深度。为了使潜在代码更全面地用于手部形状重建,利用所提出的基于空间锚点的特征逆投影策略将二维图像特征转换到三维空间。将基于空间锚点的特征逆投影与关键值融合框架相结合,构建了完整的手部姿态估计与形状重建模型EVA。EVA模型在使用HO-3D和DexYCB数据集的对齐和未对齐指标上都具有出色的准确性。特别是,与最先进的方法相比,平均误差和Trans&;Scale指标的改进约为30%~50%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Information Sciences
Information Sciences 工程技术-计算机:信息系统
CiteScore
14.00
自引率
17.30%
发文量
1322
审稿时长
10.4 months
期刊介绍: Informatics and Computer Science Intelligent Systems Applications is an esteemed international journal that focuses on publishing original and creative research findings in the field of information sciences. We also feature a limited number of timely tutorial and surveying contributions. Our journal aims to cater to a diverse audience, including researchers, developers, managers, strategic planners, graduate students, and anyone interested in staying up-to-date with cutting-edge research in information science, knowledge engineering, and intelligent systems. While readers are expected to share a common interest in information science, they come from varying backgrounds such as engineering, mathematics, statistics, physics, computer science, cell biology, molecular biology, management science, cognitive science, neurobiology, behavioral sciences, and biochemistry.
期刊最新文献
Editorial Board Tensorized topological manifold for multiple kernel clustering LPCLNet: Leveraging local pixel-wise contrastive learning for image tampering localization UHTS-DRL: A deep reinforcement learning framework for integrated agile satellite observation and data transmission scheduling A framework for technological bottleneck detection and collaborative optimization in heterogeneous parallel networks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1