Van-Hung Le, Van-Nam Hoang, Hai Vu, Thi-Lan Le, Thanh-Hai Tran, V. Vu
{"title":"Hand PointNet-based 3D Hand Pose Estimation in Egocentric RGB-D Images","authors":"Van-Hung Le, Van-Nam Hoang, Hai Vu, Thi-Lan Le, Thanh-Hai Tran, V. Vu","doi":"10.1109/ATC50776.2020.9255478","DOIUrl":null,"url":null,"abstract":"Recently, understanding hand and object manipulation is an active topic in First-Person Vision (FPV) community. In this study, we present an initial study on estimating 3-D hand joints using the state-of-the-art neuronal network. We firstly propose a pre-processing step that is to separate hand regions from clustered background. We deploy the completed pipeline for estimating 3-D hand joints based on HandPointNet (HPN). HPN demonstrates the state-of-the-art hand pose estimation performances with depth data. We deploy a fine-tuning scheme to Hand PointNet (HPN) on the CVAR [1], UCI-EGO [2] datasets for 3D hand pose estimation. In the experimental results, we evaluate the estimated results using the pre-processing step to see the effectiveness of the proposed method. The results show that 3-D joint estimation errors are decreased comparing with the full hand data on different datasets as MSRA, NYU, ICVL. Particularly, we measure the estimation errors of missing, obscured data. The experimental results infer that, it is still existing a big gap between the results of un-occluded and occluded cases. Based on this initial study, we tend to investigate more deeply the techniques addressing the object occlusions or self-occlusions cases that make the current networks hard to localize hidden joints/parts of the hand.","PeriodicalId":218972,"journal":{"name":"2020 International Conference on Advanced Technologies for Communications (ATC)","volume":"140 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Advanced Technologies for Communications (ATC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ATC50776.2020.9255478","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Recently, understanding hand and object manipulation is an active topic in First-Person Vision (FPV) community. In this study, we present an initial study on estimating 3-D hand joints using the state-of-the-art neuronal network. We firstly propose a pre-processing step that is to separate hand regions from clustered background. We deploy the completed pipeline for estimating 3-D hand joints based on HandPointNet (HPN). HPN demonstrates the state-of-the-art hand pose estimation performances with depth data. We deploy a fine-tuning scheme to Hand PointNet (HPN) on the CVAR [1], UCI-EGO [2] datasets for 3D hand pose estimation. In the experimental results, we evaluate the estimated results using the pre-processing step to see the effectiveness of the proposed method. The results show that 3-D joint estimation errors are decreased comparing with the full hand data on different datasets as MSRA, NYU, ICVL. Particularly, we measure the estimation errors of missing, obscured data. The experimental results infer that, it is still existing a big gap between the results of un-occluded and occluded cases. Based on this initial study, we tend to investigate more deeply the techniques addressing the object occlusions or self-occlusions cases that make the current networks hard to localize hidden joints/parts of the hand.