LAR-Pose: Lightweight human pose estimation with adaptive regression loss

IF 5.5 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Neurocomputing Pub Date : 2025-02-25 DOI:10.1016/j.neucom.2025.129777

Xudong Lou , Xin Lin , Henan Zeng , Xiangxian Zhu

{"title":"LAR-Pose: Lightweight human pose estimation with adaptive regression loss","authors":"Xudong Lou , Xin Lin , Henan Zeng , Xiangxian Zhu","doi":"10.1016/j.neucom.2025.129777","DOIUrl":null,"url":null,"abstract":"<div><div>In this paper, LAR-Pose, a lightweight, high-resolution network for human pose estimation driven by adaptive regression loss is proposed and experimentally demonstrated based on MS COCO and MPII. The architecture of the LAR-Pose comprises two main components. One is a lightweight high-resolution backbone network, which utilizes a parallel high-resolution architecture with conditional channel weighting block to reduce the model size and computational complexity. The other is a dynamic residual refinement network, which calculates residuals from pseudo-heatmaps and scaling factors, improving training concentration for consistent distribution estimation, rather than predicting coordinates or heatmaps directly. Specific coordinates are derived through integral heatmap regression, effectively minimizing quantization errors. Our adaptive regression loss, which uses a flow model to fit the distribution of residuals in real-time, provides more sensitive parameter feedback than conventional heatmap loss, ensuring differentiability and continuity during backpropagation while enhancing performance. With a relatively small parameter scale, LAR-Pose achieves an AP of 73.5 on MS COCO and a PCKh of 90.9 on MPII, while the results outperform most advanced small networks and approach the performance of large networks.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"633 ","pages":"Article 129777"},"PeriodicalIF":5.5000,"publicationDate":"2025-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231225004497","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

In this paper, LAR-Pose, a lightweight, high-resolution network for human pose estimation driven by adaptive regression loss is proposed and experimentally demonstrated based on MS COCO and MPII. The architecture of the LAR-Pose comprises two main components. One is a lightweight high-resolution backbone network, which utilizes a parallel high-resolution architecture with conditional channel weighting block to reduce the model size and computational complexity. The other is a dynamic residual refinement network, which calculates residuals from pseudo-heatmaps and scaling factors, improving training concentration for consistent distribution estimation, rather than predicting coordinates or heatmaps directly. Specific coordinates are derived through integral heatmap regression, effectively minimizing quantization errors. Our adaptive regression loss, which uses a flow model to fit the distribution of residuals in real-time, provides more sensitive parameter feedback than conventional heatmap loss, ensuring differentiability and continuity during backpropagation while enhancing performance. With a relatively small parameter scale, LAR-Pose achieves an AP of 73.5 on MS COCO and a PCKh of 90.9 on MPII, while the results outperform most advanced small networks and approach the performance of large networks.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

求助全文

约1分钟内获得全文去求助

来源期刊

Neurocomputing 工程技术-计算机：人工智能

CiteScore

13.10

自引率

10.00%

发文量

1382

审稿时长

70 days

期刊介绍： Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.