InfoGCN++: Learning Representation by Predicting the Future for Online Skeleton-Based Action Recognition

IF 18.6 IEEE transactions on pattern analysis and machine intelligence Pub Date : 2024-09-26 DOI:10.1109/TPAMI.2024.3466212

Seunggeun Chi;Hyung-Gun Chi;Qixing Huang;Karthik Ramani

{"title":"InfoGCN++: Learning Representation by Predicting the Future for Online Skeleton-Based Action Recognition","authors":"Seunggeun Chi;Hyung-Gun Chi;Qixing Huang;Karthik Ramani","doi":"10.1109/TPAMI.2024.3466212","DOIUrl":null,"url":null,"abstract":"Skeleton-based action recognition has made significant advancements recently, with models like InfoGCN showcasing remarkable accuracy. However, these models exhibit a key limitation: they necessitate complete action observation prior to classification, which constrains their applicability in real-time situations such as surveillance and robotic systems. To overcome this barrier, we introduce InfoGCN++, an innovative extension of InfoGCN, explicitly developed for online skeleton-based action recognition. InfoGCN++ augments the abilities of the original InfoGCN model by allowing real-time categorization of action types, independent of the observation sequence’s length. It transcends conventional approaches by learning from current and anticipated future movements, thereby creating a more thorough representation of the entire sequence. Our approach to prediction is managed as an extrapolation issue, grounded on observed actions. To enable this, InfoGCN++ incorporates Neural Ordinary Differential Equations, a concept that lets it effectively model the continuous evolution of hidden states. Following rigorous evaluations on three skeleton-based action recognition benchmarks, InfoGCN++ demonstrates exceptional performance in online action recognition. It consistently equals or exceeds existing techniques, highlighting its significant potential to reshape the landscape of real-time action recognition applications. Consequently, this work represents a major leap forward from InfoGCN, pushing the limits of what’s possible in online, skeleton-based action recognition.","PeriodicalId":94034,"journal":{"name":"IEEE transactions on pattern analysis and machine intelligence","volume":"47 1","pages":"514-528"},"PeriodicalIF":18.6000,"publicationDate":"2024-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on pattern analysis and machine intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10694798/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Skeleton-based action recognition has made significant advancements recently, with models like InfoGCN showcasing remarkable accuracy. However, these models exhibit a key limitation: they necessitate complete action observation prior to classification, which constrains their applicability in real-time situations such as surveillance and robotic systems. To overcome this barrier, we introduce InfoGCN++, an innovative extension of InfoGCN, explicitly developed for online skeleton-based action recognition. InfoGCN++ augments the abilities of the original InfoGCN model by allowing real-time categorization of action types, independent of the observation sequence’s length. It transcends conventional approaches by learning from current and anticipated future movements, thereby creating a more thorough representation of the entire sequence. Our approach to prediction is managed as an extrapolation issue, grounded on observed actions. To enable this, InfoGCN++ incorporates Neural Ordinary Differential Equations, a concept that lets it effectively model the continuous evolution of hidden states. Following rigorous evaluations on three skeleton-based action recognition benchmarks, InfoGCN++ demonstrates exceptional performance in online action recognition. It consistently equals or exceeds existing techniques, highlighting its significant potential to reshape the landscape of real-time action recognition applications. Consequently, this work represents a major leap forward from InfoGCN, pushing the limits of what’s possible in online, skeleton-based action recognition.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

InfoGCN++：通过预测未来学习表征，实现基于骨架的在线动作识别

基于骨骼的动作识别最近取得了重大进展，像InfoGCN这样的模型显示出了惊人的准确性。然而，这些模型表现出一个关键的局限性：它们需要在分类之前进行完整的动作观察，这限制了它们在监视和机器人系统等实时情况下的适用性。为了克服这一障碍，我们引入了infocn ++，这是infocn的一个创新扩展，明确地为基于骨架的在线动作识别而开发。InfoGCN++增强了原始InfoGCN模型的能力，允许对动作类型进行实时分类，而不依赖于观察序列的长度。它超越了传统的方法，从当前和预期的未来运动中学习，从而创造了整个序列的更彻底的表示。我们的预测方法是作为一个外推问题来管理的，以观察到的行为为基础。为了实现这一点，infogcn++结合了神经常微分方程（Neural常微分方程），这是一个可以有效地为隐藏状态的连续演化建模的概念。经过对三个基于骨架的动作识别基准的严格评估，infogcn++在在线动作识别方面展示了卓越的性能。它始终等于或超过现有的技术，突出了其重塑实时动作识别应用领域的巨大潜力。因此，这项工作代表了InfoGCN的一个重大飞跃，推动了在线、基于骨架的动作识别的极限。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE transactions on pattern analysis and machine intelligence

自引率

0.00%

发文量