vAQA-SS: Vision-based action quality assessment for style-based skiing

IF 3.4 2区工程技术 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Displays Pub Date : 2025-07-01 Epub Date: 2025-03-04 DOI:10.1016/j.displa.2025.103020

Yijia Wen , Xiaoyan Luo , Lei Zheng , Liangnan Qi , Xiaofeng Shi

{"title":"vAQA-SS: Vision-based action quality assessment for style-based skiing","authors":"Yijia Wen , Xiaoyan Luo , Lei Zheng , Liangnan Qi , Xiaofeng Shi","doi":"10.1016/j.displa.2025.103020","DOIUrl":null,"url":null,"abstract":"<div><div>Vision-based Action Quality Assessment (AQA) aims to evaluate action quality in video data, aligning with the subjective scores of human experts. Due to the unique challenges posed by different sports, it is difficult to design a uniform AQA system applicable to all sports. Consequently, many current sports AQA methods focus on specific disciplines such as diving and gymnastics. In contrast, skiing AQA, characterized by high-dynamic actions and complex outdoor scenes, faces additional challenges. Therefore, we constructed a specific dataset for style-based skiing, which focuses athlete’s movement style and execution, encompassing diverse skiing events with detailed annotations on action classes and athletes’ final scores, named Skiing-6. Leveraging this dataset, we designed two vision-based skiing action quality assessment (vAQA-SS) models. One model directly generates an absolute AQA score by measuring the quality of an athlete’s actions in the input video without any external reference, termed <em>or</em>AQA, which assesses athlete performance based on low-level spatiotemporal features of the video data alongside high-level pose features. The other model calculates a relative AQA score, deriving the performance score of an athlete’s actions from the source input video with a reference video, termed <em>wr</em>AQA. Finally, we conducted extensive experiments on Skiing-6 and SkiTB to demonstrate the effectiveness of our vAQA-SS models. The results demonstrate that our approach achieves significant improvements in both absolute evaluation (<em>or</em>AQA) and relative evaluation (<em>wr</em>AQA), surpassing other similar sports AQA methods.</div></div>","PeriodicalId":50570,"journal":{"name":"Displays","volume":"88 ","pages":"Article 103020"},"PeriodicalIF":3.4000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Displays","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0141938225000575","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/4 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}

引用次数: 0

Abstract

Vision-based Action Quality Assessment (AQA) aims to evaluate action quality in video data, aligning with the subjective scores of human experts. Due to the unique challenges posed by different sports, it is difficult to design a uniform AQA system applicable to all sports. Consequently, many current sports AQA methods focus on specific disciplines such as diving and gymnastics. In contrast, skiing AQA, characterized by high-dynamic actions and complex outdoor scenes, faces additional challenges. Therefore, we constructed a specific dataset for style-based skiing, which focuses athlete’s movement style and execution, encompassing diverse skiing events with detailed annotations on action classes and athletes’ final scores, named Skiing-6. Leveraging this dataset, we designed two vision-based skiing action quality assessment (vAQA-SS) models. One model directly generates an absolute AQA score by measuring the quality of an athlete’s actions in the input video without any external reference, termed orAQA, which assesses athlete performance based on low-level spatiotemporal features of the video data alongside high-level pose features. The other model calculates a relative AQA score, deriving the performance score of an athlete’s actions from the source input video with a reference video, termed wrAQA. Finally, we conducted extensive experiments on Skiing-6 and SkiTB to demonstrate the effectiveness of our vAQA-SS models. The results demonstrate that our approach achieves significant improvements in both absolute evaluation (orAQA) and relative evaluation (wrAQA), surpassing other similar sports AQA methods.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

vAQA-SS：基于视觉的基于风格的滑雪动作质量评估

基于视觉的动作质量评估（AQA）旨在评估视频数据中的动作质量，与人类专家的主观评分保持一致。由于不同的运动项目所带来的独特挑战，很难设计出一套适用于所有运动项目的统一的AQA系统。因此，目前许多体育项目的AQA方法侧重于特定的项目，如跳水和体操。相比之下，以高动态动作和复杂户外场景为特点的滑雪AQA则面临着额外的挑战。因此，我们为基于风格的滑雪构建了一个特定的数据集，该数据集专注于运动员的运动风格和执行，包含各种滑雪项目，并对动作类别和运动员的最终分数进行了详细的注释，名为skiing -6。利用该数据集，我们设计了两个基于视觉的滑雪动作质量评估（vAQA-SS）模型。一种模型通过测量输入视频中运动员的动作质量直接生成绝对AQA分数，而不需要任何外部参考，称为orAQA，它基于视频数据的低水平时空特征和高水平姿势特征来评估运动员的表现。另一个模型计算相对AQA分数，从源输入视频和参考视频（称为wrAQA）中得出运动员动作的表现分数。最后，我们在ski -6和SkiTB上进行了大量实验，以证明我们的vAQA-SS模型的有效性。结果表明，我们的方法在绝对评价（orAQA）和相对评价（wrAQA）两方面都取得了显著的进步，超越了其他类似的体育AQA方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Displays 工程技术-工程：电子与电气

CiteScore

4.60

自引率

25.60%

发文量

138

审稿时长

92 days

期刊介绍： Displays is the international journal covering the research and development of display technology, its effective presentation and perception of information, and applications and systems including display-human interface. Technical papers on practical developments in Displays technology provide an effective channel to promote greater understanding and cross-fertilization across the diverse disciplines of the Displays community. Original research papers solving ergonomics issues at the display-human interface advance effective presentation of information. Tutorial papers covering fundamentals intended for display technologies and human factor engineers new to the field will also occasionally featured.