vAQA-SS: Vision-based action quality assessment for style-based skiing

IF 3.4 2区 工程技术 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Displays Pub Date : 2025-07-01 Epub Date: 2025-03-04 DOI:10.1016/j.displa.2025.103020
Yijia Wen , Xiaoyan Luo , Lei Zheng , Liangnan Qi , Xiaofeng Shi
{"title":"vAQA-SS: Vision-based action quality assessment for style-based skiing","authors":"Yijia Wen ,&nbsp;Xiaoyan Luo ,&nbsp;Lei Zheng ,&nbsp;Liangnan Qi ,&nbsp;Xiaofeng Shi","doi":"10.1016/j.displa.2025.103020","DOIUrl":null,"url":null,"abstract":"<div><div>Vision-based Action Quality Assessment (AQA) aims to evaluate action quality in video data, aligning with the subjective scores of human experts. Due to the unique challenges posed by different sports, it is difficult to design a uniform AQA system applicable to all sports. Consequently, many current sports AQA methods focus on specific disciplines such as diving and gymnastics. In contrast, skiing AQA, characterized by high-dynamic actions and complex outdoor scenes, faces additional challenges. Therefore, we constructed a specific dataset for style-based skiing, which focuses athlete’s movement style and execution, encompassing diverse skiing events with detailed annotations on action classes and athletes’ final scores, named Skiing-6. Leveraging this dataset, we designed two vision-based skiing action quality assessment (vAQA-SS) models. One model directly generates an absolute AQA score by measuring the quality of an athlete’s actions in the input video without any external reference, termed <em>or</em>AQA, which assesses athlete performance based on low-level spatiotemporal features of the video data alongside high-level pose features. The other model calculates a relative AQA score, deriving the performance score of an athlete’s actions from the source input video with a reference video, termed <em>wr</em>AQA. Finally, we conducted extensive experiments on Skiing-6 and SkiTB to demonstrate the effectiveness of our vAQA-SS models. The results demonstrate that our approach achieves significant improvements in both absolute evaluation (<em>or</em>AQA) and relative evaluation (<em>wr</em>AQA), surpassing other similar sports AQA methods.</div></div>","PeriodicalId":50570,"journal":{"name":"Displays","volume":"88 ","pages":"Article 103020"},"PeriodicalIF":3.4000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Displays","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0141938225000575","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/4 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0

Abstract

Vision-based Action Quality Assessment (AQA) aims to evaluate action quality in video data, aligning with the subjective scores of human experts. Due to the unique challenges posed by different sports, it is difficult to design a uniform AQA system applicable to all sports. Consequently, many current sports AQA methods focus on specific disciplines such as diving and gymnastics. In contrast, skiing AQA, characterized by high-dynamic actions and complex outdoor scenes, faces additional challenges. Therefore, we constructed a specific dataset for style-based skiing, which focuses athlete’s movement style and execution, encompassing diverse skiing events with detailed annotations on action classes and athletes’ final scores, named Skiing-6. Leveraging this dataset, we designed two vision-based skiing action quality assessment (vAQA-SS) models. One model directly generates an absolute AQA score by measuring the quality of an athlete’s actions in the input video without any external reference, termed orAQA, which assesses athlete performance based on low-level spatiotemporal features of the video data alongside high-level pose features. The other model calculates a relative AQA score, deriving the performance score of an athlete’s actions from the source input video with a reference video, termed wrAQA. Finally, we conducted extensive experiments on Skiing-6 and SkiTB to demonstrate the effectiveness of our vAQA-SS models. The results demonstrate that our approach achieves significant improvements in both absolute evaluation (orAQA) and relative evaluation (wrAQA), surpassing other similar sports AQA methods.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
vAQA-SS:基于视觉的基于风格的滑雪动作质量评估
基于视觉的动作质量评估(AQA)旨在评估视频数据中的动作质量,与人类专家的主观评分保持一致。由于不同的运动项目所带来的独特挑战,很难设计出一套适用于所有运动项目的统一的AQA系统。因此,目前许多体育项目的AQA方法侧重于特定的项目,如跳水和体操。相比之下,以高动态动作和复杂户外场景为特点的滑雪AQA则面临着额外的挑战。因此,我们为基于风格的滑雪构建了一个特定的数据集,该数据集专注于运动员的运动风格和执行,包含各种滑雪项目,并对动作类别和运动员的最终分数进行了详细的注释,名为skiing -6。利用该数据集,我们设计了两个基于视觉的滑雪动作质量评估(vAQA-SS)模型。一种模型通过测量输入视频中运动员的动作质量直接生成绝对AQA分数,而不需要任何外部参考,称为orAQA,它基于视频数据的低水平时空特征和高水平姿势特征来评估运动员的表现。另一个模型计算相对AQA分数,从源输入视频和参考视频(称为wrAQA)中得出运动员动作的表现分数。最后,我们在ski -6和SkiTB上进行了大量实验,以证明我们的vAQA-SS模型的有效性。结果表明,我们的方法在绝对评价(orAQA)和相对评价(wrAQA)两方面都取得了显著的进步,超越了其他类似的体育AQA方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Displays
Displays 工程技术-工程:电子与电气
CiteScore
4.60
自引率
25.60%
发文量
138
审稿时长
92 days
期刊介绍: Displays is the international journal covering the research and development of display technology, its effective presentation and perception of information, and applications and systems including display-human interface. Technical papers on practical developments in Displays technology provide an effective channel to promote greater understanding and cross-fertilization across the diverse disciplines of the Displays community. Original research papers solving ergonomics issues at the display-human interface advance effective presentation of information. Tutorial papers covering fundamentals intended for display technologies and human factor engineers new to the field will also occasionally featured.
期刊最新文献
An end-to-end Chinese-Braille translation method based on mT5: Vocabulary expansion and structural enhancement A degradation-adaptive deep-sea polymetallic nodule image segmentation framework with dual-frequency fusion for Jiaolong submersible Decoder-enhanced and semantic-aware layered image compression for human and machine Using siamese networks with transfer learning for dental identification on small-samples datasets Modeling epistemic uncertainty in 3D Gaussian Splatting for robust scene reconstruction
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1