Detecting Outliers in Cardiopulmonary Exercise Testing Data of Ski Racers – A Comparison of Methods and their Effect on the Performance of Fatigue Prediction

N. Baumgartner, C. Kranzinger, S. Kranzinger, C. Snyder, T. Stöggl, B. Resch
{"title":"Detecting Outliers in Cardiopulmonary Exercise Testing Data of Ski Racers – A Comparison of Methods and their Effect on the Performance of Fatigue Prediction","authors":"N. Baumgartner, C. Kranzinger, S. Kranzinger, C. Snyder, T. Stöggl, B. Resch","doi":"10.2478/ijcss-2023-0005","DOIUrl":null,"url":null,"abstract":"Abstract In sports science, cardiopulmonary data is used to assess exercise intensity, performance and health status of athletes and derive relevant target values. However, sensors may produce flawed data and data may include a wide variety of artifacts, which could potentially lead to false conclusions. Thus, appropriate and customized pre-processing algorithms are a vital prerequisite for producing reliable and valid analysis results. To find adequate outlier detection methods for this type of data, we compared three algorithms by applying them on seven ergospirometric measures of junior ski racing athletes and applied a model to predict fatigue during skiing based on the pre-processed data. While values that lie outside a realistic spectrum were consistently labelled as outliers by all methods, and mean values and standard deviations changed in similar ways, methods differed from each other when it comes to changing trends, recurring patterns, and subsequent outliers. Decomposing the sensor data into different components (trend, seasonality, remainder) before dealing with outliers increased average predictive performance the most. However, pre-processing remarkably improved prediction results for certain study participants and not for others. Thus, handling outliers correctly prior to deriving information from ergospirometric data is recommended but more research should be conducted to find methods that achieve more consistent improvement.","PeriodicalId":38466,"journal":{"name":"International Journal of Computer Science in Sport","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Computer Science in Sport","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2478/ijcss-2023-0005","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 0

Abstract

Abstract In sports science, cardiopulmonary data is used to assess exercise intensity, performance and health status of athletes and derive relevant target values. However, sensors may produce flawed data and data may include a wide variety of artifacts, which could potentially lead to false conclusions. Thus, appropriate and customized pre-processing algorithms are a vital prerequisite for producing reliable and valid analysis results. To find adequate outlier detection methods for this type of data, we compared three algorithms by applying them on seven ergospirometric measures of junior ski racing athletes and applied a model to predict fatigue during skiing based on the pre-processed data. While values that lie outside a realistic spectrum were consistently labelled as outliers by all methods, and mean values and standard deviations changed in similar ways, methods differed from each other when it comes to changing trends, recurring patterns, and subsequent outliers. Decomposing the sensor data into different components (trend, seasonality, remainder) before dealing with outliers increased average predictive performance the most. However, pre-processing remarkably improved prediction results for certain study participants and not for others. Thus, handling outliers correctly prior to deriving information from ergospirometric data is recommended but more research should be conducted to find methods that achieve more consistent improvement.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
滑雪运动员心肺运动测试数据异常值的检测——疲劳预测方法的比较及其对性能的影响
在运动科学中,心肺数据被用来评估运动员的运动强度、表现和健康状况,并得出相关的目标值。然而,传感器可能产生有缺陷的数据,数据可能包括各种各样的伪影,这可能导致错误的结论。因此,适当和定制的预处理算法是产生可靠和有效的分析结果的重要前提。为了找到适合这类数据的异常值检测方法,我们对三种算法进行了比较,将它们应用于初级滑雪比赛运动员的七项人体呼吸量测量,并基于预处理数据应用了一个模型来预测滑雪过程中的疲劳。虽然所有方法都将超出现实范围的值标记为异常值,并且平均值和标准差以类似的方式变化,但当涉及到变化趋势,重复模式和随后的异常值时,方法各不相同。在处理异常值之前,将传感器数据分解为不同的组成部分(趋势、季节性、剩余),可以最大程度地提高平均预测性能。然而,预处理显著改善了某些研究参与者的预测结果,而对其他参与者则没有。因此,建议在从肺活量计数据中获得信息之前正确处理异常值,但应该进行更多的研究以找到实现更一致改善的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
International Journal of Computer Science in Sport
International Journal of Computer Science in Sport Computer Science-Computer Science (all)
CiteScore
2.20
自引率
0.00%
发文量
4
审稿时长
12 weeks
期刊最新文献
Automatic Detection of Faults in Simulated Race Walking from a Fixed Smartphone Camera Spin measurement system for table tennis balls based on asynchronous non-high-speed cameras The Use of Momentum-Inspired Features in Pre-Game Prediction Models for the Sport of Ice Hockey Hierarchical Bayesian analysis of racehorse running ability and jockey skills Workload Monitoring Tools in Field-Based Team Sports, the Emerging Technology and Analytics used for Performance and Injury Prediction: A Systematic Review
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1