Detecting Outliers in Cardiopulmonary Exercise Testing Data of Ski Racers – A Comparison of Methods and their Effect on the Performance of Fatigue Prediction

Q2 Computer Science International Journal of Computer Science in Sport Pub Date : 2023-03-01 DOI:10.2478/ijcss-2023-0005

N. Baumgartner, C. Kranzinger, S. Kranzinger, C. Snyder, T. Stöggl, B. Resch

{"title":"Detecting Outliers in Cardiopulmonary Exercise Testing Data of Ski Racers – A Comparison of Methods and their Effect on the Performance of Fatigue Prediction","authors":"N. Baumgartner, C. Kranzinger, S. Kranzinger, C. Snyder, T. Stöggl, B. Resch","doi":"10.2478/ijcss-2023-0005","DOIUrl":null,"url":null,"abstract":"Abstract In sports science, cardiopulmonary data is used to assess exercise intensity, performance and health status of athletes and derive relevant target values. However, sensors may produce flawed data and data may include a wide variety of artifacts, which could potentially lead to false conclusions. Thus, appropriate and customized pre-processing algorithms are a vital prerequisite for producing reliable and valid analysis results. To find adequate outlier detection methods for this type of data, we compared three algorithms by applying them on seven ergospirometric measures of junior ski racing athletes and applied a model to predict fatigue during skiing based on the pre-processed data. While values that lie outside a realistic spectrum were consistently labelled as outliers by all methods, and mean values and standard deviations changed in similar ways, methods differed from each other when it comes to changing trends, recurring patterns, and subsequent outliers. Decomposing the sensor data into different components (trend, seasonality, remainder) before dealing with outliers increased average predictive performance the most. However, pre-processing remarkably improved prediction results for certain study participants and not for others. Thus, handling outliers correctly prior to deriving information from ergospirometric data is recommended but more research should be conducted to find methods that achieve more consistent improvement.","PeriodicalId":38466,"journal":{"name":"International Journal of Computer Science in Sport","volume":"22 1","pages":"65 - 79"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Computer Science in Sport","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2478/ijcss-2023-0005","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Computer Science","Score":null,"Total":0}

引用次数: 0

Abstract

Abstract In sports science, cardiopulmonary data is used to assess exercise intensity, performance and health status of athletes and derive relevant target values. However, sensors may produce flawed data and data may include a wide variety of artifacts, which could potentially lead to false conclusions. Thus, appropriate and customized pre-processing algorithms are a vital prerequisite for producing reliable and valid analysis results. To find adequate outlier detection methods for this type of data, we compared three algorithms by applying them on seven ergospirometric measures of junior ski racing athletes and applied a model to predict fatigue during skiing based on the pre-processed data. While values that lie outside a realistic spectrum were consistently labelled as outliers by all methods, and mean values and standard deviations changed in similar ways, methods differed from each other when it comes to changing trends, recurring patterns, and subsequent outliers. Decomposing the sensor data into different components (trend, seasonality, remainder) before dealing with outliers increased average predictive performance the most. However, pre-processing remarkably improved prediction results for certain study participants and not for others. Thus, handling outliers correctly prior to deriving information from ergospirometric data is recommended but more research should be conducted to find methods that achieve more consistent improvement.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

滑雪运动员心肺运动测试数据异常值的检测——疲劳预测方法的比较及其对性能的影响

在运动科学中，心肺数据被用来评估运动员的运动强度、表现和健康状况，并得出相关的目标值。然而，传感器可能产生有缺陷的数据，数据可能包括各种各样的伪影，这可能导致错误的结论。因此，适当和定制的预处理算法是产生可靠和有效的分析结果的重要前提。为了找到适合这类数据的异常值检测方法，我们对三种算法进行了比较，将它们应用于初级滑雪比赛运动员的七项人体呼吸量测量，并基于预处理数据应用了一个模型来预测滑雪过程中的疲劳。虽然所有方法都将超出现实范围的值标记为异常值，并且平均值和标准差以类似的方式变化，但当涉及到变化趋势，重复模式和随后的异常值时，方法各不相同。在处理异常值之前，将传感器数据分解为不同的组成部分(趋势、季节性、剩余)，可以最大程度地提高平均预测性能。然而，预处理显著改善了某些研究参与者的预测结果，而对其他参与者则没有。因此，建议在从肺活量计数据中获得信息之前正确处理异常值，但应该进行更多的研究以找到实现更一致改善的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊