A multi-season machine learning approach to examine the training load and injury relationship in professional soccer

Pub Date : 2024-04-22 DOI:10.3233/jsa-240718
Aritra Majumdar, Rashid Bakirov, Dan Hodges, Sean McCullagh, Tim Rees
{"title":"A multi-season machine learning approach to examine the training load and injury relationship in professional soccer","authors":"Aritra Majumdar, Rashid Bakirov, Dan Hodges, Sean McCullagh, Tim Rees","doi":"10.3233/jsa-240718","DOIUrl":null,"url":null,"abstract":"OBJECTIVES: The purpose of this study was to use machine learning to examine the relationship between training load and soccer injury with a multi-season dataset from one English Premier League club. METHODS: Participants were 35 male professional soccer players (aged 25.79±3.75 years, range 18–37 years; height 1.80±0.07 m, range 1.63–1.95 m; weight 80.70±6.78 kg, range 66.03–93.70 kg), with data collected from the 2014–2015 season until the 2018–2019 season. A total of 106 training loads variables (40 GPS data, 6 personal information, 14 physical data, 4 psychological data and 14 ACWR, 14 MSWR and 14 EWMA data) were examined in relation to 133 non-contact injuries, with a high imbalance ratio of 0.013. RESULTS: XGBoost and Artificial Neural Network were implemented to train the machine learning models using four and a half seasons’ data, with the developed models subsequently tested on the following half season’s data. During the first four and a half seasons, there were 341 injuries; during the next half season there were 37 injuries. To interpret and visualize the output of each model and the contribution of each feature (i.e., training load) towards the model, we used the Shapley Additive Explanations (SHAP) approach. Of 37 injuries, XGBoost correctly predicted 26 injuries, with recall and precision of 73% and 10% respectively. Artificial Neural Network correctly predicted 28 injuries, with recall and precision of 77% and 13% respectively. In the model using Artificial Neural Network (the relatively more accurate model), last injury area and weight appeared to be the most important features contributing to the prediction of injury. CONCLUSIONS: This was the first study of its kind to use Artificial Neural Network and a multi-season dataset for injury prediction. Our results demonstrate the potential to predict injuries with high recall, thereby identifying most of the injury cases, albeit, due to high class imbalance, precision suffered. This approach to using machine learning provides potentially valuable insights for soccer organizations and practitioners when monitoring load injuries.","PeriodicalId":0,"journal":{"name":"","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3233/jsa-240718","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

OBJECTIVES: The purpose of this study was to use machine learning to examine the relationship between training load and soccer injury with a multi-season dataset from one English Premier League club. METHODS: Participants were 35 male professional soccer players (aged 25.79±3.75 years, range 18–37 years; height 1.80±0.07 m, range 1.63–1.95 m; weight 80.70±6.78 kg, range 66.03–93.70 kg), with data collected from the 2014–2015 season until the 2018–2019 season. A total of 106 training loads variables (40 GPS data, 6 personal information, 14 physical data, 4 psychological data and 14 ACWR, 14 MSWR and 14 EWMA data) were examined in relation to 133 non-contact injuries, with a high imbalance ratio of 0.013. RESULTS: XGBoost and Artificial Neural Network were implemented to train the machine learning models using four and a half seasons’ data, with the developed models subsequently tested on the following half season’s data. During the first four and a half seasons, there were 341 injuries; during the next half season there were 37 injuries. To interpret and visualize the output of each model and the contribution of each feature (i.e., training load) towards the model, we used the Shapley Additive Explanations (SHAP) approach. Of 37 injuries, XGBoost correctly predicted 26 injuries, with recall and precision of 73% and 10% respectively. Artificial Neural Network correctly predicted 28 injuries, with recall and precision of 77% and 13% respectively. In the model using Artificial Neural Network (the relatively more accurate model), last injury area and weight appeared to be the most important features contributing to the prediction of injury. CONCLUSIONS: This was the first study of its kind to use Artificial Neural Network and a multi-season dataset for injury prediction. Our results demonstrate the potential to predict injuries with high recall, thereby identifying most of the injury cases, albeit, due to high class imbalance, precision suffered. This approach to using machine learning provides potentially valuable insights for soccer organizations and practitioners when monitoring load injuries.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
采用多赛季机器学习方法研究职业足球训练负荷与受伤之间的关系
研究目的本研究的目的是利用机器学习技术,通过一个英超俱乐部的多赛季数据集来研究训练负荷与足球损伤之间的关系。方法:参与者为 35 名男性职业足球运动员(年龄为 25.79±3.75岁,范围为 18-37 岁;身高为 1.80±0.07米,范围为 1.63-1.95 米;体重为 80.70±6.78公斤,范围为 66.03-93.70 公斤),数据收集时间为 2014-2015 赛季至 2018-2019 赛季。共研究了106个训练负荷变量(40个GPS数据、6个个人信息、14个体能数据、4个心理数据和14个ACWR、14个MSWR和14个EWMA数据)与133次非接触性损伤的关系,不平衡比高达0.013。结果:使用 XGBoost 和人工神经网络对四个半赛季的数据进行了机器学习模型的训练,随后在接下来的半个赛季的数据中对所开发的模型进行了测试。在前四个半赛季中,共有 341 人受伤;在后半个赛季中,共有 37 人受伤。为了解释和直观显示每个模型的输出结果以及每个特征(即训练负荷)对模型的贡献,我们使用了夏普利加法解释(SHAP)方法。在 37 例伤害中,XGBoost 正确预测了 26 例伤害,召回率和精确率分别为 73% 和 10%。人工神经网络正确预测了 28 起伤害事故,召回率和精确率分别为 77% 和 13%。在使用人工神经网络的模型(相对更准确的模型)中,最后的损伤面积和重量似乎是预测损伤的最重要特征。结论:这是首次使用人工神经网络和多赛季数据集进行损伤预测的研究。我们的研究结果表明,尽管由于类的高度不平衡,精确度受到了影响,但仍有可能以高召回率预测受伤情况,从而识别出大多数受伤病例。这种使用机器学习的方法为足球组织和从业人员监测负荷伤害提供了潜在的宝贵见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1