Charlotte Wenzel, Thomas Liebig, Adrian Swoboda, Rika Smolareck, Marit L Schlagheck, David Walzik, Andreas Groll, Richie P Goulding, Philipp Zimmer
{"title":"Machine learning predicts peak oxygen uptake and peak power output for customizing cardiopulmonary exercise testing using non-exercise features.","authors":"Charlotte Wenzel, Thomas Liebig, Adrian Swoboda, Rika Smolareck, Marit L Schlagheck, David Walzik, Andreas Groll, Richie P Goulding, Philipp Zimmer","doi":"10.1007/s00421-024-05543-x","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>Cardiopulmonary exercise testing (CPET) is considered the gold standard for assessing cardiorespiratory fitness. To ensure consistent performance of each test, it is necessary to adapt the power increase of the test protocol to the physical characteristics of each individual. This study aimed to use machine learning models to determine individualized ramp protocols based on non-exercise features. We hypothesized that machine learning models will predict peak oxygen uptake ( <math><mover><mi>V</mi> <mo>˙</mo></mover> </math> O<sub>2peak</sub>) and peak power output (PPO) more accurately than conventional multiple linear regression (MLR).</p><p><strong>Methods: </strong>The cross-sectional study was conducted with 274 (♀168, ♂106) participants who performed CPET on a cycle ergometer. Machine learning models and multiple linear regression were used to predict <math><mover><mi>V</mi> <mo>˙</mo></mover> </math> O<sub>2peak</sub> and PPO using non-exercise features. The accuracy of the models was compared using criteria such as root mean square error (RMSE). Shapley additive explanation (SHAP) was applied to determine the feature importance.</p><p><strong>Results: </strong>The most accurate machine learning model was the random forest (RMSE: 6.52 ml/kg/min [95% CI 5.21-8.17]) for <math><mover><mi>V</mi> <mo>˙</mo></mover> </math> O<sub>2peak</sub> prediction and the gradient boosting regression (RMSE: 43watts [95% CI 35-52]) for PPO prediction. Compared to the MLR, the machine learning models reduced the RMSE by up to 28% and 22% for prediction of <math><mover><mi>V</mi> <mo>˙</mo></mover> </math> O<sub>2peak</sub> and PPO, respectively. Furthermore, SHAP ranked body composition data such as skeletal muscle mass and extracellular water as the most impactful features.</p><p><strong>Conclusion: </strong>Machine learning models predict <math><mover><mi>V</mi> <mo>˙</mo></mover> </math> O<sub>2peak</sub> and PPO more accurately than MLR and can be used to individualize CPET protocols. Features that provide information about the participant's body composition contribute most to the improvement of these predictions.</p><p><strong>Trial registration number: </strong>DRKS00031401 (6 March 2023, retrospectively registered).</p>","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11519113/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00421-024-05543-x","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/7/3 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose: Cardiopulmonary exercise testing (CPET) is considered the gold standard for assessing cardiorespiratory fitness. To ensure consistent performance of each test, it is necessary to adapt the power increase of the test protocol to the physical characteristics of each individual. This study aimed to use machine learning models to determine individualized ramp protocols based on non-exercise features. We hypothesized that machine learning models will predict peak oxygen uptake ( O2peak) and peak power output (PPO) more accurately than conventional multiple linear regression (MLR).
Methods: The cross-sectional study was conducted with 274 (♀168, ♂106) participants who performed CPET on a cycle ergometer. Machine learning models and multiple linear regression were used to predict O2peak and PPO using non-exercise features. The accuracy of the models was compared using criteria such as root mean square error (RMSE). Shapley additive explanation (SHAP) was applied to determine the feature importance.
Results: The most accurate machine learning model was the random forest (RMSE: 6.52 ml/kg/min [95% CI 5.21-8.17]) for O2peak prediction and the gradient boosting regression (RMSE: 43watts [95% CI 35-52]) for PPO prediction. Compared to the MLR, the machine learning models reduced the RMSE by up to 28% and 22% for prediction of O2peak and PPO, respectively. Furthermore, SHAP ranked body composition data such as skeletal muscle mass and extracellular water as the most impactful features.
Conclusion: Machine learning models predict O2peak and PPO more accurately than MLR and can be used to individualize CPET protocols. Features that provide information about the participant's body composition contribute most to the improvement of these predictions.
Trial registration number: DRKS00031401 (6 March 2023, retrospectively registered).