Min Yang, Huiqin Zhang, Minglan Yu, Yunxuan Xu, Bo Xiang, Xiaopeng Yao
{"title":"使用基于心率变异性的可解释机器学习模型辅助识别抑郁症患者:一项回顾性研究。","authors":"Min Yang, Huiqin Zhang, Minglan Yu, Yunxuan Xu, Bo Xiang, Xiaopeng Yao","doi":"10.1186/s12888-024-06384-w","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>Depression has emerged as a global public health concern with high incidence and disability rates, which are timely imperative to identify and intervene in clinical practice. The objective of this study was to explore the association between heart rate variability (HRV) and depression, with the aim of establishing and validating machine learning models for the auxiliary diagnosis of depression.</p><p><strong>Methods: </strong>The data of 465 outpatients from the Affiliated Hospital of Southwest Medical University were selected for the study. The study population was then randomly divided into training and test sets in a 7:3 ratio. Logistic regression (LR), support vector machine (SVM), random forest (RF) and eXtreme gradient boosting (XGBoost) algorithm models were used to construct risk prediction models in the training set, and the model performance was verified in the test set. The four models were evaluated by the area under the receiver operating characteristic curve (ROC), calibration curve and the decision curve analysis (DCA). Furthermore, we employed the SHapley Additive exPlanations (SHAP) method to illustrate the effects of the features attributed to the model.</p><p><strong>Results: </strong>There were 237 people in the depressed group and 228 in the non-depressed group. In the training set (n = 325) and test set (n = 140), the area under of the curve(AUC) values of the XGBoost model are 0.92 [95% confidence interval (CI) 0.888,0.95] and 0.82 (95% CI 0.754,0.892)] respectively, which are higher than the other three models. The XGBoost model has excellent predictive efficacy and clinical utility. The SHAP method was ranked according to the importance of the degree of influence on the model, with age, heart rate, Standard deviation of the NN intervals (SDNN), two nonlinear parameters of HRV and sex considered to be the top 6 predictors.</p><p><strong>Conclusion: </strong>We provided a feasibility study of HRV as a potential biomarker for depression. The proposed model based on HRV provides clinicians with a quantitative auxiliary diagnostic tool, which is assist to improving the accuracy and efficiency of depression diagnosis, and can also be utilized for the monitoring and prevention of depression.</p>","PeriodicalId":9029,"journal":{"name":"BMC Psychiatry","volume":"24 1","pages":"914"},"PeriodicalIF":4.6000,"publicationDate":"2024-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11654159/pdf/","citationCount":"0","resultStr":"{\"title\":\"Auxiliary identification of depression patients using interpretable machine learning models based on heart rate variability: a retrospective study.\",\"authors\":\"Min Yang, Huiqin Zhang, Minglan Yu, Yunxuan Xu, Bo Xiang, Xiaopeng Yao\",\"doi\":\"10.1186/s12888-024-06384-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objective: </strong>Depression has emerged as a global public health concern with high incidence and disability rates, which are timely imperative to identify and intervene in clinical practice. The objective of this study was to explore the association between heart rate variability (HRV) and depression, with the aim of establishing and validating machine learning models for the auxiliary diagnosis of depression.</p><p><strong>Methods: </strong>The data of 465 outpatients from the Affiliated Hospital of Southwest Medical University were selected for the study. The study population was then randomly divided into training and test sets in a 7:3 ratio. Logistic regression (LR), support vector machine (SVM), random forest (RF) and eXtreme gradient boosting (XGBoost) algorithm models were used to construct risk prediction models in the training set, and the model performance was verified in the test set. The four models were evaluated by the area under the receiver operating characteristic curve (ROC), calibration curve and the decision curve analysis (DCA). Furthermore, we employed the SHapley Additive exPlanations (SHAP) method to illustrate the effects of the features attributed to the model.</p><p><strong>Results: </strong>There were 237 people in the depressed group and 228 in the non-depressed group. In the training set (n = 325) and test set (n = 140), the area under of the curve(AUC) values of the XGBoost model are 0.92 [95% confidence interval (CI) 0.888,0.95] and 0.82 (95% CI 0.754,0.892)] respectively, which are higher than the other three models. The XGBoost model has excellent predictive efficacy and clinical utility. The SHAP method was ranked according to the importance of the degree of influence on the model, with age, heart rate, Standard deviation of the NN intervals (SDNN), two nonlinear parameters of HRV and sex considered to be the top 6 predictors.</p><p><strong>Conclusion: </strong>We provided a feasibility study of HRV as a potential biomarker for depression. The proposed model based on HRV provides clinicians with a quantitative auxiliary diagnostic tool, which is assist to improving the accuracy and efficiency of depression diagnosis, and can also be utilized for the monitoring and prevention of depression.</p>\",\"PeriodicalId\":9029,\"journal\":{\"name\":\"BMC Psychiatry\",\"volume\":\"24 1\",\"pages\":\"914\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2024-12-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11654159/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Psychiatry\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12888-024-06384-w\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"PSYCHIATRY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Psychiatry","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12888-024-06384-w","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PSYCHIATRY","Score":null,"Total":0}
引用次数: 0
摘要
目的:抑郁症已成为全球关注的公共卫生问题,其发病率和致残率高,迫切需要在临床实践中及时识别和干预。本研究的目的是探讨心率变异性(HRV)与抑郁症之间的关系,目的是建立和验证用于抑郁症辅助诊断的机器学习模型。方法:选取西南医科大学附属医院465例门诊患者为研究对象。然后将研究人群按7:3的比例随机分为训练组和测试组。在训练集中采用Logistic回归(LR)、支持向量机(SVM)、随机森林(RF)和极端梯度提升(XGBoost)算法模型构建风险预测模型,并在测试集中验证模型的性能。采用受试者工作特征曲线(ROC)下面积、校正曲线下面积和决策曲线分析(DCA)对4种模型进行评价。此外,我们采用SHapley加性解释(SHAP)方法来说明归因于模型的特征的影响。结果:抑郁组237人,非抑郁组228人。在训练集(n = 325)和测试集(n = 140)中,XGBoost模型的曲线下面积(AUC)值分别为0.92[95%置信区间(CI) 0.888,0.95]和0.82 (95% CI 0.754,0.892)],均高于其他三种模型。XGBoost模型具有良好的预测效果和临床应用价值。根据对模型影响程度的重要程度对SHAP方法进行排序,年龄、心率、NN区间标准差(SDNN)、HRV和性别两个非线性参数被认为是前6个预测因子。结论:我们提供了HRV作为抑郁症潜在生物标志物的可行性研究。基于HRV的模型为临床医生提供了一种定量的辅助诊断工具,有助于提高抑郁症诊断的准确性和效率,也可用于抑郁症的监测和预防。
Auxiliary identification of depression patients using interpretable machine learning models based on heart rate variability: a retrospective study.
Objective: Depression has emerged as a global public health concern with high incidence and disability rates, which are timely imperative to identify and intervene in clinical practice. The objective of this study was to explore the association between heart rate variability (HRV) and depression, with the aim of establishing and validating machine learning models for the auxiliary diagnosis of depression.
Methods: The data of 465 outpatients from the Affiliated Hospital of Southwest Medical University were selected for the study. The study population was then randomly divided into training and test sets in a 7:3 ratio. Logistic regression (LR), support vector machine (SVM), random forest (RF) and eXtreme gradient boosting (XGBoost) algorithm models were used to construct risk prediction models in the training set, and the model performance was verified in the test set. The four models were evaluated by the area under the receiver operating characteristic curve (ROC), calibration curve and the decision curve analysis (DCA). Furthermore, we employed the SHapley Additive exPlanations (SHAP) method to illustrate the effects of the features attributed to the model.
Results: There were 237 people in the depressed group and 228 in the non-depressed group. In the training set (n = 325) and test set (n = 140), the area under of the curve(AUC) values of the XGBoost model are 0.92 [95% confidence interval (CI) 0.888,0.95] and 0.82 (95% CI 0.754,0.892)] respectively, which are higher than the other three models. The XGBoost model has excellent predictive efficacy and clinical utility. The SHAP method was ranked according to the importance of the degree of influence on the model, with age, heart rate, Standard deviation of the NN intervals (SDNN), two nonlinear parameters of HRV and sex considered to be the top 6 predictors.
Conclusion: We provided a feasibility study of HRV as a potential biomarker for depression. The proposed model based on HRV provides clinicians with a quantitative auxiliary diagnostic tool, which is assist to improving the accuracy and efficiency of depression diagnosis, and can also be utilized for the monitoring and prevention of depression.
期刊介绍:
BMC Psychiatry is an open access, peer-reviewed journal that considers articles on all aspects of the prevention, diagnosis and management of psychiatric disorders, as well as related molecular genetics, pathophysiology, and epidemiology.