Development of an explainable machine learning model for predicting depression in adolescent girls with non-suicidal self-injury: A cross-sectional multicenter study

IF 4.9 2区 医学 Q1 CLINICAL NEUROLOGY Journal of affective disorders Pub Date : 2025-03-15 DOI:10.1016/j.jad.2025.03.080
Ben Niu , Mengjie Wan , Yongjie Zhou
{"title":"Development of an explainable machine learning model for predicting depression in adolescent girls with non-suicidal self-injury: A cross-sectional multicenter study","authors":"Ben Niu ,&nbsp;Mengjie Wan ,&nbsp;Yongjie Zhou","doi":"10.1016/j.jad.2025.03.080","DOIUrl":null,"url":null,"abstract":"<div><div>Non-suicidal self-injury (NSSI) in adolescent girls is a critical predictor of subsequent depression and suicide risk, yet current tools lack both accuracy and clinical interpretability. We developed the first explainable machine learning model integrating multicenter psychosocial data to predict depression among Chinese adolescent girls with NSSI, addressing the critical need for culturally tailored risk stratification tools. In this cross - sectional observational study, our model was developed using data from 14 hospitals. We used five categories of data as predictors, including individual, family, school, psychosocial, and behavioral and lifestyle factors. We compared seven machine learning models and selected the best one to develop final model and the Shapley Additive exPlanations (SHAP) method were used to explain model prediction. The Random Forest (RF) model was compared against six other machine learning algorithms. We assessed the discrimination using the area under receiver operating characteristic (AUROC) with 95 % CIs. Using the development dataset (<em>n</em> = 1163) and predictive model building process, a simplified model containing only the top 20 features had similar predictive performance to the full model, the RF model outperformed six algorithms (AUROC = 0.964 [0.945–0.975]), demonstrating superior discriminative power and robustness. The top ten risk predictors were Borderline personality, Rumination, Perceived stress, Hopelessness, Self-esteem, Sleep quality, Loneliness, Resilience, Parental care, and Problem-focused coping. We developed a three-tiered, color-coded web-based clinical tool to operationalize predictions, enabling real-time risk stratification and personalized interventions. Our study bridges machine learning and clinical interpretability to advance precision mental health interventions for vulnerable adolescent populations.</div></div>","PeriodicalId":14963,"journal":{"name":"Journal of affective disorders","volume":"379 ","pages":"Pages 690-702"},"PeriodicalIF":4.9000,"publicationDate":"2025-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of affective disorders","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0165032725004264","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Non-suicidal self-injury (NSSI) in adolescent girls is a critical predictor of subsequent depression and suicide risk, yet current tools lack both accuracy and clinical interpretability. We developed the first explainable machine learning model integrating multicenter psychosocial data to predict depression among Chinese adolescent girls with NSSI, addressing the critical need for culturally tailored risk stratification tools. In this cross - sectional observational study, our model was developed using data from 14 hospitals. We used five categories of data as predictors, including individual, family, school, psychosocial, and behavioral and lifestyle factors. We compared seven machine learning models and selected the best one to develop final model and the Shapley Additive exPlanations (SHAP) method were used to explain model prediction. The Random Forest (RF) model was compared against six other machine learning algorithms. We assessed the discrimination using the area under receiver operating characteristic (AUROC) with 95 % CIs. Using the development dataset (n = 1163) and predictive model building process, a simplified model containing only the top 20 features had similar predictive performance to the full model, the RF model outperformed six algorithms (AUROC = 0.964 [0.945–0.975]), demonstrating superior discriminative power and robustness. The top ten risk predictors were Borderline personality, Rumination, Perceived stress, Hopelessness, Self-esteem, Sleep quality, Loneliness, Resilience, Parental care, and Problem-focused coping. We developed a three-tiered, color-coded web-based clinical tool to operationalize predictions, enabling real-time risk stratification and personalized interventions. Our study bridges machine learning and clinical interpretability to advance precision mental health interventions for vulnerable adolescent populations.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
开发一种可解释的机器学习模型,用于预测非自杀性自伤少女的抑郁情况:一项横断面多中心研究。
青春期女孩的非自杀性自伤(NSSI)是随后抑郁和自杀风险的重要预测因子,但目前的工具缺乏准确性和临床可解释性。我们开发了第一个可解释的机器学习模型,整合多中心社会心理数据来预测中国青少年自伤女孩的抑郁,解决了对文化量身定制的风险分层工具的迫切需求。在这项横断面观察性研究中,我们的模型是使用14家医院的数据开发的。我们使用五类数据作为预测因素,包括个人、家庭、学校、社会心理、行为和生活方式因素。我们比较了7种机器学习模型,从中选择了最好的一种来开发最终模型,并使用Shapley加性解释(SHAP)方法来解释模型预测。随机森林(RF)模型与其他六种机器学习算法进行了比较。我们用接受者工作特征下面积(AUROC)评估鉴别,95% % ci。利用开发数据集(n = 1163)和预测模型构建过程,仅包含前20个特征的简化模型具有与完整模型相似的预测性能,RF模型优于6种算法(AUROC = 0.964[0.945-0.975]),显示出优越的判别能力和鲁棒性。前十名的风险预测因素是:边缘型人格、沉思、感知压力、绝望、自尊、睡眠质量、孤独、适应力、父母照顾和以问题为中心的应对。我们开发了一个三层、彩色编码的基于网络的临床工具,用于操作预测,实现实时风险分层和个性化干预。我们的研究将机器学习和临床可解释性结合起来,为弱势青少年群体推进精确的心理健康干预。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of affective disorders
Journal of affective disorders 医学-精神病学
CiteScore
10.90
自引率
6.10%
发文量
1319
审稿时长
9.3 weeks
期刊介绍: The Journal of Affective Disorders publishes papers concerned with affective disorders in the widest sense: depression, mania, mood spectrum, emotions and personality, anxiety and stress. It is interdisciplinary and aims to bring together different approaches for a diverse readership. Top quality papers will be accepted dealing with any aspect of affective disorders, including neuroimaging, cognitive neurosciences, genetics, molecular biology, experimental and clinical neurosciences, pharmacology, neuroimmunoendocrinology, intervention and treatment trials.
期刊最新文献
Concerns regarding scope and methodological framing in "Mining the risk factors for stroke occurrence and dietary protective factors based on the NHANES database: Analysis using SHAP". Psychological distress and mental health service use trends in Japan (2013-2022): Focusing on the change before and after the COVID-19 pandemic. Influence of childhood excess weight on cognitive, behavioural and emotional outcomes. Associations between multiple neurodevelopmental disorders and mental health in children. Acid accumulation in bipolar disorder.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1