Interpretable machine learning approaches for children's ADHD detection using clinical assessment data: an online web application deployment.

IF 3.4 2区医学 Q2 PSYCHIATRY BMC Psychiatry Pub Date : 2025-02-17 DOI:10.1186/s12888-025-06573-1

Han Qin, Lili Zhang, Jianhong Wang, Weiheng Yan, Xi Wang, Xia Qu, Nan Peng, Lin Wang

{"title":"Interpretable machine learning approaches for children's ADHD detection using clinical assessment data: an online web application deployment.","authors":"Han Qin, Lili Zhang, Jianhong Wang, Weiheng Yan, Xi Wang, Xia Qu, Nan Peng, Lin Wang","doi":"10.1186/s12888-025-06573-1","DOIUrl":null,"url":null,"abstract":"Background: Attention-deficit/hyperactivity disorder (ADHD) is a prevalent mental disorder characterized by hyperactivity, impulsivity, and inattention. This study aims to develop a verifiable and interpretable machine learning model to identify ADHD and its subtypes in children using clinical assessment scales data.Methods: This study utilized the ADHD-200 dataset, including demographic data, Behavioral Rating Scale, and Wechsler Intelligence Scale assessments, to train and validate our models. The model's performance was evaluated using 10-fold cross-validation within the internal dataset, and the best model will be used for external validation. Seven machine learning models were evaluated. The SHapley Additive exPlanations (SHAP) method was employed for model interpretation. Finally, a web application will deploy the prediction model to provide ADHD probabilities based on user input.Results: The Random Forest (RF) model performing best in identifying ADHD and the Support Vector Machine (SVM) model excelling in distinguishing ADHD subtypes. The RF model achieved an AUC of 0.99 in 10-fold cross-validation and an AUC of 0.99 in external validation, and the SVM model achieved a micro-average AUC of 0.96 and an accuracy of 0.83 in internal validation and a micro-average AUC of 0.96 and an accuracy of 0.85 in external validation. We used SHAP to interpret the models, revealing that higher ADHD Index pushed the model towards ADHD classification. Additionally, lower IQ scores were correlated with a higher likelihood of ADHD, consistent with previous studies. The dependency analysis found that the model can identify different behavioral scales. We deployed the final model online using a web application and showed users how the model made decisions.Conclusions: Our findings highlight the potential of using machine learning and clinical assessment scales to support the diagnosis and subtype identification of ADHD in children, offering a practical solution for improving diagnostic accuracy and efficiency in clinical settings.","PeriodicalId":9029,"journal":{"name":"BMC Psychiatry","volume":"25 1","pages":"139"},"PeriodicalIF":3.4000,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11834570/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Psychiatry","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12888-025-06573-1","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PSYCHIATRY","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Attention-deficit/hyperactivity disorder (ADHD) is a prevalent mental disorder characterized by hyperactivity, impulsivity, and inattention. This study aims to develop a verifiable and interpretable machine learning model to identify ADHD and its subtypes in children using clinical assessment scales data.

Methods: This study utilized the ADHD-200 dataset, including demographic data, Behavioral Rating Scale, and Wechsler Intelligence Scale assessments, to train and validate our models. The model's performance was evaluated using 10-fold cross-validation within the internal dataset, and the best model will be used for external validation. Seven machine learning models were evaluated. The SHapley Additive exPlanations (SHAP) method was employed for model interpretation. Finally, a web application will deploy the prediction model to provide ADHD probabilities based on user input.

Results: The Random Forest (RF) model performing best in identifying ADHD and the Support Vector Machine (SVM) model excelling in distinguishing ADHD subtypes. The RF model achieved an AUC of 0.99 in 10-fold cross-validation and an AUC of 0.99 in external validation, and the SVM model achieved a micro-average AUC of 0.96 and an accuracy of 0.83 in internal validation and a micro-average AUC of 0.96 and an accuracy of 0.85 in external validation. We used SHAP to interpret the models, revealing that higher ADHD Index pushed the model towards ADHD classification. Additionally, lower IQ scores were correlated with a higher likelihood of ADHD, consistent with previous studies. The dependency analysis found that the model can identify different behavioral scales. We deployed the final model online using a web application and showed users how the model made decisions.

Conclusions: Our findings highlight the potential of using machine learning and clinical assessment scales to support the diagnosis and subtype identification of ADHD in children, offering a practical solution for improving diagnostic accuracy and efficiency in clinical settings.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

使用临床评估数据的儿童ADHD检测的可解释机器学习方法：在线web应用程序部署。

背景：注意缺陷/多动障碍（ADHD）是一种以多动、冲动和注意力不集中为特征的普遍精神障碍。本研究旨在开发一种可验证且可解释的机器学习模型，利用临床评估量表数据识别儿童ADHD及其亚型。方法：本研究利用ADHD-200数据集，包括人口统计数据、行为评定量表和韦氏智力量表评估，来训练和验证我们的模型。模型的性能在内部数据集中使用10倍交叉验证进行评估，最佳模型将用于外部验证。对七个机器学习模型进行了评估。采用SHapley加性解释（SHAP）方法进行模型解释。最后，一个web应用程序将部署预测模型，根据用户输入提供ADHD概率。结果：随机森林（Random Forest， RF）模型对ADHD的识别效果最好，支持向量机（Support Vector Machine， SVM）模型对ADHD亚型的识别效果最好。RF模型10倍交叉验证的AUC为0.99，外部验证的AUC为0.99，支持向量机模型内部验证的微平均AUC为0.96，精度为0.83，外部验证的微平均AUC为0.96，精度为0.85。我们使用SHAP来解释模型，发现较高的ADHD指数将模型推向ADHD分类。此外，较低的智商得分与较高的多动症可能性相关，这与之前的研究一致。依赖分析发现，该模型可以识别不同的行为尺度。我们使用web应用程序在线部署最终模型，并向用户展示模型如何做出决策。结论：我们的研究结果强调了使用机器学习和临床评估量表来支持儿童ADHD的诊断和亚型识别的潜力，为提高临床诊断的准确性和效率提供了一种实用的解决方案。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

BMC Psychiatry 医学-精神病学

CiteScore

5.90

自引率

4.50%

发文量

716

审稿时长

3-6 weeks

期刊介绍： BMC Psychiatry is an open access, peer-reviewed journal that considers articles on all aspects of the prevention, diagnosis and management of psychiatric disorders, as well as related molecular genetics, pathophysiology, and epidemiology.