Prediction of early-onset bipolar using electronic health records

IF 7 1区 医学 Q1 PSYCHIATRY Journal of Child Psychology and Psychiatry Pub Date : 2025-02-18 DOI:10.1111/jcpp.14131
Bo Wang, Yi-Han Sheu, Hyunjoon Lee, Robert G. Mealer, Victor M. Castro, Jordan W. Smoller
{"title":"Prediction of early-onset bipolar using electronic health records","authors":"Bo Wang,&nbsp;Yi-Han Sheu,&nbsp;Hyunjoon Lee,&nbsp;Robert G. Mealer,&nbsp;Victor M. Castro,&nbsp;Jordan W. Smoller","doi":"10.1111/jcpp.14131","DOIUrl":null,"url":null,"abstract":"<div>\n \n <section>\n \n <h3> Background</h3>\n \n <p>Early identification of bipolar disorder (BD) provides an important opportunity for timely intervention. In this study, we aimed to develop machine learning models using large-scale electronic health record (EHR) data including clinical notes for predicting early-onset BD.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>Structured and unstructured data were extracted from the longitudinal EHR of the Mass General Brigham health system. We defined three cohorts aged 10–25 years: (1) the full youth cohort (<i>N</i> = 300,398); (2) a subcohort defined by having a mental health visit (<i>N</i> = 105,461); and (3) a subcohort defined by having a diagnosis of mood disorder or ADHD (<i>N</i> = 35,213). By adopting a prospective landmark modeling approach that aligns with clinical practice, we developed and validated a range of machine learning models, across different cohorts and prediction windows.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>We found the two tree-based models, random forests (RF) and light gradient-boosting machine (LGBM), achieving good discriminative performance across different clinical settings (area under the receiver operating characteristic curve 0.76–0.88 for RF and 0.74–0.89 for LGBM). In addition, we showed comparable performance can be achieved with a greatly reduced set of features, demonstrating computational efficiency can be attained without significant compromise of model accuracy.</p>\n </section>\n \n <section>\n \n <h3> Conclusions</h3>\n \n <p>Good discriminative performance for models predicting early-onset BD can be achieved utilizing large-scale EHR data. Our study offers a scalable and accurate method for identifying youth at risk for BD that could help inform clinical decision-making and facilitate early intervention. Future work includes evaluating the portability of our approach to other healthcare systems and exploring considerations regarding possible implementation.</p>\n </section>\n </div>","PeriodicalId":187,"journal":{"name":"Journal of Child Psychology and Psychiatry","volume":"66 8","pages":"1141-1154"},"PeriodicalIF":7.0000,"publicationDate":"2025-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Child Psychology and Psychiatry","FirstCategoryId":"3","ListUrlMain":"https://acamh.onlinelibrary.wiley.com/doi/10.1111/jcpp.14131","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHIATRY","Score":null,"Total":0}
引用次数: 0

Abstract

Background

Early identification of bipolar disorder (BD) provides an important opportunity for timely intervention. In this study, we aimed to develop machine learning models using large-scale electronic health record (EHR) data including clinical notes for predicting early-onset BD.

Methods

Structured and unstructured data were extracted from the longitudinal EHR of the Mass General Brigham health system. We defined three cohorts aged 10–25 years: (1) the full youth cohort (N = 300,398); (2) a subcohort defined by having a mental health visit (N = 105,461); and (3) a subcohort defined by having a diagnosis of mood disorder or ADHD (N = 35,213). By adopting a prospective landmark modeling approach that aligns with clinical practice, we developed and validated a range of machine learning models, across different cohorts and prediction windows.

Results

We found the two tree-based models, random forests (RF) and light gradient-boosting machine (LGBM), achieving good discriminative performance across different clinical settings (area under the receiver operating characteristic curve 0.76–0.88 for RF and 0.74–0.89 for LGBM). In addition, we showed comparable performance can be achieved with a greatly reduced set of features, demonstrating computational efficiency can be attained without significant compromise of model accuracy.

Conclusions

Good discriminative performance for models predicting early-onset BD can be achieved utilizing large-scale EHR data. Our study offers a scalable and accurate method for identifying youth at risk for BD that could help inform clinical decision-making and facilitate early intervention. Future work includes evaluating the portability of our approach to other healthcare systems and exploring considerations regarding possible implementation.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用电子健康记录预测早发性双相情感障碍
背景:双相情感障碍(BD)的识别为及时干预提供了重要的机会。在这项研究中,我们旨在利用包括临床记录在内的大规模电子健康记录(EHR)数据开发机器学习模型,以预测早发性bd。方法从麻省总医院布里格姆卫生系统的纵向电子健康记录中提取结构化和非结构化数据。我们定义了三个10-25岁的队列:(1)全青年队列(N = 300,398);(2)以心理健康访问定义的亚队列(N = 105,461);(3)诊断为情绪障碍或ADHD的亚队列(N = 35213)。通过采用与临床实践相一致的前瞻性里程碑建模方法,我们开发并验证了一系列机器学习模型,涵盖不同的队列和预测窗口。结果我们发现两种基于树的模型,随机森林(RF)和光梯度增强机(LGBM),在不同的临床环境中具有良好的判别性能(RF和LGBM的受试者工作特征曲线下面积分别为0.76-0.88和0.74-0.89)。此外,我们还展示了使用大大减少的特征集可以实现相当的性能,这表明可以在不显著损害模型精度的情况下获得计算效率。结论利用大规模电子病历数据,预测早发性BD的模型具有良好的判别性能。我们的研究提供了一种可扩展且准确的方法来识别有双相障碍风险的年轻人,这有助于为临床决策提供信息并促进早期干预。未来的工作包括评估我们的方法对其他医疗保健系统的可移植性,并探索有关可能实施的考虑因素。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
13.80
自引率
5.30%
发文量
169
审稿时长
1 months
期刊介绍: The Journal of Child Psychology and Psychiatry (JCPP) is a highly regarded international publication that focuses on the fields of child and adolescent psychology and psychiatry. It is recognized for publishing top-tier, clinically relevant research across various disciplines related to these areas. JCPP has a broad global readership and covers a diverse range of topics, including: Epidemiology: Studies on the prevalence and distribution of mental health issues in children and adolescents. Diagnosis: Research on the identification and classification of childhood disorders. Treatments: Psychotherapeutic and psychopharmacological interventions for child and adolescent mental health. Behavior and Cognition: Studies on the behavioral and cognitive aspects of childhood disorders. Neuroscience and Neurobiology: Research on the neural and biological underpinnings of child mental health. Genetics: Genetic factors contributing to the development of childhood disorders. JCPP serves as a platform for integrating empirical research, clinical studies, and high-quality reviews from diverse perspectives, theoretical viewpoints, and disciplines. This interdisciplinary approach is a key feature of the journal, as it fosters a comprehensive understanding of child and adolescent mental health. The Journal of Child Psychology and Psychiatry is published 12 times a year and is affiliated with the Association for Child and Adolescent Mental Health (ACAMH), which supports the journal's mission to advance knowledge and practice in the field of child and adolescent mental health.
期刊最新文献
Effectiveness of the school-based internet intervention StresSOS for the prevention of mental health problems in young people: a randomized controlled trial as part of the ProHEAD consortium. Research Review: Measuring life impact of youth mental health difficulties: scoping umbrella review of 80 instruments. Violent and sexual victimisation and incident anxiety, mood and substance use disorders in childhood and adolescence: a co-sibling study. Children's oppositional defiant disorder and mother-child interpersonal brain synchrony: the role of maternal meta-emotion philosophy. Directionality of longitudinal associations between frontostriatal structural connectivity and depressive symptoms in adolescent girls.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1