A predictive model for identification of pediatric individuals with common variable immunodeficiency through electronic medical records

IF 11.2 1区 医学 Q1 ALLERGY Journal of Allergy and Clinical Immunology Pub Date : 2025-07-01 Epub Date: 2025-03-07 DOI:10.1016/j.jaci.2025.02.032
Nouf Alsaati MD, MMSc , Chris Penney MSc , Ingo Helbig MD , Kathleen E. Sullivan MD, PhD
{"title":"A predictive model for identification of pediatric individuals with common variable immunodeficiency through electronic medical records","authors":"Nouf Alsaati MD, MMSc ,&nbsp;Chris Penney MSc ,&nbsp;Ingo Helbig MD ,&nbsp;Kathleen E. Sullivan MD, PhD","doi":"10.1016/j.jaci.2025.02.032","DOIUrl":null,"url":null,"abstract":"<div><h3>Introduction</h3><div>Common variable immunodeficiency (CVID) is characterized by recurrent sinopulmonary infections. However, in the pediatric population, recurrent sinopulmonary infections early in life are common, which can render key clinical features of CVID less distinctive. Accordingly, the diagnosis of CVID is often delayed owing to the heterogeneous nature of the presentation and the broad range of ages of onset. A 10-year lag in diagnosis has been found for CVID, and there is a critical need for improved time to diagnosis.</div></div><div><h3>Objective</h3><div>Our aim was to utilize machine learning techniques to identify a clinical signature of CVID in a pediatric population.</div></div><div><h3>Methods</h3><div>Our selected cohort included 112 individuals with CVID and 627 controls. The controls were restricted from having other medical conditions associated with infection. A machine learning data set was constructed by summing patient-level counts of clinical metrics. A total of 3 supervised machine learning classifiers were trained, tuned, and performance-tested. We validated our findings using a distinct control cohort with high medical complexity and tested a logistic regression approach.</div></div><div><h3>Results</h3><div>Key features associated with CVID were chest radiograph count, number of antibiotic prescriptions, and number of common infections. Our Extreme Gradient Boosting (XGBoost) model best predicted eventual CVID diagnosis, with an F1 score of 0.77, a total of 21 of 29 CVID diagnoses classified correctly (8 false-negative results), and 179 of 183 patients without CVID correctly classified (4 false-positive results) up to 10 years before the eventual clinical diagnosis. Key features with a robust association with pediatric CVID were the frequency of common infections and antibiotic prescriptions.</div></div><div><h3>Conclusion</h3><div>In spite of a high frequency of infections in the comparator population, the clinical signature of pediatric CVID was sufficiently distinctive to enable early identification.</div></div>","PeriodicalId":14936,"journal":{"name":"Journal of Allergy and Clinical Immunology","volume":"156 1","pages":"Pages 186-194"},"PeriodicalIF":11.2000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Allergy and Clinical Immunology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0091674925002635","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/7 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ALLERGY","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction

Common variable immunodeficiency (CVID) is characterized by recurrent sinopulmonary infections. However, in the pediatric population, recurrent sinopulmonary infections early in life are common, which can render key clinical features of CVID less distinctive. Accordingly, the diagnosis of CVID is often delayed owing to the heterogeneous nature of the presentation and the broad range of ages of onset. A 10-year lag in diagnosis has been found for CVID, and there is a critical need for improved time to diagnosis.

Objective

Our aim was to utilize machine learning techniques to identify a clinical signature of CVID in a pediatric population.

Methods

Our selected cohort included 112 individuals with CVID and 627 controls. The controls were restricted from having other medical conditions associated with infection. A machine learning data set was constructed by summing patient-level counts of clinical metrics. A total of 3 supervised machine learning classifiers were trained, tuned, and performance-tested. We validated our findings using a distinct control cohort with high medical complexity and tested a logistic regression approach.

Results

Key features associated with CVID were chest radiograph count, number of antibiotic prescriptions, and number of common infections. Our Extreme Gradient Boosting (XGBoost) model best predicted eventual CVID diagnosis, with an F1 score of 0.77, a total of 21 of 29 CVID diagnoses classified correctly (8 false-negative results), and 179 of 183 patients without CVID correctly classified (4 false-positive results) up to 10 years before the eventual clinical diagnosis. Key features with a robust association with pediatric CVID were the frequency of common infections and antibiotic prescriptions.

Conclusion

In spite of a high frequency of infections in the comparator population, the clinical signature of pediatric CVID was sufficiently distinctive to enable early identification.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
通过电子病历识别儿童共同可变免疫缺陷个体的预测模型
简介:常见的变异性免疫缺陷(CVID)以复发性肺感染为特征。然而,在儿童人群中,生命早期复发性肺感染很常见,这可能使CVID的关键临床特征不那么明显。因此,由于表现的异质性和发病的广泛年龄,CVID的诊断经常被延迟。CVID的诊断滞后10年,迫切需要改善诊断时间。目的:利用机器学习技术识别儿科人群CVID的临床特征。方法:我们选择的队列包括112名CVID患者和627名对照组。对照组被限制患有与感染相关的其他疾病。通过将临床指标的患者级计数相加,构建了机器学习数据集。三个监督机器学习分类器进行了训练、调优和性能测试。我们使用具有高度医疗复杂性的独特对照队列验证了我们的发现,并测试了逻辑回归方法。结果:与CVID相关的主要特征是胸片计数、抗生素处方和常见感染数量。我们的极端梯度增强模型预测最终CVID诊断的最佳F1评分为0.77,29例CVID诊断中有21例正确分类(假阴性:8),183例非CVID患者中有179例正确分类(假阳性:4),最长可达10年最终临床诊断。与儿科CVID密切相关的关键特征是常见感染的频率和抗生素处方。结论:尽管比较人群中感染的频率很高,但儿科CVID的临床特征足够独特,可以早期识别。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
25.90
自引率
7.70%
发文量
1302
审稿时长
38 days
期刊介绍: The Journal of Allergy and Clinical Immunology is a prestigious publication that features groundbreaking research in the fields of Allergy, Asthma, and Immunology. This influential journal publishes high-impact research papers that explore various topics, including asthma, food allergy, allergic rhinitis, atopic dermatitis, primary immune deficiencies, occupational and environmental allergy, and other allergic and immunologic diseases. The articles not only report on clinical trials and mechanistic studies but also provide insights into novel therapies, underlying mechanisms, and important discoveries that contribute to our understanding of these diseases. By sharing this valuable information, the journal aims to enhance the diagnosis and management of patients in the future.
期刊最新文献
Context-specific genetic effects inform endotypes and treatment in asthma Reply Corrigendum Editorial Board Cover 1
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1