Nouf Alsaati, Chris Penney, Ingo Helbig, Kathleen E Sullivan
{"title":"A Predictive Model for Identification of Pediatric Individuals With Common Variable Immunodeficiency through Electronic Medical Records.","authors":"Nouf Alsaati, Chris Penney, Ingo Helbig, Kathleen E Sullivan","doi":"10.1016/j.jaci.2025.02.032","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Common variable immunodeficiency (CVID) is characterized by recurrent sinopulmonary infections. However, in the pediatric population, recurrent sinopulmonary infections early in life are common, which can render key clinical features of CVID less distinctive. Accordingly, the diagnosis of CVID is often delayed due to the heterogeneous nature of the presentation and the broad age of onset. A 10-year lag in diagnosis has been found for CVID, and a critical need is improved time-to-diagnosis.</p><p><strong>Objective: </strong>To utilize machine-learning techniques to identify a clinical signature of CVID in a pediatric population.</p><p><strong>Methods: </strong>Our selected cohort included 112 individuals with CVID and 627 controls. Controls were restricted from having other medical conditions associated with infection. A machine learning dataset was constructed by summing patient-level counts of clinical metrics. Three supervised machine learning classifiers were trained, tuned, and performance-tested. We validated our findings using a distinct control cohort with high medical complexity and tested a logistic regression approach.</p><p><strong>Results: </strong>Key features associated with CVID were chest X-ray count, antibiotic prescriptions, and number of common infections. Our Extreme Gradient Boosting model best predicted eventual CVID diagnosis with an F1 score of 0.77, with 21 of 29 CVID diagnoses classified correctly (false negative: 8), and 179 of 183 non-CVID patients correctly classified (false positive: 4), up to 10 years prior to the eventual clinical diagnosis. Key features with a robust association with pediatric CVID were the frequency of common infections and antibiotic prescriptions.</p><p><strong>Conclusion: </strong>In spite of a high frequency of infections in the comparator population, the clinical signature of pediatric CVID was sufficiently distinctive to enable early identification.</p>","PeriodicalId":14936,"journal":{"name":"Journal of Allergy and Clinical Immunology","volume":" ","pages":""},"PeriodicalIF":11.4000,"publicationDate":"2025-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Allergy and Clinical Immunology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.jaci.2025.02.032","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ALLERGY","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction: Common variable immunodeficiency (CVID) is characterized by recurrent sinopulmonary infections. However, in the pediatric population, recurrent sinopulmonary infections early in life are common, which can render key clinical features of CVID less distinctive. Accordingly, the diagnosis of CVID is often delayed due to the heterogeneous nature of the presentation and the broad age of onset. A 10-year lag in diagnosis has been found for CVID, and a critical need is improved time-to-diagnosis.
Objective: To utilize machine-learning techniques to identify a clinical signature of CVID in a pediatric population.
Methods: Our selected cohort included 112 individuals with CVID and 627 controls. Controls were restricted from having other medical conditions associated with infection. A machine learning dataset was constructed by summing patient-level counts of clinical metrics. Three supervised machine learning classifiers were trained, tuned, and performance-tested. We validated our findings using a distinct control cohort with high medical complexity and tested a logistic regression approach.
Results: Key features associated with CVID were chest X-ray count, antibiotic prescriptions, and number of common infections. Our Extreme Gradient Boosting model best predicted eventual CVID diagnosis with an F1 score of 0.77, with 21 of 29 CVID diagnoses classified correctly (false negative: 8), and 179 of 183 non-CVID patients correctly classified (false positive: 4), up to 10 years prior to the eventual clinical diagnosis. Key features with a robust association with pediatric CVID were the frequency of common infections and antibiotic prescriptions.
Conclusion: In spite of a high frequency of infections in the comparator population, the clinical signature of pediatric CVID was sufficiently distinctive to enable early identification.
期刊介绍:
The Journal of Allergy and Clinical Immunology is a prestigious publication that features groundbreaking research in the fields of Allergy, Asthma, and Immunology. This influential journal publishes high-impact research papers that explore various topics, including asthma, food allergy, allergic rhinitis, atopic dermatitis, primary immune deficiencies, occupational and environmental allergy, and other allergic and immunologic diseases. The articles not only report on clinical trials and mechanistic studies but also provide insights into novel therapies, underlying mechanisms, and important discoveries that contribute to our understanding of these diseases. By sharing this valuable information, the journal aims to enhance the diagnosis and management of patients in the future.