基于宿主基因表达秩的集成机器学习算法对急性细菌和病毒感染的鲁棒诊断:多队列模型开发和验证研究。

IF 7.1 2区 医学 Q1 MEDICAL LABORATORY TECHNOLOGY Clinical chemistry Pub Date : 2025-01-21 DOI:10.1093/clinchem/hvae220
Yifei Shen,Dongsheng Han,Wenxin Qu,Fei Yu,Dan Zhang,Yifan Xu,Enhui Shen,Qinjie Chu,Michael P Timko,Longjiang Fan,Shufa Zheng,Yu Chen
{"title":"基于宿主基因表达秩的集成机器学习算法对急性细菌和病毒感染的鲁棒诊断:多队列模型开发和验证研究。","authors":"Yifei Shen,Dongsheng Han,Wenxin Qu,Fei Yu,Dan Zhang,Yifan Xu,Enhui Shen,Qinjie Chu,Michael P Timko,Longjiang Fan,Shufa Zheng,Yu Chen","doi":"10.1093/clinchem/hvae220","DOIUrl":null,"url":null,"abstract":"BACKGROUND\r\nThe accurate and prompt diagnosis of infections is essential for improving patient outcomes and preventing bacterial drug resistance. Host gene expression profiling as an approach to infection diagnosis holds great potential in assisting early and accurate diagnosis of infection.\r\n\r\nMETHODS\r\nTo improve the precision of infection diagnosis, we developed InfectDiagno, a rank-based ensemble machine learning algorithm for infection diagnosis via host gene expression patterns. Eleven data sets were used as training data sets for the method development, and the InfectDiagno algorithm was optimized by multi-cohort training samples. Nine data sets were used as independent validation data sets for the method. We further validated the diagnostic capacity of InfectDiagno in a prospective clinical cohort.\r\n\r\nRESULTS\r\nAfter selecting 100 feature genes based on their gene expression ranks for infection prediction, we trained a classifier using both a noninfected-vs-infected area under the receiver-operating characteristic curve (area under the curve [AUC] 0.95 [95% CI, 0.93-0.97]) and a bacterial-vs-viral AUC 0.95 (95% CI, 0.93-0.97). We then used the noninfected/infected classifier together with the bacterial/viral classifier to build a discriminating infection diagnosis model. The sensitivity was 0.931 and 0.872, and specificity 0.963 and 0.929, for bacterial and viral infections, respectively. We then applied InfectDiagno to a prospective clinical cohort (n = 517), and found it classified 95% of the samples correctly.\r\n\r\nCONCLUSIONS\r\nOur study shows that the InfectDiagno algorithm is a powerful and robust tool to accurately identify infection in a real-world patient population, which has the potential to profoundly improve clinical care in the field of infection diagnosis.","PeriodicalId":10690,"journal":{"name":"Clinical chemistry","volume":"10 1","pages":""},"PeriodicalIF":7.1000,"publicationDate":"2025-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Robust Diagnosis of Acute Bacterial and Viral Infections via Host Gene Expression Rank-Based Ensemble Machine Learning Algorithm: A Multi-Cohort Model Development and Validation Study.\",\"authors\":\"Yifei Shen,Dongsheng Han,Wenxin Qu,Fei Yu,Dan Zhang,Yifan Xu,Enhui Shen,Qinjie Chu,Michael P Timko,Longjiang Fan,Shufa Zheng,Yu Chen\",\"doi\":\"10.1093/clinchem/hvae220\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"BACKGROUND\\r\\nThe accurate and prompt diagnosis of infections is essential for improving patient outcomes and preventing bacterial drug resistance. Host gene expression profiling as an approach to infection diagnosis holds great potential in assisting early and accurate diagnosis of infection.\\r\\n\\r\\nMETHODS\\r\\nTo improve the precision of infection diagnosis, we developed InfectDiagno, a rank-based ensemble machine learning algorithm for infection diagnosis via host gene expression patterns. Eleven data sets were used as training data sets for the method development, and the InfectDiagno algorithm was optimized by multi-cohort training samples. Nine data sets were used as independent validation data sets for the method. We further validated the diagnostic capacity of InfectDiagno in a prospective clinical cohort.\\r\\n\\r\\nRESULTS\\r\\nAfter selecting 100 feature genes based on their gene expression ranks for infection prediction, we trained a classifier using both a noninfected-vs-infected area under the receiver-operating characteristic curve (area under the curve [AUC] 0.95 [95% CI, 0.93-0.97]) and a bacterial-vs-viral AUC 0.95 (95% CI, 0.93-0.97). We then used the noninfected/infected classifier together with the bacterial/viral classifier to build a discriminating infection diagnosis model. The sensitivity was 0.931 and 0.872, and specificity 0.963 and 0.929, for bacterial and viral infections, respectively. We then applied InfectDiagno to a prospective clinical cohort (n = 517), and found it classified 95% of the samples correctly.\\r\\n\\r\\nCONCLUSIONS\\r\\nOur study shows that the InfectDiagno algorithm is a powerful and robust tool to accurately identify infection in a real-world patient population, which has the potential to profoundly improve clinical care in the field of infection diagnosis.\",\"PeriodicalId\":10690,\"journal\":{\"name\":\"Clinical chemistry\",\"volume\":\"10 1\",\"pages\":\"\"},\"PeriodicalIF\":7.1000,\"publicationDate\":\"2025-01-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Clinical chemistry\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1093/clinchem/hvae220\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MEDICAL LABORATORY TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical chemistry","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1093/clinchem/hvae220","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICAL LABORATORY TECHNOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

背景准确和及时的感染诊断对于改善患者预后和预防细菌耐药至关重要。宿主基因表达谱作为一种诊断感染的方法,在帮助早期和准确诊断感染方面具有很大的潜力。方法为了提高感染诊断的准确性,我们开发了一种基于秩的集成机器学习算法,用于通过宿主基因表达模式诊断感染。采用11个数据集作为训练数据集进行方法开发,并通过多队列训练样本对感染诊断算法进行优化。9个数据集作为该方法的独立验证数据集。我们在一项前瞻性临床队列研究中进一步验证了感染诊断的诊断能力。结果根据基因表达等级选择100个特征基因进行感染预测后,我们使用接收者操作特征曲线下的非感染vs感染区域(曲线下面积[AUC] 0.95 [95% CI, 0.93-0.97])和细菌vs病毒AUC 0.95 (95% CI, 0.93-0.97)训练分类器。然后,我们将非感染/感染分类器与细菌/病毒分类器结合使用,建立了鉴别感染诊断模型。细菌感染和病毒感染的敏感性分别为0.931和0.872,特异性分别为0.963和0.929。然后,我们将感染诊断应用于前瞻性临床队列(n = 517),发现它对95%的样本进行了正确分类。结论我们的研究表明,感染诊断算法是一种强大而稳健的工具,可以准确识别现实世界患者群体中的感染,具有深刻改善感染诊断领域临床护理的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Robust Diagnosis of Acute Bacterial and Viral Infections via Host Gene Expression Rank-Based Ensemble Machine Learning Algorithm: A Multi-Cohort Model Development and Validation Study.
BACKGROUND The accurate and prompt diagnosis of infections is essential for improving patient outcomes and preventing bacterial drug resistance. Host gene expression profiling as an approach to infection diagnosis holds great potential in assisting early and accurate diagnosis of infection. METHODS To improve the precision of infection diagnosis, we developed InfectDiagno, a rank-based ensemble machine learning algorithm for infection diagnosis via host gene expression patterns. Eleven data sets were used as training data sets for the method development, and the InfectDiagno algorithm was optimized by multi-cohort training samples. Nine data sets were used as independent validation data sets for the method. We further validated the diagnostic capacity of InfectDiagno in a prospective clinical cohort. RESULTS After selecting 100 feature genes based on their gene expression ranks for infection prediction, we trained a classifier using both a noninfected-vs-infected area under the receiver-operating characteristic curve (area under the curve [AUC] 0.95 [95% CI, 0.93-0.97]) and a bacterial-vs-viral AUC 0.95 (95% CI, 0.93-0.97). We then used the noninfected/infected classifier together with the bacterial/viral classifier to build a discriminating infection diagnosis model. The sensitivity was 0.931 and 0.872, and specificity 0.963 and 0.929, for bacterial and viral infections, respectively. We then applied InfectDiagno to a prospective clinical cohort (n = 517), and found it classified 95% of the samples correctly. CONCLUSIONS Our study shows that the InfectDiagno algorithm is a powerful and robust tool to accurately identify infection in a real-world patient population, which has the potential to profoundly improve clinical care in the field of infection diagnosis.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Clinical chemistry
Clinical chemistry 医学-医学实验技术
CiteScore
11.30
自引率
4.30%
发文量
212
审稿时长
1.7 months
期刊介绍: Clinical Chemistry is a peer-reviewed scientific journal that is the premier publication for the science and practice of clinical laboratory medicine. It was established in 1955 and is associated with the Association for Diagnostics & Laboratory Medicine (ADLM). The journal focuses on laboratory diagnosis and management of patients, and has expanded to include other clinical laboratory disciplines such as genomics, hematology, microbiology, and toxicology. It also publishes articles relevant to clinical specialties including cardiology, endocrinology, gastroenterology, genetics, immunology, infectious diseases, maternal-fetal medicine, neurology, nutrition, oncology, and pediatrics. In addition to original research, editorials, and reviews, Clinical Chemistry features recurring sections such as clinical case studies, perspectives, podcasts, and Q&A articles. It has the highest impact factor among journals of clinical chemistry, laboratory medicine, pathology, analytical chemistry, transfusion medicine, and clinical microbiology. The journal is indexed in databases such as MEDLINE and Web of Science.
期刊最新文献
Reflecting on 70 Years of Clinical Chemistry. Robust Diagnosis of Acute Bacterial and Viral Infections via Host Gene Expression Rank-Based Ensemble Machine Learning Algorithm: A Multi-Cohort Model Development and Validation Study. How Can Digital PCR Support the Rapid Development of New Detection Tests in Future Pandemics? A Multianalyte Machine Learning Model to Detect Wrong Blood in Complete Blood Count Tube Errors in a Pediatric Setting Structural Variation Interpretation in the Genome Sequencing Era: Lessons from Cytogenetics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1