Towards XAI agnostic explainability to assess differential diagnosis for Meningitis diseases

Aya Messai, Ahlem Drif, A. Ouyahia, Meriem Guechi, Mounira Rais, Lars Kaderali, Hocine Cherifi
{"title":"Towards XAI agnostic explainability to assess differential diagnosis for Meningitis diseases","authors":"Aya Messai, Ahlem Drif, A. Ouyahia, Meriem Guechi, Mounira Rais, Lars Kaderali, Hocine Cherifi","doi":"10.1088/2632-2153/ad4a1f","DOIUrl":null,"url":null,"abstract":"\n Meningitis, characterized by meninges and cerebrospinal fluid (CSF) inflammation, poses diagnostic challenges due to diverse clinical manifestations. This work introduces an explainable AI automatic medical decision methodology that determines critical features and their relevant values for the differential diagnosis of various meningitis cases. We proceed with knowledge acquisition to define the rules for this research. Currently, we have established the etiological diagnosis of Meningococcaemia, Meningococcal Meningitis, Tuberculous Meningitis, Aseptic Meningitis, Haemophilus influenzae Meningitis, and Pneumococcal Meningitis. The data preprocessing was conducted after collecting data from samples with meningitis diseases at Setif Hospital in Algeria. Tree-based ensemble methods were then applied to assess the model’s performance. Finally, we implement an XAI agnostic explainability approach based on the SHapley Additive exPlanations technique to attribute each feature’s contribution to the model’s output. Experiments were conducted on the collected dataset and the SINAN database, obtained from the Brazilian Government’s Health Information System on Notifiable Diseases, which comprises 6729 patients aged over 18 years. The Extreme Gradient Boosting model was chosen for its superior performance metrics (Accuracy: 0.90, AUROC: 0.94, and F1-score: 0.98). Setif’s hospital data revealed notable performance metrics (Accuracy: 0.7143, F1-Score: 0.7857). This study's findings showcase each feature's contribution to the model’s predictions and diagnosis. It also reveals critical biomarker ranges associated with distinct types of Meningitis. Significant diagnostic effect was found for Meningococcal Meningitis with elevated neutrophil levels (>40%) and balanced lymphocyte levels (40-60%). Tuberculous Meningitis demonstrated low neutrophil levels (<60%) and elevated lymphocyte levels (>60%). Haemophilus influenzae meningitis exhibited a predominance of neutrophils (>80%), while Aseptic meningitis showed lower neutrophil levels (<40%) and lymphocyte levels within the range of 50-60%. The majority of the AI automatic medical decision results are twinned with validation by our team of infectious disease experts, confirming the alignment of algorithmic diagnoses with clinical practices.","PeriodicalId":503691,"journal":{"name":"Machine Learning: Science and Technology","volume":" 38","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machine Learning: Science and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1088/2632-2153/ad4a1f","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Meningitis, characterized by meninges and cerebrospinal fluid (CSF) inflammation, poses diagnostic challenges due to diverse clinical manifestations. This work introduces an explainable AI automatic medical decision methodology that determines critical features and their relevant values for the differential diagnosis of various meningitis cases. We proceed with knowledge acquisition to define the rules for this research. Currently, we have established the etiological diagnosis of Meningococcaemia, Meningococcal Meningitis, Tuberculous Meningitis, Aseptic Meningitis, Haemophilus influenzae Meningitis, and Pneumococcal Meningitis. The data preprocessing was conducted after collecting data from samples with meningitis diseases at Setif Hospital in Algeria. Tree-based ensemble methods were then applied to assess the model’s performance. Finally, we implement an XAI agnostic explainability approach based on the SHapley Additive exPlanations technique to attribute each feature’s contribution to the model’s output. Experiments were conducted on the collected dataset and the SINAN database, obtained from the Brazilian Government’s Health Information System on Notifiable Diseases, which comprises 6729 patients aged over 18 years. The Extreme Gradient Boosting model was chosen for its superior performance metrics (Accuracy: 0.90, AUROC: 0.94, and F1-score: 0.98). Setif’s hospital data revealed notable performance metrics (Accuracy: 0.7143, F1-Score: 0.7857). This study's findings showcase each feature's contribution to the model’s predictions and diagnosis. It also reveals critical biomarker ranges associated with distinct types of Meningitis. Significant diagnostic effect was found for Meningococcal Meningitis with elevated neutrophil levels (>40%) and balanced lymphocyte levels (40-60%). Tuberculous Meningitis demonstrated low neutrophil levels (<60%) and elevated lymphocyte levels (>60%). Haemophilus influenzae meningitis exhibited a predominance of neutrophils (>80%), while Aseptic meningitis showed lower neutrophil levels (<40%) and lymphocyte levels within the range of 50-60%. The majority of the AI automatic medical decision results are twinned with validation by our team of infectious disease experts, confirming the alignment of algorithmic diagnoses with clinical practices.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
实现 XAI 不可知论的可解释性,以评估脑膜炎疾病的鉴别诊断
脑膜炎以脑膜和脑脊液(CSF)炎症为特征,临床表现多种多样,给诊断带来了挑战。本作品介绍了一种可解释的人工智能自动医疗决策方法,它能确定各种脑膜炎病例鉴别诊断的关键特征及其相关值。我们从知识获取入手,为这项研究确定规则。目前,我们已经确定了脑膜炎球菌血症、脑膜炎球菌性脑膜炎、结核性脑膜炎、无菌性脑膜炎、流感嗜血杆菌性脑膜炎和肺炎球菌性脑膜炎的病因诊断。数据预处理是在收集阿尔及利亚塞提夫医院脑膜炎疾病样本数据后进行的。然后采用基于树的集合方法来评估模型的性能。最后,我们采用了基于 SHapley Additive exPlanations 技术的 XAI 不可知论可解释性方法,以确定每个特征对模型输出的贡献。实验是在收集的数据集和 SINAN 数据库上进行的,SINAN 数据库来自巴西政府的应申报疾病健康信息系统,其中包括 6729 名 18 岁以上的患者。极端梯度提升模型因其卓越的性能指标(准确率:0.90;AUROC:0.94;F1-score:0.98)而被选中。Setif 的医院数据显示了显著的性能指标(准确率:0.7143,F1-分数:0.7857)。这项研究的结果展示了每个特征对模型预测和诊断的贡献。它还揭示了与不同类型脑膜炎相关的关键生物标志物范围。中性粒细胞水平升高(>40%)和淋巴细胞水平平衡(40-60%)对脑膜炎球菌性脑膜炎有显著的诊断效果。结核性脑膜炎的中性粒细胞水平较低(60%)。流感嗜血杆菌脑膜炎显示中性粒细胞占主导地位(>80%),而无菌性脑膜炎显示中性粒细胞水平较低(<40%),淋巴细胞水平在 50-60% 之间。大多数人工智能自动医疗决策结果都经过了我们的传染病专家团队的验证,确认了算法诊断与临床实践的一致性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
On the Benefit of Attention in Inverse Design of Thin Films Filters Predictive Models for Inorganic Materials Thermoelectric Properties with Machine Learning Benchmarking machine learning interatomic potentials via phonon anharmonicity Application of Deep Learning-based Fuzzy Systems to Analyze the Overall Risk of Mortality in Glioblastoma Multiforme Formation Energy Prediction of Neutral Single-Atom Impurities in 2D Materials using Tree-based Machine Learning
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1