首页 > 最新文献

Intelligence-based medicine最新文献

英文 中文
Hybrid vision transformer and Xception model for reliable CT-based ovarian neoplasms diagnosis 基于ct的卵巢肿瘤可靠诊断的混合视觉变换器和异常模型
Pub Date : 2025-01-01 Epub Date: 2025-02-18 DOI: 10.1016/j.ibmed.2025.100227
Eman Hussein Alshdaifat , Hasan Gharaibeh , Amer Mahmoud Sindiani , Rola Madain , Asma'a Mohammad Al-Mnayyis , Hamad Yahia Abu Mhanna , Rawan Eimad Almahmoud , Hanan Fawaz Akhdar , Mohammad Amin , Ahmad Nasayreh , Raneem Hamad
Ovarian cancer is a major global health concern, characterized by high mortality rates and a lack of accurate diagnostic methods. Rapid and accurate detection of ovarian cancer is essential to improve patient outcomes and formulate appropriate treatment protocols. Medical imaging methods are essential for identifying ovarian cancer; however, achieving accurate diagnosis remains a challenge. This paper presents a robust methodology for ovarian cancer detection, including the identification and classification of benign and malignant tumors, using the Xception_ViT model. This hybrid approach was chosen because it combines the advantages of traditional CNN-based models (such as Xception) with the capabilities of modern Transformers-based models (such as ViT). This combination allows the model to take advantage of Xception, which extracts features from images. The Vision Transformer (ViT) model is then used to identify connections between diverse visual elements, enhancing the model's understanding of complex components. A Multi-Layer Perceptron (MLP) layer is finally integrated with the proposed model for image classification. The effectiveness of the model is evaluated using three computed tomography (CT) image datasets from King Abdullah University Hospital (KAUH) in Jordan. The first dataset consists of the ovarian cancer computed tomography dataset (KAUH-OCCTD), the second is the benign ovarian tumors dataset (KAUH-BOTD), and the third is the malignant ovarian tumors dataset (KAUH-MOTD). The three datasets collected from 500 women are characterized by their diversity in ovarian tumor classification and are the first of their kind collected in Jordan. The proposed model Xception_ViT achieved an accuracy of 98.09 % in identifying ovarian cancer on the KAUH-OCCTD dataset, and an accuracy of 96.05 % and 98.73 % on the KAUH-BOTD and KAUH-MOTD datasets, respectively, in distinguishing between benign and malignant ovarian tumors. The proposed model outperformed the pre-trained models on all three datasets. The results demonstrate that the proposed model can classify ovarian tumors. This method could also greatly enhance the efficiency of novice radiologists in evaluating ovarian malignancies and assist gynecologists in providing improved treatment alternatives for these individuals.
卵巢癌是一个主要的全球健康问题,其特点是死亡率高,缺乏准确的诊断方法。快速准确地检测卵巢癌对于改善患者预后和制定适当的治疗方案至关重要。医学影像学方法是鉴别卵巢癌的必要手段;然而,实现准确的诊断仍然是一个挑战。本文提出了一种强大的卵巢癌检测方法,包括使用Xception_ViT模型对良性和恶性肿瘤进行识别和分类。之所以选择这种混合方法,是因为它结合了传统的基于cnn的模型(如Xception)的优势和现代基于transformer的模型(如ViT)的能力。这种组合允许模型利用Xception,它从图像中提取特征。然后使用视觉转换器(Vision Transformer, ViT)模型来识别不同视觉元素之间的联系,增强模型对复杂组件的理解。最后将多层感知器(MLP)层与所提出的图像分类模型相结合。使用约旦阿卜杜拉国王大学医院(KAUH)的三个计算机断层扫描(CT)图像数据集评估该模型的有效性。第一个数据集包括卵巢癌计算机断层扫描数据集(KAUH-OCCTD),第二个数据集是良性卵巢肿瘤数据集(KAUH-BOTD),第三个数据集是恶性卵巢肿瘤数据集(KAUH-MOTD)。从500名妇女中收集的三个数据集以其卵巢肿瘤分类的多样性为特征,是约旦首次收集此类数据集。所提出的Xception_ViT模型在KAUH-OCCTD数据集上识别卵巢癌的准确率为98.09%,在KAUH-BOTD和KAUH-MOTD数据集上区分卵巢良恶性肿瘤的准确率分别为96.05%和98.73%。提出的模型在所有三个数据集上都优于预训练模型。结果表明,该模型能够对卵巢肿瘤进行分类。该方法还可以大大提高新手放射科医生评估卵巢恶性肿瘤的效率,并协助妇科医生为这些个体提供改进的治疗方案。
{"title":"Hybrid vision transformer and Xception model for reliable CT-based ovarian neoplasms diagnosis","authors":"Eman Hussein Alshdaifat ,&nbsp;Hasan Gharaibeh ,&nbsp;Amer Mahmoud Sindiani ,&nbsp;Rola Madain ,&nbsp;Asma'a Mohammad Al-Mnayyis ,&nbsp;Hamad Yahia Abu Mhanna ,&nbsp;Rawan Eimad Almahmoud ,&nbsp;Hanan Fawaz Akhdar ,&nbsp;Mohammad Amin ,&nbsp;Ahmad Nasayreh ,&nbsp;Raneem Hamad","doi":"10.1016/j.ibmed.2025.100227","DOIUrl":"10.1016/j.ibmed.2025.100227","url":null,"abstract":"<div><div>Ovarian cancer is a major global health concern, characterized by high mortality rates and a lack of accurate diagnostic methods. Rapid and accurate detection of ovarian cancer is essential to improve patient outcomes and formulate appropriate treatment protocols. Medical imaging methods are essential for identifying ovarian cancer; however, achieving accurate diagnosis remains a challenge. This paper presents a robust methodology for ovarian cancer detection, including the identification and classification of benign and malignant tumors, using the Xception_ViT model. This hybrid approach was chosen because it combines the advantages of traditional CNN-based models (such as Xception) with the capabilities of modern Transformers-based models (such as ViT). This combination allows the model to take advantage of Xception, which extracts features from images. The Vision Transformer (ViT) model is then used to identify connections between diverse visual elements, enhancing the model's understanding of complex components. A Multi-Layer Perceptron (MLP) layer is finally integrated with the proposed model for image classification. The effectiveness of the model is evaluated using three computed tomography (CT) image datasets from King Abdullah University Hospital (KAUH) in Jordan. The first dataset consists of the ovarian cancer computed tomography dataset (KAUH-OCCTD), the second is the benign ovarian tumors dataset (KAUH-BOTD), and the third is the malignant ovarian tumors dataset (KAUH-MOTD). The three datasets collected from 500 women are characterized by their diversity in ovarian tumor classification and are the first of their kind collected in Jordan. The proposed model Xception_ViT achieved an accuracy of 98.09 % in identifying ovarian cancer on the KAUH-OCCTD dataset, and an accuracy of 96.05 % and 98.73 % on the KAUH-BOTD and KAUH-MOTD datasets, respectively, in distinguishing between benign and malignant ovarian tumors. The proposed model outperformed the pre-trained models on all three datasets. The results demonstrate that the proposed model can classify ovarian tumors. This method could also greatly enhance the efficiency of novice radiologists in evaluating ovarian malignancies and assist gynecologists in providing improved treatment alternatives for these individuals.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"11 ","pages":"Article 100227"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143465514","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting maternal health risk using PCA-enhanced XGBoost and SMOTE-ENN for improved healthcare outcomes 使用pca增强的XGBoost和SMOTE-ENN预测孕产妇健康风险,以改善医疗保健结果
Pub Date : 2025-01-01 Epub Date: 2025-10-03 DOI: 10.1016/j.ibmed.2025.100300
Rahmatul Kabir Rasel Sarker , Sadman Hafij , Md Adib Yasir , Md Assaduzzaman , Md Monir Hossain Shimul , Md Kamrul Hossain

Background

Maternal health remains a global priority, especially in low-resource settings where timely risk identification is critical. Traditional machine learning models often suffer from poor generalizability, data imbalance, and computational inefficiencies. This study proposes an enhanced predictive model combining SMOTE-ENN data balancing and Principal Component Analysis (PCA) with XGBoost to improve maternal risk classification accuracy using minimal, easily collectible clinical features.

Methods

The dataset of 1014 maternal health records comprising seven physiological features was sourced from a public repository. Preprocessing involved standardization, label encoding, and class balancing using SMOTE-ENN. PCA was applied for dimensionality reduction to enhance computational performance and reduce overfitting. Several machine learning classifiers including Decision Tree, Random Forest, LightGBM, Gradient Boosting, and SVM were evaluated, with XGBoost selected as the final model. Performance metrics included accuracy, precision, recall, F1-score, ROC-AUC, and 10-fold cross-validation.

Results

The PCA-enhanced XGBoost model achieved the highest accuracy (97.73 %), precision (98 %), recall (98 %), and F1-score (98 %). It outperformed all other models, particularly in identifying high-risk cases with minimal false negatives. Cross-validation confirmed the model's robustness (mean accuracy: 98.39 %), and ROC-AUC scores exceeded 0.998 for all classes, indicating near-perfect classification performance.

Conclusion

This study validates a maternal health risk prediction model that is scalable for use in resource-constrained environments and interpretable within the limitations of the selected dimensionality-reduction approach. Its simplicity, high accuracy, and generalizability make it a promising tool for early clinical decision-making and intervention.
产妇保健仍然是全球优先事项,特别是在资源匮乏的环境中,及时识别风险至关重要。传统的机器学习模型通常存在泛化能力差、数据不平衡和计算效率低下的问题。本研究提出了一个增强的预测模型,结合SMOTE-ENN数据平衡和主成分分析(PCA)与XGBoost,利用最小的、易于收集的临床特征来提高孕产妇风险分类的准确性。方法从公共信息库中获取1014份孕产妇健康记录,包括7项生理特征。预处理包括使用SMOTE-ENN进行标准化、标签编码和类平衡。采用主成分分析法进行降维,提高计算性能,减少过拟合。对决策树、随机森林、LightGBM、梯度增强和支持向量机等几种机器学习分类器进行了评估,最终选择XGBoost作为最终模型。性能指标包括准确性、精密度、召回率、f1评分、ROC-AUC和10倍交叉验证。结果pca增强的XGBoost模型具有最高的准确率(97.73%)、精密度(98%)、召回率(98%)和f1评分(98%)。它优于所有其他模型,特别是在识别高风险病例时,以最小的假阴性。交叉验证证实了模型的稳健性(平均准确率为98.39%),所有类别的ROC-AUC得分均超过0.998,表明分类性能接近完美。结论:本研究验证了一种产妇健康风险预测模型,该模型可扩展用于资源受限环境,并可在所选降维方法的限制下解释。它的简单,高精度和可推广性使其成为早期临床决策和干预的有前途的工具。
{"title":"Predicting maternal health risk using PCA-enhanced XGBoost and SMOTE-ENN for improved healthcare outcomes","authors":"Rahmatul Kabir Rasel Sarker ,&nbsp;Sadman Hafij ,&nbsp;Md Adib Yasir ,&nbsp;Md Assaduzzaman ,&nbsp;Md Monir Hossain Shimul ,&nbsp;Md Kamrul Hossain","doi":"10.1016/j.ibmed.2025.100300","DOIUrl":"10.1016/j.ibmed.2025.100300","url":null,"abstract":"<div><h3>Background</h3><div>Maternal health remains a global priority, especially in low-resource settings where timely risk identification is critical. Traditional machine learning models often suffer from poor generalizability, data imbalance, and computational inefficiencies. This study proposes an enhanced predictive model combining SMOTE-ENN data balancing and Principal Component Analysis (PCA) with XGBoost to improve maternal risk classification accuracy using minimal, easily collectible clinical features.</div></div><div><h3>Methods</h3><div>The dataset of 1014 maternal health records comprising seven physiological features was sourced from a public repository. Preprocessing involved standardization, label encoding, and class balancing using SMOTE-ENN. PCA was applied for dimensionality reduction to enhance computational performance and reduce overfitting. Several machine learning classifiers including Decision Tree, Random Forest, LightGBM, Gradient Boosting, and SVM were evaluated, with XGBoost selected as the final model. Performance metrics included accuracy, precision, recall, F1-score, ROC-AUC, and 10-fold cross-validation.</div></div><div><h3>Results</h3><div>The PCA-enhanced XGBoost model achieved the highest accuracy (97.73 %), precision (98 %), recall (98 %), and F1-score (98 %). It outperformed all other models, particularly in identifying high-risk cases with minimal false negatives. Cross-validation confirmed the model's robustness (mean accuracy: 98.39 %), and ROC-AUC scores exceeded 0.998 for all classes, indicating near-perfect classification performance.</div></div><div><h3>Conclusion</h3><div>This study validates a maternal health risk prediction model that is scalable for use in resource-constrained environments and interpretable within the limitations of the selected dimensionality-reduction approach. Its simplicity, high accuracy, and generalizability make it a promising tool for early clinical decision-making and intervention.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"12 ","pages":"Article 100300"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145219043","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring the intersection of cochlear implants and artificial intelligence: A mixed-method systematic and scoping review 探索人工耳蜗与人工智能的交叉:一种混合方法的系统和范围综述
Pub Date : 2025-01-01 Epub Date: 2025-09-24 DOI: 10.1016/j.ibmed.2025.100296
Aurenzo Gonçalves Mocelin , Pedro Angelo Basei de Paula , Daniel Tiepolo Kochinski , Thayná Cristina Wiezbicki , Rogério de Azevedo Hamerschmidt , Mayara Risnei Watanabe , Rogério Hamerschmidt

Objective

This study systematically evaluates the role of artificial intelligence (AI) in cochlear implant (CI) technology, focusing on speech enhancement, automated fitting, AI-assisted surgery, predictive modeling, and rehabilitation. The review identifies key advancements, existing limitations, and areas for future development.

Methods

Following PRISMA guidelines, we conducted a systematic search across PubMed, IEEE Xplore, Scopus, ScienceDirect, and Embase. We included peer-reviewed primary data studies on AI applications in CIs. The selected studies were categorized into thematic subdomains, such as noise suppression, adaptive programming, AI-driven surgical planning, and telemedicine applications.

Results

From an initial pool of 743 records, 129 studies met the eligibility criteria and were included in the final analysis. These studies were categorized into eleven thematic subdomains. The review identified the main application areas and emerging research fronts at the intersection of artificial intelligence and cochlear implant technologies, including speech enhancement, automated fitting, predictive modeling, rehabilitation support, and AI-assisted surgery.

Discussion and conclusion

AI is transforming CI technology by improving speech perception, personalization, and surgical precision. However, challenges persist, including computational constraints, data heterogeneity, and the need for large-scale clinical validation. Future research should prioritize energy-efficient AI architectures, regulatory approval pathways, and ethical considerations in automated decision-making. Advancing AI-driven telemedicine solutions can expand CI accessibility, reducing the need for in-person programming. Addressing these challenges will accelerate the development of more adaptive and user-centered CI solutions, ultimately enhancing auditory rehabilitation and quality of life for CI users.
目的系统评估人工智能(AI)在人工耳蜗(CI)技术中的作用,重点关注语音增强、自动验配、人工智能辅助手术、预测建模和康复。该审查确定了主要进展、现有限制和未来发展的领域。方法遵循PRISMA指南,我们在PubMed、IEEE explore、Scopus、ScienceDirect和Embase中进行了系统搜索。我们纳入了人工智能在ci中的应用的同行评议的原始数据研究。选定的研究被分类为主题子领域,如噪声抑制、自适应编程、人工智能驱动的手术计划和远程医疗应用。结果从最初的743份记录中,有129项研究符合资格标准,并被纳入最终分析。这些研究分为11个主题子领域。该综述确定了人工智能和人工耳蜗技术交叉的主要应用领域和新兴研究前沿,包括语音增强、自动装配、预测建模、康复支持和人工智能辅助手术。人工智能正在通过提高语音感知、个性化和手术精度来改变CI技术。然而,挑战依然存在,包括计算限制、数据异质性和大规模临床验证的需要。未来的研究应优先考虑节能的人工智能架构、监管审批途径和自动化决策中的道德考虑。推进人工智能驱动的远程医疗解决方案可以扩大CI的可访问性,减少对亲自编程的需求。解决这些挑战将加速开发更具适应性和以用户为中心的CI解决方案,最终提高CI用户的听觉康复和生活质量。
{"title":"Exploring the intersection of cochlear implants and artificial intelligence: A mixed-method systematic and scoping review","authors":"Aurenzo Gonçalves Mocelin ,&nbsp;Pedro Angelo Basei de Paula ,&nbsp;Daniel Tiepolo Kochinski ,&nbsp;Thayná Cristina Wiezbicki ,&nbsp;Rogério de Azevedo Hamerschmidt ,&nbsp;Mayara Risnei Watanabe ,&nbsp;Rogério Hamerschmidt","doi":"10.1016/j.ibmed.2025.100296","DOIUrl":"10.1016/j.ibmed.2025.100296","url":null,"abstract":"<div><h3>Objective</h3><div>This study systematically evaluates the role of artificial intelligence (AI) in cochlear implant (CI) technology, focusing on speech enhancement, automated fitting, AI-assisted surgery, predictive modeling, and rehabilitation. The review identifies key advancements, existing limitations, and areas for future development.</div></div><div><h3>Methods</h3><div>Following PRISMA guidelines, we conducted a systematic search across PubMed, IEEE Xplore, Scopus, ScienceDirect, and Embase. We included peer-reviewed primary data studies on AI applications in CIs. The selected studies were categorized into thematic subdomains, such as noise suppression, adaptive programming, AI-driven surgical planning, and telemedicine applications.</div></div><div><h3>Results</h3><div>From an initial pool of 743 records, 129 studies met the eligibility criteria and were included in the final analysis. These studies were categorized into eleven thematic subdomains. The review identified the main application areas and emerging research fronts at the intersection of artificial intelligence and cochlear implant technologies, including speech enhancement, automated fitting, predictive modeling, rehabilitation support, and AI-assisted surgery.</div></div><div><h3>Discussion and conclusion</h3><div>AI is transforming CI technology by improving speech perception, personalization, and surgical precision. However, challenges persist, including computational constraints, data heterogeneity, and the need for large-scale clinical validation. Future research should prioritize energy-efficient AI architectures, regulatory approval pathways, and ethical considerations in automated decision-making. Advancing AI-driven telemedicine solutions can expand CI accessibility, reducing the need for in-person programming. Addressing these challenges will accelerate the development of more adaptive and user-centered CI solutions, ultimately enhancing auditory rehabilitation and quality of life for CI users.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"12 ","pages":"Article 100296"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145264862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Expression of concern for DieT Transformer model with PCA-ADE integration for advanced multi-class brain tumor classification by Mohammad Amin, Khalid M.O. Nahar, et al. [Intell.-Base Med. 11, (2025), 100192, https://doi.org/10.1016/j.ibmed.2024.100192] Mohammad Amin, Khalid M.O. Nahar等人对PCA-ADE集成的DieT Transformer模型在晚期多级别脑肿瘤分类中的关注表达[intel]。-基础医学,(2025),100192,https://doi.org/10.1016/j.ibmed.2024.100192]
Pub Date : 2025-01-01
{"title":"Expression of concern for DieT Transformer model with PCA-ADE integration for advanced multi-class brain tumor classification by Mohammad Amin, Khalid M.O. Nahar, et al. [Intell.-Base Med. 11, (2025), 100192, https://doi.org/10.1016/j.ibmed.2024.100192]","authors":"","doi":"","DOIUrl":"","url":null,"abstract":"","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"12 ","pages":"Article 100279"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145733186","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Forecasting pediatric emergency department arrivals: Evaluating the role of exogenous variables using deep learning models 预测儿科急诊科到达:使用深度学习模型评估外生变量的作用
Pub Date : 2025-01-01 Epub Date: 2025-11-12 DOI: 10.1016/j.ibmed.2025.100313
Egbe-Etu Etu , Jordan Larot , Kindness Etu , Joshua Emakhu , Sara Masoud , Imokhai Tenebe , Gaojian Huang , Satheesh Gunaga , Joseph Miller

Background

Forecasting pediatric emergency department (ED) demand remains a critical challenge in healthcare operations. This study aimed to identify exogenous variables influencing pediatric ED visits and evaluate the performance of different forecasting models.

Method

Using a retrospective observational design, we analyzed 192,347 pediatric ED visits across nine hospitals in Southeast Michigan between 2017 and 2019. Patient data were aggregated into daily arrival counts and enriched with exogenous variables such as weather, air quality, pollen, calendar, Google search trends, and chief complaints. Feature selection was performed using XGBoost and SHapley Additive exPlanations to identify the most influential predictors. Three forecasting models were developed: a Naïve baseline, Long Short-Term Memory (LSTM), and an attention-based neural network. The models were evaluated across 1-day, 7-day, and 14-day forecasting horizons using mean absolute percentage error (MAPE) and R2 metrics.

Results

LSTM and attention-based model significantly outperformed the Naïve baseline across all horizons. The LSTM model incorporating calendar data achieved the best 1-day forecast (MAPE: 8.71 %, R2: 0.67). For 7-day forecasts, the attention-based model using chief complaint data performed best (MAPE: 9.18 %, R2: 0.57). At 14 days, the attention-based model without exogenous inputs outperformed most LSTM variants, reflecting superior performance in long-range forecasting. Among exogenous variables, calendar and chief complaint data added the most predictive value, while Google Trends and pollen data introduced noise and diminished model performance.

Conclusion

Combining deep learning architectures with selected external data improves pediatric ED arrival forecasting. From an operational perspective, such forecasts can support more efficient staffing, reduce wait times, and mitigate ED crowding.
背景预测儿科急诊科(ED)的需求仍然是医疗保健业务的关键挑战。本研究旨在确定影响儿科急诊科就诊的外生变量,并评估不同预测模型的性能。方法采用回顾性观察设计,分析2017年至2019年密歇根州东南部9家医院的192,347例儿科急诊科就诊情况。患者数据汇总为每日到达计数,并丰富了外生变量,如天气、空气质量、花粉、日历、谷歌搜索趋势和主诉。使用XGBoost和SHapley加性解释进行特征选择,以确定最具影响力的预测因子。开发了三种预测模型:Naïve基线,长短期记忆(LSTM)和基于注意的神经网络。使用平均绝对百分比误差(MAPE)和R2指标对模型进行1天、7天和14天的预测期评估。结果slstm和基于注意力的模型在所有视界上都显著优于Naïve基线。结合日历数据的LSTM模型获得了最好的1天预测(MAPE: 8.71%, R2: 0.67)。对于7天的预测,使用主诉数据的基于注意力的模型表现最好(MAPE: 9.18%, R2: 0.57)。在第14天,没有外源输入的基于注意力的模型优于大多数LSTM变体,反映出在长期预测方面的优越性能。在外源变量中,日历和主诉数据的预测价值最高,而谷歌趋势和花粉数据引入了噪声,降低了模型的性能。结论将深度学习架构与选定的外部数据相结合可以提高儿科急诊科的到来预测。从操作的角度来看,这样的预测可以支持更有效的人员配置,减少等待时间,并缓解急诊科拥挤。
{"title":"Forecasting pediatric emergency department arrivals: Evaluating the role of exogenous variables using deep learning models","authors":"Egbe-Etu Etu ,&nbsp;Jordan Larot ,&nbsp;Kindness Etu ,&nbsp;Joshua Emakhu ,&nbsp;Sara Masoud ,&nbsp;Imokhai Tenebe ,&nbsp;Gaojian Huang ,&nbsp;Satheesh Gunaga ,&nbsp;Joseph Miller","doi":"10.1016/j.ibmed.2025.100313","DOIUrl":"10.1016/j.ibmed.2025.100313","url":null,"abstract":"<div><h3>Background</h3><div>Forecasting pediatric emergency department (ED) demand remains a critical challenge in healthcare operations. This study aimed to identify exogenous variables influencing pediatric ED visits and evaluate the performance of different forecasting models.</div></div><div><h3>Method</h3><div>Using a retrospective observational design, we analyzed 192,347 pediatric ED visits across nine hospitals in Southeast Michigan between 2017 and 2019. Patient data were aggregated into daily arrival counts and enriched with exogenous variables such as weather, air quality, pollen, calendar, Google search trends, and chief complaints. Feature selection was performed using XGBoost and SHapley Additive exPlanations to identify the most influential predictors. Three forecasting models were developed: a Naïve baseline, Long Short-Term Memory (LSTM), and an attention-based neural network. The models were evaluated across 1-day, 7-day, and 14-day forecasting horizons using mean absolute percentage error (MAPE) and R<sup>2</sup> metrics.</div></div><div><h3>Results</h3><div>LSTM and attention-based model significantly outperformed the Naïve baseline across all horizons. The LSTM model incorporating calendar data achieved the best 1-day forecast (MAPE: 8.71 %, R<sup>2</sup>: 0.67). For 7-day forecasts, the attention-based model using chief complaint data performed best (MAPE: 9.18 %, R<sup>2</sup>: 0.57). At 14 days, the attention-based model without exogenous inputs outperformed most LSTM variants, reflecting superior performance in long-range forecasting. Among exogenous variables, calendar and chief complaint data added the most predictive value, while Google Trends and pollen data introduced noise and diminished model performance.</div></div><div><h3>Conclusion</h3><div>Combining deep learning architectures with selected external data improves pediatric ED arrival forecasting. From an operational perspective, such forecasts can support more efficient staffing, reduce wait times, and mitigate ED crowding.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"12 ","pages":"Article 100313"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145519506","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting the prevalence of cardiovascular diseases using machine learning algorithms 使用机器学习算法预测心血管疾病的患病率
Pub Date : 2025-01-01 Epub Date: 2025-01-20 DOI: 10.1016/j.ibmed.2025.100199
Bernada E. Sianga , Maurice C. Mbago , Amina S. Msengwa
Cardiovascular Diseases (CVDs) are the major cause of morbidity, disability, and mortality worldwide and are the most life-threatening diseases. Early detection and appropriate action can significantly reduce the effects and complications of CVD. Prediction of the likelihood that an individual can develop CVD adverse outcomes is essential. Machine learning methods are used to predict the risk of CVD incidences. Optimal model parameters were obtained using the grid search and randomized search methods. A hyperparameter tuning method with the highest accuracy was used to find the optimal parameters for the six algorithms used in this study. Two experiments were deployed: the first was training and testing the CVD dataset using hyperparameterized ML algorithms excluding geographical features, and the second included geographical features. The geographical features are air humidity, temperature and education status of a location. The performances of the two experiments were compared using classification metrics. The findings revealed that the performance of the second experiment outperformed the first experiment. XGBoost achieved the highest accuracy of 95.24 %, followed by the decision tree 93.87 % and support vector machine 92.87 % when geographical features were included (second experiment). Including geographical risk factors in predicting CVD is crucial as they contribute to the probability of developing CVD incidences.
心血管疾病(cvd)是世界范围内发病、残疾和死亡的主要原因,也是最危及生命的疾病。早期发现和适当的行动可以显著减少心血管疾病的影响和并发症。预测个体发生心血管疾病不良后果的可能性至关重要。机器学习方法用于预测心血管疾病发病率的风险。采用网格搜索和随机搜索方法获得最优模型参数。采用精度最高的超参数整定方法对六种算法进行了参数优化。部署了两个实验:第一个是使用排除地理特征的超参数化ML算法训练和测试CVD数据集,第二个是包含地理特征。地理特征是指一个地点的空气湿度、温度和教育状况。使用分类指标对两个实验的性能进行比较。结果显示,第二个实验的表现优于第一个实验。在包含地理特征时,XGBoost的准确率最高,为95.24%,其次是决策树(93.87%)和支持向量机(92.87%)(第二次实验)。在预测心血管疾病时包括地理危险因素是至关重要的,因为它们有助于心血管疾病发生的概率。
{"title":"Predicting the prevalence of cardiovascular diseases using machine learning algorithms","authors":"Bernada E. Sianga ,&nbsp;Maurice C. Mbago ,&nbsp;Amina S. Msengwa","doi":"10.1016/j.ibmed.2025.100199","DOIUrl":"10.1016/j.ibmed.2025.100199","url":null,"abstract":"<div><div>Cardiovascular Diseases (CVDs) are the major cause of morbidity, disability, and mortality worldwide and are the most life-threatening diseases. Early detection and appropriate action can significantly reduce the effects and complications of CVD. Prediction of the likelihood that an individual can develop CVD adverse outcomes is essential. Machine learning methods are used to predict the risk of CVD incidences. Optimal model parameters were obtained using the grid search and randomized search methods. A hyperparameter tuning method with the highest accuracy was used to find the optimal parameters for the six algorithms used in this study. Two experiments were deployed: the first was training and testing the CVD dataset using hyperparameterized ML algorithms excluding geographical features, and the second included geographical features. The geographical features are air humidity, temperature and education status of a location. The performances of the two experiments were compared using classification metrics. The findings revealed that the performance of the second experiment outperformed the first experiment. XGBoost achieved the highest accuracy of 95.24 %, followed by the decision tree 93.87 % and support vector machine 92.87 % when geographical features were included (second experiment). Including geographical risk factors in predicting CVD is crucial as they contribute to the probability of developing CVD incidences.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"11 ","pages":"Article 100199"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143174330","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A neonatal sepsis prediction algorithm using electronic medical record data from Mbarara Regional Referral Hospital 基于Mbarara地区转诊医院电子病历数据的新生儿败血症预测算法
Pub Date : 2025-01-01 Epub Date: 2025-01-07 DOI: 10.1016/j.ibmed.2025.100198
Peace Ezeobi Dennis , Angella Musiimenta , William Wasswa , Stella Kyoyagala

Introduction

Neonatal sepsis is a global challenge that contributes significantly to neonatal morbidity and mortality. The current diagnostic methods depend on conventional culture methods, a procedure that takes time and leads to delays in making timely treatment decisions. This study proposes a machine learning algorithm utilizing electronic medical record (EMR) data from Mbarara Regional Referral Hospital (MRRH) to enhance early detection and treatment of neonatal sepsis.

Methods

We performed a retrospective study on a dataset of neonates hospitalized for at least 48 h in the Neonatal Intensive Care Unit (NICU) at MRRH between October 2015 to September 2019 who received at least one sepsis evaluation. 482 records of neonates met the inclusion criteria and the dataset comprises 38 neonatal sepsis screening parameters. The study considered two outcomes for sepsis evaluations: culture-positive if a blood culture was positive, and clinically positive if cultures were negative but antibiotics were administered for at least 120 h. We implemented k-fold cross-validation with k set to 10 to guarantee robust training and testing of the models. Seven machine learning models were trained to classify inputs as sepsis positive or negative, and their performance was compared with physician diagnoses.

Results

The results of this study show that the proposed algorithm, combining maternal risk factors, neonatal clinical signs, and laboratory tests (the algorithm demonstrated a sensitivity and specificity of at least 95 %) outperformed the physician diagnosis (Sensitivity = 89 %, Specificity = 11 %). SVM model with radial basis function, polynomial kernels, and DT model (with the highest AUROC values of 98 %) performed better than the other models.

Conclusions

The study shows that the combination of maternal risk factors, neonatal clinical signs, and laboratory tests can help improve the prediction of neonatal sepsis. Further research is warranted to assess the potential performance improvements and clinical efficacy in a prospective trial.
新生儿败血症是一项全球性挑战,对新生儿发病率和死亡率有重要影响。目前的诊断方法依赖于传统的培养方法,这一过程需要时间,并导致及时做出治疗决定的延误。本研究提出了一种利用Mbarara地区转诊医院(MRRH)电子病历(EMR)数据的机器学习算法,以提高新生儿败血症的早期发现和治疗。方法对2015年10月至2019年9月期间在MRRH新生儿重症监护病房(NICU)住院至少48小时并接受至少一次脓毒症评估的新生儿数据集进行回顾性研究。482例符合纳入标准的新生儿记录,数据集包括38个新生儿败血症筛查参数。该研究考虑了脓毒症评估的两种结果:如果血液培养呈阳性,则培养呈阳性;如果培养呈阴性,但使用抗生素至少120小时,则临床呈阳性。我们实施了k-fold交叉验证,k设置为10,以保证模型的稳健训练和测试。七个机器学习模型被训练来将输入分类为脓毒症阳性或阴性,并将它们的表现与医生的诊断进行比较。结果本研究结果表明,结合产妇危险因素、新生儿临床体征和实验室检查(该算法的灵敏度和特异性至少为95%)提出的算法优于医生诊断(灵敏度= 89%,特异性= 11%)。采用径向基函数、多项式核的SVM模型和AUROC最高达98%的DT模型均优于其他模型。结论结合产妇危险因素、新生儿临床体征和实验室检查,有助于提高对新生儿脓毒症的预测。进一步的研究需要在前瞻性试验中评估潜在的性能改善和临床疗效。
{"title":"A neonatal sepsis prediction algorithm using electronic medical record data from Mbarara Regional Referral Hospital","authors":"Peace Ezeobi Dennis ,&nbsp;Angella Musiimenta ,&nbsp;William Wasswa ,&nbsp;Stella Kyoyagala","doi":"10.1016/j.ibmed.2025.100198","DOIUrl":"10.1016/j.ibmed.2025.100198","url":null,"abstract":"<div><h3>Introduction</h3><div>Neonatal sepsis is a global challenge that contributes significantly to neonatal morbidity and mortality. The current diagnostic methods depend on conventional culture methods, a procedure that takes time and leads to delays in making timely treatment decisions. This study proposes a machine learning algorithm utilizing electronic medical record (EMR) data from Mbarara Regional Referral Hospital (MRRH) to enhance early detection and treatment of neonatal sepsis.</div></div><div><h3>Methods</h3><div>We performed a retrospective study on a dataset of neonates hospitalized for at least 48 h in the Neonatal Intensive Care Unit (NICU) at MRRH between October 2015 to September 2019 who received at least one sepsis evaluation. 482 records of neonates met the inclusion criteria and the dataset comprises 38 neonatal sepsis screening parameters. The study considered two outcomes for sepsis evaluations: culture-positive if a blood culture was positive, and clinically positive if cultures were negative but antibiotics were administered for at least 120 h. We implemented k-fold cross-validation with k set to 10 to guarantee robust training and testing of the models. Seven machine learning models were trained to classify inputs as sepsis positive or negative, and their performance was compared with physician diagnoses.</div></div><div><h3>Results</h3><div>The results of this study show that the proposed algorithm, combining maternal risk factors, neonatal clinical signs, and laboratory tests (the algorithm demonstrated a sensitivity and specificity of at least 95 %) outperformed the physician diagnosis (Sensitivity = 89 %, Specificity = 11 %). SVM model with radial basis function, polynomial kernels, and DT model (with the highest AUROC values of 98 %) performed better than the other models.</div></div><div><h3>Conclusions</h3><div>The study shows that the combination of maternal risk factors, neonatal clinical signs, and laboratory tests can help improve the prediction of neonatal sepsis. Further research is warranted to assess the potential performance improvements and clinical efficacy in a prospective trial.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"11 ","pages":"Article 100198"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143174356","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep learning-based approach to diagnose lung cancer using CT-scan images 基于深度学习的ct扫描图像肺癌诊断方法
Pub Date : 2025-01-01 Epub Date: 2024-12-12 DOI: 10.1016/j.ibmed.2024.100188
Mohammad Q. Shatnawi, Qusai Abuein, Romesaa Al-Quraan
The work in this research focuses on the automatic classification and prediction of lung cancer using computed tomography (CT) scans, employing Deep Learning (DL) strategies, specifically Enhanced Convolutional Neural Networks (CNNs), to enable rapid and accurate image analysis. This research designed and developed pre-trained models, including ConvNeXtSmall, VGG16, ResNet50, InceptionV3, and EfficientNetB0, to classify lung cancer. The dataset was divided into four classes, consisting of 338 images of adenocarcinoma, 187 images of large cell carcinoma, 260 images of squamous cell carcinoma, and 215 normal images. Notably, The Enhanced CNN model achieved an unprecedented testing accuracy of 100 %, outperforming all other models, which included ConvNeXt at 87 %, VGG16 at 99 %, ResNet50 at 94.5 %, InceptionV3 at 76.9 %, and EfficientNetB0 at 97.9 %. The study of this research is considered the first one that hits 100 % testing accuracy with an Enhanced CNN, demonstrating significant advancements in lung cancer detection through the application of sophisticated image enhancement techniques and innovative model architectures. This highlights the potential of Enhanced CNN models in transforming lung cancer diagnostics and emphasizes the importance of integrating advanced image processing techniques into clinical practice.
本研究的工作重点是使用计算机断层扫描(CT)自动分类和预测肺癌,采用深度学习(DL)策略,特别是增强型卷积神经网络(cnn),以实现快速准确的图像分析。本研究设计并开发了包括ConvNeXtSmall、VGG16、ResNet50、InceptionV3和EfficientNetB0在内的预训练模型,用于肺癌分类。数据集分为4类,包括腺癌图像338张,大细胞癌图像187张,鳞状细胞癌图像260张,正常图像215张。值得注意的是,增强的CNN模型实现了前所未有的100%的测试精度,优于所有其他模型,包括ConvNeXt为87%,VGG16为99%,ResNet50为94.5%,InceptionV3为76.9%,EfficientNetB0为97.9%。这项研究被认为是第一个使用增强型CNN达到100%测试准确率的研究,通过应用复杂的图像增强技术和创新的模型架构,展示了肺癌检测方面的重大进步。这突出了增强CNN模型在改变肺癌诊断方面的潜力,并强调了将先进的图像处理技术整合到临床实践中的重要性。
{"title":"Deep learning-based approach to diagnose lung cancer using CT-scan images","authors":"Mohammad Q. Shatnawi,&nbsp;Qusai Abuein,&nbsp;Romesaa Al-Quraan","doi":"10.1016/j.ibmed.2024.100188","DOIUrl":"10.1016/j.ibmed.2024.100188","url":null,"abstract":"<div><div>The work in this research focuses on the automatic classification and prediction of lung cancer using computed tomography (CT) scans, employing Deep Learning (DL) strategies, specifically Enhanced Convolutional Neural Networks (CNNs), to enable rapid and accurate image analysis. This research designed and developed pre-trained models, including ConvNeXtSmall, VGG16, ResNet50, InceptionV3, and EfficientNetB0, to classify lung cancer. The dataset was divided into four classes, consisting of 338 images of adenocarcinoma, 187 images of large cell carcinoma, 260 images of squamous cell carcinoma, and 215 normal images. Notably, The Enhanced CNN model achieved an unprecedented testing accuracy of 100 %, outperforming all other models, which included ConvNeXt at 87 %, VGG16 at 99 %, ResNet50 at 94.5 %, InceptionV3 at 76.9 %, and EfficientNetB0 at 97.9 %. The study of this research is considered the first one that hits 100 % testing accuracy with an Enhanced CNN, demonstrating significant advancements in lung cancer detection through the application of sophisticated image enhancement techniques and innovative model architectures. This highlights the potential of Enhanced CNN models in transforming lung cancer diagnostics and emphasizes the importance of integrating advanced image processing techniques into clinical practice.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"11 ","pages":"Article 100188"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143174358","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Attention-driven graph-based machine learning for non-invasive diagnosis of NAFLD 基于注意力驱动图的机器学习在非侵入性NAFLD诊断中的应用
Pub Date : 2025-01-01 Epub Date: 2025-08-21 DOI: 10.1016/j.ibmed.2025.100288
Ekta Srivastava , Sarath Mohan , Tapan Kumar Gandhi , Ashok Kumar Choudhury , Sandeep Kumar
An estimated 25%–30% of the global population is affected by non-alcoholic fatty liver disease (NAFLD), a silent yet progressive condition that can advance from simple steatosis to severe stages like non-alcoholic steatohepatitis (NASH), fibrosis, and cirrhosis, significantly heightening the risk of liver cancer. Currently, the gold-standard method for staging NAFLD is liver biopsy, an invasive procedure with risks such as bleeding, infection, and sampling error. Due to its high cost and impracticality for routine monitoring, there is a critical need for reliable, non-invasive diagnostic tools capable of effectively identifying NAFLD stages. We developed a graph-based framework in which each patient is represented as a node in a similarity network. Edges are formed via k-nearest neighbors (KNN) on standardized clinical and biochemical features, with missing values imputed by KNN to preserve biologically plausible variability. A two-layer Graph Attention Network (GAT) then learns edge-specific attention weights to focus on the most informative inter-patient relationships. Tested on a proprietary ILBS cohort (n = 622), our model achieved 75.2% accuracy (AUC = 0.768; F1 = 0.752), an 11% absolute improvement over Support Vector Machines and Random Forests, and demonstrated robustness in 10-fold cross-validation and adversarial noise tests. On a separate public dataset (n = 80) spanning lipidomic, glycomic, fatty acid, and hormone panels, it exceeded 99% accuracy (AUC > 0.99). Attention-based explanations further highlighted key patient similarities driving each prediction. These findings suggest that attention-driven graph learning can clearly improve non-invasive NAFLD staging, enabling early detection and supporting personalized disease monitoring in diverse clinical settings.
据估计,全球25%-30%的人口受到非酒精性脂肪性肝病(NAFLD)的影响,这是一种沉默但进展的疾病,可从单纯的脂肪变性发展到严重阶段,如非酒精性脂肪性肝炎(NASH)、纤维化和肝硬化,显著增加了肝癌的风险。目前,NAFLD分期的金标准方法是肝活检,这是一种侵入性手术,存在出血、感染和抽样错误等风险。由于其高成本和常规监测的不实用性,迫切需要能够有效识别NAFLD分期的可靠、非侵入性诊断工具。我们开发了一个基于图形的框架,其中每个患者都表示为相似网络中的节点。边缘是通过标准化临床和生化特征的k近邻(KNN)形成的,缺失值由KNN输入以保持生物学上合理的可变性。然后,两层图注意网络(GAT)学习边缘特定注意权重,以关注最具信息量的患者间关系。在专有的ILBS队列(n = 622)上进行测试,我们的模型达到了75.2%的准确率(AUC = 0.768; F1 = 0.752),比支持向量机和随机森林提高了11%,并在10倍交叉验证和对抗噪声测试中显示出鲁棒性。在一个独立的公共数据集(n = 80)上,包括脂质组、糖糖组、脂肪酸组和激素组,准确率超过99% (AUC > 0.99)。基于注意力的解释进一步强调了驱动每种预测的关键患者相似性。这些发现表明,注意力驱动的图学习可以明显改善非侵入性NAFLD的分期,使早期发现成为可能,并在不同的临床环境中支持个性化的疾病监测。
{"title":"Attention-driven graph-based machine learning for non-invasive diagnosis of NAFLD","authors":"Ekta Srivastava ,&nbsp;Sarath Mohan ,&nbsp;Tapan Kumar Gandhi ,&nbsp;Ashok Kumar Choudhury ,&nbsp;Sandeep Kumar","doi":"10.1016/j.ibmed.2025.100288","DOIUrl":"10.1016/j.ibmed.2025.100288","url":null,"abstract":"<div><div>An estimated 25%–30% of the global population is affected by non-alcoholic fatty liver disease (NAFLD), a silent yet progressive condition that can advance from simple steatosis to severe stages like non-alcoholic steatohepatitis (NASH), fibrosis, and cirrhosis, significantly heightening the risk of liver cancer. Currently, the gold-standard method for staging NAFLD is liver biopsy, an invasive procedure with risks such as bleeding, infection, and sampling error. Due to its high cost and impracticality for routine monitoring, there is a critical need for reliable, non-invasive diagnostic tools capable of effectively identifying NAFLD stages. We developed a graph-based framework in which each patient is represented as a node in a similarity network. Edges are formed via k-nearest neighbors (KNN) on standardized clinical and biochemical features, with missing values imputed by KNN to preserve biologically plausible variability. A two-layer Graph Attention Network (GAT) then learns edge-specific attention weights to focus on the most informative inter-patient relationships. Tested on a proprietary ILBS cohort (n = 622), our model achieved 75.2% accuracy (AUC = 0.768; F1 = 0.752), an 11% absolute improvement over Support Vector Machines and Random Forests, and demonstrated robustness in 10-fold cross-validation and adversarial noise tests. On a separate public dataset (n = 80) spanning lipidomic, glycomic, fatty acid, and hormone panels, it exceeded 99% accuracy (AUC <span><math><mo>&gt;</mo></math></span> 0.99). Attention-based explanations further highlighted key patient similarities driving each prediction. These findings suggest that attention-driven graph learning can clearly improve non-invasive NAFLD staging, enabling early detection and supporting personalized disease monitoring in diverse clinical settings.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"12 ","pages":"Article 100288"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144912336","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Privacy-aware and interpretable deep learning framework for dental caries classification 隐私感知和可解释的龋齿分类深度学习框架
Pub Date : 2025-01-01 Epub Date: 2025-08-25 DOI: 10.1016/j.ibmed.2025.100294
Jashvant Kumar , Khaled Mohamad Almustafa , Rand Madanat , Akhilesh Kumar Sharma , Muhammed Sutcu , Juliano Katrib
Dental caries remains one of the most prevalent and persistent chronic diseases globally, affecting individuals across all age groups and posing a significant burden on public health systems. Early detection is critical to prevent the progression of tooth decay, reduce treatment complexity, and improve long-term oral health outcomes. In response to these clinical demands, this study presents a comprehensive, privacy-aware, and interpretable deep learning framework for the automated classification of dental caries from X-ray images. The approach addresses the issues of class imbalance, low Resolution image and privacy preserved patient's medical images.The framework is structured into three progressive phases that incorporate supervised learning through Convolutional Neural Networks (CNN), ResNet-18, and DenseNet; unsupervised clustering using Principal Component Analysis (PCA); and a decentralized federated learning strategy to ensure secure model training across distributed datasets. The experimental dataset consists of 957 labelled dental radiographs, including 174 healthy and 783 carious cases, emphasizing the issue of class imbalance. Initial baseline models achieved an accuracy of 84 %, which improved to 96 % following strategic data augmentation and class balancing interventions. PCA-based clustering visualizations revealed well-separated clusters (Silhouette Score: 0.6660), confirming the discriminative power of the selected features. Meanwhile, the federated learning implementation preserved data confidentiality without sacrificing performance, reinforcing the model's suitability for real-world clinical deployment. Collectively, these findings validate the framework's robustness, interpretability, and adaptability, offering a scalable and ethically aligned solution for AI-driven dental diagnostics in modern healthcare systems.
龋齿仍然是全球最普遍和最持久的慢性疾病之一,影响所有年龄组的个体,并对公共卫生系统构成重大负担。早期发现对于防止蛀牙恶化、减少治疗复杂性和改善长期口腔健康结果至关重要。为了响应这些临床需求,本研究提出了一个全面的、隐私意识的、可解释的深度学习框架,用于从x射线图像中自动分类龋齿。该方法解决了分类不平衡、图像分辨率低和患者医学图像隐私保护等问题。该框架分为三个渐进阶段,包括通过卷积神经网络(CNN)、ResNet-18和DenseNet进行监督学习;基于主成分分析(PCA)的无监督聚类;以及分散的联邦学习策略,以确保跨分布式数据集的安全模型训练。实验数据集由957张标记的牙科x光片组成,其中包括174张健康病例和783张龋齿病例,强调了类别不平衡的问题。初始基线模型的准确率为84%,在策略数据增强和班级平衡干预后提高到96%。基于pca的聚类可视化显示了分离良好的聚类(剪影得分:0.6660),证实了所选特征的判别能力。同时,联邦学习实现在不牺牲性能的情况下保护了数据机密性,增强了模型对现实世界临床部署的适用性。总的来说,这些发现验证了框架的稳健性、可解释性和适应性,为现代医疗保健系统中人工智能驱动的牙科诊断提供了可扩展和符合道德的解决方案。
{"title":"Privacy-aware and interpretable deep learning framework for dental caries classification","authors":"Jashvant Kumar ,&nbsp;Khaled Mohamad Almustafa ,&nbsp;Rand Madanat ,&nbsp;Akhilesh Kumar Sharma ,&nbsp;Muhammed Sutcu ,&nbsp;Juliano Katrib","doi":"10.1016/j.ibmed.2025.100294","DOIUrl":"10.1016/j.ibmed.2025.100294","url":null,"abstract":"<div><div>Dental caries remains one of the most prevalent and persistent chronic diseases globally, affecting individuals across all age groups and posing a significant burden on public health systems. Early detection is critical to prevent the progression of tooth decay, reduce treatment complexity, and improve long-term oral health outcomes. In response to these clinical demands, this study presents a comprehensive, privacy-aware, and interpretable deep learning framework for the automated classification of dental caries from X-ray images. The approach addresses the issues of class imbalance, low Resolution image and privacy preserved patient's medical images.The framework is structured into three progressive phases that incorporate supervised learning through Convolutional Neural Networks (CNN), ResNet-18, and DenseNet; unsupervised clustering using Principal Component Analysis (PCA); and a decentralized federated learning strategy to ensure secure model training across distributed datasets. The experimental dataset consists of 957 labelled dental radiographs, including 174 healthy and 783 carious cases, emphasizing the issue of class imbalance. Initial baseline models achieved an accuracy of 84 %, which improved to 96 % following strategic data augmentation and class balancing interventions. PCA-based clustering visualizations revealed well-separated clusters (Silhouette Score: 0.6660), confirming the discriminative power of the selected features. Meanwhile, the federated learning implementation preserved data confidentiality without sacrificing performance, reinforcing the model's suitability for real-world clinical deployment. Collectively, these findings validate the framework's robustness, interpretability, and adaptability, offering a scalable and ethically aligned solution for AI-driven dental diagnostics in modern healthcare systems.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"12 ","pages":"Article 100294"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144912406","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Intelligence-based medicine
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1