首页 > 最新文献

Intelligence-based medicine最新文献

英文 中文
A neonatal sepsis prediction algorithm using electronic medical record data from Mbarara Regional Referral Hospital 基于Mbarara地区转诊医院电子病历数据的新生儿败血症预测算法
Pub Date : 2025-01-01 DOI: 10.1016/j.ibmed.2025.100198
Peace Ezeobi Dennis , Angella Musiimenta , William Wasswa , Stella Kyoyagala

Introduction

Neonatal sepsis is a global challenge that contributes significantly to neonatal morbidity and mortality. The current diagnostic methods depend on conventional culture methods, a procedure that takes time and leads to delays in making timely treatment decisions. This study proposes a machine learning algorithm utilizing electronic medical record (EMR) data from Mbarara Regional Referral Hospital (MRRH) to enhance early detection and treatment of neonatal sepsis.

Methods

We performed a retrospective study on a dataset of neonates hospitalized for at least 48 h in the Neonatal Intensive Care Unit (NICU) at MRRH between October 2015 to September 2019 who received at least one sepsis evaluation. 482 records of neonates met the inclusion criteria and the dataset comprises 38 neonatal sepsis screening parameters. The study considered two outcomes for sepsis evaluations: culture-positive if a blood culture was positive, and clinically positive if cultures were negative but antibiotics were administered for at least 120 h. We implemented k-fold cross-validation with k set to 10 to guarantee robust training and testing of the models. Seven machine learning models were trained to classify inputs as sepsis positive or negative, and their performance was compared with physician diagnoses.

Results

The results of this study show that the proposed algorithm, combining maternal risk factors, neonatal clinical signs, and laboratory tests (the algorithm demonstrated a sensitivity and specificity of at least 95 %) outperformed the physician diagnosis (Sensitivity = 89 %, Specificity = 11 %). SVM model with radial basis function, polynomial kernels, and DT model (with the highest AUROC values of 98 %) performed better than the other models.

Conclusions

The study shows that the combination of maternal risk factors, neonatal clinical signs, and laboratory tests can help improve the prediction of neonatal sepsis. Further research is warranted to assess the potential performance improvements and clinical efficacy in a prospective trial.
新生儿败血症是一项全球性挑战,对新生儿发病率和死亡率有重要影响。目前的诊断方法依赖于传统的培养方法,这一过程需要时间,并导致及时做出治疗决定的延误。本研究提出了一种利用Mbarara地区转诊医院(MRRH)电子病历(EMR)数据的机器学习算法,以提高新生儿败血症的早期发现和治疗。方法对2015年10月至2019年9月期间在MRRH新生儿重症监护病房(NICU)住院至少48小时并接受至少一次脓毒症评估的新生儿数据集进行回顾性研究。482例符合纳入标准的新生儿记录,数据集包括38个新生儿败血症筛查参数。该研究考虑了脓毒症评估的两种结果:如果血液培养呈阳性,则培养呈阳性;如果培养呈阴性,但使用抗生素至少120小时,则临床呈阳性。我们实施了k-fold交叉验证,k设置为10,以保证模型的稳健训练和测试。七个机器学习模型被训练来将输入分类为脓毒症阳性或阴性,并将它们的表现与医生的诊断进行比较。结果本研究结果表明,结合产妇危险因素、新生儿临床体征和实验室检查(该算法的灵敏度和特异性至少为95%)提出的算法优于医生诊断(灵敏度= 89%,特异性= 11%)。采用径向基函数、多项式核的SVM模型和AUROC最高达98%的DT模型均优于其他模型。结论结合产妇危险因素、新生儿临床体征和实验室检查,有助于提高对新生儿脓毒症的预测。进一步的研究需要在前瞻性试验中评估潜在的性能改善和临床疗效。
{"title":"A neonatal sepsis prediction algorithm using electronic medical record data from Mbarara Regional Referral Hospital","authors":"Peace Ezeobi Dennis ,&nbsp;Angella Musiimenta ,&nbsp;William Wasswa ,&nbsp;Stella Kyoyagala","doi":"10.1016/j.ibmed.2025.100198","DOIUrl":"10.1016/j.ibmed.2025.100198","url":null,"abstract":"<div><h3>Introduction</h3><div>Neonatal sepsis is a global challenge that contributes significantly to neonatal morbidity and mortality. The current diagnostic methods depend on conventional culture methods, a procedure that takes time and leads to delays in making timely treatment decisions. This study proposes a machine learning algorithm utilizing electronic medical record (EMR) data from Mbarara Regional Referral Hospital (MRRH) to enhance early detection and treatment of neonatal sepsis.</div></div><div><h3>Methods</h3><div>We performed a retrospective study on a dataset of neonates hospitalized for at least 48 h in the Neonatal Intensive Care Unit (NICU) at MRRH between October 2015 to September 2019 who received at least one sepsis evaluation. 482 records of neonates met the inclusion criteria and the dataset comprises 38 neonatal sepsis screening parameters. The study considered two outcomes for sepsis evaluations: culture-positive if a blood culture was positive, and clinically positive if cultures were negative but antibiotics were administered for at least 120 h. We implemented k-fold cross-validation with k set to 10 to guarantee robust training and testing of the models. Seven machine learning models were trained to classify inputs as sepsis positive or negative, and their performance was compared with physician diagnoses.</div></div><div><h3>Results</h3><div>The results of this study show that the proposed algorithm, combining maternal risk factors, neonatal clinical signs, and laboratory tests (the algorithm demonstrated a sensitivity and specificity of at least 95 %) outperformed the physician diagnosis (Sensitivity = 89 %, Specificity = 11 %). SVM model with radial basis function, polynomial kernels, and DT model (with the highest AUROC values of 98 %) performed better than the other models.</div></div><div><h3>Conclusions</h3><div>The study shows that the combination of maternal risk factors, neonatal clinical signs, and laboratory tests can help improve the prediction of neonatal sepsis. Further research is warranted to assess the potential performance improvements and clinical efficacy in a prospective trial.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"11 ","pages":"Article 100198"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143174356","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep learning-based approach to diagnose lung cancer using CT-scan images 基于深度学习的ct扫描图像肺癌诊断方法
Pub Date : 2025-01-01 DOI: 10.1016/j.ibmed.2024.100188
Mohammad Q. Shatnawi, Qusai Abuein, Romesaa Al-Quraan
The work in this research focuses on the automatic classification and prediction of lung cancer using computed tomography (CT) scans, employing Deep Learning (DL) strategies, specifically Enhanced Convolutional Neural Networks (CNNs), to enable rapid and accurate image analysis. This research designed and developed pre-trained models, including ConvNeXtSmall, VGG16, ResNet50, InceptionV3, and EfficientNetB0, to classify lung cancer. The dataset was divided into four classes, consisting of 338 images of adenocarcinoma, 187 images of large cell carcinoma, 260 images of squamous cell carcinoma, and 215 normal images. Notably, The Enhanced CNN model achieved an unprecedented testing accuracy of 100 %, outperforming all other models, which included ConvNeXt at 87 %, VGG16 at 99 %, ResNet50 at 94.5 %, InceptionV3 at 76.9 %, and EfficientNetB0 at 97.9 %. The study of this research is considered the first one that hits 100 % testing accuracy with an Enhanced CNN, demonstrating significant advancements in lung cancer detection through the application of sophisticated image enhancement techniques and innovative model architectures. This highlights the potential of Enhanced CNN models in transforming lung cancer diagnostics and emphasizes the importance of integrating advanced image processing techniques into clinical practice.
本研究的工作重点是使用计算机断层扫描(CT)自动分类和预测肺癌,采用深度学习(DL)策略,特别是增强型卷积神经网络(cnn),以实现快速准确的图像分析。本研究设计并开发了包括ConvNeXtSmall、VGG16、ResNet50、InceptionV3和EfficientNetB0在内的预训练模型,用于肺癌分类。数据集分为4类,包括腺癌图像338张,大细胞癌图像187张,鳞状细胞癌图像260张,正常图像215张。值得注意的是,增强的CNN模型实现了前所未有的100%的测试精度,优于所有其他模型,包括ConvNeXt为87%,VGG16为99%,ResNet50为94.5%,InceptionV3为76.9%,EfficientNetB0为97.9%。这项研究被认为是第一个使用增强型CNN达到100%测试准确率的研究,通过应用复杂的图像增强技术和创新的模型架构,展示了肺癌检测方面的重大进步。这突出了增强CNN模型在改变肺癌诊断方面的潜力,并强调了将先进的图像处理技术整合到临床实践中的重要性。
{"title":"Deep learning-based approach to diagnose lung cancer using CT-scan images","authors":"Mohammad Q. Shatnawi,&nbsp;Qusai Abuein,&nbsp;Romesaa Al-Quraan","doi":"10.1016/j.ibmed.2024.100188","DOIUrl":"10.1016/j.ibmed.2024.100188","url":null,"abstract":"<div><div>The work in this research focuses on the automatic classification and prediction of lung cancer using computed tomography (CT) scans, employing Deep Learning (DL) strategies, specifically Enhanced Convolutional Neural Networks (CNNs), to enable rapid and accurate image analysis. This research designed and developed pre-trained models, including ConvNeXtSmall, VGG16, ResNet50, InceptionV3, and EfficientNetB0, to classify lung cancer. The dataset was divided into four classes, consisting of 338 images of adenocarcinoma, 187 images of large cell carcinoma, 260 images of squamous cell carcinoma, and 215 normal images. Notably, The Enhanced CNN model achieved an unprecedented testing accuracy of 100 %, outperforming all other models, which included ConvNeXt at 87 %, VGG16 at 99 %, ResNet50 at 94.5 %, InceptionV3 at 76.9 %, and EfficientNetB0 at 97.9 %. The study of this research is considered the first one that hits 100 % testing accuracy with an Enhanced CNN, demonstrating significant advancements in lung cancer detection through the application of sophisticated image enhancement techniques and innovative model architectures. This highlights the potential of Enhanced CNN models in transforming lung cancer diagnostics and emphasizes the importance of integrating advanced image processing techniques into clinical practice.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"11 ","pages":"Article 100188"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143174358","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting patient no-shows using machine learning: A comprehensive review and future research agenda 利用机器学习预测患者未就诊情况:全面回顾与未来研究议程
Pub Date : 2025-01-01 DOI: 10.1016/j.ibmed.2025.100229
Khaled M. Toffaha , Mecit Can Emre Simsekler , Mohammed Atif Omar , Imad ElKebbi
Patient no-shows for scheduled medical appointments pose significant challenges to healthcare systems, resulting in wasted resources, increased costs, and disrupted continuity of care. This comprehensive review examines state-of-the-art machine learning (ML) approaches for predicting patient no-shows in outpatient settings, analyzing 52 publications from 2010 to 2025.
The study reveals significant advancements in the field, with Logistic Regression (LR) as the most commonly used model in 68% of the studies. Tree-based models, ensemble methods, and deep learning techniques have gained traction in recent years, reflecting the field’s evolution. The best-performing models achieved Area Under the Curve (AUC) scores between 0.75 and 0.95, with accuracy ranging from 52% to 99.44%. Methodologically, researchers addressed common challenges such as class imbalance using various sampling techniques and employed a wide range of feature selection methods to improve model efficiency. The review also highlighted the importance of considering temporal factors and the context-dependent nature of no-show behavior across different healthcare settings.
Using the ITPOSMO framework (Information, Technology, Processes, Objectives, Staffing, Management, and Other Resources), the study identified several gaps in current ML approaches. Key challenges include data quality and completeness, model interpretability, and integration with existing healthcare systems. Future research directions include improving data collection methods, incorporating organizational factors, ensuring ethical implementation, and developing standardized approaches for handling data imbalance. The review also suggests exploring new data sources, utilizing ML algorithms to analyze patient behavior patterns, and using transfer learning techniques to adapt models across different healthcare facilities.
By addressing these challenges, healthcare providers can leverage ML to improve resource allocation, enhance patient care quality, and advance predictive analytics in healthcare. This comprehensive review underscores the potential of ML in predicting no-shows while acknowledging the complexities and challenges in its practical implementation.
病人爽约给医疗保健系统带来了巨大挑战,导致资源浪费、成本增加和护理连续性中断。本综述分析了 2010 年至 2025 年间发表的 52 篇论文,探讨了用于预测门诊患者爽约情况的最先进的机器学习(ML)方法。研究显示,该领域取得了重大进展,在 68% 的研究中,逻辑回归(LR)是最常用的模型。基于树的模型、集合方法和深度学习技术在近几年得到了广泛应用,反映了该领域的发展。表现最好的模型的曲线下面积(AUC)得分在 0.75 到 0.95 之间,准确率在 52% 到 99.44% 之间。在方法上,研究人员利用各种采样技术解决了类不平衡等常见难题,并采用了多种特征选择方法来提高模型效率。该综述还强调了考虑时间因素和不同医疗环境中缺席行为的环境依赖性的重要性。利用 ITPOSMO 框架(信息、技术、流程、目标、人员配备、管理和其他资源),该研究确定了当前 ML 方法中的几个差距。主要挑战包括数据质量和完整性、模型可解释性以及与现有医疗保健系统的集成。未来的研究方向包括改进数据收集方法、纳入组织因素、确保符合道德规范的实施,以及开发处理数据不平衡的标准化方法。该综述还建议探索新的数据源,利用 ML 算法分析患者行为模式,并使用迁移学习技术在不同的医疗机构间调整模型。通过应对这些挑战,医疗机构可以利用 ML 改善资源分配,提高患者护理质量,并推进医疗领域的预测分析。这篇全面的综述强调了人工智能在预测病例缺席方面的潜力,同时也承认了其实际应用中的复杂性和挑战。
{"title":"Predicting patient no-shows using machine learning: A comprehensive review and future research agenda","authors":"Khaled M. Toffaha ,&nbsp;Mecit Can Emre Simsekler ,&nbsp;Mohammed Atif Omar ,&nbsp;Imad ElKebbi","doi":"10.1016/j.ibmed.2025.100229","DOIUrl":"10.1016/j.ibmed.2025.100229","url":null,"abstract":"<div><div>Patient no-shows for scheduled medical appointments pose significant challenges to healthcare systems, resulting in wasted resources, increased costs, and disrupted continuity of care. This comprehensive review examines state-of-the-art machine learning (ML) approaches for predicting patient no-shows in outpatient settings, analyzing 52 publications from 2010 to 2025.</div><div>The study reveals significant advancements in the field, with Logistic Regression (LR) as the most commonly used model in 68% of the studies. Tree-based models, ensemble methods, and deep learning techniques have gained traction in recent years, reflecting the field’s evolution. The best-performing models achieved Area Under the Curve (AUC) scores between 0.75 and 0.95, with accuracy ranging from 52% to 99.44%. Methodologically, researchers addressed common challenges such as class imbalance using various sampling techniques and employed a wide range of feature selection methods to improve model efficiency. The review also highlighted the importance of considering temporal factors and the context-dependent nature of no-show behavior across different healthcare settings.</div><div>Using the ITPOSMO framework (Information, Technology, Processes, Objectives, Staffing, Management, and Other Resources), the study identified several gaps in current ML approaches. Key challenges include data quality and completeness, model interpretability, and integration with existing healthcare systems. Future research directions include improving data collection methods, incorporating organizational factors, ensuring ethical implementation, and developing standardized approaches for handling data imbalance. The review also suggests exploring new data sources, utilizing ML algorithms to analyze patient behavior patterns, and using transfer learning techniques to adapt models across different healthcare facilities.</div><div>By addressing these challenges, healthcare providers can leverage ML to improve resource allocation, enhance patient care quality, and advance predictive analytics in healthcare. This comprehensive review underscores the potential of ML in predicting no-shows while acknowledging the complexities and challenges in its practical implementation.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"11 ","pages":"Article 100229"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143529776","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hybrid vision transformer and Xception model for reliable CT-based ovarian neoplasms diagnosis 基于ct的卵巢肿瘤可靠诊断的混合视觉变换器和异常模型
Pub Date : 2025-01-01 DOI: 10.1016/j.ibmed.2025.100227
Eman Hussein Alshdaifat , Hasan Gharaibeh , Amer Mahmoud Sindiani , Rola Madain , Asma'a Mohammad Al-Mnayyis , Hamad Yahia Abu Mhanna , Rawan Eimad Almahmoud , Hanan Fawaz Akhdar , Mohammad Amin , Ahmad Nasayreh , Raneem Hamad
Ovarian cancer is a major global health concern, characterized by high mortality rates and a lack of accurate diagnostic methods. Rapid and accurate detection of ovarian cancer is essential to improve patient outcomes and formulate appropriate treatment protocols. Medical imaging methods are essential for identifying ovarian cancer; however, achieving accurate diagnosis remains a challenge. This paper presents a robust methodology for ovarian cancer detection, including the identification and classification of benign and malignant tumors, using the Xception_ViT model. This hybrid approach was chosen because it combines the advantages of traditional CNN-based models (such as Xception) with the capabilities of modern Transformers-based models (such as ViT). This combination allows the model to take advantage of Xception, which extracts features from images. The Vision Transformer (ViT) model is then used to identify connections between diverse visual elements, enhancing the model's understanding of complex components. A Multi-Layer Perceptron (MLP) layer is finally integrated with the proposed model for image classification. The effectiveness of the model is evaluated using three computed tomography (CT) image datasets from King Abdullah University Hospital (KAUH) in Jordan. The first dataset consists of the ovarian cancer computed tomography dataset (KAUH-OCCTD), the second is the benign ovarian tumors dataset (KAUH-BOTD), and the third is the malignant ovarian tumors dataset (KAUH-MOTD). The three datasets collected from 500 women are characterized by their diversity in ovarian tumor classification and are the first of their kind collected in Jordan. The proposed model Xception_ViT achieved an accuracy of 98.09 % in identifying ovarian cancer on the KAUH-OCCTD dataset, and an accuracy of 96.05 % and 98.73 % on the KAUH-BOTD and KAUH-MOTD datasets, respectively, in distinguishing between benign and malignant ovarian tumors. The proposed model outperformed the pre-trained models on all three datasets. The results demonstrate that the proposed model can classify ovarian tumors. This method could also greatly enhance the efficiency of novice radiologists in evaluating ovarian malignancies and assist gynecologists in providing improved treatment alternatives for these individuals.
卵巢癌是一个主要的全球健康问题,其特点是死亡率高,缺乏准确的诊断方法。快速准确地检测卵巢癌对于改善患者预后和制定适当的治疗方案至关重要。医学影像学方法是鉴别卵巢癌的必要手段;然而,实现准确的诊断仍然是一个挑战。本文提出了一种强大的卵巢癌检测方法,包括使用Xception_ViT模型对良性和恶性肿瘤进行识别和分类。之所以选择这种混合方法,是因为它结合了传统的基于cnn的模型(如Xception)的优势和现代基于transformer的模型(如ViT)的能力。这种组合允许模型利用Xception,它从图像中提取特征。然后使用视觉转换器(Vision Transformer, ViT)模型来识别不同视觉元素之间的联系,增强模型对复杂组件的理解。最后将多层感知器(MLP)层与所提出的图像分类模型相结合。使用约旦阿卜杜拉国王大学医院(KAUH)的三个计算机断层扫描(CT)图像数据集评估该模型的有效性。第一个数据集包括卵巢癌计算机断层扫描数据集(KAUH-OCCTD),第二个数据集是良性卵巢肿瘤数据集(KAUH-BOTD),第三个数据集是恶性卵巢肿瘤数据集(KAUH-MOTD)。从500名妇女中收集的三个数据集以其卵巢肿瘤分类的多样性为特征,是约旦首次收集此类数据集。所提出的Xception_ViT模型在KAUH-OCCTD数据集上识别卵巢癌的准确率为98.09%,在KAUH-BOTD和KAUH-MOTD数据集上区分卵巢良恶性肿瘤的准确率分别为96.05%和98.73%。提出的模型在所有三个数据集上都优于预训练模型。结果表明,该模型能够对卵巢肿瘤进行分类。该方法还可以大大提高新手放射科医生评估卵巢恶性肿瘤的效率,并协助妇科医生为这些个体提供改进的治疗方案。
{"title":"Hybrid vision transformer and Xception model for reliable CT-based ovarian neoplasms diagnosis","authors":"Eman Hussein Alshdaifat ,&nbsp;Hasan Gharaibeh ,&nbsp;Amer Mahmoud Sindiani ,&nbsp;Rola Madain ,&nbsp;Asma'a Mohammad Al-Mnayyis ,&nbsp;Hamad Yahia Abu Mhanna ,&nbsp;Rawan Eimad Almahmoud ,&nbsp;Hanan Fawaz Akhdar ,&nbsp;Mohammad Amin ,&nbsp;Ahmad Nasayreh ,&nbsp;Raneem Hamad","doi":"10.1016/j.ibmed.2025.100227","DOIUrl":"10.1016/j.ibmed.2025.100227","url":null,"abstract":"<div><div>Ovarian cancer is a major global health concern, characterized by high mortality rates and a lack of accurate diagnostic methods. Rapid and accurate detection of ovarian cancer is essential to improve patient outcomes and formulate appropriate treatment protocols. Medical imaging methods are essential for identifying ovarian cancer; however, achieving accurate diagnosis remains a challenge. This paper presents a robust methodology for ovarian cancer detection, including the identification and classification of benign and malignant tumors, using the Xception_ViT model. This hybrid approach was chosen because it combines the advantages of traditional CNN-based models (such as Xception) with the capabilities of modern Transformers-based models (such as ViT). This combination allows the model to take advantage of Xception, which extracts features from images. The Vision Transformer (ViT) model is then used to identify connections between diverse visual elements, enhancing the model's understanding of complex components. A Multi-Layer Perceptron (MLP) layer is finally integrated with the proposed model for image classification. The effectiveness of the model is evaluated using three computed tomography (CT) image datasets from King Abdullah University Hospital (KAUH) in Jordan. The first dataset consists of the ovarian cancer computed tomography dataset (KAUH-OCCTD), the second is the benign ovarian tumors dataset (KAUH-BOTD), and the third is the malignant ovarian tumors dataset (KAUH-MOTD). The three datasets collected from 500 women are characterized by their diversity in ovarian tumor classification and are the first of their kind collected in Jordan. The proposed model Xception_ViT achieved an accuracy of 98.09 % in identifying ovarian cancer on the KAUH-OCCTD dataset, and an accuracy of 96.05 % and 98.73 % on the KAUH-BOTD and KAUH-MOTD datasets, respectively, in distinguishing between benign and malignant ovarian tumors. The proposed model outperformed the pre-trained models on all three datasets. The results demonstrate that the proposed model can classify ovarian tumors. This method could also greatly enhance the efficiency of novice radiologists in evaluating ovarian malignancies and assist gynecologists in providing improved treatment alternatives for these individuals.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"11 ","pages":"Article 100227"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143465514","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Features and eigenspectral densities analyses for machine learning and classification of severities in chronic obstructive pulmonary diseases 慢性阻塞性肺疾病中机器学习和严重程度分类的特征和特征谱密度分析
Pub Date : 2025-01-01 DOI: 10.1016/j.ibmed.2025.100217
Timothy Albiges, Zoheir Sabeur, Banafshe Arbab-Zavar
Chronic Obstructive Pulmonary Disease (COPD) has been presenting highly significant global health challenges for many decades. Equally, it is important to slow down this disease's ever-increasingly challenging impact on hospital patient loads. It has become necessary, if not critical, to capitalise on existing knowledge of advanced artificial intelligence to achieve the early detection of COPD and advance personalised care of COPD patients from their homes. The use of machine learning and reaching out on the classification of the multiple types of COPD severities effectively and at progressively acceptable levels of confidence is of paramount importance. Indeed, this capability will feed into highly effective personalised care of COPD patients from their homes while significantly improving their quality of life.
Auscultation lung sound analysis has emerged as a valuable, non-invasive, and cost-effective remote diagnostic tool of the future for respiratory conditions such as COPD. This research paper introduces a novel machine learning-based approach for classifying multiple COPD severities through the analysis of lung sound data streams. Leveraging two open datasets with diverse acoustic characteristics and clinical manifestations, the research study involves the transformation and decomposition of lung sound data matrices into their eigenspace representation in order to capture key features for machine learning and detection. Early eigenvalue spectra analyses were also performed to discover their distinct manifestations under the multiple established COPD severities. This has led us into projecting our experimental data matrices into their eigenspace with the use of the manifested data features prior to the machine learning process. This was followed by various methods of machine classification of COPD severities successfully. Support Vector Classifiers, Logistic Regression, Random Forests and Naive Bayes Classifiers were deployed. Systematic classifier performance metrics were also adopted; they showed early promising classification accuracies beyond 75 % for distinguishing COPD severities.
This research benchmark contributes to computer-aided medical diagnosis and supports the integration of auscultation lung sound analyses into COPD assessment protocols for individualised patient care and treatment. Future work involves the acquisition of larger volumes of lung sound data while also exploring multi-modal sensing of COPD patients for heterogeneous data fusion to advance COPD severity classification performance.
几十年来,慢性阻塞性肺疾病(COPD)一直是全球健康面临的重大挑战。同样,减缓这种疾病对医院病人负荷的日益严峻的影响也很重要。利用现有的先进人工智能知识来实现COPD的早期发现,并在家中推进COPD患者的个性化护理,即使不是至关重要,也是必要的。机器学习的使用和对多种类型COPD严重程度的有效分类以及逐步可接受的置信度水平的接触是至关重要的。事实上,这种能力将有助于在家中为COPD患者提供高效的个性化护理,同时显著改善他们的生活质量。听诊肺音分析已成为一种有价值的、无创的、具有成本效益的远程诊断工具,用于未来的呼吸系统疾病,如慢性阻塞性肺病。本文介绍了一种新的基于机器学习的方法,通过分析肺声数据流来分类多种COPD严重程度。利用两个具有不同声学特征和临床表现的开放数据集,该研究涉及将肺音数据矩阵转换和分解为其特征空间表示,以捕获用于机器学习和检测的关键特征。还进行了早期特征值谱分析,以发现其在多个已确定的COPD严重程度下的不同表现。这导致我们在机器学习过程之前使用已显示的数据特征将实验数据矩阵投影到它们的特征空间中。随后,各种COPD严重程度的机器分类方法都取得了成功。使用了支持向量分类器、逻辑回归、随机森林和朴素贝叶斯分类器。采用系统分类器性能指标;在区分COPD严重程度方面,他们显示出早期有希望的分类准确率超过75%。该研究基准有助于计算机辅助医疗诊断,并支持将听诊肺音分析整合到COPD评估方案中,以实现个体化患者护理和治疗。未来的工作包括获取更大量的肺声数据,同时探索COPD患者的多模式感知,以进行异构数据融合,以提高COPD严重程度分类性能。
{"title":"Features and eigenspectral densities analyses for machine learning and classification of severities in chronic obstructive pulmonary diseases","authors":"Timothy Albiges,&nbsp;Zoheir Sabeur,&nbsp;Banafshe Arbab-Zavar","doi":"10.1016/j.ibmed.2025.100217","DOIUrl":"10.1016/j.ibmed.2025.100217","url":null,"abstract":"<div><div>Chronic Obstructive Pulmonary Disease (COPD) has been presenting highly significant global health challenges for many decades. Equally, it is important to slow down this disease's ever-increasingly challenging impact on hospital patient loads. It has become necessary, if not critical, to capitalise on existing knowledge of advanced artificial intelligence to achieve the early detection of COPD and advance personalised care of COPD patients from their homes. The use of machine learning and reaching out on the classification of the multiple types of COPD severities effectively and at progressively acceptable levels of confidence is of paramount importance. Indeed, this capability will feed into highly effective personalised care of COPD patients from their homes while significantly improving their quality of life.</div><div>Auscultation lung sound analysis has emerged as a valuable, non-invasive, and cost-effective remote diagnostic tool of the future for respiratory conditions such as COPD. This research paper introduces a novel machine learning-based approach for classifying multiple COPD severities through the analysis of lung sound data streams. Leveraging two open datasets with diverse acoustic characteristics and clinical manifestations, the research study involves the transformation and decomposition of lung sound data matrices into their eigenspace representation in order to capture key features for machine learning and detection. Early eigenvalue spectra analyses were also performed to discover their distinct manifestations under the multiple established COPD severities. This has led us into projecting our experimental data matrices into their eigenspace with the use of the manifested data features prior to the machine learning process. This was followed by various methods of machine classification of COPD severities successfully. Support Vector Classifiers, Logistic Regression, Random Forests and Naive Bayes Classifiers were deployed. Systematic classifier performance metrics were also adopted; they showed early promising classification accuracies beyond 75 % for distinguishing COPD severities.</div><div>This research benchmark contributes to computer-aided medical diagnosis and supports the integration of auscultation lung sound analyses into COPD assessment protocols for individualised patient care and treatment. Future work involves the acquisition of larger volumes of lung sound data while also exploring multi-modal sensing of COPD patients for heterogeneous data fusion to advance COPD severity classification performance.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"11 ","pages":"Article 100217"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143419554","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting maternal health risk using PCA-enhanced XGBoost and SMOTE-ENN for improved healthcare outcomes 使用pca增强的XGBoost和SMOTE-ENN预测孕产妇健康风险,以改善医疗保健结果
Pub Date : 2025-01-01 DOI: 10.1016/j.ibmed.2025.100300
Rahmatul Kabir Rasel Sarker , Sadman Hafij , Md Adib Yasir , Md Assaduzzaman , Md Monir Hossain Shimul , Md Kamrul Hossain

Background

Maternal health remains a global priority, especially in low-resource settings where timely risk identification is critical. Traditional machine learning models often suffer from poor generalizability, data imbalance, and computational inefficiencies. This study proposes an enhanced predictive model combining SMOTE-ENN data balancing and Principal Component Analysis (PCA) with XGBoost to improve maternal risk classification accuracy using minimal, easily collectible clinical features.

Methods

The dataset of 1014 maternal health records comprising seven physiological features was sourced from a public repository. Preprocessing involved standardization, label encoding, and class balancing using SMOTE-ENN. PCA was applied for dimensionality reduction to enhance computational performance and reduce overfitting. Several machine learning classifiers including Decision Tree, Random Forest, LightGBM, Gradient Boosting, and SVM were evaluated, with XGBoost selected as the final model. Performance metrics included accuracy, precision, recall, F1-score, ROC-AUC, and 10-fold cross-validation.

Results

The PCA-enhanced XGBoost model achieved the highest accuracy (97.73 %), precision (98 %), recall (98 %), and F1-score (98 %). It outperformed all other models, particularly in identifying high-risk cases with minimal false negatives. Cross-validation confirmed the model's robustness (mean accuracy: 98.39 %), and ROC-AUC scores exceeded 0.998 for all classes, indicating near-perfect classification performance.

Conclusion

This study validates a maternal health risk prediction model that is scalable for use in resource-constrained environments and interpretable within the limitations of the selected dimensionality-reduction approach. Its simplicity, high accuracy, and generalizability make it a promising tool for early clinical decision-making and intervention.
产妇保健仍然是全球优先事项,特别是在资源匮乏的环境中,及时识别风险至关重要。传统的机器学习模型通常存在泛化能力差、数据不平衡和计算效率低下的问题。本研究提出了一个增强的预测模型,结合SMOTE-ENN数据平衡和主成分分析(PCA)与XGBoost,利用最小的、易于收集的临床特征来提高孕产妇风险分类的准确性。方法从公共信息库中获取1014份孕产妇健康记录,包括7项生理特征。预处理包括使用SMOTE-ENN进行标准化、标签编码和类平衡。采用主成分分析法进行降维,提高计算性能,减少过拟合。对决策树、随机森林、LightGBM、梯度增强和支持向量机等几种机器学习分类器进行了评估,最终选择XGBoost作为最终模型。性能指标包括准确性、精密度、召回率、f1评分、ROC-AUC和10倍交叉验证。结果pca增强的XGBoost模型具有最高的准确率(97.73%)、精密度(98%)、召回率(98%)和f1评分(98%)。它优于所有其他模型,特别是在识别高风险病例时,以最小的假阴性。交叉验证证实了模型的稳健性(平均准确率为98.39%),所有类别的ROC-AUC得分均超过0.998,表明分类性能接近完美。结论:本研究验证了一种产妇健康风险预测模型,该模型可扩展用于资源受限环境,并可在所选降维方法的限制下解释。它的简单,高精度和可推广性使其成为早期临床决策和干预的有前途的工具。
{"title":"Predicting maternal health risk using PCA-enhanced XGBoost and SMOTE-ENN for improved healthcare outcomes","authors":"Rahmatul Kabir Rasel Sarker ,&nbsp;Sadman Hafij ,&nbsp;Md Adib Yasir ,&nbsp;Md Assaduzzaman ,&nbsp;Md Monir Hossain Shimul ,&nbsp;Md Kamrul Hossain","doi":"10.1016/j.ibmed.2025.100300","DOIUrl":"10.1016/j.ibmed.2025.100300","url":null,"abstract":"<div><h3>Background</h3><div>Maternal health remains a global priority, especially in low-resource settings where timely risk identification is critical. Traditional machine learning models often suffer from poor generalizability, data imbalance, and computational inefficiencies. This study proposes an enhanced predictive model combining SMOTE-ENN data balancing and Principal Component Analysis (PCA) with XGBoost to improve maternal risk classification accuracy using minimal, easily collectible clinical features.</div></div><div><h3>Methods</h3><div>The dataset of 1014 maternal health records comprising seven physiological features was sourced from a public repository. Preprocessing involved standardization, label encoding, and class balancing using SMOTE-ENN. PCA was applied for dimensionality reduction to enhance computational performance and reduce overfitting. Several machine learning classifiers including Decision Tree, Random Forest, LightGBM, Gradient Boosting, and SVM were evaluated, with XGBoost selected as the final model. Performance metrics included accuracy, precision, recall, F1-score, ROC-AUC, and 10-fold cross-validation.</div></div><div><h3>Results</h3><div>The PCA-enhanced XGBoost model achieved the highest accuracy (97.73 %), precision (98 %), recall (98 %), and F1-score (98 %). It outperformed all other models, particularly in identifying high-risk cases with minimal false negatives. Cross-validation confirmed the model's robustness (mean accuracy: 98.39 %), and ROC-AUC scores exceeded 0.998 for all classes, indicating near-perfect classification performance.</div></div><div><h3>Conclusion</h3><div>This study validates a maternal health risk prediction model that is scalable for use in resource-constrained environments and interpretable within the limitations of the selected dimensionality-reduction approach. Its simplicity, high accuracy, and generalizability make it a promising tool for early clinical decision-making and intervention.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"12 ","pages":"Article 100300"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145219043","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generative AI and scientific manuscript peer review 生成人工智能和科学手稿同行评审
Pub Date : 2025-01-01 DOI: 10.1016/j.ibmed.2025.100246
Robert Hoyt MD (Associate Clinical Professor), Alfonso Limon PhD (Senior Data Scientist), Anthony Chang MD (Chief Intelligence and Innovation Officer)
{"title":"Generative AI and scientific manuscript peer review","authors":"Robert Hoyt MD (Associate Clinical Professor),&nbsp;Alfonso Limon PhD (Senior Data Scientist),&nbsp;Anthony Chang MD (Chief Intelligence and Innovation Officer)","doi":"10.1016/j.ibmed.2025.100246","DOIUrl":"10.1016/j.ibmed.2025.100246","url":null,"abstract":"","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"11 ","pages":"Article 100246"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144195200","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Attention-driven graph-based machine learning for non-invasive diagnosis of NAFLD 基于注意力驱动图的机器学习在非侵入性NAFLD诊断中的应用
Pub Date : 2025-01-01 DOI: 10.1016/j.ibmed.2025.100288
Ekta Srivastava , Sarath Mohan , Tapan Kumar Gandhi , Ashok Kumar Choudhury , Sandeep Kumar
An estimated 25%–30% of the global population is affected by non-alcoholic fatty liver disease (NAFLD), a silent yet progressive condition that can advance from simple steatosis to severe stages like non-alcoholic steatohepatitis (NASH), fibrosis, and cirrhosis, significantly heightening the risk of liver cancer. Currently, the gold-standard method for staging NAFLD is liver biopsy, an invasive procedure with risks such as bleeding, infection, and sampling error. Due to its high cost and impracticality for routine monitoring, there is a critical need for reliable, non-invasive diagnostic tools capable of effectively identifying NAFLD stages. We developed a graph-based framework in which each patient is represented as a node in a similarity network. Edges are formed via k-nearest neighbors (KNN) on standardized clinical and biochemical features, with missing values imputed by KNN to preserve biologically plausible variability. A two-layer Graph Attention Network (GAT) then learns edge-specific attention weights to focus on the most informative inter-patient relationships. Tested on a proprietary ILBS cohort (n = 622), our model achieved 75.2% accuracy (AUC = 0.768; F1 = 0.752), an 11% absolute improvement over Support Vector Machines and Random Forests, and demonstrated robustness in 10-fold cross-validation and adversarial noise tests. On a separate public dataset (n = 80) spanning lipidomic, glycomic, fatty acid, and hormone panels, it exceeded 99% accuracy (AUC > 0.99). Attention-based explanations further highlighted key patient similarities driving each prediction. These findings suggest that attention-driven graph learning can clearly improve non-invasive NAFLD staging, enabling early detection and supporting personalized disease monitoring in diverse clinical settings.
据估计,全球25%-30%的人口受到非酒精性脂肪性肝病(NAFLD)的影响,这是一种沉默但进展的疾病,可从单纯的脂肪变性发展到严重阶段,如非酒精性脂肪性肝炎(NASH)、纤维化和肝硬化,显著增加了肝癌的风险。目前,NAFLD分期的金标准方法是肝活检,这是一种侵入性手术,存在出血、感染和抽样错误等风险。由于其高成本和常规监测的不实用性,迫切需要能够有效识别NAFLD分期的可靠、非侵入性诊断工具。我们开发了一个基于图形的框架,其中每个患者都表示为相似网络中的节点。边缘是通过标准化临床和生化特征的k近邻(KNN)形成的,缺失值由KNN输入以保持生物学上合理的可变性。然后,两层图注意网络(GAT)学习边缘特定注意权重,以关注最具信息量的患者间关系。在专有的ILBS队列(n = 622)上进行测试,我们的模型达到了75.2%的准确率(AUC = 0.768; F1 = 0.752),比支持向量机和随机森林提高了11%,并在10倍交叉验证和对抗噪声测试中显示出鲁棒性。在一个独立的公共数据集(n = 80)上,包括脂质组、糖糖组、脂肪酸组和激素组,准确率超过99% (AUC > 0.99)。基于注意力的解释进一步强调了驱动每种预测的关键患者相似性。这些发现表明,注意力驱动的图学习可以明显改善非侵入性NAFLD的分期,使早期发现成为可能,并在不同的临床环境中支持个性化的疾病监测。
{"title":"Attention-driven graph-based machine learning for non-invasive diagnosis of NAFLD","authors":"Ekta Srivastava ,&nbsp;Sarath Mohan ,&nbsp;Tapan Kumar Gandhi ,&nbsp;Ashok Kumar Choudhury ,&nbsp;Sandeep Kumar","doi":"10.1016/j.ibmed.2025.100288","DOIUrl":"10.1016/j.ibmed.2025.100288","url":null,"abstract":"<div><div>An estimated 25%–30% of the global population is affected by non-alcoholic fatty liver disease (NAFLD), a silent yet progressive condition that can advance from simple steatosis to severe stages like non-alcoholic steatohepatitis (NASH), fibrosis, and cirrhosis, significantly heightening the risk of liver cancer. Currently, the gold-standard method for staging NAFLD is liver biopsy, an invasive procedure with risks such as bleeding, infection, and sampling error. Due to its high cost and impracticality for routine monitoring, there is a critical need for reliable, non-invasive diagnostic tools capable of effectively identifying NAFLD stages. We developed a graph-based framework in which each patient is represented as a node in a similarity network. Edges are formed via k-nearest neighbors (KNN) on standardized clinical and biochemical features, with missing values imputed by KNN to preserve biologically plausible variability. A two-layer Graph Attention Network (GAT) then learns edge-specific attention weights to focus on the most informative inter-patient relationships. Tested on a proprietary ILBS cohort (n = 622), our model achieved 75.2% accuracy (AUC = 0.768; F1 = 0.752), an 11% absolute improvement over Support Vector Machines and Random Forests, and demonstrated robustness in 10-fold cross-validation and adversarial noise tests. On a separate public dataset (n = 80) spanning lipidomic, glycomic, fatty acid, and hormone panels, it exceeded 99% accuracy (AUC <span><math><mo>&gt;</mo></math></span> 0.99). Attention-based explanations further highlighted key patient similarities driving each prediction. These findings suggest that attention-driven graph learning can clearly improve non-invasive NAFLD staging, enabling early detection and supporting personalized disease monitoring in diverse clinical settings.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"12 ","pages":"Article 100288"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144912336","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Privacy-aware and interpretable deep learning framework for dental caries classification 隐私感知和可解释的龋齿分类深度学习框架
Pub Date : 2025-01-01 DOI: 10.1016/j.ibmed.2025.100294
Jashvant Kumar , Khaled Mohamad Almustafa , Rand Madanat , Akhilesh Kumar Sharma , Muhammed Sutcu , Juliano Katrib
Dental caries remains one of the most prevalent and persistent chronic diseases globally, affecting individuals across all age groups and posing a significant burden on public health systems. Early detection is critical to prevent the progression of tooth decay, reduce treatment complexity, and improve long-term oral health outcomes. In response to these clinical demands, this study presents a comprehensive, privacy-aware, and interpretable deep learning framework for the automated classification of dental caries from X-ray images. The approach addresses the issues of class imbalance, low Resolution image and privacy preserved patient's medical images.The framework is structured into three progressive phases that incorporate supervised learning through Convolutional Neural Networks (CNN), ResNet-18, and DenseNet; unsupervised clustering using Principal Component Analysis (PCA); and a decentralized federated learning strategy to ensure secure model training across distributed datasets. The experimental dataset consists of 957 labelled dental radiographs, including 174 healthy and 783 carious cases, emphasizing the issue of class imbalance. Initial baseline models achieved an accuracy of 84 %, which improved to 96 % following strategic data augmentation and class balancing interventions. PCA-based clustering visualizations revealed well-separated clusters (Silhouette Score: 0.6660), confirming the discriminative power of the selected features. Meanwhile, the federated learning implementation preserved data confidentiality without sacrificing performance, reinforcing the model's suitability for real-world clinical deployment. Collectively, these findings validate the framework's robustness, interpretability, and adaptability, offering a scalable and ethically aligned solution for AI-driven dental diagnostics in modern healthcare systems.
龋齿仍然是全球最普遍和最持久的慢性疾病之一,影响所有年龄组的个体,并对公共卫生系统构成重大负担。早期发现对于防止蛀牙恶化、减少治疗复杂性和改善长期口腔健康结果至关重要。为了响应这些临床需求,本研究提出了一个全面的、隐私意识的、可解释的深度学习框架,用于从x射线图像中自动分类龋齿。该方法解决了分类不平衡、图像分辨率低和患者医学图像隐私保护等问题。该框架分为三个渐进阶段,包括通过卷积神经网络(CNN)、ResNet-18和DenseNet进行监督学习;基于主成分分析(PCA)的无监督聚类;以及分散的联邦学习策略,以确保跨分布式数据集的安全模型训练。实验数据集由957张标记的牙科x光片组成,其中包括174张健康病例和783张龋齿病例,强调了类别不平衡的问题。初始基线模型的准确率为84%,在策略数据增强和班级平衡干预后提高到96%。基于pca的聚类可视化显示了分离良好的聚类(剪影得分:0.6660),证实了所选特征的判别能力。同时,联邦学习实现在不牺牲性能的情况下保护了数据机密性,增强了模型对现实世界临床部署的适用性。总的来说,这些发现验证了框架的稳健性、可解释性和适应性,为现代医疗保健系统中人工智能驱动的牙科诊断提供了可扩展和符合道德的解决方案。
{"title":"Privacy-aware and interpretable deep learning framework for dental caries classification","authors":"Jashvant Kumar ,&nbsp;Khaled Mohamad Almustafa ,&nbsp;Rand Madanat ,&nbsp;Akhilesh Kumar Sharma ,&nbsp;Muhammed Sutcu ,&nbsp;Juliano Katrib","doi":"10.1016/j.ibmed.2025.100294","DOIUrl":"10.1016/j.ibmed.2025.100294","url":null,"abstract":"<div><div>Dental caries remains one of the most prevalent and persistent chronic diseases globally, affecting individuals across all age groups and posing a significant burden on public health systems. Early detection is critical to prevent the progression of tooth decay, reduce treatment complexity, and improve long-term oral health outcomes. In response to these clinical demands, this study presents a comprehensive, privacy-aware, and interpretable deep learning framework for the automated classification of dental caries from X-ray images. The approach addresses the issues of class imbalance, low Resolution image and privacy preserved patient's medical images.The framework is structured into three progressive phases that incorporate supervised learning through Convolutional Neural Networks (CNN), ResNet-18, and DenseNet; unsupervised clustering using Principal Component Analysis (PCA); and a decentralized federated learning strategy to ensure secure model training across distributed datasets. The experimental dataset consists of 957 labelled dental radiographs, including 174 healthy and 783 carious cases, emphasizing the issue of class imbalance. Initial baseline models achieved an accuracy of 84 %, which improved to 96 % following strategic data augmentation and class balancing interventions. PCA-based clustering visualizations revealed well-separated clusters (Silhouette Score: 0.6660), confirming the discriminative power of the selected features. Meanwhile, the federated learning implementation preserved data confidentiality without sacrificing performance, reinforcing the model's suitability for real-world clinical deployment. Collectively, these findings validate the framework's robustness, interpretability, and adaptability, offering a scalable and ethically aligned solution for AI-driven dental diagnostics in modern healthcare systems.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"12 ","pages":"Article 100294"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144912406","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient multi-modal fusion framework with advanced AI-driven approaches for automated Parkinson's disease detection 高效的多模态融合框架与先进的人工智能驱动的方法,用于帕金森病的自动检测
Pub Date : 2025-01-01 DOI: 10.1016/j.ibmed.2025.100310
Gouri Shankar Chakraborty , Joy Chakra Bortty , Joy Das , Inshad Rahman Noman , Kanchon Kumar Bishnu , Araf Islam
Parkinson's Disease is a neurological disorder, characterized by gradual loss of dopaminergic neurons in the substantia nigra resulting in: tremors, muscle rigidity, bradykinesia and postural instability. Other symptoms which involve other parts of the body and organs are as follows: loss of smell, difficulty to sleep, and changes in cognition. Being a neurodegenerative disorder, it becomes important to detect the disease in early manner. Different researchers across all over the world are trying to develop such techniques that can be helpful for disease detection process. Manually detection process based on the medical images is complex and time consuming where accuracy and reliability is also questionable. Here, deep learning came to the picture to make the process automatic and reliable where deep neural network-based models are being used to classify different diseases quite accurately and efficiently. Utilizing the potentiality of Artificial Intelligence (AI), a novel work on Parkinson's disease diagnosis has been performed with comprehensive personalized management strategies. Here in this work, AI-powered detection frameworks have been designed for Parkinson's disease classification. Seven Machine Learning models (Logistic Regression, K-Nearest Neighbors, Perceptron, Support Vector Machine, XGBoost, Decision Tree and Random Forest) and five Deep Learning Models (ResNet101, VGG19, Xception, Inception and EfficientNet) were trained and best models have been selected based on the performance analysis. Feature fusion technique with modified classification layers with hyperparameter tuning ensures optimized and remarkable output. LR and VGG19 have been selected where accuracies of 95.74 % for EEG data with LR model, 96.78 % for MRI image-based classification and 97.7 % for spiral and wave-based drawings with proposed fusion VGG19 model.
帕金森病是一种神经系统疾病,其特征是黑质中多巴胺能神经元的逐渐丧失,导致震颤、肌肉僵硬、运动迟缓和姿势不稳定。其他涉及身体其他部位和器官的症状如下:嗅觉丧失、睡眠困难和认知改变。作为一种神经退行性疾病,早期发现它变得很重要。世界各地不同的研究人员都在努力开发有助于疾病检测过程的技术。基于医学图像的人工检测过程复杂且耗时,准确性和可靠性也存在问题。在这里,深度学习的出现使这个过程自动化和可靠,其中基于深度神经网络的模型被用于非常准确和有效地分类不同的疾病。利用人工智能(AI)的潜力,在帕金森病诊断方面开展了一项新的工作,并采用了全面的个性化管理策略。在这项工作中,人工智能检测框架被设计用于帕金森病分类。七个机器学习模型(逻辑回归、k近邻、感知器、支持向量机、XGBoost、决策树和随机森林)和五个深度学习模型(ResNet101、VGG19、Xception、Inception和EfficientNet)进行了训练,并根据性能分析选择了最佳模型。特征融合技术与改进的分类层和超参数调谐保证了优化和显著的输出。选择了LR和VGG19模型,其中LR模型对脑电数据的分类准确率为95.74%,基于MRI图像的分类准确率为96.78%,基于螺旋和波浪的分类准确率为97.7%。
{"title":"Efficient multi-modal fusion framework with advanced AI-driven approaches for automated Parkinson's disease detection","authors":"Gouri Shankar Chakraborty ,&nbsp;Joy Chakra Bortty ,&nbsp;Joy Das ,&nbsp;Inshad Rahman Noman ,&nbsp;Kanchon Kumar Bishnu ,&nbsp;Araf Islam","doi":"10.1016/j.ibmed.2025.100310","DOIUrl":"10.1016/j.ibmed.2025.100310","url":null,"abstract":"<div><div>Parkinson's Disease is a neurological disorder, characterized by gradual loss of dopaminergic neurons in the substantia nigra resulting in: tremors, muscle rigidity, bradykinesia and postural instability. Other symptoms which involve other parts of the body and organs are as follows: loss of smell, difficulty to sleep, and changes in cognition. Being a neurodegenerative disorder, it becomes important to detect the disease in early manner. Different researchers across all over the world are trying to develop such techniques that can be helpful for disease detection process. Manually detection process based on the medical images is complex and time consuming where accuracy and reliability is also questionable. Here, deep learning came to the picture to make the process automatic and reliable where deep neural network-based models are being used to classify different diseases quite accurately and efficiently. Utilizing the potentiality of Artificial Intelligence (AI), a novel work on Parkinson's disease diagnosis has been performed with comprehensive personalized management strategies. Here in this work, AI-powered detection frameworks have been designed for Parkinson's disease classification. Seven Machine Learning models (Logistic Regression, K-Nearest Neighbors, Perceptron, Support Vector Machine, XGBoost, Decision Tree and Random Forest) and five Deep Learning Models (ResNet101, VGG19, Xception, Inception and EfficientNet) were trained and best models have been selected based on the performance analysis. Feature fusion technique with modified classification layers with hyperparameter tuning ensures optimized and remarkable output. LR and VGG19 have been selected where accuracies of 95.74 % for EEG data with LR model, 96.78 % for MRI image-based classification and 97.7 % for spiral and wave-based drawings with proposed fusion VGG19 model.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"12 ","pages":"Article 100310"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145465891","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Intelligence-based medicine
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1