首页 > 最新文献

Intelligence-based medicine最新文献

英文 中文
Artificial neural network based automatic detection of motor evoked potentials 基于人工神经网络的运动诱发电位自动检测
Pub Date : 2025-01-01 DOI: 10.1016/j.ibmed.2025.100295
Bethel Osuagwu , Hongli Huang , Emily L. McNicol , Vellaisamy A.L. Roy , Aleksandra Vučkovič

Introduction

Motor evoked potentials (MEP) are detected using various methods that determine signal changepoints. The current detection methods perform well given a high signal to noise ratio. However, performance can diminish with artefact such as those arising due to poor signal quality and unwanted electrical potentials. Part of the problem is likely because the methods ignore the morphology of a signal making it impossible to differentiate noise from MEPs.

Methods

For the first time, we investigated a new detection method able to learn MEP morphology using artificial neural networks. To build an MEP detection model, we trained deep neural networks with architectures based on combined CNN and LSTM or self-attention mechanism, using sample MEP data recorded from able-bodied individuals. The MEP detection capability of the models was compared with that of a changepoint based detection method.

Results

Our models reached test accuracy of up to 89.7 ± 1.5 % on average. In a real-world setting evaluation, our models achieved average detection accuracy of up to 94.7 ± 1.2 %, compared with 76.4 ± 5.3 % for the standard changepoint detection method (p = 0.004).

Conclusion

Artificial neural network models can be used for improved automated detection of MEPs.
运动诱发电位(MEP)的检测使用各种方法来确定信号的变化点。当前的检测方法在高信噪比条件下表现良好。然而,由于信号质量差和不需要的电势而产生的伪影会降低性能。部分问题可能是因为这些方法忽略了信号的形态,从而无法区分噪声和mep。方法首次研究了一种基于人工神经网络的MEP形态学检测方法。为了构建MEP检测模型,我们使用健全个体的MEP样本数据,训练了基于CNN和LSTM(自注意机制)相结合的深层神经网络架构。将模型的MEP检测能力与基于变化点的检测方法进行了比较。结果模型的检测准确率平均可达89.7±1.5%。在现实环境评估中,我们的模型实现了高达94.7±1.2%的平均检测精度,而标准变化点检测方法的平均检测精度为76.4±5.3% (p = 0.004)。结论人工神经网络模型可用于改进mep的自动检测。
{"title":"Artificial neural network based automatic detection of motor evoked potentials","authors":"Bethel Osuagwu ,&nbsp;Hongli Huang ,&nbsp;Emily L. McNicol ,&nbsp;Vellaisamy A.L. Roy ,&nbsp;Aleksandra Vučkovič","doi":"10.1016/j.ibmed.2025.100295","DOIUrl":"10.1016/j.ibmed.2025.100295","url":null,"abstract":"<div><h3>Introduction</h3><div>Motor evoked potentials (MEP) are detected using various methods that determine signal changepoints. The current detection methods perform well given a high signal to noise ratio. However, performance can diminish with artefact such as those arising due to poor signal quality and unwanted electrical potentials. Part of the problem is likely because the methods ignore the morphology of a signal making it impossible to differentiate noise from MEPs.</div></div><div><h3>Methods</h3><div>For the first time, we investigated a new detection method able to learn MEP morphology using artificial neural networks. To build an MEP detection model, we trained deep neural networks with architectures based on combined CNN and LSTM or self-attention mechanism, using sample MEP data recorded from able-bodied individuals. The MEP detection capability of the models was compared with that of a changepoint based detection method.</div></div><div><h3>Results</h3><div>Our models reached test accuracy of up to 89.7 ± 1.5 % on average. In a real-world setting evaluation, our models achieved average detection accuracy of up to 94.7 ± 1.2 %, compared with 76.4 ± 5.3 % for the standard changepoint detection method (p = 0.004).</div></div><div><h3>Conclusion</h3><div>Artificial neural network models can be used for improved automated detection of MEPs.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"12 ","pages":"Article 100295"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145094604","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhanced X-ray image classification for pneumonia detection using deep learning based CBAM and SE mechanisms 基于深度学习的CBAM和SE机制增强肺炎x射线图像分类
Pub Date : 2025-01-01 DOI: 10.1016/j.ibmed.2025.100299
Saiprasad Potharaju , Swapnali N. Tambe , Kishore Dasari , N. Srikanth , Rampay Venkatarao , Sagar Tambe

Problem considered

Pneumonia, a global health concern, remains a significant cause of morbidity and mortality, particularly in children under five and the elderly. Diagnostic challenges are pronounced in resource-limited settings, where expertise in radiological interpretation is scarce. X-ray imaging, a common diagnostic tool, often fails to provide accurate results without expert analysis. This gap in timely and precise diagnosis leads to delayed treatments and worsening patient outcomes. The emergence of antibiotic-resistant strains further emphasizes the urgency for innovative diagnostic solutions.

Methods

This research integrates advanced attention mechanisms into convolutional neural networks (CNNs) to enhance pneumonia detection from X-ray images. Utilizing a dataset of 5816 X-rays, preprocessing steps included normalization and data augmentation to improve robustness. The baseline CNN model was augmented with Convolutional Block Attention Module (CBAM) and Squeeze-and-Excitation (SE) networks, which prioritize critical image regions and recalibrate feature channels. Comparative evaluations were conducted using ResNet50 combined with CBAM.

Results

The CBAM-enhanced CNN achieved 98.6 % accuracy, improving upon the baseline CNN's 92.08 %, with a sensitivity of 98.3 % and specificity of 97.9 %. The SE-integrated CNN followed with 96.25 % accuracy, demonstrating superior feature recalibration. ResNet50 with CBAM attained 93.32 % accuracy. Compared to standard CNN models, these models exhibited reduced overfitting, improved generalization, and enhanced feature extraction. The proposed approach ensures a higher precision rate in detecting pneumonia from X-ray images. The model is designed for real-world clinical applications, particularly in low-resource healthcare settings. A lightweight, user-friendly web application was developed to assist radiologists and general practitioners in automated pneumonia detection, reducing reliance on expert interpretation.
肺炎是一个全球性的健康问题,仍然是发病和死亡的一个重要原因,特别是在五岁以下儿童和老年人中。在资源有限的环境中,诊断方面的挑战是明显的,在那里,放射学解释的专业知识是稀缺的。x射线成像是一种常见的诊断工具,在没有专家分析的情况下往往无法提供准确的结果。这种在及时和准确诊断方面的差距导致治疗延误和患者预后恶化。抗生素耐药菌株的出现进一步强调了创新诊断解决方案的紧迫性。方法本研究将先进的注意机制整合到卷积神经网络(cnn)中,以增强对x射线图像的肺炎检测。利用5816个x射线数据集,预处理步骤包括归一化和数据增强以提高鲁棒性。基线CNN模型被卷积块注意模块(CBAM)和压缩激励(SE)网络增强,它们优先考虑关键图像区域并重新校准特征通道。采用ResNet50联合CBAM进行对比评价。结果cbam增强CNN的准确率达到98.6%,比基线CNN的92.08%有所提高,敏感性为98.3%,特异性为97.9%。se集成的CNN以96.25%的准确率紧随其后,显示出优越的特征重新校准。采用CBAM的ResNet50的准确率为93.32%。与标准CNN模型相比,这些模型表现出更少的过拟合、更好的泛化和增强的特征提取。该方法保证了从x射线图像中检测肺炎的较高准确率。该模型是为现实世界的临床应用而设计的,特别是在资源匮乏的医疗保健环境中。开发了一个轻量级、用户友好的web应用程序,以帮助放射科医生和全科医生自动检测肺炎,减少对专家解释的依赖。
{"title":"Enhanced X-ray image classification for pneumonia detection using deep learning based CBAM and SE mechanisms","authors":"Saiprasad Potharaju ,&nbsp;Swapnali N. Tambe ,&nbsp;Kishore Dasari ,&nbsp;N. Srikanth ,&nbsp;Rampay Venkatarao ,&nbsp;Sagar Tambe","doi":"10.1016/j.ibmed.2025.100299","DOIUrl":"10.1016/j.ibmed.2025.100299","url":null,"abstract":"<div><h3>Problem considered</h3><div>Pneumonia, a global health concern, remains a significant cause of morbidity and mortality, particularly in children under five and the elderly. Diagnostic challenges are pronounced in resource-limited settings, where expertise in radiological interpretation is scarce. X-ray imaging, a common diagnostic tool, often fails to provide accurate results without expert analysis. This gap in timely and precise diagnosis leads to delayed treatments and worsening patient outcomes. The emergence of antibiotic-resistant strains further emphasizes the urgency for innovative diagnostic solutions.</div></div><div><h3>Methods</h3><div>This research integrates advanced attention mechanisms into convolutional neural networks (CNNs) to enhance pneumonia detection from X-ray images. Utilizing a dataset of 5816 X-rays, preprocessing steps included normalization and data augmentation to improve robustness. The baseline CNN model was augmented with Convolutional Block Attention Module (CBAM) and Squeeze-and-Excitation (SE) networks, which prioritize critical image regions and recalibrate feature channels. Comparative evaluations were conducted using ResNet50 combined with CBAM.</div></div><div><h3>Results</h3><div>The CBAM-enhanced CNN achieved 98.6 % accuracy, improving upon the baseline CNN's 92.08 %, with a sensitivity of 98.3 % and specificity of 97.9 %. The SE-integrated CNN followed with 96.25 % accuracy, demonstrating superior feature recalibration. ResNet50 with CBAM attained 93.32 % accuracy. Compared to standard CNN models, these models exhibited reduced overfitting, improved generalization, and enhanced feature extraction. The proposed approach ensures a higher precision rate in detecting pneumonia from X-ray images. The model is designed for real-world clinical applications, particularly in low-resource healthcare settings. A lightweight, user-friendly web application was developed to assist radiologists and general practitioners in automated pneumonia detection, reducing reliance on expert interpretation.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"12 ","pages":"Article 100299"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145219042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fully automatic content-aware tiling pipeline for pathology whole slide images 全自动内容感知平铺管道病理整个幻灯片图像
Pub Date : 2025-01-01 DOI: 10.1016/j.ibmed.2025.100318
Falah Jabar , Lill-Tove Rasmussen Busund , Biagio Ricciuti , Masoud Tafavvoghi , Thomas K. Kilvaer , David J. Pinato , Mette Pøhl , Sigve Andersen , Tom Donnem , David J. Kwiatkowski , Mehrdad Rakaee
Tiling (or patching) histology Whole Slide Images (WSIs) is a required initial step in the development of deep learning (DL) models. Gigapixel-scale WSIs must be divided into smaller, manageable image tiles. Standard WSI tiling techniques often exclude diagnostically important tissue regions or include regions with artifacts such as folds, blurs, and pen-markings, which can significantly degrade DL model performance and analysis. This paper introduces WSI-SmartTiling, a fully automated, deep learning-based, content-aware WSI tiling pipeline designed to include maximal information content from WSI. A supervised DL model for artifact detection was developed using pixel-based semantic segmentation at high magnification (20× and 40x) to classify WSI regions as either artifacts or qualified tissue. The model was trained on a diverse dataset and validated using both internal and external datasets. Quantitative and qualitative evaluations demonstrated its superiority, outperforming state-of-the-art methods with accuracy, precision, recall, and F1 scores exceeding 95 % across all artifact types, along with Dice scores above 94 %. In addition, WSI-SmartTiling integrates a generative adversarial network model to reconstruct tissue regions obscured by pen-markings in various colors, ensuring relevant valuable areas are preserved. Lastly, while excluding artifacts, the pipeline efficiently tiles qualified tissue regions with minimum tissue loss.
In conclusion, this high-resolution preprocessing pipeline can significantly improve pathology WSI-based feature extraction and DL-based classification by minimizing tissue loss and providing high-quality – artifact-free – tissue tiles. The WSI-SmartTiling pipeline is publicly available on GitHub.
铺贴(或修补)组织学全幻灯片图像(wsi)是开发深度学习(DL)模型所需的第一步。千兆像素级wsi必须划分为更小的、可管理的图像块。标准的WSI平铺技术通常会排除诊断上重要的组织区域,或者包括褶皱、模糊和笔标记等伪影区域,这些区域会显著降低DL模型的性能和分析。本文介绍了WSI- smarttiling,这是一个全自动的、基于深度学习的、内容感知的WSI平铺管道,旨在包含来自WSI的最大信息内容。在高倍率(20倍和40倍)下,使用基于像素的语义分割开发了一个用于伪像检测的监督深度学习模型,将WSI区域分类为伪像或合格组织。该模型在不同的数据集上进行训练,并使用内部和外部数据集进行验证。定量和定性评估证明了它的优越性,在准确性、精密度、召回率方面优于最先进的方法,在所有神器类型中F1得分超过95%,Dice得分超过94%。此外,WSI-SmartTiling集成了生成对抗网络模型,以重建被各种颜色的笔标记遮挡的组织区域,确保保留相关的有价值的区域。最后,在排除伪影的同时,管道有效地以最小的组织损失覆盖合格的组织区域。总之,这种高分辨率的预处理管道可以通过最大限度地减少组织损失和提供高质量的无伪影组织块,显著改善基于病理wsi的特征提取和基于dl的分类。WSI-SmartTiling管道在GitHub上是公开的。
{"title":"Fully automatic content-aware tiling pipeline for pathology whole slide images","authors":"Falah Jabar ,&nbsp;Lill-Tove Rasmussen Busund ,&nbsp;Biagio Ricciuti ,&nbsp;Masoud Tafavvoghi ,&nbsp;Thomas K. Kilvaer ,&nbsp;David J. Pinato ,&nbsp;Mette Pøhl ,&nbsp;Sigve Andersen ,&nbsp;Tom Donnem ,&nbsp;David J. Kwiatkowski ,&nbsp;Mehrdad Rakaee","doi":"10.1016/j.ibmed.2025.100318","DOIUrl":"10.1016/j.ibmed.2025.100318","url":null,"abstract":"<div><div>Tiling (or patching) histology Whole Slide Images (WSIs) is a required initial step in the development of deep learning (DL) models. Gigapixel-scale WSIs must be divided into smaller, manageable image tiles. Standard WSI tiling techniques often exclude diagnostically important tissue regions or include regions with artifacts such as folds, blurs, and pen-markings, which can significantly degrade DL model performance and analysis. This paper introduces WSI-SmartTiling, a fully automated, deep learning-based, content-aware WSI tiling pipeline designed to include maximal information content from WSI. A supervised DL model for artifact detection was developed using pixel-based semantic segmentation at high magnification (20× and 40x) to classify WSI regions as either artifacts or qualified tissue. The model was trained on a diverse dataset and validated using both internal and external datasets. Quantitative and qualitative evaluations demonstrated its superiority, outperforming state-of-the-art methods with accuracy, precision, recall, and F1 scores exceeding 95 % across all artifact types, along with Dice scores above 94 %. In addition, WSI-SmartTiling integrates a generative adversarial network model to reconstruct tissue regions obscured by pen-markings in various colors, ensuring relevant valuable areas are preserved. Lastly, while excluding artifacts, the pipeline efficiently tiles qualified tissue regions with minimum tissue loss.</div><div>In conclusion, this high-resolution preprocessing pipeline can significantly improve pathology WSI-based feature extraction and DL-based classification by minimizing tissue loss and providing high-quality – artifact-free – tissue tiles. The WSI-SmartTiling pipeline is publicly available on <span><span>GitHub</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"12 ","pages":"Article 100318"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145683783","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting the prevalence of cardiovascular diseases using machine learning algorithms 使用机器学习算法预测心血管疾病的患病率
Pub Date : 2025-01-01 DOI: 10.1016/j.ibmed.2025.100199
Bernada E. Sianga , Maurice C. Mbago , Amina S. Msengwa
Cardiovascular Diseases (CVDs) are the major cause of morbidity, disability, and mortality worldwide and are the most life-threatening diseases. Early detection and appropriate action can significantly reduce the effects and complications of CVD. Prediction of the likelihood that an individual can develop CVD adverse outcomes is essential. Machine learning methods are used to predict the risk of CVD incidences. Optimal model parameters were obtained using the grid search and randomized search methods. A hyperparameter tuning method with the highest accuracy was used to find the optimal parameters for the six algorithms used in this study. Two experiments were deployed: the first was training and testing the CVD dataset using hyperparameterized ML algorithms excluding geographical features, and the second included geographical features. The geographical features are air humidity, temperature and education status of a location. The performances of the two experiments were compared using classification metrics. The findings revealed that the performance of the second experiment outperformed the first experiment. XGBoost achieved the highest accuracy of 95.24 %, followed by the decision tree 93.87 % and support vector machine 92.87 % when geographical features were included (second experiment). Including geographical risk factors in predicting CVD is crucial as they contribute to the probability of developing CVD incidences.
心血管疾病(cvd)是世界范围内发病、残疾和死亡的主要原因,也是最危及生命的疾病。早期发现和适当的行动可以显著减少心血管疾病的影响和并发症。预测个体发生心血管疾病不良后果的可能性至关重要。机器学习方法用于预测心血管疾病发病率的风险。采用网格搜索和随机搜索方法获得最优模型参数。采用精度最高的超参数整定方法对六种算法进行了参数优化。部署了两个实验:第一个是使用排除地理特征的超参数化ML算法训练和测试CVD数据集,第二个是包含地理特征。地理特征是指一个地点的空气湿度、温度和教育状况。使用分类指标对两个实验的性能进行比较。结果显示,第二个实验的表现优于第一个实验。在包含地理特征时,XGBoost的准确率最高,为95.24%,其次是决策树(93.87%)和支持向量机(92.87%)(第二次实验)。在预测心血管疾病时包括地理危险因素是至关重要的,因为它们有助于心血管疾病发生的概率。
{"title":"Predicting the prevalence of cardiovascular diseases using machine learning algorithms","authors":"Bernada E. Sianga ,&nbsp;Maurice C. Mbago ,&nbsp;Amina S. Msengwa","doi":"10.1016/j.ibmed.2025.100199","DOIUrl":"10.1016/j.ibmed.2025.100199","url":null,"abstract":"<div><div>Cardiovascular Diseases (CVDs) are the major cause of morbidity, disability, and mortality worldwide and are the most life-threatening diseases. Early detection and appropriate action can significantly reduce the effects and complications of CVD. Prediction of the likelihood that an individual can develop CVD adverse outcomes is essential. Machine learning methods are used to predict the risk of CVD incidences. Optimal model parameters were obtained using the grid search and randomized search methods. A hyperparameter tuning method with the highest accuracy was used to find the optimal parameters for the six algorithms used in this study. Two experiments were deployed: the first was training and testing the CVD dataset using hyperparameterized ML algorithms excluding geographical features, and the second included geographical features. The geographical features are air humidity, temperature and education status of a location. The performances of the two experiments were compared using classification metrics. The findings revealed that the performance of the second experiment outperformed the first experiment. XGBoost achieved the highest accuracy of 95.24 %, followed by the decision tree 93.87 % and support vector machine 92.87 % when geographical features were included (second experiment). Including geographical risk factors in predicting CVD is crucial as they contribute to the probability of developing CVD incidences.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"11 ","pages":"Article 100199"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143174330","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A neonatal sepsis prediction algorithm using electronic medical record data from Mbarara Regional Referral Hospital 基于Mbarara地区转诊医院电子病历数据的新生儿败血症预测算法
Pub Date : 2025-01-01 DOI: 10.1016/j.ibmed.2025.100198
Peace Ezeobi Dennis , Angella Musiimenta , William Wasswa , Stella Kyoyagala

Introduction

Neonatal sepsis is a global challenge that contributes significantly to neonatal morbidity and mortality. The current diagnostic methods depend on conventional culture methods, a procedure that takes time and leads to delays in making timely treatment decisions. This study proposes a machine learning algorithm utilizing electronic medical record (EMR) data from Mbarara Regional Referral Hospital (MRRH) to enhance early detection and treatment of neonatal sepsis.

Methods

We performed a retrospective study on a dataset of neonates hospitalized for at least 48 h in the Neonatal Intensive Care Unit (NICU) at MRRH between October 2015 to September 2019 who received at least one sepsis evaluation. 482 records of neonates met the inclusion criteria and the dataset comprises 38 neonatal sepsis screening parameters. The study considered two outcomes for sepsis evaluations: culture-positive if a blood culture was positive, and clinically positive if cultures were negative but antibiotics were administered for at least 120 h. We implemented k-fold cross-validation with k set to 10 to guarantee robust training and testing of the models. Seven machine learning models were trained to classify inputs as sepsis positive or negative, and their performance was compared with physician diagnoses.

Results

The results of this study show that the proposed algorithm, combining maternal risk factors, neonatal clinical signs, and laboratory tests (the algorithm demonstrated a sensitivity and specificity of at least 95 %) outperformed the physician diagnosis (Sensitivity = 89 %, Specificity = 11 %). SVM model with radial basis function, polynomial kernels, and DT model (with the highest AUROC values of 98 %) performed better than the other models.

Conclusions

The study shows that the combination of maternal risk factors, neonatal clinical signs, and laboratory tests can help improve the prediction of neonatal sepsis. Further research is warranted to assess the potential performance improvements and clinical efficacy in a prospective trial.
新生儿败血症是一项全球性挑战,对新生儿发病率和死亡率有重要影响。目前的诊断方法依赖于传统的培养方法,这一过程需要时间,并导致及时做出治疗决定的延误。本研究提出了一种利用Mbarara地区转诊医院(MRRH)电子病历(EMR)数据的机器学习算法,以提高新生儿败血症的早期发现和治疗。方法对2015年10月至2019年9月期间在MRRH新生儿重症监护病房(NICU)住院至少48小时并接受至少一次脓毒症评估的新生儿数据集进行回顾性研究。482例符合纳入标准的新生儿记录,数据集包括38个新生儿败血症筛查参数。该研究考虑了脓毒症评估的两种结果:如果血液培养呈阳性,则培养呈阳性;如果培养呈阴性,但使用抗生素至少120小时,则临床呈阳性。我们实施了k-fold交叉验证,k设置为10,以保证模型的稳健训练和测试。七个机器学习模型被训练来将输入分类为脓毒症阳性或阴性,并将它们的表现与医生的诊断进行比较。结果本研究结果表明,结合产妇危险因素、新生儿临床体征和实验室检查(该算法的灵敏度和特异性至少为95%)提出的算法优于医生诊断(灵敏度= 89%,特异性= 11%)。采用径向基函数、多项式核的SVM模型和AUROC最高达98%的DT模型均优于其他模型。结论结合产妇危险因素、新生儿临床体征和实验室检查,有助于提高对新生儿脓毒症的预测。进一步的研究需要在前瞻性试验中评估潜在的性能改善和临床疗效。
{"title":"A neonatal sepsis prediction algorithm using electronic medical record data from Mbarara Regional Referral Hospital","authors":"Peace Ezeobi Dennis ,&nbsp;Angella Musiimenta ,&nbsp;William Wasswa ,&nbsp;Stella Kyoyagala","doi":"10.1016/j.ibmed.2025.100198","DOIUrl":"10.1016/j.ibmed.2025.100198","url":null,"abstract":"<div><h3>Introduction</h3><div>Neonatal sepsis is a global challenge that contributes significantly to neonatal morbidity and mortality. The current diagnostic methods depend on conventional culture methods, a procedure that takes time and leads to delays in making timely treatment decisions. This study proposes a machine learning algorithm utilizing electronic medical record (EMR) data from Mbarara Regional Referral Hospital (MRRH) to enhance early detection and treatment of neonatal sepsis.</div></div><div><h3>Methods</h3><div>We performed a retrospective study on a dataset of neonates hospitalized for at least 48 h in the Neonatal Intensive Care Unit (NICU) at MRRH between October 2015 to September 2019 who received at least one sepsis evaluation. 482 records of neonates met the inclusion criteria and the dataset comprises 38 neonatal sepsis screening parameters. The study considered two outcomes for sepsis evaluations: culture-positive if a blood culture was positive, and clinically positive if cultures were negative but antibiotics were administered for at least 120 h. We implemented k-fold cross-validation with k set to 10 to guarantee robust training and testing of the models. Seven machine learning models were trained to classify inputs as sepsis positive or negative, and their performance was compared with physician diagnoses.</div></div><div><h3>Results</h3><div>The results of this study show that the proposed algorithm, combining maternal risk factors, neonatal clinical signs, and laboratory tests (the algorithm demonstrated a sensitivity and specificity of at least 95 %) outperformed the physician diagnosis (Sensitivity = 89 %, Specificity = 11 %). SVM model with radial basis function, polynomial kernels, and DT model (with the highest AUROC values of 98 %) performed better than the other models.</div></div><div><h3>Conclusions</h3><div>The study shows that the combination of maternal risk factors, neonatal clinical signs, and laboratory tests can help improve the prediction of neonatal sepsis. Further research is warranted to assess the potential performance improvements and clinical efficacy in a prospective trial.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"11 ","pages":"Article 100198"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143174356","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep learning-based approach to diagnose lung cancer using CT-scan images 基于深度学习的ct扫描图像肺癌诊断方法
Pub Date : 2025-01-01 DOI: 10.1016/j.ibmed.2024.100188
Mohammad Q. Shatnawi, Qusai Abuein, Romesaa Al-Quraan
The work in this research focuses on the automatic classification and prediction of lung cancer using computed tomography (CT) scans, employing Deep Learning (DL) strategies, specifically Enhanced Convolutional Neural Networks (CNNs), to enable rapid and accurate image analysis. This research designed and developed pre-trained models, including ConvNeXtSmall, VGG16, ResNet50, InceptionV3, and EfficientNetB0, to classify lung cancer. The dataset was divided into four classes, consisting of 338 images of adenocarcinoma, 187 images of large cell carcinoma, 260 images of squamous cell carcinoma, and 215 normal images. Notably, The Enhanced CNN model achieved an unprecedented testing accuracy of 100 %, outperforming all other models, which included ConvNeXt at 87 %, VGG16 at 99 %, ResNet50 at 94.5 %, InceptionV3 at 76.9 %, and EfficientNetB0 at 97.9 %. The study of this research is considered the first one that hits 100 % testing accuracy with an Enhanced CNN, demonstrating significant advancements in lung cancer detection through the application of sophisticated image enhancement techniques and innovative model architectures. This highlights the potential of Enhanced CNN models in transforming lung cancer diagnostics and emphasizes the importance of integrating advanced image processing techniques into clinical practice.
本研究的工作重点是使用计算机断层扫描(CT)自动分类和预测肺癌,采用深度学习(DL)策略,特别是增强型卷积神经网络(cnn),以实现快速准确的图像分析。本研究设计并开发了包括ConvNeXtSmall、VGG16、ResNet50、InceptionV3和EfficientNetB0在内的预训练模型,用于肺癌分类。数据集分为4类,包括腺癌图像338张,大细胞癌图像187张,鳞状细胞癌图像260张,正常图像215张。值得注意的是,增强的CNN模型实现了前所未有的100%的测试精度,优于所有其他模型,包括ConvNeXt为87%,VGG16为99%,ResNet50为94.5%,InceptionV3为76.9%,EfficientNetB0为97.9%。这项研究被认为是第一个使用增强型CNN达到100%测试准确率的研究,通过应用复杂的图像增强技术和创新的模型架构,展示了肺癌检测方面的重大进步。这突出了增强CNN模型在改变肺癌诊断方面的潜力,并强调了将先进的图像处理技术整合到临床实践中的重要性。
{"title":"Deep learning-based approach to diagnose lung cancer using CT-scan images","authors":"Mohammad Q. Shatnawi,&nbsp;Qusai Abuein,&nbsp;Romesaa Al-Quraan","doi":"10.1016/j.ibmed.2024.100188","DOIUrl":"10.1016/j.ibmed.2024.100188","url":null,"abstract":"<div><div>The work in this research focuses on the automatic classification and prediction of lung cancer using computed tomography (CT) scans, employing Deep Learning (DL) strategies, specifically Enhanced Convolutional Neural Networks (CNNs), to enable rapid and accurate image analysis. This research designed and developed pre-trained models, including ConvNeXtSmall, VGG16, ResNet50, InceptionV3, and EfficientNetB0, to classify lung cancer. The dataset was divided into four classes, consisting of 338 images of adenocarcinoma, 187 images of large cell carcinoma, 260 images of squamous cell carcinoma, and 215 normal images. Notably, The Enhanced CNN model achieved an unprecedented testing accuracy of 100 %, outperforming all other models, which included ConvNeXt at 87 %, VGG16 at 99 %, ResNet50 at 94.5 %, InceptionV3 at 76.9 %, and EfficientNetB0 at 97.9 %. The study of this research is considered the first one that hits 100 % testing accuracy with an Enhanced CNN, demonstrating significant advancements in lung cancer detection through the application of sophisticated image enhancement techniques and innovative model architectures. This highlights the potential of Enhanced CNN models in transforming lung cancer diagnostics and emphasizes the importance of integrating advanced image processing techniques into clinical practice.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"11 ","pages":"Article 100188"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143174358","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting patient no-shows using machine learning: A comprehensive review and future research agenda 利用机器学习预测患者未就诊情况:全面回顾与未来研究议程
Pub Date : 2025-01-01 DOI: 10.1016/j.ibmed.2025.100229
Khaled M. Toffaha , Mecit Can Emre Simsekler , Mohammed Atif Omar , Imad ElKebbi
Patient no-shows for scheduled medical appointments pose significant challenges to healthcare systems, resulting in wasted resources, increased costs, and disrupted continuity of care. This comprehensive review examines state-of-the-art machine learning (ML) approaches for predicting patient no-shows in outpatient settings, analyzing 52 publications from 2010 to 2025.
The study reveals significant advancements in the field, with Logistic Regression (LR) as the most commonly used model in 68% of the studies. Tree-based models, ensemble methods, and deep learning techniques have gained traction in recent years, reflecting the field’s evolution. The best-performing models achieved Area Under the Curve (AUC) scores between 0.75 and 0.95, with accuracy ranging from 52% to 99.44%. Methodologically, researchers addressed common challenges such as class imbalance using various sampling techniques and employed a wide range of feature selection methods to improve model efficiency. The review also highlighted the importance of considering temporal factors and the context-dependent nature of no-show behavior across different healthcare settings.
Using the ITPOSMO framework (Information, Technology, Processes, Objectives, Staffing, Management, and Other Resources), the study identified several gaps in current ML approaches. Key challenges include data quality and completeness, model interpretability, and integration with existing healthcare systems. Future research directions include improving data collection methods, incorporating organizational factors, ensuring ethical implementation, and developing standardized approaches for handling data imbalance. The review also suggests exploring new data sources, utilizing ML algorithms to analyze patient behavior patterns, and using transfer learning techniques to adapt models across different healthcare facilities.
By addressing these challenges, healthcare providers can leverage ML to improve resource allocation, enhance patient care quality, and advance predictive analytics in healthcare. This comprehensive review underscores the potential of ML in predicting no-shows while acknowledging the complexities and challenges in its practical implementation.
病人爽约给医疗保健系统带来了巨大挑战,导致资源浪费、成本增加和护理连续性中断。本综述分析了 2010 年至 2025 年间发表的 52 篇论文,探讨了用于预测门诊患者爽约情况的最先进的机器学习(ML)方法。研究显示,该领域取得了重大进展,在 68% 的研究中,逻辑回归(LR)是最常用的模型。基于树的模型、集合方法和深度学习技术在近几年得到了广泛应用,反映了该领域的发展。表现最好的模型的曲线下面积(AUC)得分在 0.75 到 0.95 之间,准确率在 52% 到 99.44% 之间。在方法上,研究人员利用各种采样技术解决了类不平衡等常见难题,并采用了多种特征选择方法来提高模型效率。该综述还强调了考虑时间因素和不同医疗环境中缺席行为的环境依赖性的重要性。利用 ITPOSMO 框架(信息、技术、流程、目标、人员配备、管理和其他资源),该研究确定了当前 ML 方法中的几个差距。主要挑战包括数据质量和完整性、模型可解释性以及与现有医疗保健系统的集成。未来的研究方向包括改进数据收集方法、纳入组织因素、确保符合道德规范的实施,以及开发处理数据不平衡的标准化方法。该综述还建议探索新的数据源,利用 ML 算法分析患者行为模式,并使用迁移学习技术在不同的医疗机构间调整模型。通过应对这些挑战,医疗机构可以利用 ML 改善资源分配,提高患者护理质量,并推进医疗领域的预测分析。这篇全面的综述强调了人工智能在预测病例缺席方面的潜力,同时也承认了其实际应用中的复杂性和挑战。
{"title":"Predicting patient no-shows using machine learning: A comprehensive review and future research agenda","authors":"Khaled M. Toffaha ,&nbsp;Mecit Can Emre Simsekler ,&nbsp;Mohammed Atif Omar ,&nbsp;Imad ElKebbi","doi":"10.1016/j.ibmed.2025.100229","DOIUrl":"10.1016/j.ibmed.2025.100229","url":null,"abstract":"<div><div>Patient no-shows for scheduled medical appointments pose significant challenges to healthcare systems, resulting in wasted resources, increased costs, and disrupted continuity of care. This comprehensive review examines state-of-the-art machine learning (ML) approaches for predicting patient no-shows in outpatient settings, analyzing 52 publications from 2010 to 2025.</div><div>The study reveals significant advancements in the field, with Logistic Regression (LR) as the most commonly used model in 68% of the studies. Tree-based models, ensemble methods, and deep learning techniques have gained traction in recent years, reflecting the field’s evolution. The best-performing models achieved Area Under the Curve (AUC) scores between 0.75 and 0.95, with accuracy ranging from 52% to 99.44%. Methodologically, researchers addressed common challenges such as class imbalance using various sampling techniques and employed a wide range of feature selection methods to improve model efficiency. The review also highlighted the importance of considering temporal factors and the context-dependent nature of no-show behavior across different healthcare settings.</div><div>Using the ITPOSMO framework (Information, Technology, Processes, Objectives, Staffing, Management, and Other Resources), the study identified several gaps in current ML approaches. Key challenges include data quality and completeness, model interpretability, and integration with existing healthcare systems. Future research directions include improving data collection methods, incorporating organizational factors, ensuring ethical implementation, and developing standardized approaches for handling data imbalance. The review also suggests exploring new data sources, utilizing ML algorithms to analyze patient behavior patterns, and using transfer learning techniques to adapt models across different healthcare facilities.</div><div>By addressing these challenges, healthcare providers can leverage ML to improve resource allocation, enhance patient care quality, and advance predictive analytics in healthcare. This comprehensive review underscores the potential of ML in predicting no-shows while acknowledging the complexities and challenges in its practical implementation.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"11 ","pages":"Article 100229"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143529776","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hybrid vision transformer and Xception model for reliable CT-based ovarian neoplasms diagnosis 基于ct的卵巢肿瘤可靠诊断的混合视觉变换器和异常模型
Pub Date : 2025-01-01 DOI: 10.1016/j.ibmed.2025.100227
Eman Hussein Alshdaifat , Hasan Gharaibeh , Amer Mahmoud Sindiani , Rola Madain , Asma'a Mohammad Al-Mnayyis , Hamad Yahia Abu Mhanna , Rawan Eimad Almahmoud , Hanan Fawaz Akhdar , Mohammad Amin , Ahmad Nasayreh , Raneem Hamad
Ovarian cancer is a major global health concern, characterized by high mortality rates and a lack of accurate diagnostic methods. Rapid and accurate detection of ovarian cancer is essential to improve patient outcomes and formulate appropriate treatment protocols. Medical imaging methods are essential for identifying ovarian cancer; however, achieving accurate diagnosis remains a challenge. This paper presents a robust methodology for ovarian cancer detection, including the identification and classification of benign and malignant tumors, using the Xception_ViT model. This hybrid approach was chosen because it combines the advantages of traditional CNN-based models (such as Xception) with the capabilities of modern Transformers-based models (such as ViT). This combination allows the model to take advantage of Xception, which extracts features from images. The Vision Transformer (ViT) model is then used to identify connections between diverse visual elements, enhancing the model's understanding of complex components. A Multi-Layer Perceptron (MLP) layer is finally integrated with the proposed model for image classification. The effectiveness of the model is evaluated using three computed tomography (CT) image datasets from King Abdullah University Hospital (KAUH) in Jordan. The first dataset consists of the ovarian cancer computed tomography dataset (KAUH-OCCTD), the second is the benign ovarian tumors dataset (KAUH-BOTD), and the third is the malignant ovarian tumors dataset (KAUH-MOTD). The three datasets collected from 500 women are characterized by their diversity in ovarian tumor classification and are the first of their kind collected in Jordan. The proposed model Xception_ViT achieved an accuracy of 98.09 % in identifying ovarian cancer on the KAUH-OCCTD dataset, and an accuracy of 96.05 % and 98.73 % on the KAUH-BOTD and KAUH-MOTD datasets, respectively, in distinguishing between benign and malignant ovarian tumors. The proposed model outperformed the pre-trained models on all three datasets. The results demonstrate that the proposed model can classify ovarian tumors. This method could also greatly enhance the efficiency of novice radiologists in evaluating ovarian malignancies and assist gynecologists in providing improved treatment alternatives for these individuals.
卵巢癌是一个主要的全球健康问题,其特点是死亡率高,缺乏准确的诊断方法。快速准确地检测卵巢癌对于改善患者预后和制定适当的治疗方案至关重要。医学影像学方法是鉴别卵巢癌的必要手段;然而,实现准确的诊断仍然是一个挑战。本文提出了一种强大的卵巢癌检测方法,包括使用Xception_ViT模型对良性和恶性肿瘤进行识别和分类。之所以选择这种混合方法,是因为它结合了传统的基于cnn的模型(如Xception)的优势和现代基于transformer的模型(如ViT)的能力。这种组合允许模型利用Xception,它从图像中提取特征。然后使用视觉转换器(Vision Transformer, ViT)模型来识别不同视觉元素之间的联系,增强模型对复杂组件的理解。最后将多层感知器(MLP)层与所提出的图像分类模型相结合。使用约旦阿卜杜拉国王大学医院(KAUH)的三个计算机断层扫描(CT)图像数据集评估该模型的有效性。第一个数据集包括卵巢癌计算机断层扫描数据集(KAUH-OCCTD),第二个数据集是良性卵巢肿瘤数据集(KAUH-BOTD),第三个数据集是恶性卵巢肿瘤数据集(KAUH-MOTD)。从500名妇女中收集的三个数据集以其卵巢肿瘤分类的多样性为特征,是约旦首次收集此类数据集。所提出的Xception_ViT模型在KAUH-OCCTD数据集上识别卵巢癌的准确率为98.09%,在KAUH-BOTD和KAUH-MOTD数据集上区分卵巢良恶性肿瘤的准确率分别为96.05%和98.73%。提出的模型在所有三个数据集上都优于预训练模型。结果表明,该模型能够对卵巢肿瘤进行分类。该方法还可以大大提高新手放射科医生评估卵巢恶性肿瘤的效率,并协助妇科医生为这些个体提供改进的治疗方案。
{"title":"Hybrid vision transformer and Xception model for reliable CT-based ovarian neoplasms diagnosis","authors":"Eman Hussein Alshdaifat ,&nbsp;Hasan Gharaibeh ,&nbsp;Amer Mahmoud Sindiani ,&nbsp;Rola Madain ,&nbsp;Asma'a Mohammad Al-Mnayyis ,&nbsp;Hamad Yahia Abu Mhanna ,&nbsp;Rawan Eimad Almahmoud ,&nbsp;Hanan Fawaz Akhdar ,&nbsp;Mohammad Amin ,&nbsp;Ahmad Nasayreh ,&nbsp;Raneem Hamad","doi":"10.1016/j.ibmed.2025.100227","DOIUrl":"10.1016/j.ibmed.2025.100227","url":null,"abstract":"<div><div>Ovarian cancer is a major global health concern, characterized by high mortality rates and a lack of accurate diagnostic methods. Rapid and accurate detection of ovarian cancer is essential to improve patient outcomes and formulate appropriate treatment protocols. Medical imaging methods are essential for identifying ovarian cancer; however, achieving accurate diagnosis remains a challenge. This paper presents a robust methodology for ovarian cancer detection, including the identification and classification of benign and malignant tumors, using the Xception_ViT model. This hybrid approach was chosen because it combines the advantages of traditional CNN-based models (such as Xception) with the capabilities of modern Transformers-based models (such as ViT). This combination allows the model to take advantage of Xception, which extracts features from images. The Vision Transformer (ViT) model is then used to identify connections between diverse visual elements, enhancing the model's understanding of complex components. A Multi-Layer Perceptron (MLP) layer is finally integrated with the proposed model for image classification. The effectiveness of the model is evaluated using three computed tomography (CT) image datasets from King Abdullah University Hospital (KAUH) in Jordan. The first dataset consists of the ovarian cancer computed tomography dataset (KAUH-OCCTD), the second is the benign ovarian tumors dataset (KAUH-BOTD), and the third is the malignant ovarian tumors dataset (KAUH-MOTD). The three datasets collected from 500 women are characterized by their diversity in ovarian tumor classification and are the first of their kind collected in Jordan. The proposed model Xception_ViT achieved an accuracy of 98.09 % in identifying ovarian cancer on the KAUH-OCCTD dataset, and an accuracy of 96.05 % and 98.73 % on the KAUH-BOTD and KAUH-MOTD datasets, respectively, in distinguishing between benign and malignant ovarian tumors. The proposed model outperformed the pre-trained models on all three datasets. The results demonstrate that the proposed model can classify ovarian tumors. This method could also greatly enhance the efficiency of novice radiologists in evaluating ovarian malignancies and assist gynecologists in providing improved treatment alternatives for these individuals.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"11 ","pages":"Article 100227"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143465514","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Features and eigenspectral densities analyses for machine learning and classification of severities in chronic obstructive pulmonary diseases 慢性阻塞性肺疾病中机器学习和严重程度分类的特征和特征谱密度分析
Pub Date : 2025-01-01 DOI: 10.1016/j.ibmed.2025.100217
Timothy Albiges, Zoheir Sabeur, Banafshe Arbab-Zavar
Chronic Obstructive Pulmonary Disease (COPD) has been presenting highly significant global health challenges for many decades. Equally, it is important to slow down this disease's ever-increasingly challenging impact on hospital patient loads. It has become necessary, if not critical, to capitalise on existing knowledge of advanced artificial intelligence to achieve the early detection of COPD and advance personalised care of COPD patients from their homes. The use of machine learning and reaching out on the classification of the multiple types of COPD severities effectively and at progressively acceptable levels of confidence is of paramount importance. Indeed, this capability will feed into highly effective personalised care of COPD patients from their homes while significantly improving their quality of life.
Auscultation lung sound analysis has emerged as a valuable, non-invasive, and cost-effective remote diagnostic tool of the future for respiratory conditions such as COPD. This research paper introduces a novel machine learning-based approach for classifying multiple COPD severities through the analysis of lung sound data streams. Leveraging two open datasets with diverse acoustic characteristics and clinical manifestations, the research study involves the transformation and decomposition of lung sound data matrices into their eigenspace representation in order to capture key features for machine learning and detection. Early eigenvalue spectra analyses were also performed to discover their distinct manifestations under the multiple established COPD severities. This has led us into projecting our experimental data matrices into their eigenspace with the use of the manifested data features prior to the machine learning process. This was followed by various methods of machine classification of COPD severities successfully. Support Vector Classifiers, Logistic Regression, Random Forests and Naive Bayes Classifiers were deployed. Systematic classifier performance metrics were also adopted; they showed early promising classification accuracies beyond 75 % for distinguishing COPD severities.
This research benchmark contributes to computer-aided medical diagnosis and supports the integration of auscultation lung sound analyses into COPD assessment protocols for individualised patient care and treatment. Future work involves the acquisition of larger volumes of lung sound data while also exploring multi-modal sensing of COPD patients for heterogeneous data fusion to advance COPD severity classification performance.
几十年来,慢性阻塞性肺疾病(COPD)一直是全球健康面临的重大挑战。同样,减缓这种疾病对医院病人负荷的日益严峻的影响也很重要。利用现有的先进人工智能知识来实现COPD的早期发现,并在家中推进COPD患者的个性化护理,即使不是至关重要,也是必要的。机器学习的使用和对多种类型COPD严重程度的有效分类以及逐步可接受的置信度水平的接触是至关重要的。事实上,这种能力将有助于在家中为COPD患者提供高效的个性化护理,同时显著改善他们的生活质量。听诊肺音分析已成为一种有价值的、无创的、具有成本效益的远程诊断工具,用于未来的呼吸系统疾病,如慢性阻塞性肺病。本文介绍了一种新的基于机器学习的方法,通过分析肺声数据流来分类多种COPD严重程度。利用两个具有不同声学特征和临床表现的开放数据集,该研究涉及将肺音数据矩阵转换和分解为其特征空间表示,以捕获用于机器学习和检测的关键特征。还进行了早期特征值谱分析,以发现其在多个已确定的COPD严重程度下的不同表现。这导致我们在机器学习过程之前使用已显示的数据特征将实验数据矩阵投影到它们的特征空间中。随后,各种COPD严重程度的机器分类方法都取得了成功。使用了支持向量分类器、逻辑回归、随机森林和朴素贝叶斯分类器。采用系统分类器性能指标;在区分COPD严重程度方面,他们显示出早期有希望的分类准确率超过75%。该研究基准有助于计算机辅助医疗诊断,并支持将听诊肺音分析整合到COPD评估方案中,以实现个体化患者护理和治疗。未来的工作包括获取更大量的肺声数据,同时探索COPD患者的多模式感知,以进行异构数据融合,以提高COPD严重程度分类性能。
{"title":"Features and eigenspectral densities analyses for machine learning and classification of severities in chronic obstructive pulmonary diseases","authors":"Timothy Albiges,&nbsp;Zoheir Sabeur,&nbsp;Banafshe Arbab-Zavar","doi":"10.1016/j.ibmed.2025.100217","DOIUrl":"10.1016/j.ibmed.2025.100217","url":null,"abstract":"<div><div>Chronic Obstructive Pulmonary Disease (COPD) has been presenting highly significant global health challenges for many decades. Equally, it is important to slow down this disease's ever-increasingly challenging impact on hospital patient loads. It has become necessary, if not critical, to capitalise on existing knowledge of advanced artificial intelligence to achieve the early detection of COPD and advance personalised care of COPD patients from their homes. The use of machine learning and reaching out on the classification of the multiple types of COPD severities effectively and at progressively acceptable levels of confidence is of paramount importance. Indeed, this capability will feed into highly effective personalised care of COPD patients from their homes while significantly improving their quality of life.</div><div>Auscultation lung sound analysis has emerged as a valuable, non-invasive, and cost-effective remote diagnostic tool of the future for respiratory conditions such as COPD. This research paper introduces a novel machine learning-based approach for classifying multiple COPD severities through the analysis of lung sound data streams. Leveraging two open datasets with diverse acoustic characteristics and clinical manifestations, the research study involves the transformation and decomposition of lung sound data matrices into their eigenspace representation in order to capture key features for machine learning and detection. Early eigenvalue spectra analyses were also performed to discover their distinct manifestations under the multiple established COPD severities. This has led us into projecting our experimental data matrices into their eigenspace with the use of the manifested data features prior to the machine learning process. This was followed by various methods of machine classification of COPD severities successfully. Support Vector Classifiers, Logistic Regression, Random Forests and Naive Bayes Classifiers were deployed. Systematic classifier performance metrics were also adopted; they showed early promising classification accuracies beyond 75 % for distinguishing COPD severities.</div><div>This research benchmark contributes to computer-aided medical diagnosis and supports the integration of auscultation lung sound analyses into COPD assessment protocols for individualised patient care and treatment. Future work involves the acquisition of larger volumes of lung sound data while also exploring multi-modal sensing of COPD patients for heterogeneous data fusion to advance COPD severity classification performance.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"11 ","pages":"Article 100217"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143419554","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting maternal health risk using PCA-enhanced XGBoost and SMOTE-ENN for improved healthcare outcomes 使用pca增强的XGBoost和SMOTE-ENN预测孕产妇健康风险,以改善医疗保健结果
Pub Date : 2025-01-01 DOI: 10.1016/j.ibmed.2025.100300
Rahmatul Kabir Rasel Sarker , Sadman Hafij , Md Adib Yasir , Md Assaduzzaman , Md Monir Hossain Shimul , Md Kamrul Hossain

Background

Maternal health remains a global priority, especially in low-resource settings where timely risk identification is critical. Traditional machine learning models often suffer from poor generalizability, data imbalance, and computational inefficiencies. This study proposes an enhanced predictive model combining SMOTE-ENN data balancing and Principal Component Analysis (PCA) with XGBoost to improve maternal risk classification accuracy using minimal, easily collectible clinical features.

Methods

The dataset of 1014 maternal health records comprising seven physiological features was sourced from a public repository. Preprocessing involved standardization, label encoding, and class balancing using SMOTE-ENN. PCA was applied for dimensionality reduction to enhance computational performance and reduce overfitting. Several machine learning classifiers including Decision Tree, Random Forest, LightGBM, Gradient Boosting, and SVM were evaluated, with XGBoost selected as the final model. Performance metrics included accuracy, precision, recall, F1-score, ROC-AUC, and 10-fold cross-validation.

Results

The PCA-enhanced XGBoost model achieved the highest accuracy (97.73 %), precision (98 %), recall (98 %), and F1-score (98 %). It outperformed all other models, particularly in identifying high-risk cases with minimal false negatives. Cross-validation confirmed the model's robustness (mean accuracy: 98.39 %), and ROC-AUC scores exceeded 0.998 for all classes, indicating near-perfect classification performance.

Conclusion

This study validates a maternal health risk prediction model that is scalable for use in resource-constrained environments and interpretable within the limitations of the selected dimensionality-reduction approach. Its simplicity, high accuracy, and generalizability make it a promising tool for early clinical decision-making and intervention.
产妇保健仍然是全球优先事项,特别是在资源匮乏的环境中,及时识别风险至关重要。传统的机器学习模型通常存在泛化能力差、数据不平衡和计算效率低下的问题。本研究提出了一个增强的预测模型,结合SMOTE-ENN数据平衡和主成分分析(PCA)与XGBoost,利用最小的、易于收集的临床特征来提高孕产妇风险分类的准确性。方法从公共信息库中获取1014份孕产妇健康记录,包括7项生理特征。预处理包括使用SMOTE-ENN进行标准化、标签编码和类平衡。采用主成分分析法进行降维,提高计算性能,减少过拟合。对决策树、随机森林、LightGBM、梯度增强和支持向量机等几种机器学习分类器进行了评估,最终选择XGBoost作为最终模型。性能指标包括准确性、精密度、召回率、f1评分、ROC-AUC和10倍交叉验证。结果pca增强的XGBoost模型具有最高的准确率(97.73%)、精密度(98%)、召回率(98%)和f1评分(98%)。它优于所有其他模型,特别是在识别高风险病例时,以最小的假阴性。交叉验证证实了模型的稳健性(平均准确率为98.39%),所有类别的ROC-AUC得分均超过0.998,表明分类性能接近完美。结论:本研究验证了一种产妇健康风险预测模型,该模型可扩展用于资源受限环境,并可在所选降维方法的限制下解释。它的简单,高精度和可推广性使其成为早期临床决策和干预的有前途的工具。
{"title":"Predicting maternal health risk using PCA-enhanced XGBoost and SMOTE-ENN for improved healthcare outcomes","authors":"Rahmatul Kabir Rasel Sarker ,&nbsp;Sadman Hafij ,&nbsp;Md Adib Yasir ,&nbsp;Md Assaduzzaman ,&nbsp;Md Monir Hossain Shimul ,&nbsp;Md Kamrul Hossain","doi":"10.1016/j.ibmed.2025.100300","DOIUrl":"10.1016/j.ibmed.2025.100300","url":null,"abstract":"<div><h3>Background</h3><div>Maternal health remains a global priority, especially in low-resource settings where timely risk identification is critical. Traditional machine learning models often suffer from poor generalizability, data imbalance, and computational inefficiencies. This study proposes an enhanced predictive model combining SMOTE-ENN data balancing and Principal Component Analysis (PCA) with XGBoost to improve maternal risk classification accuracy using minimal, easily collectible clinical features.</div></div><div><h3>Methods</h3><div>The dataset of 1014 maternal health records comprising seven physiological features was sourced from a public repository. Preprocessing involved standardization, label encoding, and class balancing using SMOTE-ENN. PCA was applied for dimensionality reduction to enhance computational performance and reduce overfitting. Several machine learning classifiers including Decision Tree, Random Forest, LightGBM, Gradient Boosting, and SVM were evaluated, with XGBoost selected as the final model. Performance metrics included accuracy, precision, recall, F1-score, ROC-AUC, and 10-fold cross-validation.</div></div><div><h3>Results</h3><div>The PCA-enhanced XGBoost model achieved the highest accuracy (97.73 %), precision (98 %), recall (98 %), and F1-score (98 %). It outperformed all other models, particularly in identifying high-risk cases with minimal false negatives. Cross-validation confirmed the model's robustness (mean accuracy: 98.39 %), and ROC-AUC scores exceeded 0.998 for all classes, indicating near-perfect classification performance.</div></div><div><h3>Conclusion</h3><div>This study validates a maternal health risk prediction model that is scalable for use in resource-constrained environments and interpretable within the limitations of the selected dimensionality-reduction approach. Its simplicity, high accuracy, and generalizability make it a promising tool for early clinical decision-making and intervention.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"12 ","pages":"Article 100300"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145219043","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Intelligence-based medicine
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1