Pub Date : 2025-11-11DOI: 10.1186/s12911-025-03227-w
Yuyao Feng, Leyin Xu, Jiang Shao, Lin Wang, Huanyu Dai, Chaonan Wang, Kang Li, Keqiang Shu, Junye Chen, Yuru Wang, Yiyun Xie, Zhichao Lai, Bao Liu
Background: Atherosclerosis in the carotid artery significantly contributes to embolic events leading to ischemic stroke. Precise identification of unstable carotid plaques through non-invasive imaging, is pivotal for stroke prevention. Artificial intelligence (AI) has demonstrated promise in enhancing the accuracy of plaque risk stratification. This review aims to assess the diagnostic performance of AI algorithms in distinguishing unstable carotid plaques from stable plaques using medical imaging.
Methods: We conducted comprehensive searches in Medline, Embase, Web of Science, IEEE, PubMed, and the Cochrane Library up to June 6, 2023. Eligible studies included those that utilized AI algorithms for identifying unstable carotid plaques from medical images. Binary diagnostic accuracy metrics, including sensitivity, specificity, and Area Under the Curve (AUC), were extracted. QUADAS-AI was used to assess risk of bias of the included studies.
Results: Among the 31 studies reviewed, 14 were subjected to meta-analysis, revealing a pooled sensitivity of 91% (95%CI: 86 - 95%), specificity of 84% (79 - 89%), and AUC of 0.94 (0.91 - 0.95). However, only one study reported external validation, limiting the generalizability of these findings, and substantial heterogeneity was observed (I2 > 90%). Subgroup analyses indicated performance variations based on factors such as sample size, type of AI algorithms (machine learning or deep learning), segmentation methods (manual or automatic), and publication year. Despite observed publication bias and study heterogeneity, the findings underscore the promise of AI-driven approaches in carotid plaque risk stratification.
Conclusions: AI algorithms demonstrated favorable diagnostic performance in identifying unstable carotid plaques. Future research should focus on rigorous validation, ensuring generalizability, and enhancing the explainability of AI algorithms to facilitate their translational use.
Clinical trial number: Not applicable.
背景:颈动脉粥样硬化明显有助于栓塞事件导致缺血性卒中。通过无创成像精确识别不稳定的颈动脉斑块,是预防脑卒中的关键。人工智能(AI)在提高斑块风险分层的准确性方面表现出了希望。本综述旨在评估人工智能算法在区分不稳定颈动脉斑块和稳定斑块方面的诊断性能。方法:我们在Medline, Embase, Web of Science, IEEE, PubMed和Cochrane Library中进行了截至2023年6月6日的综合检索。符合条件的研究包括那些利用人工智能算法从医学图像中识别不稳定颈动脉斑块的研究。提取二元诊断准确性指标,包括敏感性、特异性和曲线下面积(AUC)。采用QUADAS-AI评估纳入研究的偏倚风险。结果:在回顾的31项研究中,14项进行了荟萃分析,结果显示合并敏感性为91% (95% ci: 86 - 95%),特异性为84% (79 - 89%),AUC为0.94(0.91 - 0.95)。然而,只有一项研究报告了外部验证,限制了这些发现的普遍性,并且观察到大量的异质性(I2 bb0 90%)。子组分析表明,性能变化基于样本量、人工智能算法类型(机器学习或深度学习)、分割方法(手动或自动)和出版年份等因素。尽管观察到发表偏倚和研究异质性,研究结果强调了人工智能驱动方法在颈动脉斑块风险分层中的应用前景。结论:人工智能算法在识别不稳定颈动脉斑块方面表现出良好的诊断性能。未来的研究应侧重于严格的验证,确保通用性,并增强人工智能算法的可解释性,以促进其翻译使用。临床试验号:不适用。
{"title":"Artificial intelligence diagnostic performance in image-based vulnerable carotid plaque detection: a systematic review and meta-analysis.","authors":"Yuyao Feng, Leyin Xu, Jiang Shao, Lin Wang, Huanyu Dai, Chaonan Wang, Kang Li, Keqiang Shu, Junye Chen, Yuru Wang, Yiyun Xie, Zhichao Lai, Bao Liu","doi":"10.1186/s12911-025-03227-w","DOIUrl":"10.1186/s12911-025-03227-w","url":null,"abstract":"<p><strong>Background: </strong>Atherosclerosis in the carotid artery significantly contributes to embolic events leading to ischemic stroke. Precise identification of unstable carotid plaques through non-invasive imaging, is pivotal for stroke prevention. Artificial intelligence (AI) has demonstrated promise in enhancing the accuracy of plaque risk stratification. This review aims to assess the diagnostic performance of AI algorithms in distinguishing unstable carotid plaques from stable plaques using medical imaging.</p><p><strong>Methods: </strong>We conducted comprehensive searches in Medline, Embase, Web of Science, IEEE, PubMed, and the Cochrane Library up to June 6, 2023. Eligible studies included those that utilized AI algorithms for identifying unstable carotid plaques from medical images. Binary diagnostic accuracy metrics, including sensitivity, specificity, and Area Under the Curve (AUC), were extracted. QUADAS-AI was used to assess risk of bias of the included studies.</p><p><strong>Results: </strong>Among the 31 studies reviewed, 14 were subjected to meta-analysis, revealing a pooled sensitivity of 91% (95%CI: 86 - 95%), specificity of 84% (79 - 89%), and AUC of 0.94 (0.91 - 0.95). However, only one study reported external validation, limiting the generalizability of these findings, and substantial heterogeneity was observed (I<sup>2</sup> > 90%). Subgroup analyses indicated performance variations based on factors such as sample size, type of AI algorithms (machine learning or deep learning), segmentation methods (manual or automatic), and publication year. Despite observed publication bias and study heterogeneity, the findings underscore the promise of AI-driven approaches in carotid plaque risk stratification.</p><p><strong>Conclusions: </strong>AI algorithms demonstrated favorable diagnostic performance in identifying unstable carotid plaques. Future research should focus on rigorous validation, ensuring generalizability, and enhancing the explainability of AI algorithms to facilitate their translational use.</p><p><strong>Clinical trial number: </strong>Not applicable.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"419"},"PeriodicalIF":3.8,"publicationDate":"2025-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12607216/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145494741","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Background: Accurate and rapid assessment of fluid status of maintenance hemodialysis (MHD) patients and maintaining fluid balance is essential to ensure the quality of dialysis treatment. Currently, clinical methods for assessing ultrafiltration volume are still insufficient, and reliable tools that are more accurate and rapid are needed. The objective of this study was to construct a model for predicting ultrafiltration volume (UF) in MHD patients based on artificial neural network (ANN) algorithms, to validate and evaluate this model, and to investigate the impact of body composition prior to dialysis on UF in MHD patients.
Methods: A total of 319 patients undergoing MHD treatment at our center were enrolled. Basic demographic and clinical characteristics were collected and evaluated using the hemodialysis information system. Body composition was measured on ≥ 3 separate days before dialysis treatment using an Inbody bioimpedance instrument. The target ultrafiltration volume was determined by nephrologists based on the integration of body composition measurements and clinical characteristics, yielding a dataset of 1,205 entries. Heat maps were used to demonstrate the correlation between body composition and UF in MHD patients, and LASSO regression and multifactorial linear regression were used to screen the relevant indicator factors for final inclusion in the model, and Backpropagation Neural Network model (BPNN) was developed using the MATLAB (R2022a) neural network toolbox to establish the projected relationship between UF and pre-dialysis body composition. The effectiveness of the model was assessed based on the coefficient of determination (R2) and root mean square error (RMSE) of the calculated regression.
Results: The artificial neural network model demonstrated an optimal predictive performance metric of R2 = 0.965 for forecasting ultrafiltration volume in MHD patients. With an average difference of 0.182 L between observed and predicted values, and highlighted the significant influence of certain body composition indicators on UF in MHD patients.
Conclusion: This study effectively demonstrates the predictive role of an artificial neural network model based on pre-dialysis body composition information in estimating ultrafiltration providing a valuable predictive tool to optimize assessment volume for MHD patients, of ultrafiltration volume in MHD patients.
{"title":"Prediction of ultrafiltration volume in maintenance hemodialysis patients using an artificial neural network model based on body composition information.","authors":"Jiaoyan Chen, Jurong Yang, Xianqiong Lu, Jingrong Peng, Liangji He, Wei Tan, Qing Yu, Yunyan Wang","doi":"10.1186/s12911-025-03248-5","DOIUrl":"10.1186/s12911-025-03248-5","url":null,"abstract":"<p><strong>Background: </strong>Accurate and rapid assessment of fluid status of maintenance hemodialysis (MHD) patients and maintaining fluid balance is essential to ensure the quality of dialysis treatment. Currently, clinical methods for assessing ultrafiltration volume are still insufficient, and reliable tools that are more accurate and rapid are needed. The objective of this study was to construct a model for predicting ultrafiltration volume (UF) in MHD patients based on artificial neural network (ANN) algorithms, to validate and evaluate this model, and to investigate the impact of body composition prior to dialysis on UF in MHD patients.</p><p><strong>Methods: </strong>A total of 319 patients undergoing MHD treatment at our center were enrolled. Basic demographic and clinical characteristics were collected and evaluated using the hemodialysis information system. Body composition was measured on ≥ 3 separate days before dialysis treatment using an Inbody bioimpedance instrument. The target ultrafiltration volume was determined by nephrologists based on the integration of body composition measurements and clinical characteristics, yielding a dataset of 1,205 entries. Heat maps were used to demonstrate the correlation between body composition and UF in MHD patients, and LASSO regression and multifactorial linear regression were used to screen the relevant indicator factors for final inclusion in the model, and Backpropagation Neural Network model (BPNN) was developed using the MATLAB (R2022a) neural network toolbox to establish the projected relationship between UF and pre-dialysis body composition. The effectiveness of the model was assessed based on the coefficient of determination (R<sup>2</sup>) and root mean square error (RMSE) of the calculated regression.</p><p><strong>Results: </strong>The artificial neural network model demonstrated an optimal predictive performance metric of R<sup>2</sup> = 0.965 for forecasting ultrafiltration volume in MHD patients. With an average difference of 0.182 L between observed and predicted values, and highlighted the significant influence of certain body composition indicators on UF in MHD patients.</p><p><strong>Conclusion: </strong>This study effectively demonstrates the predictive role of an artificial neural network model based on pre-dialysis body composition information in estimating ultrafiltration providing a valuable predictive tool to optimize assessment volume for MHD patients, of ultrafiltration volume in MHD patients.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"417"},"PeriodicalIF":3.8,"publicationDate":"2025-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12606949/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145494747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Predicting the risk of preterm birth with machine learning and electronic health records in China.","authors":"Lushuai Qian, Hanyue Jia, Zhou Chang, Yanjun Hu, Chunling Chen, Xiaoqing Li, Hongping Zhang","doi":"10.1186/s12911-025-03254-7","DOIUrl":"10.1186/s12911-025-03254-7","url":null,"abstract":"","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"415"},"PeriodicalIF":3.8,"publicationDate":"2025-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12604261/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145488107","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Background: Acute kidney injury (AKI) has been confirmed to be related to the prognosis of aSAH patients. Evaluating the risk of AKI in the early stage is important to avoid the unfavorable outcome of aSAH patients. However, no study has explored the predictive value of machine learning algorithms for AKI after aSAH. This study was designed to develop a machine learning algorithm-based predictive model for AKI among aSAH patients.
Methods: The outcome of this study was the AKI confirmed using the KDIGO criteria. The predictive value of seven machine learning algorithms for the AKI among aSAH patients was explored and verified using the 5-fold cross-validation. The predictive efficiency of machine learning algorithms-based predictive models was evaluated by the area under the receiver operating characteristics curve (AUC). The Shapley Additive explanation method was performed to visualize the importance of features incorporated in machine learning algorithms-based predictive models.
Results: 711 aSAH patients were enrolled with an AKI incidence of 7.7%. The AKI group had higher WFNS (p = 0.011), Hunt Hess (p = 0.006), and lower Glasgow Coma Scale (GCS) (p = 0.004). The multiple aneurysm was more frequently observed in the AKI group (p = 0.027). The AKI group had longer length of ICU stay (p < 0.001), length of hospital stay (p < 0.001), and higher mortality (p < 0.001). Three algorithms performed well in predicting the AKI in the training dataset including the random forest (AUC = 1.000), AdaBoost (AUC = 0.954), and XGBoost (AUC = 0.947). The random forest performed the best in the validation dataset with an AUC of 0.724. The top ten features in the random forest algorithm were GCS, mean blood pressure, initial serum creatinine, cystatin C level, albumin, neutrophil, lactate dehydrogenase, glucose, white blood cell, and sodium.
Conclusions: The random forest model demonstrated superior performance in predicting AKI in aSAH patients, achieving a high AUC value, predictive accuracy, and remarkable stability. This model could help clinicians evaluate the risk of AKI in the early stage and guide therapeutic options among aSAH patients.
{"title":"A machine learning predictive model for acute kidney injury among aneurysmal subarachnoid hemorrhage patients.","authors":"Ruoran Wang, Lingzhu Qian, Yunhui Zeng, Linrui Cai, Min He, Jianguo Xu, Yu Zhang","doi":"10.1186/s12911-025-03156-8","DOIUrl":"10.1186/s12911-025-03156-8","url":null,"abstract":"<p><strong>Background: </strong>Acute kidney injury (AKI) has been confirmed to be related to the prognosis of aSAH patients. Evaluating the risk of AKI in the early stage is important to avoid the unfavorable outcome of aSAH patients. However, no study has explored the predictive value of machine learning algorithms for AKI after aSAH. This study was designed to develop a machine learning algorithm-based predictive model for AKI among aSAH patients.</p><p><strong>Methods: </strong>The outcome of this study was the AKI confirmed using the KDIGO criteria. The predictive value of seven machine learning algorithms for the AKI among aSAH patients was explored and verified using the 5-fold cross-validation. The predictive efficiency of machine learning algorithms-based predictive models was evaluated by the area under the receiver operating characteristics curve (AUC). The Shapley Additive explanation method was performed to visualize the importance of features incorporated in machine learning algorithms-based predictive models.</p><p><strong>Results: </strong>711 aSAH patients were enrolled with an AKI incidence of 7.7%. The AKI group had higher WFNS (p = 0.011), Hunt Hess (p = 0.006), and lower Glasgow Coma Scale (GCS) (p = 0.004). The multiple aneurysm was more frequently observed in the AKI group (p = 0.027). The AKI group had longer length of ICU stay (p < 0.001), length of hospital stay (p < 0.001), and higher mortality (p < 0.001). Three algorithms performed well in predicting the AKI in the training dataset including the random forest (AUC = 1.000), AdaBoost (AUC = 0.954), and XGBoost (AUC = 0.947). The random forest performed the best in the validation dataset with an AUC of 0.724. The top ten features in the random forest algorithm were GCS, mean blood pressure, initial serum creatinine, cystatin C level, albumin, neutrophil, lactate dehydrogenase, glucose, white blood cell, and sodium.</p><p><strong>Conclusions: </strong>The random forest model demonstrated superior performance in predicting AKI in aSAH patients, achieving a high AUC value, predictive accuracy, and remarkable stability. This model could help clinicians evaluate the risk of AKI in the early stage and guide therapeutic options among aSAH patients.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"416"},"PeriodicalIF":3.8,"publicationDate":"2025-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12604341/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145488110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-06DOI: 10.1186/s12911-025-03246-7
Le Han, Ying Liu, Peng Xian, Xiao Liu, Kai Cao, Li Ren, Yue Chang, Zhangfang Ma, Lei Tian, Shijing Deng, Xuejiao Liu, Yunshuang Liu, Ying Jie
Background: Adolescent depression is a major public health concern. Physical health indicators are rarely included in risk tools. We examined whether adding dry eye disease (DED) to psychosocial and behavioral factors improves prediction of depressive symptoms.
Methods: We analyzed 2,076 adolescent questionnaires (94.5% response) covering ocular health, sleep, electronic device use, social support, and demographics. Five machine-learning classifiers were trained with cross-validation and evaluated for discrimination and calibration.
Results: Models that included DED achieved strong discrimination (AUC ≈ 0.84) and good calibration, with highest accuracy for no and severe depression and lower performance for mild/moderate categories.
Conclusions: Integrating ocular health with psychosocial factors enhances machine-learning prediction of adolescent depression and may support earlier, school-based identification and referral. Given the low-cost, questionnaire-based inputs and favorable calibration, this approach shows promise for population screening and targeted prevention, pending external validation and prospective testing.
{"title":"From dry eye to depression: a machine learning-based framework for predicting adolescent mental health.","authors":"Le Han, Ying Liu, Peng Xian, Xiao Liu, Kai Cao, Li Ren, Yue Chang, Zhangfang Ma, Lei Tian, Shijing Deng, Xuejiao Liu, Yunshuang Liu, Ying Jie","doi":"10.1186/s12911-025-03246-7","DOIUrl":"10.1186/s12911-025-03246-7","url":null,"abstract":"<p><strong>Background: </strong>Adolescent depression is a major public health concern. Physical health indicators are rarely included in risk tools. We examined whether adding dry eye disease (DED) to psychosocial and behavioral factors improves prediction of depressive symptoms.</p><p><strong>Methods: </strong>We analyzed 2,076 adolescent questionnaires (94.5% response) covering ocular health, sleep, electronic device use, social support, and demographics. Five machine-learning classifiers were trained with cross-validation and evaluated for discrimination and calibration.</p><p><strong>Results: </strong>Models that included DED achieved strong discrimination (AUC ≈ 0.84) and good calibration, with highest accuracy for no and severe depression and lower performance for mild/moderate categories.</p><p><strong>Conclusions: </strong>Integrating ocular health with psychosocial factors enhances machine-learning prediction of adolescent depression and may support earlier, school-based identification and referral. Given the low-cost, questionnaire-based inputs and favorable calibration, this approach shows promise for population screening and targeted prevention, pending external validation and prospective testing.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"412"},"PeriodicalIF":3.8,"publicationDate":"2025-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12593886/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145457612","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-06DOI: 10.1186/s12911-025-03223-0
Qiuxiang Zheng, Fobao Lai, Zhiyong Chen
Background: Making precise treatment decisions in esophageal cancer is essential for enhancing patient outcomes and avoiding overtreatment. Traditional approaches relying on special features or shallow learning models often fail to capture the complex, multi-scale patterns embedded in PET/CT imaging data. Recent advances in deep learning provide an opportunity to build more robust, data-driven systems for predictive modeling in oncology.
Methods: We propose a novel deep learning model that integrates convolutional and transformer-based components based on PET/CT data to support treatment decisions for esophageal cancer. The architecture introduces a Convolutional Feature Extractor with split-based residual blocks for efficient local feature capture, a Multi-scale Pooling module for spatial context aggregation, and an Multilayer Perceptron block for predicting. The model was evaluated using several performance metrics such as AUCROC, F1 score, Balanced Accuracy and benchmarked against state-of-the-art convolutional and transformer backbones such as ConvNeXt and Vision Transformer.
Results: The proposed model achieved superior performance across all evaluation metrics, including an AUCROC of 0.9935 and a Balanced Accuracy of 0.9630, outperforming existing models. These results validate the effectiveness of combining local-global representation learning through custom-designed modules. In addition, we conducted ablation studies to further demonstrate the individual contributions and effectiveness of each component within the proposed architecture. By systematically removing or replacing specific modules such as the Convolutional Feature Extractor and Multi-scale Pooling, we observed consistent performance degradation, which highlights the necessity and complementary roles of these components in achieving optimal predictive accuracy.
Conclusions: This study presents a novel hybrid deep learning architecture that enhances treatment decision support for esophageal cancer by leveraging multi-scale spatial encoding. The empirical evidence demonstrates that tailored architectural innovations significantly improve predictive accuracy over existing methods.
{"title":"Treatment decision support for esophageal cancer based on PET/CT data using deep learning.","authors":"Qiuxiang Zheng, Fobao Lai, Zhiyong Chen","doi":"10.1186/s12911-025-03223-0","DOIUrl":"10.1186/s12911-025-03223-0","url":null,"abstract":"<p><strong>Background: </strong>Making precise treatment decisions in esophageal cancer is essential for enhancing patient outcomes and avoiding overtreatment. Traditional approaches relying on special features or shallow learning models often fail to capture the complex, multi-scale patterns embedded in PET/CT imaging data. Recent advances in deep learning provide an opportunity to build more robust, data-driven systems for predictive modeling in oncology.</p><p><strong>Methods: </strong>We propose a novel deep learning model that integrates convolutional and transformer-based components based on PET/CT data to support treatment decisions for esophageal cancer. The architecture introduces a Convolutional Feature Extractor with split-based residual blocks for efficient local feature capture, a Multi-scale Pooling module for spatial context aggregation, and an Multilayer Perceptron block for predicting. The model was evaluated using several performance metrics such as AUCROC, F1 score, Balanced Accuracy and benchmarked against state-of-the-art convolutional and transformer backbones such as ConvNeXt and Vision Transformer.</p><p><strong>Results: </strong>The proposed model achieved superior performance across all evaluation metrics, including an AUCROC of 0.9935 and a Balanced Accuracy of 0.9630, outperforming existing models. These results validate the effectiveness of combining local-global representation learning through custom-designed modules. In addition, we conducted ablation studies to further demonstrate the individual contributions and effectiveness of each component within the proposed architecture. By systematically removing or replacing specific modules such as the Convolutional Feature Extractor and Multi-scale Pooling, we observed consistent performance degradation, which highlights the necessity and complementary roles of these components in achieving optimal predictive accuracy.</p><p><strong>Conclusions: </strong>This study presents a novel hybrid deep learning architecture that enhances treatment decision support for esophageal cancer by leveraging multi-scale spatial encoding. The empirical evidence demonstrates that tailored architectural innovations significantly improve predictive accuracy over existing methods.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"413"},"PeriodicalIF":3.8,"publicationDate":"2025-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12593911/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145457567","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-05DOI: 10.1186/s12911-025-03250-x
Aymen M Al-Hejri, Riyadh M Al-Tam, Archana Harsing Sable, Basheer Almuhaya, Sultan S Alshamrani, Kaled M Alshmrany
Cervical cancer is the leading cause of cancer-related deaths among women worldwide, necessitating early and accurate detection methods. This study introduces a hybrid framework utilizing Vision Transformers (ViT) and ensemble learning-based convolutional neural networks (CNN) models for cervical cancer classification based on Pap smear images. Two prominent datasets, Mendeley LBC and SIPaKMeD, are employed, encompassing nine distinct categories of cervical cell abnormalities. The proposed approach integrates pre-trained CNN models of DenseNet201, Xception, and InceptionResNetV2 to extract high-level features, further fused through ensemble learning. These features are then processed by the ViT-based encoder model designed for improved interpretability and accuracy. Experimental results demonstrate that the hybrid model achieves superior accuracy rates of 97.26%, a recall of 97.27%, a precision of 97.27%, and 96.70% for the F1-score on the Mendeley LBC dataset. For the SIPaKMeD dataset, there was an accuracy of 99.18%, a recall of 99.18%, a precision of 99.15%, and a 99.21% F1-score. On the combined dataset, the model outperformed individual pre-trained models with 95.10% accuracy and a 95.01% F1-score. Moreover, the framework incorporates augmentation with Explainable AI (XAI) techniques, specifically Grad-CAM, to provide transparent and interpretable diagnostic outcomes, enhancing its utility in clinical settings. This research underscores the potential of hybrid AI frameworks in revolutionizing cervical cancer diagnostics by offering accurate, efficient, and interpretable solutions.
{"title":"A hybrid vision transformer with ensemble CNN framework for cervical cancer diagnosis.","authors":"Aymen M Al-Hejri, Riyadh M Al-Tam, Archana Harsing Sable, Basheer Almuhaya, Sultan S Alshamrani, Kaled M Alshmrany","doi":"10.1186/s12911-025-03250-x","DOIUrl":"10.1186/s12911-025-03250-x","url":null,"abstract":"<p><p>Cervical cancer is the leading cause of cancer-related deaths among women worldwide, necessitating early and accurate detection methods. This study introduces a hybrid framework utilizing Vision Transformers (ViT) and ensemble learning-based convolutional neural networks (CNN) models for cervical cancer classification based on Pap smear images. Two prominent datasets, Mendeley LBC and SIPaKMeD, are employed, encompassing nine distinct categories of cervical cell abnormalities. The proposed approach integrates pre-trained CNN models of DenseNet201, Xception, and InceptionResNetV2 to extract high-level features, further fused through ensemble learning. These features are then processed by the ViT-based encoder model designed for improved interpretability and accuracy. Experimental results demonstrate that the hybrid model achieves superior accuracy rates of 97.26%, a recall of 97.27%, a precision of 97.27%, and 96.70% for the F1-score on the Mendeley LBC dataset. For the SIPaKMeD dataset, there was an accuracy of 99.18%, a recall of 99.18%, a precision of 99.15%, and a 99.21% F1-score. On the combined dataset, the model outperformed individual pre-trained models with 95.10% accuracy and a 95.01% F1-score. Moreover, the framework incorporates augmentation with Explainable AI (XAI) techniques, specifically Grad-CAM, to provide transparent and interpretable diagnostic outcomes, enhancing its utility in clinical settings. This research underscores the potential of hybrid AI frameworks in revolutionizing cervical cancer diagnostics by offering accurate, efficient, and interpretable solutions.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"411"},"PeriodicalIF":3.8,"publicationDate":"2025-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12590815/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145451026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Purpose: This scoping review aims to synthesize research on artificial intelligence (AI) in predicting open-heart surgery outcomes, evaluating AI model performance, and identifying gaps in data quality, algorithmic bias, and clinical applicability to guide future advancements in personalized surgical planning and patient outcomes.
Methods: Conducted using the PRISMA-ScR guideline, the review involved a systematic search across PubMed, Web of Science, IEEE, and Scopus. Articles were included if they focused on open-heart surgery, utilized AI methods, and were published in English. Exclusion criteria included non-relevance to open-heart surgery, non-original research, and lack of AI techniques. Data extraction included study details, AI methods, and performance metrics. Descriptive statistics were used for analysis.
Results: Of the 64 included studies, 89.06% were retrospective. The most frequently employed algorithm was logistic regression (n = 41), followed by random forest in 38 studies and XGBoost in 32 studies for data analysis. Most studies focused on predicting postoperative outcomes. Mortality, acute kidney injury, and complications were the outcomes that more studies concentrated on. XGBoost, used in 32 studies, exhibited the best performance in 11 of these studies. Deep learning and hybrid models were underutilized. Major limitations included inconsistent model validation, limited prospective data, and lack of diversity in patient populations.
Conclusion: AI demonstrates promising predictive capabilities in open-heart surgery, particularly through machine learning models. These models can already assist surgeons in real-world practice by supporting real-time risk stratification and personalized decision-making, such as identifying high-risk patients for targeted interventions. However, methodological limitations hinder clinical translation. Future work should emphasize prospective validation, explainable AI, and equitable data representation to enhance model reliability and applicability in real-world settings.
目的:本综述旨在综合人工智能(AI)在预测心脏直视手术结果、评估AI模型性能、识别数据质量、算法偏差和临床适用性方面的差距方面的研究,以指导个性化手术计划和患者结果的未来发展。方法:采用PRISMA-ScR指南,对PubMed、Web of Science、IEEE和Scopus进行系统检索。如果文章集中于心脏直视手术,使用人工智能方法,并以英文发表,则纳入其中。排除标准包括与开胸手术无关、非原创性研究和缺乏人工智能技术。数据提取包括研究细节、人工智能方法和性能指标。采用描述性统计进行分析。结果:纳入的64项研究中,89.06%为回顾性研究。使用最多的算法是logistic回归(n = 41),其次是随机森林(38)和XGBoost(32)进行数据分析。大多数研究集中于预测术后结果。死亡率、急性肾损伤和并发症是更多研究关注的结果。在32项研究中使用的XGBoost在其中11项研究中表现出最佳性能。深度学习和混合模型未得到充分利用。主要的限制包括不一致的模型验证,有限的前瞻性数据,以及患者群体缺乏多样性。结论:人工智能在心脏直视手术中展示了有前景的预测能力,特别是通过机器学习模型。这些模型已经可以通过支持实时风险分层和个性化决策来帮助外科医生在现实世界的实践,例如识别高风险患者进行有针对性的干预。然而,方法学的局限性阻碍了临床翻译。未来的工作应强调前瞻性验证、可解释的人工智能和公平的数据表示,以提高模型在现实世界中的可靠性和适用性。
{"title":"Application of artificial intelligence in predicting the results of open-heart surgery: a scoping review.","authors":"Taleb Khodaveisi, Nasim Aslani, Parastoo Amiri, Faezeh Kamrani, Soheila Saeedi","doi":"10.1186/s12911-025-03243-w","DOIUrl":"10.1186/s12911-025-03243-w","url":null,"abstract":"<p><strong>Purpose: </strong>This scoping review aims to synthesize research on artificial intelligence (AI) in predicting open-heart surgery outcomes, evaluating AI model performance, and identifying gaps in data quality, algorithmic bias, and clinical applicability to guide future advancements in personalized surgical planning and patient outcomes.</p><p><strong>Methods: </strong>Conducted using the PRISMA-ScR guideline, the review involved a systematic search across PubMed, Web of Science, IEEE, and Scopus. Articles were included if they focused on open-heart surgery, utilized AI methods, and were published in English. Exclusion criteria included non-relevance to open-heart surgery, non-original research, and lack of AI techniques. Data extraction included study details, AI methods, and performance metrics. Descriptive statistics were used for analysis.</p><p><strong>Results: </strong>Of the 64 included studies, 89.06% were retrospective. The most frequently employed algorithm was logistic regression (n = 41), followed by random forest in 38 studies and XGBoost in 32 studies for data analysis. Most studies focused on predicting postoperative outcomes. Mortality, acute kidney injury, and complications were the outcomes that more studies concentrated on. XGBoost, used in 32 studies, exhibited the best performance in 11 of these studies. Deep learning and hybrid models were underutilized. Major limitations included inconsistent model validation, limited prospective data, and lack of diversity in patient populations.</p><p><strong>Conclusion: </strong>AI demonstrates promising predictive capabilities in open-heart surgery, particularly through machine learning models. These models can already assist surgeons in real-world practice by supporting real-time risk stratification and personalized decision-making, such as identifying high-risk patients for targeted interventions. However, methodological limitations hinder clinical translation. Future work should emphasize prospective validation, explainable AI, and equitable data representation to enhance model reliability and applicability in real-world settings.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"410"},"PeriodicalIF":3.8,"publicationDate":"2025-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12587633/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145451004","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Background: Myocardial infarction (MI) is a life-threatening condition caused by sudden interruption of blood supply to the heart. Electrocardiogram (ECG) is the primary tool for MI diagnosis, but interpretation challenges exist. This study aimed to optimize MI detection by developing a hybrid CNN-GRU Deep Learning model (DLM) based on ECG as a diagnostic support tool.
Methods: This retrospective diagnostic study included a total of 56,354 ECGs, comprising 41,871 from patients diagnosed with (MI) and 14,474 from healthy patients. Each ECG record consisted of a 20-second 15-lead recording per individual, sampled at 1000 Hz. The CNN-GRU model was trained on 85% of these ECGs and validated on the remaining 15%. The CNN-GRU model was executed on the pre-processed data using the Pan-Tompkins algorithm obtained from the PhysioNet website (PTB Diagnostic ECG Database), and all recordings were labelled by expert cardiologists. We examined a new model for classifying ECG heartbeats and found that it can compete with advanced models. The performance of the DLM was evaluated using the area under the receiver operating characteristic curve (AUC), sensitivity, specificity, Macro Average, and Weighted Average.
Results: The area under the receiver operating characteristic curve (AUC) of the CNN-GRU model for MI detection was close to one. The CNN-GRU model achieved excellent performance with 15 leads (ACC = 99.43%, sensitivity = 99.71%, specificity = 98.59%). Using lead II alone, performance improved slightly (ACC = 99.73%, sensitivity = 99.75%, specificity = 99.66%). The high AUC and other metrics indicate strong diagnostic ability. Based on the reported results, the CNN-GRU model using lead II was the best model.
Conclusions: The findings suggest that the proposed model can support clinical decision-making and guide future research in cardiovascular medicine.
{"title":"Optimizing myocardial infarction detection: a hybrid CNN-GRU deep learning approach.","authors":"Zahra Aghababaei, Leili Tapak, Mahdi Rasoulinia, Mahlagha Afrasiabi, Seyed Kianoosh Hosseini, Irina Dinu","doi":"10.1186/s12911-025-03217-y","DOIUrl":"10.1186/s12911-025-03217-y","url":null,"abstract":"<p><strong>Background: </strong>Myocardial infarction (MI) is a life-threatening condition caused by sudden interruption of blood supply to the heart. Electrocardiogram (ECG) is the primary tool for MI diagnosis, but interpretation challenges exist. This study aimed to optimize MI detection by developing a hybrid CNN-GRU Deep Learning model (DLM) based on ECG as a diagnostic support tool.</p><p><strong>Methods: </strong>This retrospective diagnostic study included a total of 56,354 ECGs, comprising 41,871 from patients diagnosed with (MI) and 14,474 from healthy patients. Each ECG record consisted of a 20-second 15-lead recording per individual, sampled at 1000 Hz. The CNN-GRU model was trained on 85% of these ECGs and validated on the remaining 15%. The CNN-GRU model was executed on the pre-processed data using the Pan-Tompkins algorithm obtained from the PhysioNet website (PTB Diagnostic ECG Database), and all recordings were labelled by expert cardiologists. We examined a new model for classifying ECG heartbeats and found that it can compete with advanced models. The performance of the DLM was evaluated using the area under the receiver operating characteristic curve (AUC), sensitivity, specificity, Macro Average, and Weighted Average.</p><p><strong>Results: </strong>The area under the receiver operating characteristic curve (AUC) of the CNN-GRU model for MI detection was close to one. The CNN-GRU model achieved excellent performance with 15 leads (ACC = 99.43%, sensitivity = 99.71%, specificity = 98.59%). Using lead II alone, performance improved slightly (ACC = 99.73%, sensitivity = 99.75%, specificity = 99.66%). The high AUC and other metrics indicate strong diagnostic ability. Based on the reported results, the CNN-GRU model using lead II was the best model.</p><p><strong>Conclusions: </strong>The findings suggest that the proposed model can support clinical decision-making and guide future research in cardiovascular medicine.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"408"},"PeriodicalIF":3.8,"publicationDate":"2025-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12584412/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145444130","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}