Pub Date : 2025-01-06DOI: 10.1016/j.cmpb.2024.108581
Frederik Christensen , Deniz Kenan Kılıç , Izabela Ewa Nielsen , Tarec Christoffer El-Galaly , Andreas Glenthøj , Jens Helby , Henrik Frederiksen , Sören Möller , Alexander Djupnes Fuglkjær
Background:
Around 7% of the global population has congenital hemoglobin disorders, with over 300,000 new cases of -thalassemia annually. Diagnosis is costly and inaccurate in low-income regions, often relying on complete blood count (CBC) tests. This study employs machine learning (ML) to classify -thalassemia traits based on gender and CBC, exploring the effects of grouping silent- and non-carriers.
Methods:
The dataset includes 288 individuals with suspected -thalassemia from Sri Lanka. It was classified using eleven discriminant formulae and nine ML models. Outliers were removed using Mahalanobis distance, and resampling was conducted with the synthetic minority oversampling technique (SMOTE) and SMOTE-nominal continuous (NC). The Mann–Whitney U test handled feature extraction and class grouping. ML performance was evaluated with eight criteria.
Results:
The Ehsani formula achieved an area under the receiver operating characteristic curve (ROC-AUC) of 0.66 by grouping silent- and non-carriers. The convolutional neural network (CNN) without feature extraction demonstrated better performance, with an accuracy of 0.85, sensitivity of 0.8, specificity of 0.86, and ROC-AUC of 0.95/0.93 (micro/macro). Performance was maintained even without preprocessing.
Conclusion:
ML models outperformed classical discriminant formulae in classifying -thalassemia using sex and CBC features. A larger dataset could enhance ML model generalization and the impact of feature extraction. Grouping silent- and non-carriers improved ML results, especially with resampling. The silent carriers were not separable from non-carriers regarding the available features.
{"title":"Classification of α-thalassemia data using machine learning models","authors":"Frederik Christensen , Deniz Kenan Kılıç , Izabela Ewa Nielsen , Tarec Christoffer El-Galaly , Andreas Glenthøj , Jens Helby , Henrik Frederiksen , Sören Möller , Alexander Djupnes Fuglkjær","doi":"10.1016/j.cmpb.2024.108581","DOIUrl":"10.1016/j.cmpb.2024.108581","url":null,"abstract":"<div><h3>Background:</h3><div>Around 7% of the global population has congenital hemoglobin disorders, with over 300,000 new cases of <span><math><mi>α</mi></math></span>-thalassemia annually. Diagnosis is costly and inaccurate in low-income regions, often relying on complete blood count (CBC) tests. This study employs machine learning (ML) to classify <span><math><mi>α</mi></math></span>-thalassemia traits based on gender and CBC, exploring the effects of grouping silent- and non-carriers.</div></div><div><h3>Methods:</h3><div>The dataset includes 288 individuals with suspected <span><math><mi>α</mi></math></span>-thalassemia from Sri Lanka. It was classified using eleven discriminant formulae and nine ML models. Outliers were removed using Mahalanobis distance, and resampling was conducted with the synthetic minority oversampling technique (SMOTE) and SMOTE-nominal continuous (NC). The Mann–Whitney U test handled feature extraction and class grouping. ML performance was evaluated with eight criteria.</div></div><div><h3>Results:</h3><div>The Ehsani formula achieved an area under the receiver operating characteristic curve (ROC-AUC) of 0.66 by grouping silent- and non-carriers. The convolutional neural network (CNN) without feature extraction demonstrated better performance, with an accuracy of 0.85, sensitivity of 0.8, specificity of 0.86, and ROC-AUC of 0.95/0.93 (micro/macro). Performance was maintained even without preprocessing.</div></div><div><h3>Conclusion:</h3><div>ML models outperformed classical discriminant formulae in classifying <span><math><mi>α</mi></math></span>-thalassemia using sex and CBC features. A larger dataset could enhance ML model generalization and the impact of feature extraction. Grouping silent- and non-carriers improved ML results, especially with resampling. The silent carriers were not separable from non-carriers regarding the available features.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"260 ","pages":"Article 108581"},"PeriodicalIF":4.9,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142969940","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-06DOI: 10.1016/j.cmpb.2024.108578
Yunzhi Huang , Luyi Han , Haoran Dou , Sahar Ahmad , Pew-Thian Yap
Background and Objective:
Deformable registration of multimodal brain magnetic resonance images presents significant challenges, primarily due to substantial structural variations between subjects and pronounced differences in appearance across imaging modalities.
Methods:
Here, we propose to symmetrically register images from two modalities based on appearance residuals from one modality to another. Computed with simple subtraction between modalities, the appearance residuals enhance structural details and form a common representation for simplifying multimodal deformable registration. The proposed framework consists of three serially connected modules: (i) an appearance residual module, which learns intensity residual maps between modalities with a cycle-consistent loss; (ii) a deformable registration module, which predicts deformations across subjects based on appearance residuals; and (iii) a deblurring module, which enhances the warped images to match the sharpness of the original images.
Results:
The proposed method, evaluated on two public datasets (HCP and LEMON), achieves the highest registration accuracy with topology preservation when compared with state-of-the-art methods.
Conclusions:
Our residual space-guided registration framework, combined with GAN-based image enhancement, provides an effective solution to the challenges of multimodal deformable registration. By mitigating intensity distribution discrepancies and improving image quality, this approach improves registration accuracy and strengthens its potential for clinical application.
{"title":"Symmetric deformable registration of multimodal brain magnetic resonance images via appearance residuals","authors":"Yunzhi Huang , Luyi Han , Haoran Dou , Sahar Ahmad , Pew-Thian Yap","doi":"10.1016/j.cmpb.2024.108578","DOIUrl":"10.1016/j.cmpb.2024.108578","url":null,"abstract":"<div><h3>Background and Objective:</h3><div>Deformable registration of multimodal brain magnetic resonance images presents significant challenges, primarily due to substantial structural variations between subjects and pronounced differences in appearance across imaging modalities.</div></div><div><h3>Methods:</h3><div>Here, we propose to symmetrically register images from two modalities based on appearance residuals from one modality to another. Computed with simple subtraction between modalities, the appearance residuals enhance structural details and form a common representation for simplifying multimodal deformable registration. The proposed framework consists of three serially connected modules: (i) an appearance residual module, which learns intensity residual maps between modalities with a cycle-consistent loss; (ii) a deformable registration module, which predicts deformations across subjects based on appearance residuals; and (iii) a deblurring module, which enhances the warped images to match the sharpness of the original images.</div></div><div><h3>Results:</h3><div>The proposed method, evaluated on two public datasets (HCP and LEMON), achieves the highest registration accuracy with topology preservation when compared with state-of-the-art methods.</div></div><div><h3>Conclusions:</h3><div>Our residual space-guided registration framework, combined with GAN-based image enhancement, provides an effective solution to the challenges of multimodal deformable registration. By mitigating intensity distribution discrepancies and improving image quality, this approach improves registration accuracy and strengthens its potential for clinical application.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"261 ","pages":"Article 108578"},"PeriodicalIF":4.9,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142969853","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-06DOI: 10.1016/j.cmpb.2025.108592
Wanfang Xie , Zhenyu Liu , Litao Zhao , Meiyun Wang , Jie Tian , Jiangang Liu
Background and Objective
Single-source domain generalization (SSDG) aims to generalize a deep learning (DL) model trained on one source dataset to multiple unseen datasets. This is important for the clinical applications of DL-based models to breast cancer screening, wherein a DL-based model is commonly developed in an institute and then tested in other institutes. One challenge of SSDG is to alleviate the domain shifts using only one domain dataset.
Methods
The present study proposed a domain-invariant features learning framework (DIFLF) for single-source domain. Specifically, a style-augmentation module (SAM) and a content-style disentanglement module (CSDM) are proposed in DIFLF. SAM includes two different color jitter transforms, which transforms each mammogram in the source domain into two synthesized mammograms with new styles. Thus, it can greatly increase the feature diversity of the source domain, reducing the overfitting of the trained model. CSDM includes three feature disentanglement units, which extracts domain-invariant content (DIC) features by disentangling them from domain-specific style (DSS) features, reducing the influence of the domain shifts resulting from different feature distributions. Our code is available for open access on Github (https://github.com/85675/DIFLF).
Results
DIFLF is trained in a private dataset (PRI1), and tested first in another private dataset (PRI2) with similar feature distribution to PRI1 and then tested in two public datasets (INbreast and MIAS) with greatly different feature distributions from PRI1. As revealed by the experiment results, DIFLF presents excellent performance for classifying mammograms in the unseen target datasets of PRI2, INbreast, and MIAS. The accuracy and AUC of DIFLF are 0.917 and 0.928 in PRI2, 0.882 and 0.893 in INbreast, 0.767 and 0.710 in MIAS, respectively.
Conclusions
DIFLF can alleviate the influence of domain shifts only using one source dataset. Moreover, DIFLF can achieve an excellent mammogram classification performance even in the unseen datasets with great feature distribution differences from the training dataset.
{"title":"DIFLF: A domain-invariant features learning framework for single-source domain generalization in mammogram classification","authors":"Wanfang Xie , Zhenyu Liu , Litao Zhao , Meiyun Wang , Jie Tian , Jiangang Liu","doi":"10.1016/j.cmpb.2025.108592","DOIUrl":"10.1016/j.cmpb.2025.108592","url":null,"abstract":"<div><h3>Background and Objective</h3><div>Single-source domain generalization (SSDG) aims to generalize a deep learning (DL) model trained on one source dataset to multiple unseen datasets. This is important for the clinical applications of DL-based models to breast cancer screening, wherein a DL-based model is commonly developed in an institute and then tested in other institutes. One challenge of SSDG is to alleviate the domain shifts using only one domain dataset.</div></div><div><h3>Methods</h3><div>The present study proposed a domain-invariant features learning framework (DIFLF) for single-source domain. Specifically, a style-augmentation module (SAM) and a content-style disentanglement module (CSDM) are proposed in DIFLF. SAM includes two different color jitter transforms, which transforms each mammogram in the source domain into two synthesized mammograms with new styles. Thus, it can greatly increase the feature diversity of the source domain, reducing the overfitting of the trained model. CSDM includes three feature disentanglement units, which extracts domain-invariant content (DIC) features by disentangling them from domain-specific style (DSS) features, reducing the influence of the domain shifts resulting from different feature distributions. Our code is available for open access on Github (<span><span>https://github.com/85675/DIFLF</span><svg><path></path></svg></span>).</div></div><div><h3>Results</h3><div>DIFLF is trained in a private dataset (PRI1), and tested first in another private dataset (PRI2) with similar feature distribution to PRI1 and then tested in two public datasets (INbreast and MIAS) with greatly different feature distributions from PRI1. As revealed by the experiment results, DIFLF presents excellent performance for classifying mammograms in the unseen target datasets of PRI2, INbreast, and MIAS. The accuracy and AUC of DIFLF are 0.917 and 0.928 in PRI2, 0.882 and 0.893 in INbreast, 0.767 and 0.710 in MIAS, respectively.</div></div><div><h3>Conclusions</h3><div>DIFLF can alleviate the influence of domain shifts only using one source dataset. Moreover, DIFLF can achieve an excellent mammogram classification performance even in the unseen datasets with great feature distribution differences from the training dataset.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"261 ","pages":"Article 108592"},"PeriodicalIF":4.9,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143001331","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Anxiety is a psycho-physiological condition associated with an individual's mental state. Long-term anxiety persistence can lead to anxiety disorder, which is the underlying cause of many mental health problems. As such, it is critical to precisely identify anxiety by automated, effective, and user-bias-free ways.
Objective
The objective of this study is to develop an innovative emotionally intelligent Haptic system for anxiety detection, which can be used to track and manage people's anxiety.
Method
The suggested approach incorporates a haptic feedback mechanism that is based on EEG data and is analysed by machine learning algorithms. This allows users to effectively control their emotional well-being by receiving timely feedback and assessments of their anxiety levels. First, the authors use publicly accessible data to present an experimental study for the categorization of human anxiety.
Results
The ensemble model used for the classification produces results with a 97 % accuracy rate, 0.98 recall, 0.99 precision, and a 0.99 F1 score. Furthermore, self-curated data is subjected to an advanced spike analysis algorithm that identifies signal spikes and then quantifies the level of anxiety.
Conclusion
The results obtained demonstrate that haptic stimuli are produced smoothly, offering a comprehensive and innovative method of managing anxiety.
{"title":"An emotionally intelligent haptic system – An efficient solution for anxiety detection and mitigation","authors":"Swapneel Mishra , Saumya Seth , Shrishti Jain , Vasudev Pant , Jolly Parikh , Nupur Chugh , Yugnanda Puri","doi":"10.1016/j.cmpb.2025.108590","DOIUrl":"10.1016/j.cmpb.2025.108590","url":null,"abstract":"<div><h3>Background</h3><div>Anxiety is a psycho-physiological condition associated with an individual's mental state. Long-term anxiety persistence can lead to anxiety disorder, which is the underlying cause of many mental health problems. As such, it is critical to precisely identify anxiety by automated, effective, and user-bias-free ways.</div></div><div><h3>Objective</h3><div>The objective of this study is to develop an innovative emotionally intelligent Haptic system for anxiety detection, which can be used to track and manage people's anxiety.</div></div><div><h3>Method</h3><div>The suggested approach incorporates a haptic feedback mechanism that is based on EEG data and is analysed by machine learning algorithms. This allows users to effectively control their emotional well-being by receiving timely feedback and assessments of their anxiety levels. First, the authors use publicly accessible data to present an experimental study for the categorization of human anxiety.</div></div><div><h3>Results</h3><div>The ensemble model used for the classification produces results with a 97 % accuracy rate, 0.98 recall, 0.99 precision, and a 0.99 F1 score. Furthermore, self-curated data is subjected to an advanced spike analysis algorithm that identifies signal spikes and then quantifies the level of anxiety.</div></div><div><h3>Conclusion</h3><div>The results obtained demonstrate that haptic stimuli are produced smoothly, offering a comprehensive and innovative method of managing anxiety.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"260 ","pages":"Article 108590"},"PeriodicalIF":4.9,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142945897","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Predicting potential risk factors for the occurrence of coronary artery lesions (CAL) in children with Kawasaki disease (KD) is critical for subsequent treatment. The aim of our study was to establish and validate a nomograph-based model for identifying children with KD at risk for CAL.
Methods
Hospitalized children with KD attending Wuhan Children's Hospital from Jan 2011 to Dec 2023 were included in the study and were grouped into a training set (4793 cases) and a validation set (2054 cases) using a simple random sampling method in a 7:3 ratio. The analysis was performed using RStudio software, which first used LASSO regression analysis to screen for the best predictors, and then analyzed the screened predictors using logistic regression analysis to derive independent predictors and construct a nomogram model to predict CAL risk. The receiver operating characteristic (ROC) and calibration curves were employed to evaluate the discrimination and calibration of the model. Finally, decision curve analysis (DCA) was utilized to validate the clinical applicability of the models assessed in the data.
Results
Of the 6847 eligible children with KD included, 845 (12 %) were ultimately diagnosed with CAL, of whom 619 were boys (73 %) with a median age of 1.81 (0.74, 3.51) years. Six significant independent predictors were identified, including sex, intravenous immunoglobulin nonresponse, peripheral blood hemoglobin, platelet distribution width, platelet count, and serum albumin. Our model has acceptable discriminative power, with areas under the curve at 0.671 and 0.703 in the training and validation sets, respectively. DCA analysis showed that the prediction model had great clinical utility when the threshold probability interval was between 0.1 and 0.5.
Conclusions
We constructed and internally validated a nomograph-based predictive model based on six variables consisting of sex, intravenous immunoglobulin nonresponse, peripheral blood hemoglobin, platelet distribution width, platelet count, and serum albumin, which may be useful for earlier identification of children with KD who may have CAL.
{"title":"Development and validation of a nomogram-based prognostic model to predict coronary artery lesions in Kawasaki disease from 6847 children in China","authors":"Changjian Li , Huayong Zhang , Wei Yin , Yong Zhang","doi":"10.1016/j.cmpb.2025.108588","DOIUrl":"10.1016/j.cmpb.2025.108588","url":null,"abstract":"<div><h3>Background and Objective</h3><div>Predicting potential risk factors for the occurrence of coronary artery lesions (CAL) in children with Kawasaki disease (KD) is critical for subsequent treatment. The aim of our study was to establish and validate a nomograph-based model for identifying children with KD at risk for CAL.</div></div><div><h3>Methods</h3><div>Hospitalized children with KD attending Wuhan Children's Hospital from Jan 2011 to Dec 2023 were included in the study and were grouped into a training set (4793 cases) and a validation set (2054 cases) using a simple random sampling method in a 7:3 ratio. The analysis was performed using RStudio software, which first used LASSO regression analysis to screen for the best predictors, and then analyzed the screened predictors using logistic regression analysis to derive independent predictors and construct a nomogram model to predict CAL risk. The receiver operating characteristic (ROC) and calibration curves were employed to evaluate the discrimination and calibration of the model. Finally, decision curve analysis (DCA) was utilized to validate the clinical applicability of the models assessed in the data.</div></div><div><h3>Results</h3><div>Of the 6847 eligible children with KD included, 845 (12 %) were ultimately diagnosed with CAL, of whom 619 were boys (73 %) with a median age of 1.81 (0.74, 3.51) years. Six significant independent predictors were identified, including sex, intravenous immunoglobulin nonresponse, peripheral blood hemoglobin, platelet distribution width, platelet count, and serum albumin. Our model has acceptable discriminative power, with areas under the curve at 0.671 and 0.703 in the training and validation sets, respectively. DCA analysis showed that the prediction model had great clinical utility when the threshold probability interval was between 0.1 and 0.5.</div></div><div><h3>Conclusions</h3><div>We constructed and internally validated a nomograph-based predictive model based on six variables consisting of sex, intravenous immunoglobulin nonresponse, peripheral blood hemoglobin, platelet distribution width, platelet count, and serum albumin, which may be useful for earlier identification of children with KD who may have CAL.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"260 ","pages":"Article 108588"},"PeriodicalIF":4.9,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142964031","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The growing population of elderly neurocritically ill patients highlights the need for effective prognosis prediction tools. This study aims to develop and validate machine learning (ML) models for predicting 28-day mortality in intensive care units (ICUs).
Methods
Data were extracted from the Medical Information Mart for Intensive Care IV(MIMIC-IV) database, focusing on elderly neurocritical ill patients with ICU stays ≥ 24 h. The cohort was split into 70 % for training and 30 % for internal validation. We analyzed 58 variables, including demographics, vital signs, medications, lab results, comorbidities, and medical scores, using Lasso regression to identify predictors of 28-day mortality. Seven ML algorithms were evaluated, and the best model was validated with data from Guizhou Medical University Affiliated Hospital. A log-rank test was used to assess survival differences in Kaplan-Meier curves. Shapley Additive Explanations (SHAP) were used to interpret the best model, while subgroup analysis identified variations in model performance across different populations.
Results
The study included 1,773 elderly neurocritically ill patients, with a 28-day mortality rate of 28.6 %. The Light Gradient Boosting Machine (LightGBM) outperformed other models, achieving an area under the curve (AUC) of 0.896 in internal validation and 0.812 in external validation. Kaplan-Meier analysis showed that higher LightGBM prediction scores correlated with lower survival probabilities. Key predictors identified through SHAP analysis included partial pressure of arterial carbon dioxide (PaCO2), Acute physiology and chronic health evaluation II (APACHE II), white blood cell count, age, and lactate. The LightGBM model demonstrated consistent performance across various subgroups.
Conclusions
The LightGBM model effectively predicts 28-day mortality risk in elderly neurocritically ill patients, aiding clinicians in management and resource allocation. Its reliable performance across diverse subgroups underscores its clinical utility.
{"title":"Machine learning-based 28-day mortality prediction model for elderly neurocritically Ill patients","authors":"Jia Yuan , Jiong Xiong , Jinfeng Yang , Qi Dong , Yin Wang , Yumei Cheng , Xianjun Chen , Ying Liu , Chuan Xiao , Junlin Tao , Shuangzi Lizhang , Yangzi Liujiao , Qimin Chen , Feng Shen","doi":"10.1016/j.cmpb.2025.108589","DOIUrl":"10.1016/j.cmpb.2025.108589","url":null,"abstract":"<div><h3>Background</h3><div>The growing population of elderly neurocritically ill patients highlights the need for effective prognosis prediction tools. This study aims to develop and validate machine learning (ML) models for predicting 28-day mortality in intensive care units (ICUs).</div></div><div><h3>Methods</h3><div>Data were extracted from the Medical Information Mart for Intensive Care IV(MIMIC-IV) database, focusing on elderly neurocritical ill patients with ICU stays ≥ 24 h. The cohort was split into 70 % for training and 30 % for internal validation. We analyzed 58 variables, including demographics, vital signs, medications, lab results, comorbidities, and medical scores, using Lasso regression to identify predictors of 28-day mortality. Seven ML algorithms were evaluated, and the best model was validated with data from Guizhou Medical University Affiliated Hospital. A log-rank test was used to assess survival differences in Kaplan-Meier curves. Shapley Additive Explanations (SHAP) were used to interpret the best model, while subgroup analysis identified variations in model performance across different populations.</div></div><div><h3>Results</h3><div>The study included 1,773 elderly neurocritically ill patients, with a 28-day mortality rate of 28.6 %. The Light Gradient Boosting Machine (LightGBM) outperformed other models, achieving an area under the curve (AUC) of 0.896 in internal validation and 0.812 in external validation. Kaplan-Meier analysis showed that higher LightGBM prediction scores correlated with lower survival probabilities. Key predictors identified through SHAP analysis included partial pressure of arterial carbon dioxide (PaCO<sub>2</sub>), Acute physiology and chronic health evaluation II (APACHE II), white blood cell count, age, and lactate. The LightGBM model demonstrated consistent performance across various subgroups.</div></div><div><h3>Conclusions</h3><div>The LightGBM model effectively predicts 28-day mortality risk in elderly neurocritically ill patients, aiding clinicians in management and resource allocation. Its reliable performance across diverse subgroups underscores its clinical utility.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"260 ","pages":"Article 108589"},"PeriodicalIF":4.9,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142969946","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-06DOI: 10.1016/j.cmpb.2025.108591
Chao Shi , Qing Yang , Yuantian Wang , Xiangrui Zhao , Shuchang Shi , Lijia Zhang , Sutuke Yibulayimu , Yanzhen Liu , Chendi Liang , Yu Wang , Chunpeng Zhao
Background and objectives
Computer-assisted orthopedic surgical techniques and robotics has improved the therapeutic outcome of pelvic fracture reduction surgery. The preoperative reduction path is one of the prerequisites for robotic movement and an essential reference for manual operation. As the largest irregular bone with complicated morphology, the rotational motion of pelvic fracture fragments impacts the reduction process directly. To address this, the primary objective of this study is to develop an efficient and effective algorithm for automatically planning the reduction trajectory in robot-assisted pelvic fracture surgeries.
Methods
After obtaining rotational and reorientated translational degrees of freedom through the initial and target positions of the fracture fragments, the initial path is acquired through improved path planning method combined with specific designed collision detection algorithm. The final reduction path is post-processed to be shortened and smoothed. The effectiveness of the algorithm was evaluated in various pelvic fracture models with surrounding muscles and was compared with prior relevant implementations.
Results
Simulation results showed the ability of the planner to save time and overcome the state of art in terms of collision detection, path length and smoothness, search time, and surrounding muscle stretching conditions.
Conclusions
The proposed method enables a reasonable reduction path for pelvic fracture, which is demonstrated to be superior in various pelvic fracture scenarios.
{"title":"Automatic path planning for pelvic fracture reduction with multi-degree-of-freedom","authors":"Chao Shi , Qing Yang , Yuantian Wang , Xiangrui Zhao , Shuchang Shi , Lijia Zhang , Sutuke Yibulayimu , Yanzhen Liu , Chendi Liang , Yu Wang , Chunpeng Zhao","doi":"10.1016/j.cmpb.2025.108591","DOIUrl":"10.1016/j.cmpb.2025.108591","url":null,"abstract":"<div><h3>Background and objectives</h3><div>Computer-assisted orthopedic surgical techniques and robotics has improved the therapeutic outcome of pelvic fracture reduction surgery. The preoperative reduction path is one of the prerequisites for robotic movement and an essential reference for manual operation. As the largest irregular bone with complicated morphology, the rotational motion of pelvic fracture fragments impacts the reduction process directly. To address this, the primary objective of this study is to develop an efficient and effective algorithm for automatically planning the reduction trajectory in robot-assisted pelvic fracture surgeries.</div></div><div><h3>Methods</h3><div>After obtaining rotational and reorientated translational degrees of freedom through the initial and target positions of the fracture fragments, the initial path is acquired through improved path planning method combined with specific designed collision detection algorithm. The final reduction path is post-processed to be shortened and smoothed. The effectiveness of the algorithm was evaluated in various pelvic fracture models with surrounding muscles and was compared with prior relevant implementations.</div></div><div><h3>Results</h3><div>Simulation results showed the ability of the planner to save time and overcome the state of art in terms of collision detection, path length and smoothness, search time, and surrounding muscle stretching conditions.</div></div><div><h3>Conclusions</h3><div>The proposed method enables a reasonable reduction path for pelvic fracture, which is demonstrated to be superior in various pelvic fracture scenarios.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"261 ","pages":"Article 108591"},"PeriodicalIF":4.9,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143028082","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-06DOI: 10.1016/j.cmpb.2024.108582
Chenlong Guo , Xingsen Mu , Xianwei Wang , Yiming Zhao , Haoran Zhang , Dong Chen
Objective
The study aims to elucidate the mechanisms underlying plaque growth by analyzing the variations in hemodynamic parameters within the plaque region of patients' carotid arteries before and after the development of atherosclerotic lesions.
Methods
The study enrolls 25 patients with common carotid artery stenosis and 25 with tandem carotid artery stenosis. Based on pathological analysis, three-dimensional models of the actual blood vessels before and after the lesion are constructed for two patients within a two-year period. Computational fluid dynamics is employed to conduct unsteady periodic non-Newtonian fluid numerical simulations, enabling an in-depth investigation into the changes in the micro-environment of blood flow.
Results
During the systolic phase of the cardiac cycle, vortex regions are particularly prone to developing at the bifurcation point between the common carotid artery and the distal end of the internal carotid artery. In the early diastolic phase, blood reflux phenomena can be observed within the carotid artery. Towards the end of diastole, there is an expansion of vortex regions at the bifurcation point of the carotid artery. The shoulder region of initial small plaques within the blood vessel is susceptible to developing a low-speed recirculation zone, characterized by significantly reduced shear stress compared to the surrounding areas. Following vascular stenosis, the wall shear stress within the plaque domain generally increases; however, it maintains a consistent pattern of high central values and low upper shoulder values. The shear stress at the upper shoulder of the plaque of tandem carotid stenosis is below 0.4 Pa, whereas the central and lower shoulder regions exhibit shear stress exceeding 40 Pa.
Conclusions
The dynamic parameters of the blood flow micro-environment exhibit variations throughout the cardiac cycle, and temporal disparities exist in local lesions within the carotid artery. Both common and tandem carotid artery stenosis are particularly prone to developing lesions at the shoulder of initial small plaques. The micro-flow characteristics within the plaque domain undergo alterations prior to and following the onset of carotid artery disease. Furthermore, the occurrence of restenosis and rupture is associated with the location of plaque growth.
{"title":"Effect of plaque micro-watershed changes on carotid atherosclerosis","authors":"Chenlong Guo , Xingsen Mu , Xianwei Wang , Yiming Zhao , Haoran Zhang , Dong Chen","doi":"10.1016/j.cmpb.2024.108582","DOIUrl":"10.1016/j.cmpb.2024.108582","url":null,"abstract":"<div><h3>Objective</h3><div>The study aims to elucidate the mechanisms underlying plaque growth by analyzing the variations in hemodynamic parameters within the plaque region of patients' carotid arteries before and after the development of atherosclerotic lesions.</div></div><div><h3>Methods</h3><div>The study enrolls 25 patients with common carotid artery stenosis and 25 with tandem carotid artery stenosis. Based on pathological analysis, three-dimensional models of the actual blood vessels before and after the lesion are constructed for two patients within a two-year period. Computational fluid dynamics is employed to conduct unsteady periodic non-Newtonian fluid numerical simulations, enabling an in-depth investigation into the changes in the micro-environment of blood flow.</div></div><div><h3>Results</h3><div>During the systolic phase of the cardiac cycle, vortex regions are particularly prone to developing at the bifurcation point between the common carotid artery and the distal end of the internal carotid artery. In the early diastolic phase, blood reflux phenomena can be observed within the carotid artery. Towards the end of diastole, there is an expansion of vortex regions at the bifurcation point of the carotid artery. The shoulder region of initial small plaques within the blood vessel is susceptible to developing a low-speed recirculation zone, characterized by significantly reduced shear stress compared to the surrounding areas. Following vascular stenosis, the wall shear stress within the plaque domain generally increases; however, it maintains a consistent pattern of high central values and low upper shoulder values. The shear stress at the upper shoulder of the plaque of tandem carotid stenosis is below 0.4 Pa, whereas the central and lower shoulder regions exhibit shear stress exceeding 40 Pa.</div></div><div><h3>Conclusions</h3><div>The dynamic parameters of the blood flow micro-environment exhibit variations throughout the cardiac cycle, and temporal disparities exist in local lesions within the carotid artery. Both common and tandem carotid artery stenosis are particularly prone to developing lesions at the shoulder of initial small plaques. The micro-flow characteristics within the plaque domain undergo alterations prior to and following the onset of carotid artery disease. Furthermore, the occurrence of restenosis and rupture is associated with the location of plaque growth.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"260 ","pages":"Article 108582"},"PeriodicalIF":4.9,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142969942","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-04DOI: 10.1016/j.cmpb.2024.108580
Zehang Ning , Bojie Yang , Yuanyuan Wang , Zhifeng Shi , Jinhua Yu , Guoqing Wu
Background and Objective:
Utilizing AI to mine tumor microenvironment information in whole slide images (WSIs) for glioma molecular subtype and prognosis prediction is significant for treatment. Existing weakly-supervised learning frameworks based on multi-instance learning have potential in WSIs analysis, but the large number of patches from WSIs challenges the effective extraction of key local patch and neighboring patch microenvironment info. Therefore, this paper aims to develop an automatic neural network that effectively extracts tumor microenvironment information from WSIs to predict molecular typing and prognosis of glioma.
Methods:
In this paper, we proposed a dual-path pathology analysis (DPPA) framework to enhance the analysis ability of WSIs for glioma diagnosis. Firstly, to mitigate the impact of redundant patches and enhance the integration of salient patch information within a multi-instance learning context, we propose a two-stage attention-based dynamic multi-instance learning network. In the network, two-stage attention and dynamic random sampling are designed to integrate diverse image patch information in pivotal regions adaptively. Secondly, to unearth the wealth of spatial context inherent in WSIs, we build a spatial relationship information quantification module. This module captures the spatial distribution of patches that encompass a variety of tissue structures, shedding light on the tumor microenvironment.
Results:
A large number of experiments on three datasets, two in-house and one public, totaling 1,795 WSIs demonstrate the encouraging performance of the DPPA, with mean area under curves of 0.94, 0.85, and 0.88 in predicting Isocitrate Dehydrogenase 1, Telomerase Reverse Tranase, and 1p/19q respectively, and a mean C-index of 0.82 in prognosis prediction. The proposed model can also group tumors in existing tumor subgroups into good and bad prognoses, with P 0.05 on the Log-rank test.
Conclusions:
The results of multi-center experiments demonstrate that the proposed DPPA surpasses the state-of-the-art models across multiple metrics. Through ablation experiments and survival analysis, the outstanding analytical ability of this model is further validated. Meanwhile, based on the work related to the interpretability of the model, the reliability and validity of the model have also been strongly confirmed. All source codes are released at: https://github.com/nzehang97/DPPA.
{"title":"Dual-path neural network extracts tumor microenvironment information from whole slide images to predict molecular typing and prognosis of Glioma","authors":"Zehang Ning , Bojie Yang , Yuanyuan Wang , Zhifeng Shi , Jinhua Yu , Guoqing Wu","doi":"10.1016/j.cmpb.2024.108580","DOIUrl":"10.1016/j.cmpb.2024.108580","url":null,"abstract":"<div><h3>Background and Objective:</h3><div>Utilizing AI to mine tumor microenvironment information in whole slide images (WSIs) for glioma molecular subtype and prognosis prediction is significant for treatment. Existing weakly-supervised learning frameworks based on multi-instance learning have potential in WSIs analysis, but the large number of patches from WSIs challenges the effective extraction of key local patch and neighboring patch microenvironment info. Therefore, this paper aims to develop an automatic neural network that effectively extracts tumor microenvironment information from WSIs to predict molecular typing and prognosis of glioma.</div></div><div><h3>Methods:</h3><div>In this paper, we proposed a dual-path pathology analysis (DPPA) framework to enhance the analysis ability of WSIs for glioma diagnosis. Firstly, to mitigate the impact of redundant patches and enhance the integration of salient patch information within a multi-instance learning context, we propose a two-stage attention-based dynamic multi-instance learning network. In the network, two-stage attention and dynamic random sampling are designed to integrate diverse image patch information in pivotal regions adaptively. Secondly, to unearth the wealth of spatial context inherent in WSIs, we build a spatial relationship information quantification module. This module captures the spatial distribution of patches that encompass a variety of tissue structures, shedding light on the tumor microenvironment.</div></div><div><h3>Results:</h3><div>A large number of experiments on three datasets, two in-house and one public, totaling 1,795 WSIs demonstrate the encouraging performance of the DPPA, with mean area under curves of 0.94, 0.85, and 0.88 in predicting Isocitrate Dehydrogenase 1, Telomerase Reverse Tranase, and 1p/19q respectively, and a mean C-index of 0.82 in prognosis prediction. The proposed model can also group tumors in existing tumor subgroups into good and bad prognoses, with P <span><math><mo><</mo></math></span> 0.05 on the Log-rank test.</div></div><div><h3>Conclusions:</h3><div>The results of multi-center experiments demonstrate that the proposed DPPA surpasses the state-of-the-art models across multiple metrics. Through ablation experiments and survival analysis, the outstanding analytical ability of this model is further validated. Meanwhile, based on the work related to the interpretability of the model, the reliability and validity of the model have also been strongly confirmed. All source codes are released at: <span><span>https://github.com/nzehang97/DPPA</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"261 ","pages":"Article 108580"},"PeriodicalIF":4.9,"publicationDate":"2025-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142982634","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-03DOI: 10.1016/j.cmpb.2024.108567
Lluís-F. Hurtado , Luis Marco-Ruiz , Encarna Segarra , Maria Jose Castro-Bleda , Aurelia Bustos-Moreno , Maria de la Iglesia-Vayá , Juan Francisco Vallalta-Rueda
Background and Objective:
Despite significant investments in the normalization and the standardization of Electronic Health Records (EHRs), free text is still the rule rather than the exception in clinical notes. The use of free text has implications in data reuse methods used for supporting clinical research since the query mechanisms used in cohort definition and patient matching are mainly based on structured data and clinical terminologies. This study aims to develop a method for the secondary use of clinical text by: (a) using Natural Language Processing (NLP) for tagging clinical notes with biomedical terminology; and (b) designing an ontology that maps and classifies all the identified tags to various terminologies and allows for running phenotyping queries.
Methods and Results:
Transformers-based NLP Models, concretely pre-trained RoBERTa language models, were used to process radiology reports and annotate them identifying elements matching UMLS Concept Unique Identifiers (CUIs) definitions. CUIs were mapped into several biomedical ontologies useful for phenotyping (e.g., SNOMED-CT, HPO, ICD-10, FMA, LOINC, and ICPC2, among others) and represented as a lightweight ontology using OWL (Web Ontology Language) constructs. This process resulted in a Linked Knowledge Base (LKB), which allows running expressive queries to retrieve reports that comply with specific criteria using automatic reasoning.
Conclusion:
Although phenotyping tools mostly rely on relational databases, the combination of NLP and Linked Data technologies allows us to build scalable knowledge bases using standard ontologies from the Web of data. Our approach enables us to execute a pipeline which input is free text and automatically maps identified entities to a LKB that allows answering phenotyping queries. In this work, we have only used Spanish radiology reports, although it is extensible to other languages for which suitable corpora are available. This is particularly valuable in regional and national systems dealing with large research databases from different registries and cohorts and plays an essential role in the scalability of large data reuse infrastructures that require indexing and governing distributed data sources.
{"title":"Leveraging Transformers-based models and linked data for deep phenotyping in radiology","authors":"Lluís-F. Hurtado , Luis Marco-Ruiz , Encarna Segarra , Maria Jose Castro-Bleda , Aurelia Bustos-Moreno , Maria de la Iglesia-Vayá , Juan Francisco Vallalta-Rueda","doi":"10.1016/j.cmpb.2024.108567","DOIUrl":"10.1016/j.cmpb.2024.108567","url":null,"abstract":"<div><h3>Background and Objective:</h3><div>Despite significant investments in the normalization and the standardization of Electronic Health Records (EHRs), free text is still the rule rather than the exception in clinical notes. The use of free text has implications in data reuse methods used for supporting clinical research since the query mechanisms used in cohort definition and patient matching are mainly based on structured data and clinical terminologies. This study aims to develop a method for the secondary use of clinical text by: (a) using Natural Language Processing (NLP) for tagging clinical notes with biomedical terminology; and (b) designing an ontology that maps and classifies all the identified tags to various terminologies and allows for running phenotyping queries.</div></div><div><h3>Methods and Results:</h3><div>Transformers-based NLP Models, concretely pre-trained RoBERTa language models, were used to process radiology reports and annotate them identifying elements matching UMLS Concept Unique Identifiers (CUIs) definitions. CUIs were mapped into several biomedical ontologies useful for phenotyping (e.g., SNOMED-CT, HPO, ICD-10, FMA, LOINC, and ICPC2, among others) and represented as a lightweight ontology using OWL (Web Ontology Language) constructs. This process resulted in a Linked Knowledge Base (LKB), which allows running expressive queries to retrieve reports that comply with specific criteria using automatic reasoning.</div></div><div><h3>Conclusion:</h3><div>Although phenotyping tools mostly rely on relational databases, the combination of NLP and Linked Data technologies allows us to build scalable knowledge bases using standard ontologies from the Web of data. Our approach enables us to execute a pipeline which input is free text and automatically maps identified entities to a LKB that allows answering phenotyping queries. In this work, we have only used Spanish radiology reports, although it is extensible to other languages for which suitable corpora are available. This is particularly valuable in regional and national systems dealing with large research databases from different registries and cohorts and plays an essential role in the scalability of large data reuse infrastructures that require indexing and governing distributed data sources.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"260 ","pages":"Article 108567"},"PeriodicalIF":4.9,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142945900","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}