Objective: Prenatal prediction of CD34+ adequacy supports cord-blood banking by reducing expenses on low-yield units and reserving capacity for clinically promising grafts. We have developed and evaluated a prenatal machine learning model that predicts whether an umbilical cord blood (UCB) unit will meet a clinically supported adequacy threshold (≥1.5×105 CD34+ cells/kg recipient) before collection for single-unit grafts.
Materials and methods: In this retrospective study, we analysed 126,406 records from the Royan Stem Cell Technology Company (RSCT; Tehran, Iran), which included routinely available maternal, neonatal, and family-history variables. A pipeline of imputation (IterativeImputer numeric; SimpleImputer+OrdinalEncoder categorical), feature selection (Extra Trees), and hyperparameter tuning using Bayesian optimisation with model training/evaluation was performed within cross-validation folds. Decision Tree (DT), K-Nearest Neighbours (KNN), Random Forest (RF), Support Vector Machine (SVM), and Multilayer Perceptron (MLP) classifiers were tuned via Bayesian optimisation. Models were ranked by the Technique for Order Preference by Similarity to Ideal Solution (TOPSIS). Majority voting (MV) ensembles were constructed from the top-k models. Model interpretability used SHapley Additive exPlanations (SHAP).
Results: The MV (top-4: RF, KNN, DT, MLP) ensemble achieved an area under the receiver operating characteristic curve (ROC-AUC)=0.808 and an area under the precision-recall curve (PR-AUC)=0.744 on the held-out test set, with an accuracy=0.757, precision=0.726, recall/sensitivity=0.804, F1=0.762, specificity=0.716, and Brier score=0.181. SHAP highlighted history of hepatitis C, birth place, hyperthyroidism, history of anaemia, oral fungus, and rheumatism among the most influential features.
Conclusion: Prenatal prediction of UCB CD34+ adequacy using an interpretable MV ensemble is feasible and accurate to support pre-collection triage and can potentially improve banking efficiency. The resultant model offers a non-invasive tool to enhance the efficiency of cord blood banking by prioritising units with higher transplantation potential.
扫码关注我们
求助内容:
应助结果提醒方式:
