Impact Exploration of Spatiotemporal Feature Derivation and Selection on Machine Learning-Based Predictive Models for Post-Embolization Cerebral Aneurysm Recanalization.
{"title":"Impact Exploration of Spatiotemporal Feature Derivation and Selection on Machine Learning-Based Predictive Models for Post-Embolization Cerebral Aneurysm Recanalization.","authors":"Jing Liao, Kouichi Misaki, Jiro Sakamoto","doi":"10.1007/s13239-024-00721-6","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>To enhance the performance of machine learning (ML) models for the post-embolization recanalization of cerebral aneurysms, we evaluated the impact of hemodynamic feature derivation and selection method on six ML algorithms.</p><p><strong>Methods: </strong>We utilized computational fluid dynamics (CFD) to simulate hemodynamics in 66 cerebral aneurysms from 65 patients, including 57 stable and nine recanalized aneurysms. We derived a total of 107 features for each aneurysm, encompassing four clinical features, 12 morphological features, and 91 hemodynamic features. To investigate the influence of feature derivation and selection methods on the ML models, we employed two derivation methods, simplified and fully derived, in combination with four selection methods: all features, statistically significant analysis, stepwise multivariate logistic regression analysis (stepwise-LR), and recursive feature elimination (RFE). Model performance was assessed using the area under the receiver operating characteristic curve (AUROC) and precision-recall curve (AUPRC) on both the training and testing datasets.</p><p><strong>Results: </strong>The AUROC values on the testing dataset exhibited a wide-ranging spectrum, spanning from 0.373 to 0.863. Fully derived features and the RFE selection method demonstrated superior performance in intra-model comparisons. The multi-layer perceptron (MLP) model, trained with RFE-selected fully derived features, achieved the best performance on the testing dataset, with an AUROC value of 0.863 (95% CI: 0.684- 1.000).</p><p><strong>Conclusion: </strong>Our study demonstrated the importance of feature derivation and selection in determining the performance of ML models. This enabled the development of accurate decision-making models without the need to invade the patient.</p>","PeriodicalId":54322,"journal":{"name":"Cardiovascular Engineering and Technology","volume":" ","pages":"394-404"},"PeriodicalIF":1.6000,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cardiovascular Engineering and Technology","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s13239-024-00721-6","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/5/23 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose: To enhance the performance of machine learning (ML) models for the post-embolization recanalization of cerebral aneurysms, we evaluated the impact of hemodynamic feature derivation and selection method on six ML algorithms.
Methods: We utilized computational fluid dynamics (CFD) to simulate hemodynamics in 66 cerebral aneurysms from 65 patients, including 57 stable and nine recanalized aneurysms. We derived a total of 107 features for each aneurysm, encompassing four clinical features, 12 morphological features, and 91 hemodynamic features. To investigate the influence of feature derivation and selection methods on the ML models, we employed two derivation methods, simplified and fully derived, in combination with four selection methods: all features, statistically significant analysis, stepwise multivariate logistic regression analysis (stepwise-LR), and recursive feature elimination (RFE). Model performance was assessed using the area under the receiver operating characteristic curve (AUROC) and precision-recall curve (AUPRC) on both the training and testing datasets.
Results: The AUROC values on the testing dataset exhibited a wide-ranging spectrum, spanning from 0.373 to 0.863. Fully derived features and the RFE selection method demonstrated superior performance in intra-model comparisons. The multi-layer perceptron (MLP) model, trained with RFE-selected fully derived features, achieved the best performance on the testing dataset, with an AUROC value of 0.863 (95% CI: 0.684- 1.000).
Conclusion: Our study demonstrated the importance of feature derivation and selection in determining the performance of ML models. This enabled the development of accurate decision-making models without the need to invade the patient.
期刊介绍:
Cardiovascular Engineering and Technology is a journal publishing the spectrum of basic to translational research in all aspects of cardiovascular physiology and medical treatment. It is the forum for academic and industrial investigators to disseminate research that utilizes engineering principles and methods to advance fundamental knowledge and technological solutions related to the cardiovascular system. Manuscripts spanning from subcellular to systems level topics are invited, including but not limited to implantable medical devices, hemodynamics and tissue biomechanics, functional imaging, surgical devices, electrophysiology, tissue engineering and regenerative medicine, diagnostic instruments, transport and delivery of biologics, and sensors. In addition to manuscripts describing the original publication of research, manuscripts reviewing developments in these topics or their state-of-art are also invited.