{"title":"时空特征提取和选择对基于机器学习的栓塞后脑动脉瘤再通畅预测模型的影响探索","authors":"Jing Liao, Kouichi Misaki, Jiro Sakamoto","doi":"10.1007/s13239-024-00721-6","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>To enhance the performance of machine learning (ML) models for the post-embolization recanalization of cerebral aneurysms, we evaluated the impact of hemodynamic feature derivation and selection method on six ML algorithms.</p><p><strong>Methods: </strong>We utilized computational fluid dynamics (CFD) to simulate hemodynamics in 66 cerebral aneurysms from 65 patients, including 57 stable and nine recanalized aneurysms. We derived a total of 107 features for each aneurysm, encompassing four clinical features, 12 morphological features, and 91 hemodynamic features. To investigate the influence of feature derivation and selection methods on the ML models, we employed two derivation methods, simplified and fully derived, in combination with four selection methods: all features, statistically significant analysis, stepwise multivariate logistic regression analysis (stepwise-LR), and recursive feature elimination (RFE). Model performance was assessed using the area under the receiver operating characteristic curve (AUROC) and precision-recall curve (AUPRC) on both the training and testing datasets.</p><p><strong>Results: </strong>The AUROC values on the testing dataset exhibited a wide-ranging spectrum, spanning from 0.373 to 0.863. Fully derived features and the RFE selection method demonstrated superior performance in intra-model comparisons. The multi-layer perceptron (MLP) model, trained with RFE-selected fully derived features, achieved the best performance on the testing dataset, with an AUROC value of 0.863 (95% CI: 0.684- 1.000).</p><p><strong>Conclusion: </strong>Our study demonstrated the importance of feature derivation and selection in determining the performance of ML models. This enabled the development of accurate decision-making models without the need to invade the patient.</p>","PeriodicalId":54322,"journal":{"name":"Cardiovascular Engineering and Technology","volume":" ","pages":"394-404"},"PeriodicalIF":1.6000,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Impact Exploration of Spatiotemporal Feature Derivation and Selection on Machine Learning-Based Predictive Models for Post-Embolization Cerebral Aneurysm Recanalization.\",\"authors\":\"Jing Liao, Kouichi Misaki, Jiro Sakamoto\",\"doi\":\"10.1007/s13239-024-00721-6\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Purpose: </strong>To enhance the performance of machine learning (ML) models for the post-embolization recanalization of cerebral aneurysms, we evaluated the impact of hemodynamic feature derivation and selection method on six ML algorithms.</p><p><strong>Methods: </strong>We utilized computational fluid dynamics (CFD) to simulate hemodynamics in 66 cerebral aneurysms from 65 patients, including 57 stable and nine recanalized aneurysms. We derived a total of 107 features for each aneurysm, encompassing four clinical features, 12 morphological features, and 91 hemodynamic features. To investigate the influence of feature derivation and selection methods on the ML models, we employed two derivation methods, simplified and fully derived, in combination with four selection methods: all features, statistically significant analysis, stepwise multivariate logistic regression analysis (stepwise-LR), and recursive feature elimination (RFE). Model performance was assessed using the area under the receiver operating characteristic curve (AUROC) and precision-recall curve (AUPRC) on both the training and testing datasets.</p><p><strong>Results: </strong>The AUROC values on the testing dataset exhibited a wide-ranging spectrum, spanning from 0.373 to 0.863. Fully derived features and the RFE selection method demonstrated superior performance in intra-model comparisons. The multi-layer perceptron (MLP) model, trained with RFE-selected fully derived features, achieved the best performance on the testing dataset, with an AUROC value of 0.863 (95% CI: 0.684- 1.000).</p><p><strong>Conclusion: </strong>Our study demonstrated the importance of feature derivation and selection in determining the performance of ML models. This enabled the development of accurate decision-making models without the need to invade the patient.</p>\",\"PeriodicalId\":54322,\"journal\":{\"name\":\"Cardiovascular Engineering and Technology\",\"volume\":\" \",\"pages\":\"394-404\"},\"PeriodicalIF\":1.6000,\"publicationDate\":\"2024-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cardiovascular Engineering and Technology\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1007/s13239-024-00721-6\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/5/23 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q3\",\"JCRName\":\"CARDIAC & CARDIOVASCULAR SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cardiovascular Engineering and Technology","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s13239-024-00721-6","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/5/23 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}
引用次数: 0
摘要
目的:为了提高机器学习(ML)模型在脑动脉瘤栓塞后再通方面的性能,我们评估了血液动力学特征推导和选择方法对六种 ML 算法的影响:我们利用计算流体动力学(CFD)模拟了 65 名患者的 66 个脑动脉瘤的血流动力学,其中包括 57 个稳定动脉瘤和 9 个再闭塞动脉瘤。我们为每个动脉瘤得出了共 107 个特征,包括 4 个临床特征、12 个形态特征和 91 个血液动力学特征。为了研究特征推导和选择方法对 ML 模型的影响,我们采用了简化和完全推导两种推导方法以及四种选择方法:所有特征、统计意义分析、逐步多元逻辑回归分析(逐步-LR)和递归特征剔除(RFE)。在训练数据集和测试数据集上使用接收者工作特征曲线下面积(AUROC)和精确度-召回曲线(AUPRC)评估模型性能:测试数据集上的 AUROC 值范围很广,从 0.373 到 0.863 不等。完全导出特征和 RFE 选择方法在模型内部比较中表现出更优越的性能。在测试数据集上,使用 RFE 选择的完全导出特征训练的多层感知器(MLP)模型取得了最佳性能,AUROC 值为 0.863(95% CI:0.684- 1.000):我们的研究证明了特征推导和选择在决定 ML 模型性能方面的重要性。结论:我们的研究证明了特征推导和选择在决定 ML 模型性能方面的重要性,这使得我们能够在无需侵入患者体内的情况下开发出准确的决策模型。
Impact Exploration of Spatiotemporal Feature Derivation and Selection on Machine Learning-Based Predictive Models for Post-Embolization Cerebral Aneurysm Recanalization.
Purpose: To enhance the performance of machine learning (ML) models for the post-embolization recanalization of cerebral aneurysms, we evaluated the impact of hemodynamic feature derivation and selection method on six ML algorithms.
Methods: We utilized computational fluid dynamics (CFD) to simulate hemodynamics in 66 cerebral aneurysms from 65 patients, including 57 stable and nine recanalized aneurysms. We derived a total of 107 features for each aneurysm, encompassing four clinical features, 12 morphological features, and 91 hemodynamic features. To investigate the influence of feature derivation and selection methods on the ML models, we employed two derivation methods, simplified and fully derived, in combination with four selection methods: all features, statistically significant analysis, stepwise multivariate logistic regression analysis (stepwise-LR), and recursive feature elimination (RFE). Model performance was assessed using the area under the receiver operating characteristic curve (AUROC) and precision-recall curve (AUPRC) on both the training and testing datasets.
Results: The AUROC values on the testing dataset exhibited a wide-ranging spectrum, spanning from 0.373 to 0.863. Fully derived features and the RFE selection method demonstrated superior performance in intra-model comparisons. The multi-layer perceptron (MLP) model, trained with RFE-selected fully derived features, achieved the best performance on the testing dataset, with an AUROC value of 0.863 (95% CI: 0.684- 1.000).
Conclusion: Our study demonstrated the importance of feature derivation and selection in determining the performance of ML models. This enabled the development of accurate decision-making models without the need to invade the patient.
期刊介绍:
Cardiovascular Engineering and Technology is a journal publishing the spectrum of basic to translational research in all aspects of cardiovascular physiology and medical treatment. It is the forum for academic and industrial investigators to disseminate research that utilizes engineering principles and methods to advance fundamental knowledge and technological solutions related to the cardiovascular system. Manuscripts spanning from subcellular to systems level topics are invited, including but not limited to implantable medical devices, hemodynamics and tissue biomechanics, functional imaging, surgical devices, electrophysiology, tissue engineering and regenerative medicine, diagnostic instruments, transport and delivery of biologics, and sensors. In addition to manuscripts describing the original publication of research, manuscripts reviewing developments in these topics or their state-of-art are also invited.