Arantxa Ortega-Leon, Arnaud Gucciardi, A. Segado-Arenas, Isabel Benavente-Fernández, Daniel Urda, Ignacio Turias
{"title":"Neurodevelopmental Impairments Prediction in Premature Infants Based on Clinical Data and Machine Learning Techniques","authors":"Arantxa Ortega-Leon, Arnaud Gucciardi, A. Segado-Arenas, Isabel Benavente-Fernández, Daniel Urda, Ignacio Turias","doi":"10.3390/stats7030041","DOIUrl":null,"url":null,"abstract":"Preterm infants are prone to NeuroDevelopmental Impairment (NDI). Some previous works have identified clinical variables that can be potential predictors of NDI. However, machine learning (ML)-based models still present low predictive capabilities when addressing this problem. This work attempts to evaluate the application of ML techniques to predict NDI using clinical data from a cohort of very preterm infants recruited at birth and assessed at 2 years of age. Six different classification models were assessed, using all features, clinician-selected features, and mutual information feature selection. The best results were obtained by ML models trained using mutual information-selected features and employing oversampling, for cognitive and motor impairment prediction, while for language impairment prediction the best setting was clinician-selected features. Although the performance indicators in this local cohort are consistent with similar previous works and still rather poor. This is a clear indication that, in order to obtain better performance rates, further analysis and methods should be considered, and other types of data should be taken into account together with the clinical variables.","PeriodicalId":0,"journal":{"name":"","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/stats7030041","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Preterm infants are prone to NeuroDevelopmental Impairment (NDI). Some previous works have identified clinical variables that can be potential predictors of NDI. However, machine learning (ML)-based models still present low predictive capabilities when addressing this problem. This work attempts to evaluate the application of ML techniques to predict NDI using clinical data from a cohort of very preterm infants recruited at birth and assessed at 2 years of age. Six different classification models were assessed, using all features, clinician-selected features, and mutual information feature selection. The best results were obtained by ML models trained using mutual information-selected features and employing oversampling, for cognitive and motor impairment prediction, while for language impairment prediction the best setting was clinician-selected features. Although the performance indicators in this local cohort are consistent with similar previous works and still rather poor. This is a clear indication that, in order to obtain better performance rates, further analysis and methods should be considered, and other types of data should be taken into account together with the clinical variables.
早产儿容易出现神经发育障碍(NDI)。之前的一些研究已经确定了一些临床变量,这些变量可以作为 NDI 的潜在预测因子。然而,在解决这一问题时,基于机器学习(ML)的模型仍显示出较低的预测能力。这项研究试图评估应用 ML 技术预测 NDI 的情况,该技术使用的临床数据来自一组出生时招募并在 2 岁时接受评估的早产儿。通过使用所有特征、临床医生选择的特征和互信息特征选择,对六种不同的分类模型进行了评估。在认知障碍和运动障碍预测方面,使用互信息特征选择和超采样训练的 ML 模型获得了最佳结果,而在语言障碍预测方面,临床医生选择的特征是最佳设置。虽然这一本地队列的性能指标与之前的类似研究一致,但仍然相当差。这清楚地表明,为了获得更好的性能比,应考虑进一步的分析和方法,并将其他类型的数据与临床变量一并考虑在内。