Probabilistic machine learning methods for Down syndrome prenatal screening: unified first and second trimester markers multivariate bayesian risk model versus current contingent sequential strategy
Luis M Lopez-Garcia, A. Bataller-Calatayud, Concha Sanz-Marti, Antonio V Antoli-Frances
{"title":"Probabilistic machine learning methods for Down syndrome prenatal screening: unified first and second trimester markers multivariate bayesian risk model versus current contingent sequential strategy","authors":"Luis M Lopez-Garcia, A. Bataller-Calatayud, Concha Sanz-Marti, Antonio V Antoli-Frances","doi":"10.1145/3502060.3503633","DOIUrl":null,"url":null,"abstract":"The aim of the study is to compare performance of a multivariate bayesian model to estimate Down syndrome (DS) risk from a dataset containing biochemical and ultrasound parameters used in first and second-trimester prenatal screening with current contingent sequential screening strategy in the diagnostic of cases missed in first-trimester screening. Methods: observational study in a retrospective way of collecting data from pregnant women who attended the Lluís Alcanyís, Ribera de Alzira y Verge dels Lliris de Alcoy hospitals in Valencia -Spain- for a period of 10 years (2010-2020). Multivariate normal distributions for affected and unaffected fetuses, specified by the means and the covariance matrix of biochemical and ultrasound parameters were set up. Area under the curve (AUC) of ROC (receiver operating characteristic) for classification of DS missed in first trimester screening was the figure to compare by DeLong test the diagnostic performance of quadratic discriminant analysis from this multivariate Bayesian model to current Naïve-Bayes strategy. Results: no significant differences were found between both strategies. Other than second-trimester biometric variables between them, no relevant correlation between the rest of markers was found. Conclusion: accuracy of other machine learning strategies should be investigated, namely artificial neural network, to improve the identification of first trimester undetected cases.","PeriodicalId":193100,"journal":{"name":"2021 International Symposium on Biomedical Engineering and Computational Biology","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Symposium on Biomedical Engineering and Computational Biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3502060.3503633","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The aim of the study is to compare performance of a multivariate bayesian model to estimate Down syndrome (DS) risk from a dataset containing biochemical and ultrasound parameters used in first and second-trimester prenatal screening with current contingent sequential screening strategy in the diagnostic of cases missed in first-trimester screening. Methods: observational study in a retrospective way of collecting data from pregnant women who attended the Lluís Alcanyís, Ribera de Alzira y Verge dels Lliris de Alcoy hospitals in Valencia -Spain- for a period of 10 years (2010-2020). Multivariate normal distributions for affected and unaffected fetuses, specified by the means and the covariance matrix of biochemical and ultrasound parameters were set up. Area under the curve (AUC) of ROC (receiver operating characteristic) for classification of DS missed in first trimester screening was the figure to compare by DeLong test the diagnostic performance of quadratic discriminant analysis from this multivariate Bayesian model to current Naïve-Bayes strategy. Results: no significant differences were found between both strategies. Other than second-trimester biometric variables between them, no relevant correlation between the rest of markers was found. Conclusion: accuracy of other machine learning strategies should be investigated, namely artificial neural network, to improve the identification of first trimester undetected cases.
该研究的目的是比较多变量贝叶斯模型的性能,以估计唐氏综合征(DS)的风险,该模型来自一个包含生化和超声参数的数据集,该数据集用于妊娠早期和中期产前筛查,而当前的随机顺序筛查策略用于诊断妊娠早期筛查中遗漏的病例。方法:观察性研究以回顾性的方式收集在西班牙瓦伦西亚Lluís Alcanyís, Ribera de Alzira y Verge dels Lliris de Alcoy医院就诊的孕妇的数据,为期10年(2010-2020)。建立受影响胎儿和未受影响胎儿的多变量正态分布,由生化和超声参数的均值和协方差矩阵指定。ROC (receiver operating characteristic)曲线下面积(Area under the curve, AUC)为早期妊娠筛查漏诊DS的分类图,通过DeLong检验比较该多元贝叶斯模型与目前Naïve-Bayes策略二次判别分析的诊断性能。结果:两种策略间无显著差异。除中期妊娠生物特征变量外,其余指标间无相关性。结论:应进一步研究其他机器学习策略的准确性,即人工神经网络,以提高对早期妊娠未发现病例的识别。