通过机器学习算法构建新型临床指标模型以预测妊娠期地中海贫血的发生

Frontiers in hematology Pub Date : 2024-04-04 DOI:10.3389/frhem.2024.1341225

Yaoshui Long, Wenxue Bai

{"title":"通过机器学习算法构建新型临床指标模型以预测妊娠期地中海贫血的发生","authors":"Yaoshui Long, Wenxue Bai","doi":"10.3389/frhem.2024.1341225","DOIUrl":null,"url":null,"abstract":"Thalassemia is one of the inherited hemoglobin disorders worldwide, resulting in ineffective erythropoiesis, chronic hemolytic anemia, compensatory hemopoietic expansion, hypercoagulability, etc., and when a mother carries the thalassemia gene, the child is more likely to have severe thalassemia. Furthermore, the economic and time costs of genetic testing for thalassemia prevent many thalassemia patients from being diagnosed in time. To solve this problem, we performed least absolute shrinkage and selection operator (LASSO) regression to analyze the correlation between thalassemia and blood routine indicators containing mean corpuscular volume (MCV), mean corpuscular hemoglobin (MCH), mean corpuscular hemoglobin concentration (MCHC), and red blood cell (RBC). We then built a nomogram to predict the occurrence of thalassemia, and receiver operating characteristic (ROC) curve was used to verify the prediction efficiency of this model. In total, we obtained 7,621 cases, including 847 thalassemia patients and 6,774 non-thalassemia. Among the 847 thalassemia patients, with a positivity rate of 67.2%, 569 cases were positive for α-thalassemia, and with a rate of 31.5%, 267 cases were positive for β-thalassemia. The remaining 11 cases were positive for both α- and β-thalassemia. Based on machine learning algorithm, we screened four optimal indicators, namely, MCV, MCH, RBC, and MCHC. The AUC value of MCV, MCH, RBC, and MCHC were 0.907, 0.906, 0.796, and 0.795, respectively. Moreover, the AUC value of the prediction model was 0.911. In summary, a novel and effective machine learning model was built to predict thalassemia, which functioned accurately, and may provide new insights for the early screening of thalassemia in the future.","PeriodicalId":101407,"journal":{"name":"Frontiers in hematology","volume":"17 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Constructing a novel clinical indicator model to predict the occurrence of thalassemia in pregnancy through machine learning algorithm\",\"authors\":\"Yaoshui Long, Wenxue Bai\",\"doi\":\"10.3389/frhem.2024.1341225\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Thalassemia is one of the inherited hemoglobin disorders worldwide, resulting in ineffective erythropoiesis, chronic hemolytic anemia, compensatory hemopoietic expansion, hypercoagulability, etc., and when a mother carries the thalassemia gene, the child is more likely to have severe thalassemia. Furthermore, the economic and time costs of genetic testing for thalassemia prevent many thalassemia patients from being diagnosed in time. To solve this problem, we performed least absolute shrinkage and selection operator (LASSO) regression to analyze the correlation between thalassemia and blood routine indicators containing mean corpuscular volume (MCV), mean corpuscular hemoglobin (MCH), mean corpuscular hemoglobin concentration (MCHC), and red blood cell (RBC). We then built a nomogram to predict the occurrence of thalassemia, and receiver operating characteristic (ROC) curve was used to verify the prediction efficiency of this model. In total, we obtained 7,621 cases, including 847 thalassemia patients and 6,774 non-thalassemia. Among the 847 thalassemia patients, with a positivity rate of 67.2%, 569 cases were positive for α-thalassemia, and with a rate of 31.5%, 267 cases were positive for β-thalassemia. The remaining 11 cases were positive for both α- and β-thalassemia. Based on machine learning algorithm, we screened four optimal indicators, namely, MCV, MCH, RBC, and MCHC. The AUC value of MCV, MCH, RBC, and MCHC were 0.907, 0.906, 0.796, and 0.795, respectively. Moreover, the AUC value of the prediction model was 0.911. In summary, a novel and effective machine learning model was built to predict thalassemia, which functioned accurately, and may provide new insights for the early screening of thalassemia in the future.\",\"PeriodicalId\":101407,\"journal\":{\"name\":\"Frontiers in hematology\",\"volume\":\"17 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-04-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in hematology\",\"FirstCategoryId\":\"0\",\"ListUrlMain\":\"https://doi.org/10.3389/frhem.2024.1341225\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in hematology","FirstCategoryId":"0","ListUrlMain":"https://doi.org/10.3389/frhem.2024.1341225","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

地中海贫血是世界性遗传性血红蛋白病之一，会导致无效红细胞生成、慢性溶血性贫血、代偿性造血扩张、高凝等，母亲携带地中海贫血基因，孩子更容易患重型地中海贫血。此外，地中海贫血基因检测的经济成本和时间成本使许多地中海贫血患者无法及时确诊。为了解决这一问题，我们采用最小绝对收缩和选择算子（LASSO）回归法分析了地中海贫血与血常规指标（包括平均血球容积（MCV）、平均血红蛋白（MCH）、平均血红蛋白浓度（MCHC）和红细胞（RBC））之间的相关性。然后，我们建立了一个预测地中海贫血发生率的提名图，并使用接收器操作特征曲线（ROC）来验证该模型的预测效率。我们总共获得了 7,621 个病例，包括 847 名地中海贫血患者和 6,774 名非地中海贫血患者。在 847 例地中海贫血患者中，α-地中海贫血阳性 569 例，阳性率为 67.2%；β-地中海贫血阳性 267 例，阳性率为 31.5%。其余 11 个病例对α地中海贫血和β地中海贫血均呈阳性。基于机器学习算法，我们筛选出了四个最佳指标，即 MCV、MCH、RBC 和 MCHC。MCV、MCH、RBC和MCHC的AUC值分别为0.907、0.906、0.796和0.795。此外，预测模型的 AUC 值为 0.911。总之，本文建立了一个新颖有效的机器学习模型来预测地中海贫血症，该模型功能准确，可为将来地中海贫血症的早期筛查提供新的见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Constructing a novel clinical indicator model to predict the occurrence of thalassemia in pregnancy through machine learning algorithm

Thalassemia is one of the inherited hemoglobin disorders worldwide, resulting in ineffective erythropoiesis, chronic hemolytic anemia, compensatory hemopoietic expansion, hypercoagulability, etc., and when a mother carries the thalassemia gene, the child is more likely to have severe thalassemia. Furthermore, the economic and time costs of genetic testing for thalassemia prevent many thalassemia patients from being diagnosed in time. To solve this problem, we performed least absolute shrinkage and selection operator (LASSO) regression to analyze the correlation between thalassemia and blood routine indicators containing mean corpuscular volume (MCV), mean corpuscular hemoglobin (MCH), mean corpuscular hemoglobin concentration (MCHC), and red blood cell (RBC). We then built a nomogram to predict the occurrence of thalassemia, and receiver operating characteristic (ROC) curve was used to verify the prediction efficiency of this model. In total, we obtained 7,621 cases, including 847 thalassemia patients and 6,774 non-thalassemia. Among the 847 thalassemia patients, with a positivity rate of 67.2%, 569 cases were positive for α-thalassemia, and with a rate of 31.5%, 267 cases were positive for β-thalassemia. The remaining 11 cases were positive for both α- and β-thalassemia. Based on machine learning algorithm, we screened four optimal indicators, namely, MCV, MCH, RBC, and MCHC. The AUC value of MCV, MCH, RBC, and MCHC were 0.907, 0.906, 0.796, and 0.795, respectively. Moreover, the AUC value of the prediction model was 0.911. In summary, a novel and effective machine learning model was built to predict thalassemia, which functioned accurately, and may provide new insights for the early screening of thalassemia in the future.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Frontiers in hematology

自引率

0.00%

发文量