Lorena Bori, Marco Toschi, Rebeca Esteve, Arantza Delgado, Antonio Pellicer, Marcos Meseguer
{"title":"全自动评估工具的外部验证:对 68,471 个已评分胚胎的回顾性分析。","authors":"Lorena Bori, Marco Toschi, Rebeca Esteve, Arantza Delgado, Antonio Pellicer, Marcos Meseguer","doi":"10.1016/j.fertnstert.2024.10.006","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>To externally validate a fully automated embryo classification in in vitro fertilization (IVF) treatments.</p><p><strong>Design: </strong>Retrospective cohort study SUBJECTS: A total of 6,434 patients undergoing 7,352 IVF treatments contributed 70,456 embryos.</p><p><strong>Exposure: </strong>Embryos were evaluated by conventional morphology and retrospectively scored using a fully automated deep learning-based algorithm across conventional IVF, oocyte donation, and PGT-A cycles.</p><p><strong>Main outcome measures: </strong>The primary outcomes were implantation and live birth including odds ratios (ORs) from generalized estimating equation (GEE) models. Secondary outcomes were embryo morphology, euploidy and miscarriage. Exploratory outcomes included comparison between conventional methodology and artificial intelligence (AI) algorithm with areas under the ROC curves (AUCs), agreement degree between AI and embryologists, Cohen's Kappa coefficient and relative risk (RR).</p><p><strong>Results: </strong>Implantation and live birth rates increased as the automatic embryo score rose. The GEE model, controlling for confounders, showed the automatic score was associated with an OR of 1.31 (95%CI[1.25-1.36]) for implantation in treatments using oocytes from patients, and an OR of 1.17 (95%CI[1.14-1.20]) in the oocyte donation program, with no significant association in PGT-A treatments. For live birth, the ORs were 1.27 (95%CI[1.21-1.33]) for patients, 1.16 (95%CI[1.13-1.19]) for donors, and 1.05 (95%CI[1-1.10]) for PGT-A cycles. The average score was higher in embryos with better morphology, in euploid embryos compared to aneuploid embryos, and in embryos that resulted in a full-term pregnancy compared to those that miscarried. Concordance between the highest-scoring embryo and the embryo with the best conventional morphology was 71.4%(95%CI[67.7%-75.0%]) in treatments with patient oocytes and 61.0%(95%CI[58.6%-63.4%]) in the oocyte donation program. Overall, the Cohen's Kappa coefficient was 0.63. The automatic embryo score showed similar AUCs to conventional morphology, although implantation was higher when the transferred embryo matched the highest-scoring embryo from each cohort (57.36% vs. 49.98%). RR indicated a 1.14-fold increase in implantation likelihood when the top-ranked embryo was transferred.</p><p><strong>Conclusion: </strong>Fully automated embryo scoring effectively ranked embryos based on their potential for implantation and live birth. The performance of the conventional methodology was comparable to that of the artificial intelligence-based technology; however, better clinical outcomes were observed when the highest-scoring embryo in the cohort was transferred.</p>","PeriodicalId":12275,"journal":{"name":"Fertility and sterility","volume":null,"pages":null},"PeriodicalIF":6.6000,"publicationDate":"2024-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"External validation of a fully automated evaluation tool: a retrospective analysis on 68,471 scored embryos.\",\"authors\":\"Lorena Bori, Marco Toschi, Rebeca Esteve, Arantza Delgado, Antonio Pellicer, Marcos Meseguer\",\"doi\":\"10.1016/j.fertnstert.2024.10.006\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objective: </strong>To externally validate a fully automated embryo classification in in vitro fertilization (IVF) treatments.</p><p><strong>Design: </strong>Retrospective cohort study SUBJECTS: A total of 6,434 patients undergoing 7,352 IVF treatments contributed 70,456 embryos.</p><p><strong>Exposure: </strong>Embryos were evaluated by conventional morphology and retrospectively scored using a fully automated deep learning-based algorithm across conventional IVF, oocyte donation, and PGT-A cycles.</p><p><strong>Main outcome measures: </strong>The primary outcomes were implantation and live birth including odds ratios (ORs) from generalized estimating equation (GEE) models. Secondary outcomes were embryo morphology, euploidy and miscarriage. Exploratory outcomes included comparison between conventional methodology and artificial intelligence (AI) algorithm with areas under the ROC curves (AUCs), agreement degree between AI and embryologists, Cohen's Kappa coefficient and relative risk (RR).</p><p><strong>Results: </strong>Implantation and live birth rates increased as the automatic embryo score rose. The GEE model, controlling for confounders, showed the automatic score was associated with an OR of 1.31 (95%CI[1.25-1.36]) for implantation in treatments using oocytes from patients, and an OR of 1.17 (95%CI[1.14-1.20]) in the oocyte donation program, with no significant association in PGT-A treatments. For live birth, the ORs were 1.27 (95%CI[1.21-1.33]) for patients, 1.16 (95%CI[1.13-1.19]) for donors, and 1.05 (95%CI[1-1.10]) for PGT-A cycles. The average score was higher in embryos with better morphology, in euploid embryos compared to aneuploid embryos, and in embryos that resulted in a full-term pregnancy compared to those that miscarried. Concordance between the highest-scoring embryo and the embryo with the best conventional morphology was 71.4%(95%CI[67.7%-75.0%]) in treatments with patient oocytes and 61.0%(95%CI[58.6%-63.4%]) in the oocyte donation program. Overall, the Cohen's Kappa coefficient was 0.63. The automatic embryo score showed similar AUCs to conventional morphology, although implantation was higher when the transferred embryo matched the highest-scoring embryo from each cohort (57.36% vs. 49.98%). RR indicated a 1.14-fold increase in implantation likelihood when the top-ranked embryo was transferred.</p><p><strong>Conclusion: </strong>Fully automated embryo scoring effectively ranked embryos based on their potential for implantation and live birth. The performance of the conventional methodology was comparable to that of the artificial intelligence-based technology; however, better clinical outcomes were observed when the highest-scoring embryo in the cohort was transferred.</p>\",\"PeriodicalId\":12275,\"journal\":{\"name\":\"Fertility and sterility\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":6.6000,\"publicationDate\":\"2024-10-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Fertility and sterility\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1016/j.fertnstert.2024.10.006\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"OBSTETRICS & GYNECOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fertility and sterility","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.fertnstert.2024.10.006","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OBSTETRICS & GYNECOLOGY","Score":null,"Total":0}
引用次数: 0
摘要
目的:对体外受精(IVF)治疗中的全自动胚胎分类进行外部验证:从外部验证体外受精(IVF)治疗中的全自动胚胎分类:设计:回顾性队列研究:暴露:通过传统形态学对胚胎进行评估,并使用基于深度学习的全自动算法对传统试管婴儿、卵母细胞捐赠和PGT-A周期的胚胎进行回顾性评分:主要结果是植入和活产,包括来自广义估计方程(GEE)模型的几率比(ORs)。次要结果为胚胎形态、非整倍体和流产。探索性结果包括传统方法与人工智能(AI)算法的比较、ROC 曲线下面积(AUC)、AI 与胚胎学家之间的一致程度、科恩卡帕系数(Cohen's Kappa coefficient)和相对风险(RR):结果:随着自动胚胎评分的提高,植入率和活产率也随之提高。控制混杂因素的 GEE 模型显示,在使用患者卵母细胞的治疗中,自动评分与植入的 OR 值为 1.31(95%CI[1.25-1.36])相关,在卵母细胞捐献项目中,与植入的 OR 值为 1.17(95%CI[1.14-1.20])相关,而在 PGT-A 治疗中,与植入的 OR 值无显著相关。在活产方面,患者的 OR 值为 1.27(95%CI[1.21-1.33]),捐献者的 OR 值为 1.16(95%CI[1.13-1.19]),PGT-A 周期的 OR 值为 1.05(95%CI[1-1.10])。形态较好的胚胎、优卵胚胎与非优卵胚胎相比,以及足月妊娠的胚胎与流产的胚胎相比,平均得分更高。在使用患者卵母细胞的治疗中,得分最高的胚胎与常规形态最佳的胚胎之间的一致性为 71.4%(95%CI[67.7%-75.0%]),而在卵母细胞捐赠计划中,两者之间的一致性为 61.0%(95%CI[58.6%-63.4%])。总体而言,科恩卡帕系数为 0.63。自动胚胎评分的 AUC 与传统形态学相似,但当移植的胚胎与每个队列中得分最高的胚胎相匹配时,植入率更高(57.36% 对 49.98%)。RR表明,移植得分最高的胚胎时,植入可能性增加了1.14倍:结论:全自动胚胎评分可根据胚胎植入和活产的可能性对胚胎进行有效排名。传统方法的性能与基于人工智能技术的方法不相上下;但是,如果移植队列中得分最高的胚胎,则可观察到更好的临床结果。
External validation of a fully automated evaluation tool: a retrospective analysis on 68,471 scored embryos.
Objective: To externally validate a fully automated embryo classification in in vitro fertilization (IVF) treatments.
Design: Retrospective cohort study SUBJECTS: A total of 6,434 patients undergoing 7,352 IVF treatments contributed 70,456 embryos.
Exposure: Embryos were evaluated by conventional morphology and retrospectively scored using a fully automated deep learning-based algorithm across conventional IVF, oocyte donation, and PGT-A cycles.
Main outcome measures: The primary outcomes were implantation and live birth including odds ratios (ORs) from generalized estimating equation (GEE) models. Secondary outcomes were embryo morphology, euploidy and miscarriage. Exploratory outcomes included comparison between conventional methodology and artificial intelligence (AI) algorithm with areas under the ROC curves (AUCs), agreement degree between AI and embryologists, Cohen's Kappa coefficient and relative risk (RR).
Results: Implantation and live birth rates increased as the automatic embryo score rose. The GEE model, controlling for confounders, showed the automatic score was associated with an OR of 1.31 (95%CI[1.25-1.36]) for implantation in treatments using oocytes from patients, and an OR of 1.17 (95%CI[1.14-1.20]) in the oocyte donation program, with no significant association in PGT-A treatments. For live birth, the ORs were 1.27 (95%CI[1.21-1.33]) for patients, 1.16 (95%CI[1.13-1.19]) for donors, and 1.05 (95%CI[1-1.10]) for PGT-A cycles. The average score was higher in embryos with better morphology, in euploid embryos compared to aneuploid embryos, and in embryos that resulted in a full-term pregnancy compared to those that miscarried. Concordance between the highest-scoring embryo and the embryo with the best conventional morphology was 71.4%(95%CI[67.7%-75.0%]) in treatments with patient oocytes and 61.0%(95%CI[58.6%-63.4%]) in the oocyte donation program. Overall, the Cohen's Kappa coefficient was 0.63. The automatic embryo score showed similar AUCs to conventional morphology, although implantation was higher when the transferred embryo matched the highest-scoring embryo from each cohort (57.36% vs. 49.98%). RR indicated a 1.14-fold increase in implantation likelihood when the top-ranked embryo was transferred.
Conclusion: Fully automated embryo scoring effectively ranked embryos based on their potential for implantation and live birth. The performance of the conventional methodology was comparable to that of the artificial intelligence-based technology; however, better clinical outcomes were observed when the highest-scoring embryo in the cohort was transferred.
期刊介绍:
Fertility and Sterility® is an international journal for obstetricians, gynecologists, reproductive endocrinologists, urologists, basic scientists and others who treat and investigate problems of infertility and human reproductive disorders. The journal publishes juried original scientific articles in clinical and laboratory research relevant to reproductive endocrinology, urology, andrology, physiology, immunology, genetics, contraception, and menopause. Fertility and Sterility® encourages and supports meaningful basic and clinical research, and facilitates and promotes excellence in professional education, in the field of reproductive medicine.