Ping Cao, Ganesh Acharya, Andres Salumets, Masoud Zamani Esteki
{"title":"Large language models to facilitate pregnancy prediction after in vitro fertilization.","authors":"Ping Cao, Ganesh Acharya, Andres Salumets, Masoud Zamani Esteki","doi":"10.1111/aogs.14989","DOIUrl":null,"url":null,"abstract":"<p><p>We evaluated the efficacy of large language models (LLMs), specifically, generative pre-trained transformer-4 (GPT-4), in predicting pregnancy following in vitro fertilization (IVF) treatment and compared its accuracy with results from an original published study. Our findings revealed that GPT-4 can autonomously develop and refine advanced machine learning models for pregnancy prediction with minimal human intervention. The prediction accuracy was 0.79, and the area under the receiver operating characteristic curve (AUROC) was 0.89, exceeding or being at least equivalent to the metrics reported in the original study, that is, 0.78 for accuracy and 0.87 for AUROC. The results suggest that LLMs can facilitate data processing, optimize machine learning models in predicting IVF success rates, and provide data interpretation methods. This capacity can help bridge the knowledge gap between data scientists and medical personnel to solve the most pressing clinical challenges. However, more experiments on diverse and larger datasets are needed to validate and promote broader applications of LLMs in assisted reproduction.</p>","PeriodicalId":6990,"journal":{"name":"Acta Obstetricia et Gynecologica Scandinavica","volume":" ","pages":""},"PeriodicalIF":3.5000,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Acta Obstetricia et Gynecologica Scandinavica","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/aogs.14989","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OBSTETRICS & GYNECOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
We evaluated the efficacy of large language models (LLMs), specifically, generative pre-trained transformer-4 (GPT-4), in predicting pregnancy following in vitro fertilization (IVF) treatment and compared its accuracy with results from an original published study. Our findings revealed that GPT-4 can autonomously develop and refine advanced machine learning models for pregnancy prediction with minimal human intervention. The prediction accuracy was 0.79, and the area under the receiver operating characteristic curve (AUROC) was 0.89, exceeding or being at least equivalent to the metrics reported in the original study, that is, 0.78 for accuracy and 0.87 for AUROC. The results suggest that LLMs can facilitate data processing, optimize machine learning models in predicting IVF success rates, and provide data interpretation methods. This capacity can help bridge the knowledge gap between data scientists and medical personnel to solve the most pressing clinical challenges. However, more experiments on diverse and larger datasets are needed to validate and promote broader applications of LLMs in assisted reproduction.
期刊介绍:
Published monthly, Acta Obstetricia et Gynecologica Scandinavica is an international journal dedicated to providing the very latest information on the results of both clinical, basic and translational research work related to all aspects of women’s health from around the globe. The journal regularly publishes commentaries, reviews, and original articles on a wide variety of topics including: gynecology, pregnancy, birth, female urology, gynecologic oncology, fertility and reproductive biology.