妇科肿瘤多学科小组肿瘤委员会的 ChatGPT:可行性研究。

IF 4.7 1区 医学 Q1 OBSTETRICS & GYNECOLOGY Bjog-An International Journal of Obstetrics and Gynaecology Pub Date : 2024-08-14 DOI:10.1111/1471-0528.17929
Gabriel Levin, Walter Gotlieb, Pedro Ramirez, Raanan Meyer, Yoav Brezinov
{"title":"妇科肿瘤多学科小组肿瘤委员会的 ChatGPT:可行性研究。","authors":"Gabriel Levin,&nbsp;Walter Gotlieb,&nbsp;Pedro Ramirez,&nbsp;Raanan Meyer,&nbsp;Yoav Brezinov","doi":"10.1111/1471-0528.17929","DOIUrl":null,"url":null,"abstract":"<p>The practical medical use of artificial intelligence is rapidly progressing. Specifically, the application of ChatGPT was explored in medical education and even medical clinical data evaluation.<span><sup>1, 2</sup></span> Tumour board is an integral and pivotal part of patient treatment and management in gynaecologic oncology.<span><sup>3</sup></span> It entails the processing of various pathological and clinical parameters, coupled with the familiarity with treatment guidelines in accordance with the various parameters. The participation of ChatGPT in breast cancer tumour board was previously studied, with contrasting results.<span><sup>4, 5</sup></span> We aim to study the feasibility of ChatGPT (Versions 3.5 and 4) as a support tool for endometrial cancer (EC) and ovarian cancer (OC) according to the NCCN and ESGO guidelines.</p><p>Ten EC cases and ten OC cases were fabricated based on experience of authors pertaining to the most complex scenarios discussed in real practice. For EC the following data was formulated: age, histology, stage, grade, lymphovascular space invasion, tumour size and molecular classification—MMR, p53 and POLE mutation status. For OC, the following data was formulated: age, histology and stage.</p><p>We created a new account for ChatGPT 3.5 and purchased and created an account for ChatGPT 4. We used generic prompts for all the cases. The ChatGPT 3.5 and ChatGPT 4 prompt are described (Appendix S1).</p><p>For each tumour board case, we accessed the NCCN and ESGO guidelines separately and recorded their recommendation. All ChatGPT recommendations were judged as correct or incorrect by two independent reviewers (G.L. and Y.B.). Data analysis is described in detail in the Appendix S1.</p><p>We used SPSS 29 for the statistical analysis. As no patient information was used—no ethical board review was needed for this study.</p><p>There were ten cases of EC cancer, stages IA-IIIC with four different histology, and ten cases of OC stages IA-IC3 with five different histology. ChatGPT 3.5 was unable to give a concrete recommendation, and ChatGPT 4 gave a recommendation to all cases. No disagreements between reviewers were noted for all 40 evaluations.</p><p>The rate of correct recommendations was 70% (14/20) for NCCN guidelines and 60% (12/20) for ESGO guidelines (<i>p</i> = 0.512). (Table 1). There were 55% (11/20) of cases with correct recommendations for both guidelines, 20% (4/20) of cases in which a correct recommendation was given only according to one guideline (Figure S1), and 25% (5/20) of cases in which an incorrect recommendation was given. Of those with an incorrect recommendation, 80% (4/5) were EC, stages IA-II, of all histology, and one case of OC, stage IA. Of the four single guidelines correct recommendations, all were EC, with three incorrect recommendations according to ESGO guidelines, including the only two cases with a positive POLE mutation. OC had higher complete correct recommendation as compared to EC (90% vs. 20%, <i>p</i> = 0.005). ChatGPT 4 suggestions for adjuvant treatment are presented in Tables S1 and S2.</p><p>In this feasibility study, we showed that ChatGPT 4 provided correct recommendations in two-thirds of the cases evaluated, however in 25% of cases, mostly endometrial cancer, there was an incorrect recommendation. Endometrial cancer had a lower complete rate of correct recommendations, likely due to the complexity of stage, histology and grade in early stages and in the integration of molecular characterisation of endometrial cancer. More research is required to assess the credibility and configure protocols for the potential use of this tool. However, in a setting of high-volume clinics, or in regions where resources are limiting in terms of expertise, such tools may aid physicians maintain evidenced-based care. Further studies should focus on ChatGPT familiarity with ongoing clinical trials to assess for possible patient eligibility.</p><p>Our limitations include the small number of cases studied and limiting our study to endometrial and ovarian cancer. Additionally, we have used the generic ChatGPT tool without any specific training for our data. Moreover, we have used only two AI platforms in this study, this may limit the generalisability of our results. Importantly, we did not compare the AI-generated recommendation to a multidisciplinary Tumor Board recommendation, which is the ‘gold standard’ in real practice. Finally, all data is correct to the time this manuscript was written. As ChatGPT is a large language model, he is constantly trains on prompts and his output may change and evolve over time. Future prospective real-life evaluation of gynaecologic oncology tumour board is encouraged to better delineate advantages and pitfalls of artificial intelligence tools and their impact on practice.</p><p>Gabriel Levin: conception, design, acquisition of data, analysis and interpretation of data, drafting the article, approval of the final version. Walter Gotlieb: acquisition of data, critical revision of the article, approval of the final version. Pedro Ramirez: acquisition of data, critical revision of the article, approval of the final version. Raanan Meyer: acquisition of data, critical revision of the article, approval of the final version. Yoav Brezinov: conception and design, analysis and interpretation of data, critical revision of the article, approval of the final version.</p><p>This research received no external funding.</p><p>None.</p><p>The authors report no conflict of interest.</p><p>As no patient information was used—no ethical board review was needed for this study.</p>","PeriodicalId":50729,"journal":{"name":"Bjog-An International Journal of Obstetrics and Gynaecology","volume":"132 1","pages":"99-101"},"PeriodicalIF":4.7000,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/1471-0528.17929","citationCount":"0","resultStr":"{\"title\":\"ChatGPT in a gynaecologic oncology multidisciplinary team tumour board: A feasibility study\",\"authors\":\"Gabriel Levin,&nbsp;Walter Gotlieb,&nbsp;Pedro Ramirez,&nbsp;Raanan Meyer,&nbsp;Yoav Brezinov\",\"doi\":\"10.1111/1471-0528.17929\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>The practical medical use of artificial intelligence is rapidly progressing. Specifically, the application of ChatGPT was explored in medical education and even medical clinical data evaluation.<span><sup>1, 2</sup></span> Tumour board is an integral and pivotal part of patient treatment and management in gynaecologic oncology.<span><sup>3</sup></span> It entails the processing of various pathological and clinical parameters, coupled with the familiarity with treatment guidelines in accordance with the various parameters. The participation of ChatGPT in breast cancer tumour board was previously studied, with contrasting results.<span><sup>4, 5</sup></span> We aim to study the feasibility of ChatGPT (Versions 3.5 and 4) as a support tool for endometrial cancer (EC) and ovarian cancer (OC) according to the NCCN and ESGO guidelines.</p><p>Ten EC cases and ten OC cases were fabricated based on experience of authors pertaining to the most complex scenarios discussed in real practice. For EC the following data was formulated: age, histology, stage, grade, lymphovascular space invasion, tumour size and molecular classification—MMR, p53 and POLE mutation status. For OC, the following data was formulated: age, histology and stage.</p><p>We created a new account for ChatGPT 3.5 and purchased and created an account for ChatGPT 4. We used generic prompts for all the cases. The ChatGPT 3.5 and ChatGPT 4 prompt are described (Appendix S1).</p><p>For each tumour board case, we accessed the NCCN and ESGO guidelines separately and recorded their recommendation. All ChatGPT recommendations were judged as correct or incorrect by two independent reviewers (G.L. and Y.B.). Data analysis is described in detail in the Appendix S1.</p><p>We used SPSS 29 for the statistical analysis. As no patient information was used—no ethical board review was needed for this study.</p><p>There were ten cases of EC cancer, stages IA-IIIC with four different histology, and ten cases of OC stages IA-IC3 with five different histology. ChatGPT 3.5 was unable to give a concrete recommendation, and ChatGPT 4 gave a recommendation to all cases. No disagreements between reviewers were noted for all 40 evaluations.</p><p>The rate of correct recommendations was 70% (14/20) for NCCN guidelines and 60% (12/20) for ESGO guidelines (<i>p</i> = 0.512). (Table 1). There were 55% (11/20) of cases with correct recommendations for both guidelines, 20% (4/20) of cases in which a correct recommendation was given only according to one guideline (Figure S1), and 25% (5/20) of cases in which an incorrect recommendation was given. Of those with an incorrect recommendation, 80% (4/5) were EC, stages IA-II, of all histology, and one case of OC, stage IA. Of the four single guidelines correct recommendations, all were EC, with three incorrect recommendations according to ESGO guidelines, including the only two cases with a positive POLE mutation. OC had higher complete correct recommendation as compared to EC (90% vs. 20%, <i>p</i> = 0.005). ChatGPT 4 suggestions for adjuvant treatment are presented in Tables S1 and S2.</p><p>In this feasibility study, we showed that ChatGPT 4 provided correct recommendations in two-thirds of the cases evaluated, however in 25% of cases, mostly endometrial cancer, there was an incorrect recommendation. Endometrial cancer had a lower complete rate of correct recommendations, likely due to the complexity of stage, histology and grade in early stages and in the integration of molecular characterisation of endometrial cancer. More research is required to assess the credibility and configure protocols for the potential use of this tool. However, in a setting of high-volume clinics, or in regions where resources are limiting in terms of expertise, such tools may aid physicians maintain evidenced-based care. Further studies should focus on ChatGPT familiarity with ongoing clinical trials to assess for possible patient eligibility.</p><p>Our limitations include the small number of cases studied and limiting our study to endometrial and ovarian cancer. Additionally, we have used the generic ChatGPT tool without any specific training for our data. Moreover, we have used only two AI platforms in this study, this may limit the generalisability of our results. Importantly, we did not compare the AI-generated recommendation to a multidisciplinary Tumor Board recommendation, which is the ‘gold standard’ in real practice. Finally, all data is correct to the time this manuscript was written. As ChatGPT is a large language model, he is constantly trains on prompts and his output may change and evolve over time. Future prospective real-life evaluation of gynaecologic oncology tumour board is encouraged to better delineate advantages and pitfalls of artificial intelligence tools and their impact on practice.</p><p>Gabriel Levin: conception, design, acquisition of data, analysis and interpretation of data, drafting the article, approval of the final version. Walter Gotlieb: acquisition of data, critical revision of the article, approval of the final version. Pedro Ramirez: acquisition of data, critical revision of the article, approval of the final version. Raanan Meyer: acquisition of data, critical revision of the article, approval of the final version. Yoav Brezinov: conception and design, analysis and interpretation of data, critical revision of the article, approval of the final version.</p><p>This research received no external funding.</p><p>None.</p><p>The authors report no conflict of interest.</p><p>As no patient information was used—no ethical board review was needed for this study.</p>\",\"PeriodicalId\":50729,\"journal\":{\"name\":\"Bjog-An International Journal of Obstetrics and Gynaecology\",\"volume\":\"132 1\",\"pages\":\"99-101\"},\"PeriodicalIF\":4.7000,\"publicationDate\":\"2024-08-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1111/1471-0528.17929\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Bjog-An International Journal of Obstetrics and Gynaecology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/1471-0528.17929\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"OBSTETRICS & GYNECOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bjog-An International Journal of Obstetrics and Gynaecology","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/1471-0528.17929","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OBSTETRICS & GYNECOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

人工智能的实际医疗应用正在迅速发展。具体而言,探讨了ChatGPT在医学教育乃至医学临床数据评价中的应用。1,2肿瘤板是妇科肿瘤患者治疗和管理中不可或缺的关键部分它需要处理各种病理和临床参数,再加上熟悉根据各种参数的治疗指南。ChatGPT参与乳腺癌肿瘤治疗的研究之前有过对比结果。我们的目标是根据NCCN和ESGO指南,研究ChatGPT(版本3.5和版本4)作为子宫内膜癌(EC)和卵巢癌(OC)支持工具的可行性。10个EC案例和10个OC案例是根据作者在实际实践中讨论的最复杂场景的经验捏造的。对于EC,制定了以下数据:年龄,组织学,分期,分级,淋巴血管间隙侵犯,肿瘤大小和分子分类- mmr, p53和POLE突变状态。对于OC,制定以下数据:年龄,组织学和分期。我们为ChatGPT 3.5创建了一个新帐户,并为ChatGPT 4购买并创建了一个帐户。我们对所有情况都使用了通用提示。ChatGPT 3.5和ChatGPT 4提示符描述(附录S1)。对于每个肿瘤板病例,我们分别查阅了NCCN和ESGO指南,并记录了他们的建议。所有的ChatGPT建议都由两个独立的审稿人(G.L.和Y.B.)判断为正确或不正确。数据分析在附录S1中有详细描述。采用SPSS 29进行统计分析。由于没有使用患者信息,因此本研究不需要伦理委员会的审查。10例EC癌,分期为IA-IIIC,有4种不同的组织学,10例OC分期为IA-IC3,有5种不同的组织学。ChatGPT 3.5无法给出具体的建议,ChatGPT 4给出了所有案例的建议。所有40项评价均未发现评审人员之间存在分歧。NCCN指南推荐正确率为70% (14/20),ESGO指南推荐正确率为60% (12/20)(p = 0.512)。(表1)。55%(11/20)的病例对两个指南都给出了正确的建议,20%(4/20)的病例只根据一个指南给出了正确的建议(图S1), 25%(5/20)的病例给出了错误的建议。在不正确推荐的患者中,80%(4/5)为所有组织学分期IA- ii期EC, 1例为IA期OC。在四个单一指南正确的建议中,所有建议都是EC,根据ESGO指南有三个不正确的建议,包括仅有的两个阳性POLE突变病例。与EC相比,OC有更高的完全正确推荐(90%对20%,p = 0.005)。ChatGPT 4辅助治疗建议见表S1和表S2。在这项可行性研究中,我们发现在三分之二的评估病例中,ChatGPT 4提供了正确的建议,然而在25%的病例中,主要是子宫内膜癌,有一个不正确的建议。子宫内膜癌的正确建议完成率较低,可能是由于早期阶段、组织学和分级的复杂性以及子宫内膜癌分子特征的整合。需要更多的研究来评估该工具的可信性并为其潜在用途配置协议。然而,在大量诊所的环境中,或者在专业知识资源有限的地区,这些工具可能有助于医生维持循证护理。进一步的研究应关注ChatGPT对正在进行的临床试验的熟悉程度,以评估可能的患者资格。我们的局限性包括研究的病例数量少,并且我们的研究仅限于子宫内膜癌和卵巢癌。此外,我们使用了通用的ChatGPT工具,没有对我们的数据进行任何特定的培训。此外,我们在这项研究中只使用了两个人工智能平台,这可能会限制我们结果的普遍性。重要的是,我们没有将人工智能生成的建议与多学科肿瘤委员会的建议进行比较,后者是实际实践中的“黄金标准”。最后,所有的数据都是正确的,直到本文写作的时间。由于ChatGPT是一个大型的语言模型,他不断地根据提示进行训练,并且他的输出可能会随着时间的推移而变化和发展。鼓励未来对妇科肿瘤的前瞻性现实生活评估,以更好地描述人工智能工具的优点和缺陷及其对实践的影响。Gabriel Levin:构思,设计,获取数据,分析和解释数据,起草文章,批准最终版本。Walter gottlieb:获取数据,对文章进行关键性修订,批准最终版本。 Pedro Ramirez:获取数据,对文章进行关键性修改,批准最终版本。Raanan Meyer:获取数据,对文章进行关键性修改,批准最终版本。Yoav Brezinov:构思和设计,分析和解释数据,对文章进行批判性修改,批准最终版本。本研究未获得外部资助,作者报告无利益冲突。由于没有使用患者信息,因此本研究不需要伦理委员会的审查。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
ChatGPT in a gynaecologic oncology multidisciplinary team tumour board: A feasibility study

The practical medical use of artificial intelligence is rapidly progressing. Specifically, the application of ChatGPT was explored in medical education and even medical clinical data evaluation.1, 2 Tumour board is an integral and pivotal part of patient treatment and management in gynaecologic oncology.3 It entails the processing of various pathological and clinical parameters, coupled with the familiarity with treatment guidelines in accordance with the various parameters. The participation of ChatGPT in breast cancer tumour board was previously studied, with contrasting results.4, 5 We aim to study the feasibility of ChatGPT (Versions 3.5 and 4) as a support tool for endometrial cancer (EC) and ovarian cancer (OC) according to the NCCN and ESGO guidelines.

Ten EC cases and ten OC cases were fabricated based on experience of authors pertaining to the most complex scenarios discussed in real practice. For EC the following data was formulated: age, histology, stage, grade, lymphovascular space invasion, tumour size and molecular classification—MMR, p53 and POLE mutation status. For OC, the following data was formulated: age, histology and stage.

We created a new account for ChatGPT 3.5 and purchased and created an account for ChatGPT 4. We used generic prompts for all the cases. The ChatGPT 3.5 and ChatGPT 4 prompt are described (Appendix S1).

For each tumour board case, we accessed the NCCN and ESGO guidelines separately and recorded their recommendation. All ChatGPT recommendations were judged as correct or incorrect by two independent reviewers (G.L. and Y.B.). Data analysis is described in detail in the Appendix S1.

We used SPSS 29 for the statistical analysis. As no patient information was used—no ethical board review was needed for this study.

There were ten cases of EC cancer, stages IA-IIIC with four different histology, and ten cases of OC stages IA-IC3 with five different histology. ChatGPT 3.5 was unable to give a concrete recommendation, and ChatGPT 4 gave a recommendation to all cases. No disagreements between reviewers were noted for all 40 evaluations.

The rate of correct recommendations was 70% (14/20) for NCCN guidelines and 60% (12/20) for ESGO guidelines (p = 0.512). (Table 1). There were 55% (11/20) of cases with correct recommendations for both guidelines, 20% (4/20) of cases in which a correct recommendation was given only according to one guideline (Figure S1), and 25% (5/20) of cases in which an incorrect recommendation was given. Of those with an incorrect recommendation, 80% (4/5) were EC, stages IA-II, of all histology, and one case of OC, stage IA. Of the four single guidelines correct recommendations, all were EC, with three incorrect recommendations according to ESGO guidelines, including the only two cases with a positive POLE mutation. OC had higher complete correct recommendation as compared to EC (90% vs. 20%, p = 0.005). ChatGPT 4 suggestions for adjuvant treatment are presented in Tables S1 and S2.

In this feasibility study, we showed that ChatGPT 4 provided correct recommendations in two-thirds of the cases evaluated, however in 25% of cases, mostly endometrial cancer, there was an incorrect recommendation. Endometrial cancer had a lower complete rate of correct recommendations, likely due to the complexity of stage, histology and grade in early stages and in the integration of molecular characterisation of endometrial cancer. More research is required to assess the credibility and configure protocols for the potential use of this tool. However, in a setting of high-volume clinics, or in regions where resources are limiting in terms of expertise, such tools may aid physicians maintain evidenced-based care. Further studies should focus on ChatGPT familiarity with ongoing clinical trials to assess for possible patient eligibility.

Our limitations include the small number of cases studied and limiting our study to endometrial and ovarian cancer. Additionally, we have used the generic ChatGPT tool without any specific training for our data. Moreover, we have used only two AI platforms in this study, this may limit the generalisability of our results. Importantly, we did not compare the AI-generated recommendation to a multidisciplinary Tumor Board recommendation, which is the ‘gold standard’ in real practice. Finally, all data is correct to the time this manuscript was written. As ChatGPT is a large language model, he is constantly trains on prompts and his output may change and evolve over time. Future prospective real-life evaluation of gynaecologic oncology tumour board is encouraged to better delineate advantages and pitfalls of artificial intelligence tools and their impact on practice.

Gabriel Levin: conception, design, acquisition of data, analysis and interpretation of data, drafting the article, approval of the final version. Walter Gotlieb: acquisition of data, critical revision of the article, approval of the final version. Pedro Ramirez: acquisition of data, critical revision of the article, approval of the final version. Raanan Meyer: acquisition of data, critical revision of the article, approval of the final version. Yoav Brezinov: conception and design, analysis and interpretation of data, critical revision of the article, approval of the final version.

This research received no external funding.

None.

The authors report no conflict of interest.

As no patient information was used—no ethical board review was needed for this study.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
10.90
自引率
5.20%
发文量
345
审稿时长
3-6 weeks
期刊介绍: BJOG is an editorially independent publication owned by the Royal College of Obstetricians and Gynaecologists (RCOG). The Journal publishes original, peer-reviewed work in all areas of obstetrics and gynaecology, including contraception, urogynaecology, fertility, oncology and clinical practice. Its aim is to publish the highest quality medical research in women''s health, worldwide.
期刊最新文献
Fetal Fraction of Cell-Free DNA in the Prediction of Adverse Pregnancy Outcomes: A Nationwide Retrospective Cohort Study. The Contribution of Hypertensive Disorders of Pregnancy to Neonatal Unit Admissions and Iatrogenic Preterm Delivery at < 34+0 Weeks' Gestation in the UK: A Population-Based Study Using the National Neonatal Research Database. Unilateral Oophorectomy and Age at Natural Menopause: A Longitudinal Community-Based Cohort Study. Biopsychosocial Approaches for the Management of Female Chronic Pelvic Pain: A Systematic Review. Outcome Reporting in Studies Investigating Treatment for Caesarean Scar Ectopic Pregnancy: A Systematic Review.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1