基于临床数据使用机器学习方法的卵巢癌早期预测和风险分层。

IF 3.4 2区 医学 Q1 OBSTETRICS & GYNECOLOGY Journal of Gynecologic Oncology Pub Date : 2024-12-17 DOI:10.3802/jgo.2025.36.e53
Ting Gui, Dongyan Cao, Jiaxin Yang, Zhenhao Wei, Jiatong Xie, Wei Wang, Yang Xiang, Peng Peng
{"title":"基于临床数据使用机器学习方法的卵巢癌早期预测和风险分层。","authors":"Ting Gui, Dongyan Cao, Jiaxin Yang, Zhenhao Wei, Jiatong Xie, Wei Wang, Yang Xiang, Peng Peng","doi":"10.3802/jgo.2025.36.e53","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>Our study was aimed to construct a predictive model to advance ovarian cancer diagnosis by machine learning.</p><p><strong>Methods: </strong>A retrospective analysis of patients with pelvic/adnexal/ovarian mass was performed. Potential features related to ovarian cancer were obtained as many as possible. The optimal machine learning algorithm was selected among six candidates through 5-fold cross validation. Top 20 features having the most powerful predictive significance were ranked by Shapley Additive Interpretation (Shap) method. Clinical validation was further performed to confirm whether our model could advance diagnosis of ovarian cancer.</p><p><strong>Results: </strong>A total of 9,799 patients were collected. The inclusion criteria included age >18 years old, the first diagnosis being pelvic/adnexal/ovarian mass of undetermined significance, and pathological report indispensable. Four hundred and thirty-eight dimensional features were obtained after filtration. LightGBM showed the best performance with accuracy 88%. Among the top 20 features, 55% belonged to laboratory test report, 35% came from imaging examination report, and 10% were attributed to basic demographics and main symptom. Age, CA125, and risk of ovarian malignancy algorithm were the top three. Our predictive model performed stably in testing and clinical validation datasets, and was found to advance the diagnosis of ovarian cancer about 17 days before clinical pathological examination.</p><p><strong>Conclusion: </strong>LightGBM was the optimal algorithm for our predictive model with accuracy of 88%. Laboratory test and imaging examination played essential roles in diagnosing ovarian cancer. Our model could advance the diagnosis of ovarian cancer before clinical pathological examination.</p>","PeriodicalId":15868,"journal":{"name":"Journal of Gynecologic Oncology","volume":" ","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2024-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Early prediction and risk stratification of ovarian cancer based on clinical data using machine learning approaches.\",\"authors\":\"Ting Gui, Dongyan Cao, Jiaxin Yang, Zhenhao Wei, Jiatong Xie, Wei Wang, Yang Xiang, Peng Peng\",\"doi\":\"10.3802/jgo.2025.36.e53\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objective: </strong>Our study was aimed to construct a predictive model to advance ovarian cancer diagnosis by machine learning.</p><p><strong>Methods: </strong>A retrospective analysis of patients with pelvic/adnexal/ovarian mass was performed. Potential features related to ovarian cancer were obtained as many as possible. The optimal machine learning algorithm was selected among six candidates through 5-fold cross validation. Top 20 features having the most powerful predictive significance were ranked by Shapley Additive Interpretation (Shap) method. Clinical validation was further performed to confirm whether our model could advance diagnosis of ovarian cancer.</p><p><strong>Results: </strong>A total of 9,799 patients were collected. The inclusion criteria included age >18 years old, the first diagnosis being pelvic/adnexal/ovarian mass of undetermined significance, and pathological report indispensable. Four hundred and thirty-eight dimensional features were obtained after filtration. LightGBM showed the best performance with accuracy 88%. Among the top 20 features, 55% belonged to laboratory test report, 35% came from imaging examination report, and 10% were attributed to basic demographics and main symptom. Age, CA125, and risk of ovarian malignancy algorithm were the top three. Our predictive model performed stably in testing and clinical validation datasets, and was found to advance the diagnosis of ovarian cancer about 17 days before clinical pathological examination.</p><p><strong>Conclusion: </strong>LightGBM was the optimal algorithm for our predictive model with accuracy of 88%. Laboratory test and imaging examination played essential roles in diagnosing ovarian cancer. Our model could advance the diagnosis of ovarian cancer before clinical pathological examination.</p>\",\"PeriodicalId\":15868,\"journal\":{\"name\":\"Journal of Gynecologic Oncology\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2024-12-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Gynecologic Oncology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.3802/jgo.2025.36.e53\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"OBSTETRICS & GYNECOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Gynecologic Oncology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3802/jgo.2025.36.e53","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OBSTETRICS & GYNECOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

目的:建立一种基于机器学习的卵巢癌诊断预测模型。方法:对盆腔/附件/卵巢肿块患者进行回顾性分析。尽可能多地获得与卵巢癌相关的潜在特征。通过5重交叉验证,从6个候选算法中选出最优的机器学习算法。采用Shapley加性解释(Shapley Additive Interpretation, Shap)方法对预测显著性最强的前20个特征进行排序。进一步进行临床验证,以确认我们的模型是否可以提前卵巢癌的诊断。结果:共收集患者9799例。纳入标准:年龄在bb0 ~ 18岁,首次诊断为意义不明的盆腔/附件/卵巢肿块,且必须有病理报告。过滤后得到438个维度特征。LightGBM的准确率最高,达到88%。在前20个特征中,55%来自实验室检查报告,35%来自影像学检查报告,10%来自基本人口统计学和主要症状。年龄、CA125、卵巢恶性肿瘤风险排序前三位。我们的预测模型在测试和临床验证数据集中表现稳定,并在临床病理检查前约17天提前诊断卵巢癌。结论:LightGBM是预测模型的最佳算法,准确率为88%。实验室检查和影像学检查在卵巢癌的诊断中起着重要的作用。该模型可在临床病理检查前提前诊断卵巢癌。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Early prediction and risk stratification of ovarian cancer based on clinical data using machine learning approaches.

Objective: Our study was aimed to construct a predictive model to advance ovarian cancer diagnosis by machine learning.

Methods: A retrospective analysis of patients with pelvic/adnexal/ovarian mass was performed. Potential features related to ovarian cancer were obtained as many as possible. The optimal machine learning algorithm was selected among six candidates through 5-fold cross validation. Top 20 features having the most powerful predictive significance were ranked by Shapley Additive Interpretation (Shap) method. Clinical validation was further performed to confirm whether our model could advance diagnosis of ovarian cancer.

Results: A total of 9,799 patients were collected. The inclusion criteria included age >18 years old, the first diagnosis being pelvic/adnexal/ovarian mass of undetermined significance, and pathological report indispensable. Four hundred and thirty-eight dimensional features were obtained after filtration. LightGBM showed the best performance with accuracy 88%. Among the top 20 features, 55% belonged to laboratory test report, 35% came from imaging examination report, and 10% were attributed to basic demographics and main symptom. Age, CA125, and risk of ovarian malignancy algorithm were the top three. Our predictive model performed stably in testing and clinical validation datasets, and was found to advance the diagnosis of ovarian cancer about 17 days before clinical pathological examination.

Conclusion: LightGBM was the optimal algorithm for our predictive model with accuracy of 88%. Laboratory test and imaging examination played essential roles in diagnosing ovarian cancer. Our model could advance the diagnosis of ovarian cancer before clinical pathological examination.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Gynecologic Oncology
Journal of Gynecologic Oncology ONCOLOGY-OBSTETRICS & GYNECOLOGY
CiteScore
6.00
自引率
2.60%
发文量
84
审稿时长
>12 weeks
期刊介绍: The Journal of Gynecologic Oncology (JGO) is an official publication of the Asian Society of Gynecologic Oncology. Abbreviated title is ''J Gynecol Oncol''. It was launched in 1990. The JGO''s aim is to publish the highest quality manuscripts dedicated to the advancement of care of the patients with gynecologic cancer. It is an international peer-reviewed periodical journal that is published bimonthly (January, March, May, July, September, and November). Supplement numbers are at times published. The journal publishes editorials, original and review articles, correspondence, book review, etc.
期刊最新文献
Ovarian squamous cell carcinoma: clinicopathological features, prognosis and immunotherapy outcomes. Is presumed clinical stage I endometrial cancer using PET-CT and MRI accurate in predicting surgical staging? Fertility-sparing treatment outcomes using immune checkpoint inhibitors in endometrial cancer patients with Lynch syndrome. Poor accuracy of endometrial sampling in patients with uterine carcinosarcomas: a nationwide analysis. Early prediction and risk stratification of ovarian cancer based on clinical data using machine learning approaches.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1