Prediction of Breast Cancer Distant Metastasis by Artificial Intelligence Methods from an Epidemiological Perspective

IF 0.2 Q4 MEDICINE, GENERAL & INTERNAL Istanbul Medical Journal Pub Date : 2022-08-01 DOI:10.4274/imj.galenos.2022.62443
S. Akbulut, F. Yağın, C. Colak
{"title":"Prediction of Breast Cancer Distant Metastasis by Artificial Intelligence Methods from an Epidemiological Perspective","authors":"S. Akbulut, F. Yağın, C. Colak","doi":"10.4274/imj.galenos.2022.62443","DOIUrl":null,"url":null,"abstract":"Introduction: Despite significant advances in breast cancer (BC) management, the prognosis for most patients with distant metastasis remains poor. We predicted distant metastasis in BC patients with artificial intelligence (AI) methods based on genomic biomarkers. Methods: The dataset used in the study included 97 patients with BC, of whom 46 (47%) developed distant metastases, and 51 (53%) did not develop distant metastases, and the expression level of 24,481 genes of these patients. An approach combining Boruta + LASSO methods was applied to identify biomarker genes associated with BC distant metastasis. Mann-Whitney U test was used to examine the difference between groups in terms of gene expression levels in statistical analyses, and Cohen d effect sizes and odds ratios were calculated. AdaBoost and XGBoost algorithms, which are tree-based methods, were used for BC distant metastasis prediction, and the results were compared by evaluating comprehensive performance criteria. Results: After Boruta + LASSO methods, 14 biomarker candidate genes were identified. These predictive genes were PIB5PA, SSX2, OR1F1, ALDH4A1, FGF18, WISP1, PRAME, CEGP1, AL080059, NMU, ATP5E, SMARCE1, FGD6, and SLC37A1 . In effect size results; in particular, show that the AL080059 (Cohen’s D: 1.318) gene is clinically predictive of BC Metastasis. The accuracy, F1-score, positive predictive value, sensitivity, and area under the ROC Curve (AUC) values obtained with the AdaBoost algorithm for BC metastasis prediction was 95%, 96.3%, 100%, 92.6%, and 98.8%, respectively. The model created with the XGBoost algorithm, on the other hand, obtained 90%, 92.9%, 92.9%, 92.9%, 97.6% accuracy, F1-score, positive predictive value, sensitivity, and AUC values, respectively. Conclusion: Identifying genes that successfully predict BC distant metastasis with AI methods in the study may be decisive for future therapeutic targets and help clinicians better adapt adjuvant chemotherapy to their patients. Additionally, the AdaBoost prediction model created can discriminate patients at risk of BC distant metastases.","PeriodicalId":42584,"journal":{"name":"Istanbul Medical Journal","volume":"1 1","pages":""},"PeriodicalIF":0.2000,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Istanbul Medical Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4274/imj.galenos.2022.62443","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 3

Abstract

Introduction: Despite significant advances in breast cancer (BC) management, the prognosis for most patients with distant metastasis remains poor. We predicted distant metastasis in BC patients with artificial intelligence (AI) methods based on genomic biomarkers. Methods: The dataset used in the study included 97 patients with BC, of whom 46 (47%) developed distant metastases, and 51 (53%) did not develop distant metastases, and the expression level of 24,481 genes of these patients. An approach combining Boruta + LASSO methods was applied to identify biomarker genes associated with BC distant metastasis. Mann-Whitney U test was used to examine the difference between groups in terms of gene expression levels in statistical analyses, and Cohen d effect sizes and odds ratios were calculated. AdaBoost and XGBoost algorithms, which are tree-based methods, were used for BC distant metastasis prediction, and the results were compared by evaluating comprehensive performance criteria. Results: After Boruta + LASSO methods, 14 biomarker candidate genes were identified. These predictive genes were PIB5PA, SSX2, OR1F1, ALDH4A1, FGF18, WISP1, PRAME, CEGP1, AL080059, NMU, ATP5E, SMARCE1, FGD6, and SLC37A1 . In effect size results; in particular, show that the AL080059 (Cohen’s D: 1.318) gene is clinically predictive of BC Metastasis. The accuracy, F1-score, positive predictive value, sensitivity, and area under the ROC Curve (AUC) values obtained with the AdaBoost algorithm for BC metastasis prediction was 95%, 96.3%, 100%, 92.6%, and 98.8%, respectively. The model created with the XGBoost algorithm, on the other hand, obtained 90%, 92.9%, 92.9%, 92.9%, 97.6% accuracy, F1-score, positive predictive value, sensitivity, and AUC values, respectively. Conclusion: Identifying genes that successfully predict BC distant metastasis with AI methods in the study may be decisive for future therapeutic targets and help clinicians better adapt adjuvant chemotherapy to their patients. Additionally, the AdaBoost prediction model created can discriminate patients at risk of BC distant metastases.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
流行病学视角下人工智能方法预测乳腺癌远处转移
导言:尽管乳腺癌(BC)的治疗取得了重大进展,但大多数远处转移患者的预后仍然很差。我们利用基于基因组生物标志物的人工智能(AI)方法预测BC患者的远处转移。方法:本研究使用的数据集包括97例BC患者,其中46例(47%)发生远处转移,51例(53%)未发生远处转移,这些患者的24,481个基因表达水平。采用Boruta + LASSO方法联合鉴定与BC远处转移相关的生物标志物基因。统计学分析中采用Mann-Whitney U检验检验组间基因表达水平的差异,计算Cohen d效应大小和优势比。采用基于树的AdaBoost和XGBoost算法预测BC远处转移,并通过评价综合性能标准对结果进行比较。结果:经Boruta + LASSO方法鉴定出14个生物标志物候选基因。这些预测基因为PIB5PA、SSX2、OR1F1、ALDH4A1、FGF18、WISP1、PRAME、CEGP1、AL080059、NMU、ATP5E、SMARCE1、FGD6和SLC37A1。效应大小结果;特别是AL080059 (Cohen’s D: 1.318)基因在临床上可预测BC转移。AdaBoost算法预测BC转移的准确率为95%,f1评分为96.3%,阳性预测值为100%,敏感性为92.6%,ROC曲线下面积为98.8%。而采用XGBoost算法建立的模型,准确率为90%,准确率为92.9%,准确率为92.9%,准确率为92.9%,准确率为92.9%,准确率为92.9%,准确率为97.6%,准确率为f1分,准确率为阳性预测值,灵敏度为92.9%,AUC为AUC。结论:在研究中,通过人工智能方法鉴定出成功预测BC远处转移的基因,可能对未来的治疗靶点具有决定性意义,并有助于临床医生更好地适应患者的辅助化疗。此外,创建的AdaBoost预测模型可以区分有BC远处转移风险的患者。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Istanbul Medical Journal
Istanbul Medical Journal MEDICINE, GENERAL & INTERNAL-
CiteScore
0.30
自引率
0.00%
发文量
46
审稿时长
18 weeks
期刊最新文献
Concordance between Patient and Physician Predictions of PCR Results and Predictive Capacity of Presenting Complaints in Suspected Infectious Diseases Evaluation of the Systemic Immune-Inflammation Index and Systemic Inflammatory Response Index in Ankylosing Spondylitis Patients Hyperkalemia: A Cause of Non-adherence to Renin- Angiotensin-Aldosterone System Inhibitors in Chronic Kidney Disease: A Retrospective Study Impact of COVID-19 Lockdown on Glycemic Control in Patients with Type 2 Diabetes Mellitus Predictive Value of Neutrophil/Lymphocyte Ratio for Developing Acute Renal Failure in Patients with Sepsis Using Colistin in Intensive Care Units
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1