Discovering genetic biomarkers for targeted cancer therapeutics with eXplainable Artificial Intelligence

IF 20.1 1区 医学 Q1 ONCOLOGY Cancer Communications Pub Date : 2024-04-02 DOI:10.1002/cac2.12530
Debaditya Chakraborty, Elizabeth Gutierrez-Chakraborty, Cristian Rodriguez-Aguayo, Hakan Başağaoğlu, Gabriel Lopez-Berestein, Paola Amero
{"title":"Discovering genetic biomarkers for targeted cancer therapeutics with eXplainable Artificial Intelligence","authors":"Debaditya Chakraborty,&nbsp;Elizabeth Gutierrez-Chakraborty,&nbsp;Cristian Rodriguez-Aguayo,&nbsp;Hakan Başağaoğlu,&nbsp;Gabriel Lopez-Berestein,&nbsp;Paola Amero","doi":"10.1002/cac2.12530","DOIUrl":null,"url":null,"abstract":"<p>High-Grade Serous Ovarian Cancer (HGSC) is the most prevalent and lethal form of gynecologic malignancies [<span>1</span>], accounting for 70%-80% of ovarian cancer fatalities. Despite decades of research, the overall survival rate for HGSC has remained largely unchanged [<span>2</span>], and patients with advanced stages of the disease have only a 41% chance of surviving beyond five years [<span>3</span>]. Investigating the genomic and immune profiles of long-term HGSC survivors could offer valuable insights into the underlying tumor biology and inform potential therapeutic strategies [<span>4</span>]. This study advances upon prior research by employing an innovative eXplainable Artificial Intelligence (XAI) integrated with a hypothesis-driven probabilistic methodology to dissect the intricate genetic underpinnings linked to HGSC's survival outcomes in a cohort of 407 patients. The objective of this article was to uncover the most critical prognostic biomarkers from a pool of 655 potential targets through our distinctive data-driven approach and determine the impacts of potentially modulating the identified biomarkers on HGSC outcomes.</p><p>Recent studies indicate that AI models are often referred to as “black boxes” because their decision-making process lacks transparency [<span>5</span>]. The consensus is that the lack of inherent explainability is problematic as this produces biases, creates difficulties in detecting false positives and negatives, and conceals potential insights that may be derived from AI [<span>6</span>]. In this study, we provide evidence demonstrating how XAI can enhance biological explainability by revealing novel insights from the underlying data (Supplementary Materials and Methods). Our XAI approach distinctively predicts patient outcomes and survival duration based on genetic signatures (predictive AI aspect of the models) and discovers and helps visualize critical biomarkers (biological explainability aspect of the models) in HGSC. To ensure the viability of explanations generated by our XAI, we subsequently validated the most prominent HGSC-promoting biomarker identified by XAI using in vivo murine tumor models (Supplementary Figure S1). The XAI approach outlined in this study is a proof-of-concept that is not only intended to generate high predictive accuracy but also infer the cause-effect relations behind the predictions, identify counterfactuals that are useful for optimizing interventional therapies, and assess the resultant improvements in patients.</p><p>We report that our models predicted the ≥5-year overall survival probability based on the genetic features of patients (<i>n</i> = 407) with 97.52% accuracy, 100% precision, and 94.74% recall on the testing data that comprised 25% of the total samples that were hidden from the models during the training phase. Insights derived through XAI prioritized the biomarkers that are of utmost importance in determining prognosis for patients with HGSC, which we refer to as ‘global biological explanations’ (Figure 1A). The biomarkers are arranged in terms of their relative importance (most important at the top), and the variations in the expression of the biomarkers from high (red) to low (blue) together with the corresponding Shapley values on the x-axis were used to determine whether a particular biomarker is associated with poor (positive values on the x-axis) or good prognosis (negative values on the x-axis) in HGSC (Figure 1A). For example, we discovered that TATA-Box Binding Protein Associated Factor 10 (TAF10) is associated with good prognosis since higher concentrations (red) of this biomarker correspond to negative Shapley values on the x-axis; hence, it can be classified as a cancer suppressor for HGSC patients. In contrast, Interleukin 27 receptor subunit alpha (IL27RA) is associated with poor prognosis since higher concentrations (red) of this biomarker correspond to positive Shapley values on the x-axis; hence, it can be classified as a cancer promoter for HGSC patients. Furthermore, XAI unveiled the inflection points of each influential gene (local biological explanations) above or below which the prognosis would either improve or deteriorate (Figure 1B). These inflection points serve as guidance for genomic editing that could lead to the development of new gene therapeutics and enhanced prognosis of patients.</p><p>Drawing on the insights from XAI, our research identified IL27RA as a key biomarker linked to lower survival rates in HGSC patients. The XAI results led us to theorize that reducing IL27RA levels to below 2.2 transcripts per million (TPM) might improve patient outcomes (Figure 1B). To test this hypothesis, we employed an orthotopic ovarian cancer model in C57BL/6 mice, which were intraperitoneally inoculated with one million ID8 ovarian cancer cells. This model closely mirrors the extensive intraperitoneal spread observed in the human form of the disease. We divided twenty mice into two groups of ten each. Both groups were intraperitoneally injected with tumor cells. The first group received intravenous injections of 1,2-Dioleoyl-sn-glycero-3-phosphocholine (DOPC) nanoparticles containing scrambled siRNA, while the second group received DOPC nanoparticles loaded with IL27-siRNA (si-IL27RA-DOPC treatment) (Figure 1C). Over five weeks, these treatments were administered bi-weekly. Post-treatment, the mice were euthanized, and their peritoneal metastases were quantified. The si-IL27RA-DOPC treatment group exhibited a significant decrease in peritoneal metastasis (Figure 1D-E). These results indicate that targeting IL27RA may be a promising approach to improving prognoses in HGSC patients.</p><p>The biology of IL27 is multifaceted and varies depending on the context, exhibiting the potential to either mitigate or exacerbate various forms of inflammation and cancer [<span>7</span>]. This complexity underscores its dual role in immune modulation across different biological scenarios. Recently, an experimental study revealed that IL27 receptor signaling promoted hepatocellular carcinoma development in vivo and that its high expression correlated with poor prognosis for patients due to increased proliferative capacity of tumors and inflammation [<span>8</span>]. Thus, further validating that the explanations obtained from XAI could be in fact legitimate. We adopted a probabilistic approach to evaluate insights unraveled from the XAI models relating to the underlying relationships between the features and targets (Supplementary Figure S2). Our study revealed that strategic modulation of the top five biomarkers in combination (as illustrated in Figure 1A and Supplementary Figure S2) could potentially enhance the survival prospects of patients with HGSC by up to 100% for periods extending beyond five years. Notably, TAF10 emerged as a prominent HGSC suppressor according to our XAI analysis (Figure 1A and Supplementary Figures S2A and S2F), with its individual modulation potentially contributing to a 41.1% increase in survival likelihood. Further supporting these findings, Kaplan-Meier and box-plot analyses of TAF10 and IL27RA aligned with the predictions of our XAI models (Supplementary Figure S3). These analyses demonstrated a positive correlation between higher TAF10 expression and increased survival chances. Conversely, an elevated expression of IL27RA was linked to diminished survival prospects, as detailed in Supplementary Figure S3. Based on the evidence from this study, XAI approaches are amenable to generating biologically relevant testable hypotheses despite their limitations due to explanations originating from post hoc realizations [<span>9</span>]. Additionally, the application of probabilistic causality methods can enhance the insights gained from XAI. This synergistic use of both approaches offers a more streamlined pathway to propel advancements in precision oncology, yielding time and cost efficiencies.</p><p>In conclusion, this study presents a groundbreaking integrated approach combining XAI and probabilistic methods to advance the understanding and treatment of HGSC. Our research focused on deciphering the complex interactions between genetic biomarkers and the five-year survival probabilities of HGSC patients, utilizing a data set of 407 samples. The unique blend of XAI and probabilistic causality approaches has not only demonstrated high predictive accuracy but also provided valuable biological explanations, identifying key biomarkers that influence patient outcomes. Particularly, our findings highlight the roles of biomarkers like TAF10 and IL27RA in affecting survival rates, revealing their potential as therapeutic targets. Furthermore, our in vivo experiments using murine tumor models have validated the XAI-derived hypotheses, particularly regarding the role of IL27RA in HGSC prognosis. This study not only enhances our understanding of HGSC but also illustrates the potential of XAI in precision oncology, offering a more efficient pathway for developing targeted therapies. As we continue to explore and refine these approaches, the prospects for their application in diverse cancer types and clinical settings are promising, paving the way for more personalized and effective cancer treatments.</p><p><i>Conceived and designed the experiments</i>: Debaditya Chakraborty, Hakan Başağaoğlu and Gabriel Lopez-Berestein. <i>Performed the experiments</i>: Debaditya Chakraborty, Paola Amero and Cristian Rodriguez-Aguayo. <i>Analyzed the data</i>: All authors. <i>Contributed materials/analysis tools</i>: All authors. <i>Wrote the paper</i>: Debaditya Chakraborty and Elizabeth Gutierrez-Chakraborty primarily and all authors reviewed, revised, and commented on the manuscript.</p><p>The authors declare that there are no competing interests.</p><p>This study was supported in part by grants from the National Institutes of Health/National Cancer Institute (5U01CA213759-02, P30CA016672), and the American Cancer Society, National Science Foundation (CHE-1411859), and an endowment grant from the John P. Gaines Foundation. Dr. Cristian Rodriguez-Aguayo and Dr. Paola Amero were supported by the Brain SPORE Career Enhancement Program and NCI grant P50CA127001, as well as by the NIH through the Ovarian SPORE Career Enhancement Program and NCI grant P50CA217685.</p><p>All mouse studies were approved by the Institutional Animal Care and Use Committee at MD Anderson. All animal experiments were performed with 4- to 6-week-old female C57BL/6 mice obtained from Taconic Biosciences (Rensselaer, NY). Five mice per cage were housed under pathogen-free conditions at a constant temperature and humidity. All mice were fed a regular diet and water ad libitum according to American Association for Laboratory Animal Science guidelines and the US Public Health Service Policy on Human Care and Use of Laboratory Animals. Mice were euthanized via cervical dislocation if found moribund by the investigators, who have approval from the IACUC committee (IACUC Study #00001010-RN03).</p>","PeriodicalId":9495,"journal":{"name":"Cancer Communications","volume":null,"pages":null},"PeriodicalIF":20.1000,"publicationDate":"2024-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cac2.12530","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cancer Communications","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cac2.12530","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

High-Grade Serous Ovarian Cancer (HGSC) is the most prevalent and lethal form of gynecologic malignancies [1], accounting for 70%-80% of ovarian cancer fatalities. Despite decades of research, the overall survival rate for HGSC has remained largely unchanged [2], and patients with advanced stages of the disease have only a 41% chance of surviving beyond five years [3]. Investigating the genomic and immune profiles of long-term HGSC survivors could offer valuable insights into the underlying tumor biology and inform potential therapeutic strategies [4]. This study advances upon prior research by employing an innovative eXplainable Artificial Intelligence (XAI) integrated with a hypothesis-driven probabilistic methodology to dissect the intricate genetic underpinnings linked to HGSC's survival outcomes in a cohort of 407 patients. The objective of this article was to uncover the most critical prognostic biomarkers from a pool of 655 potential targets through our distinctive data-driven approach and determine the impacts of potentially modulating the identified biomarkers on HGSC outcomes.

Recent studies indicate that AI models are often referred to as “black boxes” because their decision-making process lacks transparency [5]. The consensus is that the lack of inherent explainability is problematic as this produces biases, creates difficulties in detecting false positives and negatives, and conceals potential insights that may be derived from AI [6]. In this study, we provide evidence demonstrating how XAI can enhance biological explainability by revealing novel insights from the underlying data (Supplementary Materials and Methods). Our XAI approach distinctively predicts patient outcomes and survival duration based on genetic signatures (predictive AI aspect of the models) and discovers and helps visualize critical biomarkers (biological explainability aspect of the models) in HGSC. To ensure the viability of explanations generated by our XAI, we subsequently validated the most prominent HGSC-promoting biomarker identified by XAI using in vivo murine tumor models (Supplementary Figure S1). The XAI approach outlined in this study is a proof-of-concept that is not only intended to generate high predictive accuracy but also infer the cause-effect relations behind the predictions, identify counterfactuals that are useful for optimizing interventional therapies, and assess the resultant improvements in patients.

We report that our models predicted the ≥5-year overall survival probability based on the genetic features of patients (n = 407) with 97.52% accuracy, 100% precision, and 94.74% recall on the testing data that comprised 25% of the total samples that were hidden from the models during the training phase. Insights derived through XAI prioritized the biomarkers that are of utmost importance in determining prognosis for patients with HGSC, which we refer to as ‘global biological explanations’ (Figure 1A). The biomarkers are arranged in terms of their relative importance (most important at the top), and the variations in the expression of the biomarkers from high (red) to low (blue) together with the corresponding Shapley values on the x-axis were used to determine whether a particular biomarker is associated with poor (positive values on the x-axis) or good prognosis (negative values on the x-axis) in HGSC (Figure 1A). For example, we discovered that TATA-Box Binding Protein Associated Factor 10 (TAF10) is associated with good prognosis since higher concentrations (red) of this biomarker correspond to negative Shapley values on the x-axis; hence, it can be classified as a cancer suppressor for HGSC patients. In contrast, Interleukin 27 receptor subunit alpha (IL27RA) is associated with poor prognosis since higher concentrations (red) of this biomarker correspond to positive Shapley values on the x-axis; hence, it can be classified as a cancer promoter for HGSC patients. Furthermore, XAI unveiled the inflection points of each influential gene (local biological explanations) above or below which the prognosis would either improve or deteriorate (Figure 1B). These inflection points serve as guidance for genomic editing that could lead to the development of new gene therapeutics and enhanced prognosis of patients.

Drawing on the insights from XAI, our research identified IL27RA as a key biomarker linked to lower survival rates in HGSC patients. The XAI results led us to theorize that reducing IL27RA levels to below 2.2 transcripts per million (TPM) might improve patient outcomes (Figure 1B). To test this hypothesis, we employed an orthotopic ovarian cancer model in C57BL/6 mice, which were intraperitoneally inoculated with one million ID8 ovarian cancer cells. This model closely mirrors the extensive intraperitoneal spread observed in the human form of the disease. We divided twenty mice into two groups of ten each. Both groups were intraperitoneally injected with tumor cells. The first group received intravenous injections of 1,2-Dioleoyl-sn-glycero-3-phosphocholine (DOPC) nanoparticles containing scrambled siRNA, while the second group received DOPC nanoparticles loaded with IL27-siRNA (si-IL27RA-DOPC treatment) (Figure 1C). Over five weeks, these treatments were administered bi-weekly. Post-treatment, the mice were euthanized, and their peritoneal metastases were quantified. The si-IL27RA-DOPC treatment group exhibited a significant decrease in peritoneal metastasis (Figure 1D-E). These results indicate that targeting IL27RA may be a promising approach to improving prognoses in HGSC patients.

The biology of IL27 is multifaceted and varies depending on the context, exhibiting the potential to either mitigate or exacerbate various forms of inflammation and cancer [7]. This complexity underscores its dual role in immune modulation across different biological scenarios. Recently, an experimental study revealed that IL27 receptor signaling promoted hepatocellular carcinoma development in vivo and that its high expression correlated with poor prognosis for patients due to increased proliferative capacity of tumors and inflammation [8]. Thus, further validating that the explanations obtained from XAI could be in fact legitimate. We adopted a probabilistic approach to evaluate insights unraveled from the XAI models relating to the underlying relationships between the features and targets (Supplementary Figure S2). Our study revealed that strategic modulation of the top five biomarkers in combination (as illustrated in Figure 1A and Supplementary Figure S2) could potentially enhance the survival prospects of patients with HGSC by up to 100% for periods extending beyond five years. Notably, TAF10 emerged as a prominent HGSC suppressor according to our XAI analysis (Figure 1A and Supplementary Figures S2A and S2F), with its individual modulation potentially contributing to a 41.1% increase in survival likelihood. Further supporting these findings, Kaplan-Meier and box-plot analyses of TAF10 and IL27RA aligned with the predictions of our XAI models (Supplementary Figure S3). These analyses demonstrated a positive correlation between higher TAF10 expression and increased survival chances. Conversely, an elevated expression of IL27RA was linked to diminished survival prospects, as detailed in Supplementary Figure S3. Based on the evidence from this study, XAI approaches are amenable to generating biologically relevant testable hypotheses despite their limitations due to explanations originating from post hoc realizations [9]. Additionally, the application of probabilistic causality methods can enhance the insights gained from XAI. This synergistic use of both approaches offers a more streamlined pathway to propel advancements in precision oncology, yielding time and cost efficiencies.

In conclusion, this study presents a groundbreaking integrated approach combining XAI and probabilistic methods to advance the understanding and treatment of HGSC. Our research focused on deciphering the complex interactions between genetic biomarkers and the five-year survival probabilities of HGSC patients, utilizing a data set of 407 samples. The unique blend of XAI and probabilistic causality approaches has not only demonstrated high predictive accuracy but also provided valuable biological explanations, identifying key biomarkers that influence patient outcomes. Particularly, our findings highlight the roles of biomarkers like TAF10 and IL27RA in affecting survival rates, revealing their potential as therapeutic targets. Furthermore, our in vivo experiments using murine tumor models have validated the XAI-derived hypotheses, particularly regarding the role of IL27RA in HGSC prognosis. This study not only enhances our understanding of HGSC but also illustrates the potential of XAI in precision oncology, offering a more efficient pathway for developing targeted therapies. As we continue to explore and refine these approaches, the prospects for their application in diverse cancer types and clinical settings are promising, paving the way for more personalized and effective cancer treatments.

Conceived and designed the experiments: Debaditya Chakraborty, Hakan Başağaoğlu and Gabriel Lopez-Berestein. Performed the experiments: Debaditya Chakraborty, Paola Amero and Cristian Rodriguez-Aguayo. Analyzed the data: All authors. Contributed materials/analysis tools: All authors. Wrote the paper: Debaditya Chakraborty and Elizabeth Gutierrez-Chakraborty primarily and all authors reviewed, revised, and commented on the manuscript.

The authors declare that there are no competing interests.

This study was supported in part by grants from the National Institutes of Health/National Cancer Institute (5U01CA213759-02, P30CA016672), and the American Cancer Society, National Science Foundation (CHE-1411859), and an endowment grant from the John P. Gaines Foundation. Dr. Cristian Rodriguez-Aguayo and Dr. Paola Amero were supported by the Brain SPORE Career Enhancement Program and NCI grant P50CA127001, as well as by the NIH through the Ovarian SPORE Career Enhancement Program and NCI grant P50CA217685.

All mouse studies were approved by the Institutional Animal Care and Use Committee at MD Anderson. All animal experiments were performed with 4- to 6-week-old female C57BL/6 mice obtained from Taconic Biosciences (Rensselaer, NY). Five mice per cage were housed under pathogen-free conditions at a constant temperature and humidity. All mice were fed a regular diet and water ad libitum according to American Association for Laboratory Animal Science guidelines and the US Public Health Service Policy on Human Care and Use of Laboratory Animals. Mice were euthanized via cervical dislocation if found moribund by the investigators, who have approval from the IACUC committee (IACUC Study #00001010-RN03).

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用 eXplainable 人工智能发现癌症靶向治疗的基因生物标志物
总之,本研究提出了一种开创性的综合方法,将 XAI 和概率方法相结合,以促进对 HGSC 的理解和治疗。我们的研究重点是利用 407 个样本的数据集,破译遗传生物标志物与 HGSC 患者五年生存概率之间复杂的相互作用。XAI 和概率因果关系方法的独特结合不仅证明了预测的高准确性,还提供了有价值的生物学解释,确定了影响患者预后的关键生物标志物。特别是,我们的研究结果强调了 TAF10 和 IL27RA 等生物标志物在影响生存率方面的作用,揭示了它们作为治疗靶点的潜力。此外,我们使用小鼠肿瘤模型进行的体内实验也验证了 XAI 衍生的假设,尤其是 IL27RA 在 HGSC 预后中的作用。这项研究不仅增进了我们对 HGSC 的了解,还说明了 XAI 在精准肿瘤学方面的潜力,为开发靶向疗法提供了更有效的途径。随着我们不断探索和完善这些方法,它们在不同癌症类型和临床环境中的应用前景十分广阔,为更个性化、更有效的癌症治疗铺平了道路。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Cancer Communications
Cancer Communications Biochemistry, Genetics and Molecular Biology-Cancer Research
CiteScore
25.50
自引率
4.30%
发文量
153
审稿时长
4 weeks
期刊介绍: Cancer Communications is an open access, peer-reviewed online journal that encompasses basic, clinical, and translational cancer research. The journal welcomes submissions concerning clinical trials, epidemiology, molecular and cellular biology, and genetics.
期刊最新文献
Perioperative toripalimab plus neoadjuvant chemotherapy might improve outcomes in resectable esophageal cancer: an interim analysis of a phase III randomized clinical trial. Tryptophan 2,3-dioxygenase-positive matrix fibroblasts fuel breast cancer lung metastasis via kynurenine-mediated ferroptosis resistance of metastatic cells and T cell dysfunction. Targeting SRSF10 might inhibit M2 macrophage polarization and potentiate anti-PD-1 therapy in hepatocellular carcinoma. Unraveling the routes to distant metastases in colorectal cancer: Tumor deposits and lymph node metastases as the gateway. Issue Information
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1