{"title":"SGTCDA: Prediction of circRNA-drug sensitivity associations with interpretable graph transformers and effective assessment.","authors":"Hongwei Xia, Caiyue Dong, Xinxing Chen, Zhuoyu Wei, Lichuan Gu, Xiaolei Zhu","doi":"10.1186/s12864-024-11022-6","DOIUrl":null,"url":null,"abstract":"<p><p>CircRNAs are a type of circular non-coding RNA whose associations with drug sensitivities have been demonstrated in recent studies. Due to the high cost of biomedical experiments for detecting the associations between circRNAs and drug sensitivities, several computational methods have been developed. However, these methods were evaluated mainly based on 5- or tenfold cross-validation, which are often over-optimistic. Furthermore, there are technique issues with these models, such as over-smoothing and over-squashing. To address these issues, we propose a strategy to evaluate models based on independent test sets for association prediction-related studies. In the light of this effective assessment, we constructed a model, SGTCDA, by integrating structural deep network embedding (SDNE) and a graph transformer to predict the potential associations of circRNA-drug sensitivity, which can efficiently capture long-range dependencies and local structural information of nodes. Our results on the training sets and the independent test sets indicate that SGTCDA outperforms the other state-of-the-art models, demonstrating its capacity for accurate prediction of circRNA-drug sensitivity. Moreover, we leveraged EdgeSHAPer to explain the performance of the proposed SGTCDA model, which illustrates that the edges between drugs are more important than other edges for the performance of the model. The source code and dataset of SGTCDA are available at: https://github.com/hwxia/SGTCDA .</p>","PeriodicalId":9030,"journal":{"name":"BMC Genomics","volume":"25 1","pages":"1113"},"PeriodicalIF":3.5000,"publicationDate":"2024-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11577602/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Genomics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s12864-024-11022-6","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
CircRNAs are a type of circular non-coding RNA whose associations with drug sensitivities have been demonstrated in recent studies. Due to the high cost of biomedical experiments for detecting the associations between circRNAs and drug sensitivities, several computational methods have been developed. However, these methods were evaluated mainly based on 5- or tenfold cross-validation, which are often over-optimistic. Furthermore, there are technique issues with these models, such as over-smoothing and over-squashing. To address these issues, we propose a strategy to evaluate models based on independent test sets for association prediction-related studies. In the light of this effective assessment, we constructed a model, SGTCDA, by integrating structural deep network embedding (SDNE) and a graph transformer to predict the potential associations of circRNA-drug sensitivity, which can efficiently capture long-range dependencies and local structural information of nodes. Our results on the training sets and the independent test sets indicate that SGTCDA outperforms the other state-of-the-art models, demonstrating its capacity for accurate prediction of circRNA-drug sensitivity. Moreover, we leveraged EdgeSHAPer to explain the performance of the proposed SGTCDA model, which illustrates that the edges between drugs are more important than other edges for the performance of the model. The source code and dataset of SGTCDA are available at: https://github.com/hwxia/SGTCDA .
期刊介绍:
BMC Genomics is an open access, peer-reviewed journal that considers articles on all aspects of genome-scale analysis, functional genomics, and proteomics.
BMC Genomics is part of the BMC series which publishes subject-specific journals focused on the needs of individual research communities across all areas of biology and medicine. We offer an efficient, fair and friendly peer review service, and are committed to publishing all sound science, provided that there is some advance in knowledge presented by the work.