EpicPred: predicting phenotypes driven by epitope-binding TCRs using attention-based multiple instance learning.

Jaemin Jeon, Suwan Yu, Sangam Lee, Sang Cheol Kim, Hye-Yeong Jo, Inuk Jung, Kwangsoo Kim
{"title":"EpicPred: predicting phenotypes driven by epitope-binding TCRs using attention-based multiple instance learning.","authors":"Jaemin Jeon, Suwan Yu, Sangam Lee, Sang Cheol Kim, Hye-Yeong Jo, Inuk Jung, Kwangsoo Kim","doi":"10.1093/bioinformatics/btaf080","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>Correctly identifying epitope-binding T-cell receptors (TCRs) is important to both understand their underlying biological mechanism in association to some phenotype and accordingly develop T-cell mediated immunotherapy treatments. Although the importance of the CDR3 region in TCRs for epitope recognition is well recognized, methods for profiling their interactions in association to a certain disease or phenotype remains less studied. We developed EpicPred to identify phenotype-specific TCR-epitope interactions. EpicPred first predicts and removes unlikely TCR-epitope interactions to reduce false positives using the Open-set Recognition (OSR). Subsequently, multiple instance learning was used to identify TCR-epitope interactions specific to a cancer type or severity levels of COVID-19 infected patients.</p><p><strong>Results: </strong>From six public TCR databases, 244 552 TCR sequences and 105 unique epitopes were used to predict epitope-binding TCRs and to filter out non-epitope-binding TCRs using the OSR method. The predicted interactions were used to further predict the phenotype groups in two cancer and four COVID-19 TCR-seq datasets of both bulk and single-cell resolution. EpicPred outperformed the competing methods in predicting the phenotypes, achieving an average AUROC of 0.80 ± 0.07.</p><p><strong>Availability and implementation: </strong>The EpicPred Software is available at https://github.com/jaeminjj/EpicPred.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":5.4000,"publicationDate":"2025-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11879650/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics (Oxford, England)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioinformatics/btaf080","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Motivation: Correctly identifying epitope-binding T-cell receptors (TCRs) is important to both understand their underlying biological mechanism in association to some phenotype and accordingly develop T-cell mediated immunotherapy treatments. Although the importance of the CDR3 region in TCRs for epitope recognition is well recognized, methods for profiling their interactions in association to a certain disease or phenotype remains less studied. We developed EpicPred to identify phenotype-specific TCR-epitope interactions. EpicPred first predicts and removes unlikely TCR-epitope interactions to reduce false positives using the Open-set Recognition (OSR). Subsequently, multiple instance learning was used to identify TCR-epitope interactions specific to a cancer type or severity levels of COVID-19 infected patients.

Results: From six public TCR databases, 244 552 TCR sequences and 105 unique epitopes were used to predict epitope-binding TCRs and to filter out non-epitope-binding TCRs using the OSR method. The predicted interactions were used to further predict the phenotype groups in two cancer and four COVID-19 TCR-seq datasets of both bulk and single-cell resolution. EpicPred outperformed the competing methods in predicting the phenotypes, achieving an average AUROC of 0.80 ± 0.07.

Availability and implementation: The EpicPred Software is available at https://github.com/jaeminjj/EpicPred.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
EpicPred:使用基于注意的多实例学习预测表位结合tcr驱动的表型。
动机:正确识别表位结合TCRs对于了解其与某些表型相关的潜在生物学机制以及相应开发t细胞介导的免疫治疗非常重要。虽然tcr中CDR3区域对表位识别的重要性已得到充分认识,但分析它们与某种疾病或表型相关的相互作用的方法仍然较少研究。我们开发了EpicPred来鉴定表型特异性tcr -表位相互作用。EpicPred首先使用开放集识别预测并去除不可能的tcr -表位相互作用,以减少误报。随后,使用多实例学习来识别特定于癌症类型或COVID-19患者严重程度的tcr -表位相互作用。结果:从6个公共TCR数据库中,利用244,552个TCR序列和105个独特的表位,采用开放集识别方法预测表位结合的TCR,过滤掉非表位结合的TCR。预测的相互作用被用于进一步预测两种癌症和四种COVID-19 TCR-seq数据集的表型组,包括大量和单细胞分辨率。EpicPred在预测表型方面优于竞争对手的方法,平均AUROC为0.80±0.07。可用性和实施:EpicPred软件可在https://github.com/jaeminjj/EpicPred.Supplementary上获得信息;补充数据可在Bioinformatics在线获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
scDock: Streamlining drug discovery targeting cell-cell communication via scRNA-seq analysis and molecular docking. Dogme: A nextflow pipeline for reprocessing nanopore RNA and DNA modifications. GeneExt: a gene model extension tool for enhanced single-cell RNA-seq analysis. FishFeats: streamlined quantification of multimodal labeling at the single-cell level in 3D tissues. Statistical Methods to Harmonize Electronic Health Record Data Across Healthcare Systems: Case Study and Lessons Learned.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1