A comparison of clustering models for inference of T cell receptor antigen specificity

Immunoinformatics (Amsterdam, Netherlands) Pub Date : 2024-01-29 DOI:10.1016/j.immuno.2024.100033

Dan Hudson , Alex Lubbock , Mark Basham , Hashem Koohy

{"title":"A comparison of clustering models for inference of T cell receptor antigen specificity","authors":"Dan Hudson , Alex Lubbock , Mark Basham , Hashem Koohy","doi":"10.1016/j.immuno.2024.100033","DOIUrl":null,"url":null,"abstract":"<div><p>The vast potential sequence diversity of TCRs and their ligands has presented an historic barrier to computational prediction of TCR epitope specificity, a holy grail of quantitative immunology. One common approach is to cluster sequences together, on the assumption that similar receptors bind similar epitopes. Here, we provide the first independent evaluation of widely used clustering algorithms for TCR specificity inference, observing some variability in predictive performance between models, and marked differences in scalability. Despite these differences, we find that different algorithms produce clusters with high degrees of similarity for receptors recognising the same epitope. Our analysis strengthens the case for use of clustering models to identify signals of common specificity from large repertoires, whilst highlighting scope for improvement of complex models over simple comparators.</p></div>","PeriodicalId":73343,"journal":{"name":"Immunoinformatics (Amsterdam, Netherlands)","volume":"13 ","pages":"Article 100033"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S266711902400003X/pdfft?md5=99e7206f5457951bcd4047d5992bc528&pid=1-s2.0-S266711902400003X-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Immunoinformatics (Amsterdam, Netherlands)","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S266711902400003X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The vast potential sequence diversity of TCRs and their ligands has presented an historic barrier to computational prediction of TCR epitope specificity, a holy grail of quantitative immunology. One common approach is to cluster sequences together, on the assumption that similar receptors bind similar epitopes. Here, we provide the first independent evaluation of widely used clustering algorithms for TCR specificity inference, observing some variability in predictive performance between models, and marked differences in scalability. Despite these differences, we find that different algorithms produce clusters with high degrees of similarity for receptors recognising the same epitope. Our analysis strengthens the case for use of clustering models to identify signals of common specificity from large repertoires, whilst highlighting scope for improvement of complex models over simple comparators.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用于推断 T 细胞受体抗原特异性的聚类模型比较

TCR 及其配体的潜在序列多样性巨大，这给计算预测 TCR 表位特异性（定量免疫学的圣杯）带来了历史性障碍。一种常见的方法是将序列聚类，假设相似的受体结合相似的表位。在这里，我们首次对广泛使用的 TCR 特异性推断聚类算法进行了独立评估，发现不同模型的预测性能存在一定差异，可扩展性也有明显不同。尽管存在这些差异，但我们发现不同的算法对识别相同表位的受体产生的聚类具有高度的相似性。我们的分析加强了使用聚类模型从大样本中识别共同特异性信号的理由，同时也突出了复杂模型比简单比较模型的改进空间。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Immunoinformatics (Amsterdam, Netherlands) Immunology, Computer Science Applications

自引率

0.00%

发文量

审稿时长

60 days