NetMHCphosPan - Pan-specific prediction of MHC class I antigen presentation of phosphorylated ligands

Immunoinformatics (Amsterdam, Netherlands) Pub Date : 2021-10-01 DOI:10.1016/j.immuno.2021.100005

Carina Thusgaard Refsgaard , Carolina Barra , Xu Peng , Nicola Ternette , Morten Nielsen

{"title":"NetMHCphosPan - Pan-specific prediction of MHC class I antigen presentation of phosphorylated ligands","authors":"Carina Thusgaard Refsgaard , Carolina Barra , Xu Peng , Nicola Ternette , Morten Nielsen","doi":"10.1016/j.immuno.2021.100005","DOIUrl":null,"url":null,"abstract":"<div><p>Post-translational modifications of proteins play a crucial part in carcinogenesis. Phosphorylated peptides have shown to be presented by MHC class I molecules and recognised by cytotoxic T cells, making them a promising target for immunotherapy. Identification of phosphorylated MHC class I ligands has so far predominantly been done using bioinformatic tools trained on unmodified peptides. Only one tool, PhosMHCpred, has been developed specifically for the prediction of phosphorylated MHC class I ligands so far and this tool has been trained only on a limited number of alleles and provides a limited peptide length coverage (only including 9-mers).</p><p>Here we propose a method, termed NetMHCphosPan, for the prediction of MHC presented phosphopeptides. The method is trained using the NNAlign_MA framework, which allows incorporating mixed data types and information leverage between data sets resulting in a greatly improved MHC and peptide length coverage and an overall increased predictive power compared to PhosMHCpred. Motif deconvolution suggested a strong preference for phosphosites to be located in position 4 of the binding motif, and enrichment of proline at P5 and arginine at P1. The improved performance, driven by the extended length and allelic coverage, of NetMHCphosPan over current state-of-the-art methods, was further validated on a large benchmark data set independent from the model development.</p><p>In conclusion, we have confirmed the high power of NNAlign_MA for motif deconvolution of complex immuno-peptidomics data and have developed a novel method for prediction of MHC presented phosphopeptides with improved predictive power and a broader peptide length and MHC coverage compared to current state-of-the-art methods. The developed method is available at <span>http://www.cbs.dtu.dk/services/NetMHCphosPan-1.0</span><svg><path></path></svg>.</p></div>","PeriodicalId":73343,"journal":{"name":"Immunoinformatics (Amsterdam, Netherlands)","volume":"1 ","pages":"Article 100005"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.immuno.2021.100005","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Immunoinformatics (Amsterdam, Netherlands)","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667119021000057","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

Post-translational modifications of proteins play a crucial part in carcinogenesis. Phosphorylated peptides have shown to be presented by MHC class I molecules and recognised by cytotoxic T cells, making them a promising target for immunotherapy. Identification of phosphorylated MHC class I ligands has so far predominantly been done using bioinformatic tools trained on unmodified peptides. Only one tool, PhosMHCpred, has been developed specifically for the prediction of phosphorylated MHC class I ligands so far and this tool has been trained only on a limited number of alleles and provides a limited peptide length coverage (only including 9-mers).

Here we propose a method, termed NetMHCphosPan, for the prediction of MHC presented phosphopeptides. The method is trained using the NNAlign_MA framework, which allows incorporating mixed data types and information leverage between data sets resulting in a greatly improved MHC and peptide length coverage and an overall increased predictive power compared to PhosMHCpred. Motif deconvolution suggested a strong preference for phosphosites to be located in position 4 of the binding motif, and enrichment of proline at P5 and arginine at P1. The improved performance, driven by the extended length and allelic coverage, of NetMHCphosPan over current state-of-the-art methods, was further validated on a large benchmark data set independent from the model development.

In conclusion, we have confirmed the high power of NNAlign_MA for motif deconvolution of complex immuno-peptidomics data and have developed a novel method for prediction of MHC presented phosphopeptides with improved predictive power and a broader peptide length and MHC coverage compared to current state-of-the-art methods. The developed method is available at http://www.cbs.dtu.dk/services/NetMHCphosPan-1.0.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

NetMHCphosPan-Pan特异性预测磷酸化配体的MHC I类抗原呈递

蛋白质的翻译后修饰在癌变中起着至关重要的作用。磷酸化肽已被证明由MHC I类分子呈现并被细胞毒性T细胞识别，使其成为免疫治疗的一个有希望的靶点。迄今为止，鉴定磷酸化MHC I类配体主要是使用未修饰肽训练的生物信息学工具完成的。到目前为止，只有一种工具PhosMHCpred专门用于预测磷酸化的MHC I类配体，该工具仅对有限数量的等位基因进行了训练，并提供了有限的肽长度覆盖范围(仅包括9-mers)。在这里，我们提出了一种方法，称为NetMHCphosPan，用于预测MHC呈现的磷酸化肽。该方法使用NNAlign_MA框架进行训练，该框架允许在数据集之间合并混合数据类型和信息杠杆，从而大大提高MHC和肽长度覆盖范围，并且与PhosMHCpred相比，总体上提高了预测能力。基序反褶积表明，结合基序的磷酸化位点强烈倾向于位于4号位置，脯氨酸在P5和精氨酸在P1富集。与目前最先进的方法相比，NetMHCphosPan的长度和等位基因覆盖范围更大，从而提高了性能，并在独立于模型开发的大型基准数据集上得到了进一步验证。总之，我们已经证实了NNAlign_MA对复杂免疫肽组学数据的基序反褶积的高功率，并开发了一种预测MHC的新方法，与目前最先进的方法相比，该方法具有更高的预测能力，更宽的肽长度和MHC覆盖范围。开发的方法可在http://www.cbs.dtu.dk/services/NetMHCphosPan-1.0上获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊