NetMHCphosPan - Pan-specific prediction of MHC class I antigen presentation of phosphorylated ligands

Carina Thusgaard Refsgaard , Carolina Barra , Xu Peng , Nicola Ternette , Morten Nielsen
{"title":"NetMHCphosPan - Pan-specific prediction of MHC class I antigen presentation of phosphorylated ligands","authors":"Carina Thusgaard Refsgaard ,&nbsp;Carolina Barra ,&nbsp;Xu Peng ,&nbsp;Nicola Ternette ,&nbsp;Morten Nielsen","doi":"10.1016/j.immuno.2021.100005","DOIUrl":null,"url":null,"abstract":"<div><p>Post-translational modifications of proteins play a crucial part in carcinogenesis. Phosphorylated peptides have shown to be presented by MHC class I molecules and recognised by cytotoxic T cells, making them a promising target for immunotherapy. Identification of phosphorylated MHC class I ligands has so far predominantly been done using bioinformatic tools trained on unmodified peptides. Only one tool, PhosMHCpred, has been developed specifically for the prediction of phosphorylated MHC class I ligands so far and this tool has been trained only on a limited number of alleles and provides a limited peptide length coverage (only including 9-mers).</p><p>Here we propose a method, termed NetMHCphosPan, for the prediction of MHC presented phosphopeptides. The method is trained using the NNAlign_MA framework, which allows incorporating mixed data types and information leverage between data sets resulting in a greatly improved MHC and peptide length coverage and an overall increased predictive power compared to PhosMHCpred. Motif deconvolution suggested a strong preference for phosphosites to be located in position 4 of the binding motif, and enrichment of proline at P5 and arginine at P1. The improved performance, driven by the extended length and allelic coverage, of NetMHCphosPan over current state-of-the-art methods, was further validated on a large benchmark data set independent from the model development.</p><p>In conclusion, we have confirmed the high power of NNAlign_MA for motif deconvolution of complex immuno-peptidomics data and have developed a novel method for prediction of MHC presented phosphopeptides with improved predictive power and a broader peptide length and MHC coverage compared to current state-of-the-art methods. The developed method is available at <span>http://www.cbs.dtu.dk/services/NetMHCphosPan-1.0</span><svg><path></path></svg>.</p></div>","PeriodicalId":73343,"journal":{"name":"Immunoinformatics (Amsterdam, Netherlands)","volume":"1 ","pages":"Article 100005"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.immuno.2021.100005","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Immunoinformatics (Amsterdam, Netherlands)","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667119021000057","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

Post-translational modifications of proteins play a crucial part in carcinogenesis. Phosphorylated peptides have shown to be presented by MHC class I molecules and recognised by cytotoxic T cells, making them a promising target for immunotherapy. Identification of phosphorylated MHC class I ligands has so far predominantly been done using bioinformatic tools trained on unmodified peptides. Only one tool, PhosMHCpred, has been developed specifically for the prediction of phosphorylated MHC class I ligands so far and this tool has been trained only on a limited number of alleles and provides a limited peptide length coverage (only including 9-mers).

Here we propose a method, termed NetMHCphosPan, for the prediction of MHC presented phosphopeptides. The method is trained using the NNAlign_MA framework, which allows incorporating mixed data types and information leverage between data sets resulting in a greatly improved MHC and peptide length coverage and an overall increased predictive power compared to PhosMHCpred. Motif deconvolution suggested a strong preference for phosphosites to be located in position 4 of the binding motif, and enrichment of proline at P5 and arginine at P1. The improved performance, driven by the extended length and allelic coverage, of NetMHCphosPan over current state-of-the-art methods, was further validated on a large benchmark data set independent from the model development.

In conclusion, we have confirmed the high power of NNAlign_MA for motif deconvolution of complex immuno-peptidomics data and have developed a novel method for prediction of MHC presented phosphopeptides with improved predictive power and a broader peptide length and MHC coverage compared to current state-of-the-art methods. The developed method is available at http://www.cbs.dtu.dk/services/NetMHCphosPan-1.0.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
NetMHCphosPan-Pan特异性预测磷酸化配体的MHC I类抗原呈递
蛋白质的翻译后修饰在癌变中起着至关重要的作用。磷酸化肽已被证明由MHC I类分子呈现并被细胞毒性T细胞识别,使其成为免疫治疗的一个有希望的靶点。迄今为止,鉴定磷酸化MHC I类配体主要是使用未修饰肽训练的生物信息学工具完成的。到目前为止,只有一种工具PhosMHCpred专门用于预测磷酸化的MHC I类配体,该工具仅对有限数量的等位基因进行了训练,并提供了有限的肽长度覆盖范围(仅包括9-mers)。在这里,我们提出了一种方法,称为NetMHCphosPan,用于预测MHC呈现的磷酸化肽。该方法使用NNAlign_MA框架进行训练,该框架允许在数据集之间合并混合数据类型和信息杠杆,从而大大提高MHC和肽长度覆盖范围,并且与PhosMHCpred相比,总体上提高了预测能力。基序反褶积表明,结合基序的磷酸化位点强烈倾向于位于4号位置,脯氨酸在P5和精氨酸在P1富集。与目前最先进的方法相比,NetMHCphosPan的长度和等位基因覆盖范围更大,从而提高了性能,并在独立于模型开发的大型基准数据集上得到了进一步验证。总之,我们已经证实了NNAlign_MA对复杂免疫肽组学数据的基序反褶积的高功率,并开发了一种预测MHC的新方法,与目前最先进的方法相比,该方法具有更高的预测能力,更宽的肽长度和MHC覆盖范围。开发的方法可在http://www.cbs.dtu.dk/services/NetMHCphosPan-1.0上获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Immunoinformatics (Amsterdam, Netherlands)
Immunoinformatics (Amsterdam, Netherlands) Immunology, Computer Science Applications
自引率
0.00%
发文量
0
审稿时长
60 days
期刊最新文献
Scifer: An R/Bioconductor package for large-scale integration of Sanger sequencing and flow cytometry data of index-sorted single cells Lessons learned from the IMMREP23 TCR-epitope prediction challenge Multicohort analysis identifies conserved transcriptional interactions between humans and Plasmodium falciparum In silico modelling of CD8 T cell immune response links genetic regulation to population dynamics Data mining antibody sequences for database searching in bottom-up proteomics
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1