Shikhar Saxena, Sambhavi Animesh, M. Fullwood, Y. Mu
{"title":"OnionMHC:一个使用结构和序列特征集进行肽- HLA-A*02:01结合预测的深度学习模型","authors":"Shikhar Saxena, Sambhavi Animesh, M. Fullwood, Y. Mu","doi":"10.1142/s2424913020500095","DOIUrl":null,"url":null,"abstract":"The peptide binding to Major Histocompatibility Complex (MHC) proteins is an important step in the antigen-presentation pathway. Thus, predicting the binding potential of peptides with MHC is essential for the design of peptide-based therapeutics. Most of the available machine learning-based models predict the peptide-MHC binding based on the sequence of amino acids alone. Given the importance of structural information in determining the stability of the complex, here we have utilized both the complex structure and the peptide sequence features to predict the binding affinity of peptides to human receptor HLA-A*02:01. To our knowledge, no such model has been developed for the human HLA receptor before that incorporates both structure and sequence-based features. Results: We have applied machine learning techniques through the natural language processing (NLP) and convolutional neural network to design a model that performs comparably with the existing state-of-the-art models. Our model shows that the information from both sequence and structure domains results in enhanced performance in the binding prediction compared to the information from one domain alone. The testing results in 18 weekly benchmark datasets provided by the Immune Epitope Database (IEDB) as well as experimentally validated peptides from the whole-exome sequencing analysis of the breast cancer patients indicate that our model has achieved state-of-the-art performance. Conclusion: We have developed a deep-learning model (OnionMHC) that incorporates both structure as well as sequence-based features to predict the binding affinity of peptides with human receptor HLA-A*02:01. The model demonstrates state-of-the-art performance on the IEDB benchmark dataset as well as the experimentally validated peptides. The model can be used in the screening of potential neo-epitopes for the development of cancer vaccines or designing peptides for peptide-based therapeutics. OnionMHC is freely available at https://github.com/shikhar249/OnionMHC .","PeriodicalId":36070,"journal":{"name":"Journal of Micromechanics and Molecular Physics","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"OnionMHC: A deep learning model for peptide — HLA-A*02:01 binding predictions using both structure and sequence feature sets\",\"authors\":\"Shikhar Saxena, Sambhavi Animesh, M. Fullwood, Y. Mu\",\"doi\":\"10.1142/s2424913020500095\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The peptide binding to Major Histocompatibility Complex (MHC) proteins is an important step in the antigen-presentation pathway. Thus, predicting the binding potential of peptides with MHC is essential for the design of peptide-based therapeutics. Most of the available machine learning-based models predict the peptide-MHC binding based on the sequence of amino acids alone. Given the importance of structural information in determining the stability of the complex, here we have utilized both the complex structure and the peptide sequence features to predict the binding affinity of peptides to human receptor HLA-A*02:01. To our knowledge, no such model has been developed for the human HLA receptor before that incorporates both structure and sequence-based features. Results: We have applied machine learning techniques through the natural language processing (NLP) and convolutional neural network to design a model that performs comparably with the existing state-of-the-art models. Our model shows that the information from both sequence and structure domains results in enhanced performance in the binding prediction compared to the information from one domain alone. The testing results in 18 weekly benchmark datasets provided by the Immune Epitope Database (IEDB) as well as experimentally validated peptides from the whole-exome sequencing analysis of the breast cancer patients indicate that our model has achieved state-of-the-art performance. Conclusion: We have developed a deep-learning model (OnionMHC) that incorporates both structure as well as sequence-based features to predict the binding affinity of peptides with human receptor HLA-A*02:01. The model demonstrates state-of-the-art performance on the IEDB benchmark dataset as well as the experimentally validated peptides. The model can be used in the screening of potential neo-epitopes for the development of cancer vaccines or designing peptides for peptide-based therapeutics. OnionMHC is freely available at https://github.com/shikhar249/OnionMHC .\",\"PeriodicalId\":36070,\"journal\":{\"name\":\"Journal of Micromechanics and Molecular Physics\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Micromechanics and Molecular Physics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1142/s2424913020500095\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Engineering\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Micromechanics and Molecular Physics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1142/s2424913020500095","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Engineering","Score":null,"Total":0}
OnionMHC: A deep learning model for peptide — HLA-A*02:01 binding predictions using both structure and sequence feature sets
The peptide binding to Major Histocompatibility Complex (MHC) proteins is an important step in the antigen-presentation pathway. Thus, predicting the binding potential of peptides with MHC is essential for the design of peptide-based therapeutics. Most of the available machine learning-based models predict the peptide-MHC binding based on the sequence of amino acids alone. Given the importance of structural information in determining the stability of the complex, here we have utilized both the complex structure and the peptide sequence features to predict the binding affinity of peptides to human receptor HLA-A*02:01. To our knowledge, no such model has been developed for the human HLA receptor before that incorporates both structure and sequence-based features. Results: We have applied machine learning techniques through the natural language processing (NLP) and convolutional neural network to design a model that performs comparably with the existing state-of-the-art models. Our model shows that the information from both sequence and structure domains results in enhanced performance in the binding prediction compared to the information from one domain alone. The testing results in 18 weekly benchmark datasets provided by the Immune Epitope Database (IEDB) as well as experimentally validated peptides from the whole-exome sequencing analysis of the breast cancer patients indicate that our model has achieved state-of-the-art performance. Conclusion: We have developed a deep-learning model (OnionMHC) that incorporates both structure as well as sequence-based features to predict the binding affinity of peptides with human receptor HLA-A*02:01. The model demonstrates state-of-the-art performance on the IEDB benchmark dataset as well as the experimentally validated peptides. The model can be used in the screening of potential neo-epitopes for the development of cancer vaccines or designing peptides for peptide-based therapeutics. OnionMHC is freely available at https://github.com/shikhar249/OnionMHC .