Pub Date : 2024-01-29DOI: 10.1016/j.immuno.2024.100033
Dan Hudson , Alex Lubbock , Mark Basham , Hashem Koohy
The vast potential sequence diversity of TCRs and their ligands has presented an historic barrier to computational prediction of TCR epitope specificity, a holy grail of quantitative immunology. One common approach is to cluster sequences together, on the assumption that similar receptors bind similar epitopes. Here, we provide the first independent evaluation of widely used clustering algorithms for TCR specificity inference, observing some variability in predictive performance between models, and marked differences in scalability. Despite these differences, we find that different algorithms produce clusters with high degrees of similarity for receptors recognising the same epitope. Our analysis strengthens the case for use of clustering models to identify signals of common specificity from large repertoires, whilst highlighting scope for improvement of complex models over simple comparators.
{"title":"A comparison of clustering models for inference of T cell receptor antigen specificity","authors":"Dan Hudson , Alex Lubbock , Mark Basham , Hashem Koohy","doi":"10.1016/j.immuno.2024.100033","DOIUrl":"https://doi.org/10.1016/j.immuno.2024.100033","url":null,"abstract":"<div><p>The vast potential sequence diversity of TCRs and their ligands has presented an historic barrier to computational prediction of TCR epitope specificity, a holy grail of quantitative immunology. One common approach is to cluster sequences together, on the assumption that similar receptors bind similar epitopes. Here, we provide the first independent evaluation of widely used clustering algorithms for TCR specificity inference, observing some variability in predictive performance between models, and marked differences in scalability. Despite these differences, we find that different algorithms produce clusters with high degrees of similarity for receptors recognising the same epitope. Our analysis strengthens the case for use of clustering models to identify signals of common specificity from large repertoires, whilst highlighting scope for improvement of complex models over simple comparators.</p></div>","PeriodicalId":73343,"journal":{"name":"Immunoinformatics (Amsterdam, Netherlands)","volume":"13 ","pages":"Article 100033"},"PeriodicalIF":0.0,"publicationDate":"2024-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S266711902400003X/pdfft?md5=99e7206f5457951bcd4047d5992bc528&pid=1-s2.0-S266711902400003X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139682406","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-25DOI: 10.1016/j.immuno.2024.100032
Jonas Birkelund Nilsson, Morten Nielsen
{"title":"The journey towards complete and accurate prediction of HLA antigen presentation","authors":"Jonas Birkelund Nilsson, Morten Nielsen","doi":"10.1016/j.immuno.2024.100032","DOIUrl":"10.1016/j.immuno.2024.100032","url":null,"abstract":"","PeriodicalId":73343,"journal":{"name":"Immunoinformatics (Amsterdam, Netherlands)","volume":"13 ","pages":"Article 100032"},"PeriodicalIF":0.0,"publicationDate":"2024-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667119024000028/pdfft?md5=2f6cf5dc881387d0b5903030768d291a&pid=1-s2.0-S2667119024000028-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139639896","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-20DOI: 10.1016/j.immuno.2024.100031
Thalia Newman , Annarose Taylor , Sakhi Naik , Swati Pandey , Kimberly Manalang , Robert A. Kurt , Chun Wai Liew
Modeling and experimental data were used to evaluate how monocytes would respond to dual TLR4/TLR5 and dual TLR4/TLR7 signaling analogous to how the cells would respond to simultaneously encountering different types of pathogens. Both TLR4/TLR5 and TLR4/TLR7 signaling resulted in a decreased NFkB response relative to signaling through a single TLR. The NFkB response also decreased when all three signaling cascades were triggered. The model suggested that competition between the signaling pathways led to the impaired response when the cells were exposed to multiple TLR agonists, however adjusting the level of IRAKs and TABs in the model was insufficient to overcome competition between the signaling pathways. To experimentally examine how modifying TLR signaling proteins would impact the NFkB response to multiple TLR agonists, cells were pre-conditioned with lipopolysaccharide and the response to single, dual, and triple TLR signaling was followed. Pre-conditioning led to a reduction in the NFkB response to all three agonists, likely a consequence of decreased tlr4, tlr5, tlr7, nfkb, tab1, tab2, and tab3 expression. Collectively, the model supported exploration of the effects of multiple agonists on the signaling pathways and the effectiveness of adjusting the level of TLR signaling proteins in improving the NFkB response. These experiments and data show the importance of having a model capable of integrating multiple TLR signaling cascades since data generated by the model of a single TLR signaling cascade could not predict how the cells would respond when multiple TLR signaling cascades were activated.
{"title":"A computational and experimental approach to studying NFkB signaling in response to single, dual, and triple TLR signaling","authors":"Thalia Newman , Annarose Taylor , Sakhi Naik , Swati Pandey , Kimberly Manalang , Robert A. Kurt , Chun Wai Liew","doi":"10.1016/j.immuno.2024.100031","DOIUrl":"https://doi.org/10.1016/j.immuno.2024.100031","url":null,"abstract":"<div><p>Modeling and experimental data were used to evaluate how monocytes would respond to dual TLR4/TLR5 and dual TLR4/TLR7 signaling analogous to how the cells would respond to simultaneously encountering different types of pathogens. Both TLR4/TLR5 and TLR4/TLR7 signaling resulted in a decreased NFkB response relative to signaling through a single TLR. The NFkB response also decreased when all three signaling cascades were triggered. The model suggested that competition between the signaling pathways led to the impaired response when the cells were exposed to multiple TLR agonists, however adjusting the level of IRAKs and TABs in the model was insufficient to overcome competition between the signaling pathways. To experimentally examine how modifying TLR signaling proteins would impact the NFkB response to multiple TLR agonists, cells were pre-conditioned with lipopolysaccharide and the response to single, dual, and triple TLR signaling was followed. Pre-conditioning led to a reduction in the NFkB response to all three agonists, likely a consequence of decreased <em>tlr4, tlr5, tlr7, nfkb, tab1, tab2,</em> and <em>tab3</em> expression. Collectively, the model supported exploration of the effects of multiple agonists on the signaling pathways and the effectiveness of adjusting the level of TLR signaling proteins in improving the NFkB response. These experiments and data show the importance of having a model capable of integrating multiple TLR signaling cascades since data generated by the model of a single TLR signaling cascade could not predict how the cells would respond when multiple TLR signaling cascades were activated.</p></div>","PeriodicalId":73343,"journal":{"name":"Immunoinformatics (Amsterdam, Netherlands)","volume":"13 ","pages":"Article 100031"},"PeriodicalIF":0.0,"publicationDate":"2024-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667119024000016/pdfft?md5=e656e8f0367c7afdca5cc7ef50946ade&pid=1-s2.0-S2667119024000016-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139550047","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-12-21DOI: 10.1016/j.immuno.2023.100030
Romanos Fasoulis , Mauricio Menegatti Rigo , Dinler Amaral Antunes , Georgios Paliouras , Lydia E. Kavraki
The cellular immune response comprises several processes, with the most notable ones being the binding of the peptide to the Major Histocompability Complex (MHC), the peptide-MHC (pMHC) presentation to the surface of the cell, and the recognition of the pMHC by the T-Cell Receptor. Identifying the most potent peptide targets for MHC binding, presentation and T-cell recognition is vital for developing peptide-based vaccines and T-cell-based immunotherapies. Data-driven tools that predict each of these steps have been developed, and the availability of mass spectrometry (MS) datasets has facilitated the development of accurate Machine Learning (ML) methods for class-I pMHC binding prediction. However, the accuracy of ML-based tools for pMHC kinetic stability prediction and peptide immunogenicity prediction is uncertain, as stability and immunogenicity datasets are not abundant. Here, we use transfer learning techniques to improve stability and immunogenicity predictions, by taking advantage of a large number of binding affinity and MS datasets. The resulting models, TLStab and TLImm, exhibit comparable or better performance than state-of-the-art approaches on different stability and immunogenicity test sets respectively. Our approach demonstrates the promise of learning from the task of peptide binding to improve predictions on downstream tasks. The source code of TLStab and TLImm is publicly available at https://github.com/KavrakiLab/TL-MHC.
细胞免疫反应包括几个过程,其中最显著的是多肽与主要组织相容性复合物(MHC)结合、多肽-MHC(pMHC)呈递到细胞表面以及 T 细胞受体识别 pMHC。确定与 MHC 结合、呈递和 T 细胞识别的最有效多肽靶标对于开发多肽疫苗和 T 细胞免疫疗法至关重要。目前已开发出能预测其中每个步骤的数据驱动工具,质谱(MS)数据集的可用性也促进了用于 I 类 pMHC 结合预测的精确机器学习(ML)方法的开发。然而,由于稳定性和免疫原性数据集并不丰富,基于 ML 的 pMHC 动力稳定性预测和多肽免疫原性预测工具的准确性尚不确定。在此,我们利用迁移学习技术,利用大量的结合亲和力和质谱数据集来改进稳定性和免疫原性预测。由此产生的 TLStab 和 TLImm 模型分别在不同的稳定性和免疫原性测试集上表现出与最先进方法相当甚至更好的性能。我们的方法证明了从多肽结合任务中学习以改进下游任务预测的前景。TLStab 和 TLImm 的源代码可在 https://github.com/KavrakiLab/TL-MHC 公开获取。
{"title":"Transfer learning improves pMHC kinetic stability and immunogenicity predictions","authors":"Romanos Fasoulis , Mauricio Menegatti Rigo , Dinler Amaral Antunes , Georgios Paliouras , Lydia E. Kavraki","doi":"10.1016/j.immuno.2023.100030","DOIUrl":"https://doi.org/10.1016/j.immuno.2023.100030","url":null,"abstract":"<div><p>The cellular immune response comprises several processes, with the most notable ones being the binding of the peptide to the Major Histocompability Complex (MHC), the peptide-MHC (pMHC) presentation to the surface of the cell, and the recognition of the pMHC by the T-Cell Receptor. Identifying the most potent peptide targets for MHC binding, presentation and T-cell recognition is vital for developing peptide-based vaccines and T-cell-based immunotherapies. Data-driven tools that predict each of these steps have been developed, and the availability of mass spectrometry (MS) datasets has facilitated the development of accurate Machine Learning (ML) methods for class-I pMHC binding prediction. However, the accuracy of ML-based tools for pMHC kinetic stability prediction and peptide immunogenicity prediction is uncertain, as stability and immunogenicity datasets are not abundant. Here, we use transfer learning techniques to improve stability and immunogenicity predictions, by taking advantage of a large number of binding affinity and MS datasets. The resulting models, TLStab and TLImm, exhibit comparable or better performance than state-of-the-art approaches on different stability and immunogenicity test sets respectively. Our approach demonstrates the promise of learning from the task of peptide binding to improve predictions on downstream tasks. The source code of TLStab and TLImm is publicly available at <span>https://github.com/KavrakiLab/TL-MHC</span><svg><path></path></svg>.</p></div>","PeriodicalId":73343,"journal":{"name":"Immunoinformatics (Amsterdam, Netherlands)","volume":"13 ","pages":"Article 100030"},"PeriodicalIF":0.0,"publicationDate":"2023-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667119023000101/pdfft?md5=8b373c4d3341fd69e7933198d284cc77&pid=1-s2.0-S2667119023000101-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139100653","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-12-01DOI: 10.1016/j.immuno.2023.100029
María Rodríguez Martínez , Matteo Barberis , Anna Niarakis
The immune system is highly complex, and its malfunctioning can result in many complex disorders. Understanding its inner workings is crucial to designing optimal immunotherapies, developing new vaccines, or understanding autoimmune diseases, just to name a few. Immune-related diseases present unique challenges due to our limited understanding of the complex molecular and cellular interactions involved, as well as the scarcity of available therapeutic options. Recent years have witnessed the progressive development of high-throughput experimental technologies to probe the immune system. This large amount of data has facilitated the emergence of statistical and machine-learning models focused on unravelling the intricate complexities of the immune system. With this vision in mind, a workshop titled "Computational modelling of immunological mechanisms: From statistical approaches to interpretable machine learning" was organized on Sunday, September 18th, 2022 at the 21st European Conference on Computational Biology (ECCB) in Sitges, Spain. The workshop, led by María Rodríguez Martínez, Anna Niarakis, and Matteo Barberis, explored recent statistical models, high-throughput data analyses, and machine learning models to understand immunological mechanisms. More than 60 participants attended the workshop, comprising students, early-career and senior researchers, as well as professionals from diverse domains including Immunology, Systems Biology, Computational Biology, Computer Science, and Bioinformatics. To conclude the workshop, a round table was organized to foster discussions on the existing challenges and chart a roadmap for the development of the next generation of computational models dedicated to investigating the cellular and molecular functions that underlie the immune system.
{"title":"Computational modelling of immunological mechanisms: From statistical approaches to interpretable machine learning","authors":"María Rodríguez Martínez , Matteo Barberis , Anna Niarakis","doi":"10.1016/j.immuno.2023.100029","DOIUrl":"10.1016/j.immuno.2023.100029","url":null,"abstract":"<div><p>The immune system is highly complex, and its malfunctioning can result in many complex disorders. Understanding its inner workings is crucial to designing optimal immunotherapies, developing new vaccines, or understanding autoimmune diseases, just to name a few. Immune-related diseases present unique challenges due to our limited understanding of the complex molecular and cellular interactions involved, as well as the scarcity of available therapeutic options. Recent years have witnessed the progressive development of high-throughput experimental technologies to probe the immune system. This large amount of data has facilitated the emergence of statistical and machine-learning models focused on unravelling the intricate complexities of the immune system. With this vision in mind, a workshop titled \"Computational modelling of immunological mechanisms: From statistical approaches to interpretable machine learning\" was organized on Sunday, September 18th, 2022 at the 21st European Conference on Computational Biology (ECCB) in Sitges, Spain. The workshop, led by María Rodríguez Martínez, Anna Niarakis, and Matteo Barberis, explored recent statistical models, high-throughput data analyses, and machine learning models to understand immunological mechanisms. More than 60 participants attended the workshop, comprising students, early-career and senior researchers, as well as professionals from diverse domains including Immunology, Systems Biology, Computational Biology, Computer Science, and Bioinformatics. To conclude the workshop, a round table was organized to foster discussions on the existing challenges and chart a roadmap for the development of the next generation of computational models dedicated to investigating the cellular and molecular functions that underlie the immune system.</p></div>","PeriodicalId":73343,"journal":{"name":"Immunoinformatics (Amsterdam, Netherlands)","volume":"12 ","pages":"Article 100029"},"PeriodicalIF":0.0,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667119023000095/pdfft?md5=1bd294dd942e97f1e67d4a8e7ca5da39&pid=1-s2.0-S2667119023000095-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47600835","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-09-01DOI: 10.1016/j.immuno.2023.100028
Jarosław Kończak, Bartosz Janusz, Jakub Młokosiewicz, Tadeusz Satława, Sonia Wróbel, Paweł Dudzic, Konrad Krawczyk
Protein folding problem obtained a practical solution recently, owing to advances in deep learning. There are classes of proteins though, such as antibodies, that are structurally unique, where the general solution still lacks. In particular, the prediction of the CDR-H3 loop, which is an instrumental part of an antibody in its antigen recognition abilities, remains a challenge. Antibody-specific deep learning frameworks were proposed to tackle this problem noting great progress, both on accuracy and speed fronts. Oftentimes though, the original networks produce physically implausible bond geometries that then need to undergo a time-consuming energy minimization process. Here we hypothesized that pre-training the network on a large, augmented set of models with correct physical geometries, rather than a small set of real antibody X-ray structures, would allow the network to learn better bond geometries. We show that fine-tuning such a pre-trained network on a task of shape prediction on real X-ray structures improves the number of correct peptide bond distances, abstracted as the Cα distances. We further demonstrate that pre-training allows the network to produce physically plausible shapes on an artificial set of CDR-H3s, showing the ability to generalize to the vast antibody sequence space. We hope that our strategy will benefit the development of deep learning antibody models that rapidly generate physically plausible geometries, without the burden of time-consuming energy minimization.
{"title":"Structural pre-training improves physical accuracy of antibody structure prediction using deep learning.","authors":"Jarosław Kończak, Bartosz Janusz, Jakub Młokosiewicz, Tadeusz Satława, Sonia Wróbel, Paweł Dudzic, Konrad Krawczyk","doi":"10.1016/j.immuno.2023.100028","DOIUrl":"https://doi.org/10.1016/j.immuno.2023.100028","url":null,"abstract":"<div><p>Protein folding problem obtained a practical solution recently, owing to advances in deep learning. There are classes of proteins though, such as antibodies, that are structurally unique, where the general solution still lacks. In particular, the prediction of the CDR-H3 loop, which is an instrumental part of an antibody in its antigen recognition abilities, remains a challenge. Antibody-specific deep learning frameworks were proposed to tackle this problem noting great progress, both on accuracy and speed fronts. Oftentimes though, the original networks produce physically implausible bond geometries that then need to undergo a time-consuming energy minimization process. Here we hypothesized that pre-training the network on a large, augmented set of models with correct physical geometries, rather than a small set of real antibody X-ray structures, would allow the network to learn better bond geometries. We show that fine-tuning such a pre-trained network on a task of shape prediction on real X-ray structures improves the number of correct peptide bond distances, abstracted as the Cα distances. We further demonstrate that pre-training allows the network to produce physically plausible shapes on an artificial set of CDR-H3s, showing the ability to generalize to the vast antibody sequence space. We hope that our strategy will benefit the development of deep learning antibody models that rapidly generate physically plausible geometries, without the burden of time-consuming energy minimization.</p></div>","PeriodicalId":73343,"journal":{"name":"Immunoinformatics (Amsterdam, Netherlands)","volume":"11 ","pages":"Article 100028"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49865553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The recognition of an epitope by a T-cell receptor (TCR) is crucial for eliminating pathogens and establishing immunological memory. Prediction of the binding of any TCR–epitope pair is still a challenging task, especially for novel epitopes, because the underlying patterns are largely unknown to domain experts and machine learning models. To achieve a deeper understanding of TCR–epitope interactions, we have used interpretable deep learning techniques to gain insights into the performance of TCR–epitope binding machine learning models. We demonstrate how interpretable AI techniques can be linked to the three-dimensional structure of molecules to offer novel insights into the factors that determine TCR affinity on a molecular level. Additionally, our results show the importance of using interpretability techniques to verify the predictions of machine learning models for challenging molecular biology problems where small hard-to-detect problems can accumulate to inaccurate results.
{"title":"Interpretable deep learning to uncover the molecular binding patterns determining TCR–epitope interaction predictions","authors":"Ceder Dens, Wout Bittremieux, Fabio Affaticati, Kris Laukens, Pieter Meysman","doi":"10.1016/j.immuno.2023.100027","DOIUrl":"10.1016/j.immuno.2023.100027","url":null,"abstract":"<div><p>The recognition of an epitope by a T-cell receptor (TCR) is crucial for eliminating pathogens and establishing immunological memory. Prediction of the binding of any TCR–epitope pair is still a challenging task, especially for novel epitopes, because the underlying patterns are largely unknown to domain experts and machine learning models. To achieve a deeper understanding of TCR–epitope interactions, we have used interpretable deep learning techniques to gain insights into the performance of TCR–epitope binding machine learning models. We demonstrate how interpretable AI techniques can be linked to the three-dimensional structure of molecules to offer novel insights into the factors that determine TCR affinity on a molecular level. Additionally, our results show the importance of using interpretability techniques to verify the predictions of machine learning models for challenging molecular biology problems where small hard-to-detect problems can accumulate to inaccurate results.</p></div>","PeriodicalId":73343,"journal":{"name":"Immunoinformatics (Amsterdam, Netherlands)","volume":"11 ","pages":"Article 100027"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45830738","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-06-01DOI: 10.1016/j.immuno.2023.100025
William D. Lees , Scott Christley , Ayelet Peres , Justin T. Kos , Brian Corrie , Duncan Ralph , Felix Breden , Lindsay G. Cowell , Gur Yaari , Martin Corcoran , Gunilla B. Karlsson Hedestam , Mats Ohlin , Andrew M. Collins , Corey T. Watson , Christian E. Busse , The AIRR Community
Analysis of an individual's immunoglobulin or T cell receptor gene repertoire can provide important insights into immune function. High-quality analysis of adaptive immune receptor repertoire sequencing data depends upon accurate and relatively complete germline sets, but current sets are known to be incomplete. Established processes for the review and systematic naming of receptor germline genes and alleles require specific evidence and data types, but the discovery landscape is rapidly changing. To exploit the potential of emerging data, and to provide the field with improved state-of-the-art germline sets, an intermediate approach is needed that will allow the rapid publication of consolidated sets derived from these emerging sources. These sets must use a consistent naming scheme and allow refinement and consolidation into genes as new information emerges. Name changes should be minimised, but, where changes occur, the naming history of a sequence must be traceable. Here we outline the current issues and opportunities for the curation of germline IG/TR genes and present a forward-looking data model for building out more robust germline sets that can dovetail with current established processes. We describe interoperability standards for germline sets, and an approach to transparency based on principles of findability, accessibility, interoperability, and reusability.
{"title":"AIRR community curation and standardised representation for immunoglobulin and T cell receptor germline sets","authors":"William D. Lees , Scott Christley , Ayelet Peres , Justin T. Kos , Brian Corrie , Duncan Ralph , Felix Breden , Lindsay G. Cowell , Gur Yaari , Martin Corcoran , Gunilla B. Karlsson Hedestam , Mats Ohlin , Andrew M. Collins , Corey T. Watson , Christian E. Busse , The AIRR Community","doi":"10.1016/j.immuno.2023.100025","DOIUrl":"10.1016/j.immuno.2023.100025","url":null,"abstract":"<div><p>Analysis of an individual's immunoglobulin or T cell receptor gene repertoire can provide important insights into immune function. High-quality analysis of adaptive immune receptor repertoire sequencing data depends upon accurate and relatively complete germline sets, but current sets are known to be incomplete. Established processes for the review and systematic naming of receptor germline genes and alleles require specific evidence and data types, but the discovery landscape is rapidly changing. To exploit the potential of emerging data, and to provide the field with improved state-of-the-art germline sets, an intermediate approach is needed that will allow the rapid publication of consolidated sets derived from these emerging sources. These sets must use a consistent naming scheme and allow refinement and consolidation into genes as new information emerges. Name changes should be minimised, but, where changes occur, the naming history of a sequence must be traceable. Here we outline the current issues and opportunities for the curation of germline IG/TR genes and present a forward-looking data model for building out more robust germline sets that can dovetail with current established processes. We describe interoperability standards for germline sets, and an approach to transparency based on principles of findability, accessibility, interoperability, and reusability.</p></div>","PeriodicalId":73343,"journal":{"name":"Immunoinformatics (Amsterdam, Netherlands)","volume":"10 ","pages":"Article 100025"},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10310305/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9734901","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-03-01DOI: 10.1016/j.immuno.2023.100024
Pieter Meysman , Justin Barton , Barbara Bravi , Liel Cohen-Lavi , Vadim Karnaukhov , Elias Lilleskov , Alessandro Montemurro , Morten Nielsen , Thierry Mora , Paul Pereira , Anna Postovskaya , María Rodríguez Martínez , Jorge Fernandez-de-Cossio-Diaz , Alexandra Vujkovic , Aleksandra M. Walczak , Anna Weber , Rose Yin , Anne Eugster , Virag Sharma
Many different solutions to predicting the cognate epitope target of a T-cell receptor (TCR) have been proposed. However several questions on the advantages and disadvantages of these different approaches remain unresolved, as most methods have only been evaluated within the context of their initial publications and data sets. Here, we report the findings of the first public TCR-epitope prediction benchmark performed on 23 prediction models in the context of the ImmRep 2022 TCR-epitope specificity workshop. This benchmark revealed that the use of paired-chain alpha-beta, as well as CDR1/2 or V/J information, when available, improves classification obtained with CDR3 data, independent of the underlying approach. In addition, we found that straight-forward distance-based approaches can achieve a respectable performance when compared to more complex machine-learning models. Finally, we highlight the need for a truly independent follow-up benchmark and provide recommendations for the design of such a next benchmark.
{"title":"Benchmarking solutions to the T-cell receptor epitope prediction problem: IMMREP22 workshop report","authors":"Pieter Meysman , Justin Barton , Barbara Bravi , Liel Cohen-Lavi , Vadim Karnaukhov , Elias Lilleskov , Alessandro Montemurro , Morten Nielsen , Thierry Mora , Paul Pereira , Anna Postovskaya , María Rodríguez Martínez , Jorge Fernandez-de-Cossio-Diaz , Alexandra Vujkovic , Aleksandra M. Walczak , Anna Weber , Rose Yin , Anne Eugster , Virag Sharma","doi":"10.1016/j.immuno.2023.100024","DOIUrl":"https://doi.org/10.1016/j.immuno.2023.100024","url":null,"abstract":"<div><p>Many different solutions to predicting the cognate epitope target of a T-cell receptor (TCR) have been proposed. However several questions on the advantages and disadvantages of these different approaches remain unresolved, as most methods have only been evaluated within the context of their initial publications and data sets. Here, we report the findings of the first public TCR-epitope prediction benchmark performed on 23 prediction models in the context of the ImmRep 2022 TCR-epitope specificity workshop. This benchmark revealed that the use of paired-chain alpha-beta, as well as CDR1/2 or V/J information, when available, improves classification obtained with CDR3 data, independent of the underlying approach. In addition, we found that straight-forward distance-based approaches can achieve a respectable performance when compared to more complex machine-learning models. Finally, we highlight the need for a truly independent follow-up benchmark and provide recommendations for the design of such a next benchmark.</p></div>","PeriodicalId":73343,"journal":{"name":"Immunoinformatics (Amsterdam, Netherlands)","volume":"9 ","pages":"Article 100024"},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49858643","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-03-01DOI: 10.1016/j.immuno.2023.100021
Sonia Gazeau , Xiaoyan Deng , Hsu Kiang Ooi , Fatima Mostefai , Julie Hussin , Jane Heffernan , Adrianne L. Jenner , Morgan Craig
The COVID-19 pandemic has revealed the need for the increased integration of modelling and data analysis to public health, experimental, and clinical studies. Throughout the first two years of the pandemic, there has been a concerted effort to improve our understanding of the within-host immune response to the SARS-CoV-2 virus to provide better predictions of COVID-19 severity, treatment and vaccine development questions, and insights into viral evolution and the impacts of variants on immunopathology. Here we provide perspectives on what has been accomplished using quantitative methods, including predictive modelling, population genetics, machine learning, and dimensionality reduction techniques, in the first 26 months of the COVID-19 pandemic approaches, and where we go from here to improve our responses to this and future pandemics.
{"title":"The race to understand immunopathology in COVID-19: Perspectives on the impact of quantitative approaches to understand within-host interactions","authors":"Sonia Gazeau , Xiaoyan Deng , Hsu Kiang Ooi , Fatima Mostefai , Julie Hussin , Jane Heffernan , Adrianne L. Jenner , Morgan Craig","doi":"10.1016/j.immuno.2023.100021","DOIUrl":"10.1016/j.immuno.2023.100021","url":null,"abstract":"<div><p>The COVID-19 pandemic has revealed the need for the increased integration of modelling and data analysis to public health, experimental, and clinical studies. Throughout the first two years of the pandemic, there has been a concerted effort to improve our understanding of the within-host immune response to the SARS-CoV-2 virus to provide better predictions of COVID-19 severity, treatment and vaccine development questions, and insights into viral evolution and the impacts of variants on immunopathology. Here we provide perspectives on what has been accomplished using quantitative methods, including predictive modelling, population genetics, machine learning, and dimensionality reduction techniques, in the first 26 months of the COVID-19 pandemic approaches, and where we go from here to improve our responses to this and future pandemics.</p></div>","PeriodicalId":73343,"journal":{"name":"Immunoinformatics (Amsterdam, Netherlands)","volume":"9 ","pages":"Article 100021"},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9826539/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10741906","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}