首页 > 最新文献

Immunoinformatics (Amsterdam, Netherlands)最新文献

英文 中文
Active learning for improving out-of-distribution lab-in-the-loop experimental design 主动学习改进非分布实验室在环实验设计
Pub Date : 2026-03-01 Epub Date: 2026-01-21 DOI: 10.1016/j.immuno.2026.100065
Daria Balashova , Robert Frank , Svetlana Kuzyakina , Dominique Weltevreden , Philippe A. Robert , Geir Kjetil Sandve , Victor Greiff
The accurate prediction of antibody-antigen binding is crucial for developing antibody-based therapeutics and advancing immunological research. Library-on-library approaches, where many antigens are probed against many antibodies, can identify specific interacting pairs. Machine learning models can predict target binding by analyzing many-to-many relationships between antibodies and antigens. However, these models face challenges when predicting interactions when test antibodies and antigens are not represented in the training data, a scenario known as out-of-distribution prediction. Generating experimental binding data is costly, limiting the availability of comprehensive datasets. Active learning can reduce costs by starting with a small labeled subset of data and iteratively expanding the labeled dataset. Few active learning approaches are available to handle data with many-to-many relationships as, for example, obtained from library-on-library screening approaches. In this study, we adapted twelve active learning strategies for antibody-antigen binding prediction in a library-on-library setting and evaluated their out-of-distribution performance using the Absolut! simulation framework. We found that three of the twelve algorithms tested, modestly but significantly, outperformed the baseline where random data are iteratively labeled. The best algorithm reduced the number of required antigen mutant variants by up to 12.5% compared to the random baseline. These findings demonstrate that active learning can improve experimental efficiency in a library-on-library setting and advance antibody-antigen binding prediction.
准确预测抗体-抗原结合对于开发基于抗体的治疗方法和推进免疫学研究至关重要。文库对文库的方法,其中许多抗原针对许多抗体进行探测,可以识别特定的相互作用对。机器学习模型可以通过分析抗体和抗原之间的多对多关系来预测目标结合。然而,当测试抗体和抗原没有在训练数据中表示时,这些模型在预测相互作用时面临挑战,这种情况被称为分布外预测。生成实验绑定数据是昂贵的,限制了综合数据集的可用性。主动学习可以通过从一个小的标记数据子集开始,迭代地扩展标记数据集来降低成本。很少有主动学习方法可用于处理具有多对多关系的数据,例如,从图书馆对图书馆的筛选方法中获得的数据。在这项研究中,我们采用了12种主动学习策略来预测库间的抗体-抗原结合,并使用Absolut!仿真框架。我们发现,在测试的12种算法中,有3种算法的表现,适度但显著地优于随机数据迭代标记的基线。与随机基线相比,最佳算法将所需抗原突变变体的数量减少了12.5%。这些发现表明,主动学习可以提高库对库环境下的实验效率,并推进抗体-抗原结合预测。
{"title":"Active learning for improving out-of-distribution lab-in-the-loop experimental design","authors":"Daria Balashova ,&nbsp;Robert Frank ,&nbsp;Svetlana Kuzyakina ,&nbsp;Dominique Weltevreden ,&nbsp;Philippe A. Robert ,&nbsp;Geir Kjetil Sandve ,&nbsp;Victor Greiff","doi":"10.1016/j.immuno.2026.100065","DOIUrl":"10.1016/j.immuno.2026.100065","url":null,"abstract":"<div><div>The accurate prediction of antibody-antigen binding is crucial for developing antibody-based therapeutics and advancing immunological research. Library-on-library approaches, where many antigens are probed against many antibodies, can identify specific interacting pairs. Machine learning models can predict target binding by analyzing many-to-many relationships between antibodies and antigens. However, these models face challenges when predicting interactions when test antibodies and antigens are not represented in the training data, a scenario known as out-of-distribution prediction. Generating experimental binding data is costly, limiting the availability of comprehensive datasets. Active learning can reduce costs by starting with a small labeled subset of data and iteratively expanding the labeled dataset. Few active learning approaches are available to handle data with many-to-many relationships as, for example, obtained from library-on-library screening approaches. In this study, we adapted twelve active learning strategies for antibody-antigen binding prediction in a library-on-library setting and evaluated their out-of-distribution performance using the Absolut! simulation framework. We found that three of the twelve algorithms tested, modestly but significantly, outperformed the baseline where random data are iteratively labeled. The best algorithm reduced the number of required antigen mutant variants by up to 12.5% compared to the random baseline. These findings demonstrate that active learning can improve experimental efficiency in a library-on-library setting and advance antibody-antigen binding prediction.</div></div>","PeriodicalId":73343,"journal":{"name":"Immunoinformatics (Amsterdam, Netherlands)","volume":"21 ","pages":"Article 100065"},"PeriodicalIF":0.0,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146039696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AMULETY: A Python package to embed adaptive immune receptor sequences AMULETY:嵌入适应性免疫受体序列的Python包
Pub Date : 2026-03-01 Epub Date: 2026-02-03 DOI: 10.1016/j.immuno.2026.100066
Meng Wang , Wengyao Jiang , Yuval Kluger , Steven H. Kleinstein , Gisela Gabernet
Large language models have been developed to capture relevant features of adaptive immune receptors, each with unique potential applications. However, the diversity in available models presents challenges in accessibility and usability for downstream applications. Here we present AMULETY (Adaptive imMUne receptor Language model Embedding Tool), a Python-based software package to generate language model embeddings for adaptive immune receptor sequences, enabling users to leverage the strengths of different models without the need for complex configuration. AMULETY offers functions for embedding adaptive immune receptor amino acid sequences using pre-trained protein or antibody language models for paired B-cell receptor heavy-light, T-cell receptor alpha-beta or gamma-delta chains, or single chain sequences. We showcase the variability on the embedding space for several embeddings on a dataset of antibody binders to several SARS-CoV-2 epitopes as well as T-cell receptors binding to several epitopes and showed that different models may be effective at capturing different aspects of the distinctions between epitope groups. AMULETY is freely available under GPLv3 license from https://github.com/immcantation/amulety or via pip from the Python Package Index (PyPI) from https://pypi.org/project/amulety/.
已经开发出大型语言模型来捕捉适应性免疫受体的相关特征,每个都具有独特的潜在应用。然而,可用模型的多样性对下游应用程序的可访问性和可用性提出了挑战。在这里,我们提出了AMULETY(适应性免疫受体语言模型嵌入工具),这是一个基于python的软件包,用于生成适应性免疫受体序列的语言模型嵌入,使用户能够在不需要复杂配置的情况下利用不同模型的优势。AMULETY提供嵌入适应性免疫受体氨基酸序列的功能,使用预先训练的蛋白质或抗体语言模型,用于配对b细胞受体重-轻,t细胞受体α - β或γ - δ链或单链序列。我们展示了几种嵌入到几种SARS-CoV-2表位的抗体结合物以及结合到几种表位的t细胞受体数据集的嵌入空间的可变性,并表明不同的模型可能有效地捕获表位组之间差异的不同方面。AMULETY在GPLv3许可下可从https://github.com/immcantation/amulety或通过pip从https://pypi.org/project/amulety/获得Python包索引(PyPI)。
{"title":"AMULETY: A Python package to embed adaptive immune receptor sequences","authors":"Meng Wang ,&nbsp;Wengyao Jiang ,&nbsp;Yuval Kluger ,&nbsp;Steven H. Kleinstein ,&nbsp;Gisela Gabernet","doi":"10.1016/j.immuno.2026.100066","DOIUrl":"10.1016/j.immuno.2026.100066","url":null,"abstract":"<div><div>Large language models have been developed to capture relevant features of adaptive immune receptors, each with unique potential applications. However, the diversity in available models presents challenges in accessibility and usability for downstream applications. Here we present AMULETY (Adaptive imMUne receptor Language model Embedding Tool), a Python-based software package to generate language model embeddings for adaptive immune receptor sequences, enabling users to leverage the strengths of different models without the need for complex configuration. AMULETY offers functions for embedding adaptive immune receptor amino acid sequences using pre-trained protein or antibody language models for paired B-cell receptor heavy-light, T-cell receptor alpha-beta or gamma-delta chains, or single chain sequences. We showcase the variability on the embedding space for several embeddings on a dataset of antibody binders to several SARS-CoV-2 epitopes as well as T-cell receptors binding to several epitopes and showed that different models may be effective at capturing different aspects of the distinctions between epitope groups. AMULETY is freely available under GPLv3 license from <span><span>https://github.com/immcantation/amulety</span><svg><path></path></svg></span> or via pip from the Python Package Index (PyPI) from <span><span>https://pypi.org/project/amulety/</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":73343,"journal":{"name":"Immunoinformatics (Amsterdam, Netherlands)","volume":"21 ","pages":"Article 100066"},"PeriodicalIF":0.0,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146190588","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Where single-cell transcriptomics fails T cells: The misuse of unsupervised clustering for T-cell annotation 单细胞转录组学在T细胞失败的地方:对T细胞注释滥用无监督聚类
Pub Date : 2025-12-01 Epub Date: 2025-10-21 DOI: 10.1016/j.immuno.2025.100063
Kerry A. Mullan , Sebastiaan Valkiers , Nicky de Vrij , Chen Li , Sara Verbandt , Ting Pu , Pieter Meysman
The current state of single-cell transcriptomic interrogation typically consists of using an unsupervised clustering approach followed by expert opinion-based annotation. The underlying assumption is that this process will identify transcriptional differences between cellular subsets accurately, and thus be able to cluster for example CD8+ T cells apart from CD4+ T cells. However, this widely applied assumption that the clustering reflects T-cell biology has never been validated. We used a large T-cell atlas (V2) that combined twelve 10x Genomics single T-cell transcriptomics datasets (∼500 K cells) as well as an independent CITE-seq dataset to qualify if the unsupervised clustering produced by Seurat reflected the biology. Annotations were then evaluated using the expression of key marker genes. The main T-cell markers CD8 and CD4 were mixed in most clusters, regardless of the feature selection and either principal/harmony components or features. The factors driving the clustering were also related to cellular functions (glucose metabolism), T-cell receptor (TCR), immunoglobulin and HLA transcripts, and not typical markers. Against current assumptions, the clustering was not being driven by the T-cell phenotypes and could not accurately segregate the CD4+ from CD8+ T cells, let alone the sub-classifications. This implicated many of the T cells would be incorrectly classified if using the standard cluster-based annotation approach. Methods relying on unsupervised clustering should be used with care, as improper handling can misrepresent the data, and alternatives such as semi-supervised approaches with TCR-seq or protein-based annotations should be preferred.
目前的单细胞转录组询问通常包括使用无监督聚类方法,然后是基于专家意见的注释。潜在的假设是,这一过程将准确地识别细胞亚群之间的转录差异,从而能够将CD8+ T细胞与CD4+ T细胞分开聚集。然而,这种广泛应用的聚类反应t细胞生物学的假设从未得到验证。我们使用了一个大型t细胞图谱(V2),该图谱结合了12个10x Genomics单个t细胞转录组学数据集(~ 500 K细胞)以及一个独立的CITE-seq数据集,以确定Seurat产生的无监督聚类是否反映了生物学。然后使用关键标记基因的表达来评估注释。主要的t细胞标记CD8和CD4在大多数集群中是混合的,无论特征选择和主要/和谐成分或特征。驱动聚类的因素还与细胞功能(葡萄糖代谢)、t细胞受体(TCR)、免疫球蛋白和HLA转录物有关,而不是典型的标志物。与目前的假设相反,这种聚类并不是由T细胞表型驱动的,也不能准确地分离CD4+和CD8+ T细胞,更不用说亚分类了。这意味着如果使用标准的基于簇的注释方法,许多T细胞将被错误地分类。应该谨慎使用依赖于无监督聚类的方法,因为不当的处理可能会歪曲数据,并且应该优先选择使用TCR-seq或基于蛋白质的注释的半监督方法。
{"title":"Where single-cell transcriptomics fails T cells: The misuse of unsupervised clustering for T-cell annotation","authors":"Kerry A. Mullan ,&nbsp;Sebastiaan Valkiers ,&nbsp;Nicky de Vrij ,&nbsp;Chen Li ,&nbsp;Sara Verbandt ,&nbsp;Ting Pu ,&nbsp;Pieter Meysman","doi":"10.1016/j.immuno.2025.100063","DOIUrl":"10.1016/j.immuno.2025.100063","url":null,"abstract":"<div><div>The current state of single-cell transcriptomic interrogation typically consists of using an unsupervised clustering approach followed by expert opinion-based annotation. The underlying assumption is that this process will identify transcriptional differences between cellular subsets accurately, and thus be able to cluster for example CD8+ <em>T</em> cells apart from CD4+ <em>T</em> cells. However, this widely applied assumption that the clustering reflects T-cell biology has never been validated. We used a large T-cell atlas (V2) that combined twelve 10x Genomics single T-cell transcriptomics datasets (∼500 K cells) as well as an independent CITE-seq dataset to qualify if the unsupervised clustering produced by Seurat reflected the biology. Annotations were then evaluated using the expression of key marker genes. The main T-cell markers CD8 and CD4 were mixed in most clusters, regardless of the feature selection and either principal/harmony components or features. The factors driving the clustering were also related to cellular functions (glucose metabolism), T-cell receptor (TCR), immunoglobulin and HLA transcripts, and not typical markers. Against current assumptions, the clustering was not being driven by the T-cell phenotypes and could not accurately segregate the CD4+ from CD8+ <em>T</em> cells, let alone the sub-classifications. This implicated many of the T cells would be incorrectly classified if using the standard cluster-based annotation approach. Methods relying on unsupervised clustering should be used with care, as improper handling can misrepresent the data, and alternatives such as semi-supervised approaches with TCR-seq or protein-based annotations should be preferred.</div></div>","PeriodicalId":73343,"journal":{"name":"Immunoinformatics (Amsterdam, Netherlands)","volume":"20 ","pages":"Article 100063"},"PeriodicalIF":0.0,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145417677","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Building immune digital twins: An international and transdisciplinary community effort 建立免疫数字双胞胎:一个国际和跨学科的社区努力
Pub Date : 2025-12-01 Epub Date: 2025-09-16 DOI: 10.1016/j.immuno.2025.100060
Anna Niarakis , Gary An , Luiz Ladeira , Noriko F. Hiroi , Athina Papadopoulou , Francis P. Crawley , Niloofar Nikaein , Laurence Calzone , Eirini Tsirvouli , Hasan Balci , Marina Esteban Medina , Lorenzo Veschini , Ozan Ozisik , Francesco Messina , Malvina Marku , Van Du T. Tran , Arnau Montagud , Nikola Schlosserova , Yashwanth Subbannayya , Martina Kutmon , Reinhard Laubenbacher
Digital twins, initially developed for industrial applications, are set to make significant advancements in medicine and healthcare. They have demonstrated promising potential for drug development and personalised care, especially in cardiovascular diagnostics and insulin-dependent diabetes management. A particularly compelling application lies in immune responses and immune-mediated diseases, given the immune system’s essential role in preserving human health, from fighting infections to managing autoimmune diseases. Creating Immune Digital Twins (IDTs) holds great promise for medicine and healthcare. At the same time, the development of a reliable and robust IDT presents significant challenges due to the inherent complexity and polymorphism of the human immune system, the difficulties in measuring patients’ immune state in vivo, and the intrinsic difficulties associated with modelling complex biological systems and processes.
The Working Group “Building Immune Digital Twins” (BIDT WG) aims to address these challenges by fostering transdisciplinary collaborations among immunologists, clinicians, experimentalists, computational biologists, and engineers. The international network is leveraging its cross-disciplinary expertise to build the components required for a working IDT model. Moreover, the BIDT WG focuses on creating an open-access model repository for publicly available immune-related computational models and their required metadata. The group is also active in cataloguing open-access tools, methodologies, and software to identify interoperability gaps in the current modelling landscape.
Consequently, this work can drive transformative innovations in precision medicine, unlocking new possibilities for the diagnosis, treatment, and management of immune-mediated diseases.
最初为工业应用而开发的数字双胞胎将在医学和医疗保健领域取得重大进展。它们在药物开发和个性化护理方面,特别是在心血管诊断和胰岛素依赖型糖尿病管理方面,显示出了巨大的潜力。一个特别引人注目的应用是免疫反应和免疫介导的疾病,考虑到免疫系统在保护人类健康方面的重要作用,从对抗感染到管理自身免疫性疾病。创造免疫数字双胞胎(IDTs)在医药和医疗保健领域有着巨大的前景。与此同时,由于人类免疫系统固有的复杂性和多态性,在体内测量患者免疫状态的困难,以及与复杂生物系统和过程建模相关的内在困难,开发可靠且强大的IDT面临着重大挑战。“构建免疫数字双胞胎”工作组(BIDT WG)旨在通过促进免疫学家、临床医生、实验家、计算生物学家和工程师之间的跨学科合作来应对这些挑战。国际网络正在利用其跨学科的专门知识来建立一个有效的IDT模型所需的组成部分。此外,BIDT工作组侧重于为公开可用的免疫相关计算模型及其所需元数据创建开放访问模型存储库。该小组还积极对开放获取工具、方法和软件进行编目,以确定当前建模领域的互操作性差距。因此,这项工作可以推动精准医学的变革性创新,为免疫介导疾病的诊断、治疗和管理开辟新的可能性。
{"title":"Building immune digital twins: An international and transdisciplinary community effort","authors":"Anna Niarakis ,&nbsp;Gary An ,&nbsp;Luiz Ladeira ,&nbsp;Noriko F. Hiroi ,&nbsp;Athina Papadopoulou ,&nbsp;Francis P. Crawley ,&nbsp;Niloofar Nikaein ,&nbsp;Laurence Calzone ,&nbsp;Eirini Tsirvouli ,&nbsp;Hasan Balci ,&nbsp;Marina Esteban Medina ,&nbsp;Lorenzo Veschini ,&nbsp;Ozan Ozisik ,&nbsp;Francesco Messina ,&nbsp;Malvina Marku ,&nbsp;Van Du T. Tran ,&nbsp;Arnau Montagud ,&nbsp;Nikola Schlosserova ,&nbsp;Yashwanth Subbannayya ,&nbsp;Martina Kutmon ,&nbsp;Reinhard Laubenbacher","doi":"10.1016/j.immuno.2025.100060","DOIUrl":"10.1016/j.immuno.2025.100060","url":null,"abstract":"<div><div>Digital twins, initially developed for industrial applications, are set to make significant advancements in medicine and healthcare. They have demonstrated promising potential for drug development and personalised care, especially in cardiovascular diagnostics and insulin-dependent diabetes management. A particularly compelling application lies in immune responses and immune-mediated diseases, given the immune system’s essential role in preserving human health, from fighting infections to managing autoimmune diseases. Creating Immune Digital Twins (IDTs) holds great promise for medicine and healthcare. At the same time, the development of a reliable and robust IDT presents significant challenges due to the inherent complexity and polymorphism of the human immune system, the difficulties in measuring patients’ immune state in vivo, and the intrinsic difficulties associated with modelling complex biological systems and processes.</div><div>The Working Group “Building Immune Digital Twins” (BIDT WG) aims to address these challenges by fostering transdisciplinary collaborations among immunologists, clinicians, experimentalists, computational biologists, and engineers. The international network is leveraging its cross-disciplinary expertise to build the components required for a working IDT model. Moreover, the BIDT WG focuses on creating an open-access model repository for publicly available immune-related computational models and their required metadata. The group is also active in cataloguing open-access tools, methodologies, and software to identify interoperability gaps in the current modelling landscape.</div><div>Consequently, this work can drive transformative innovations in precision medicine, unlocking new possibilities for the diagnosis, treatment, and management of immune-mediated diseases.</div></div>","PeriodicalId":73343,"journal":{"name":"Immunoinformatics (Amsterdam, Netherlands)","volume":"20 ","pages":"Article 100060"},"PeriodicalIF":0.0,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145417676","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The gremlin in the works: why T cell receptor researchers need to pay more attention to germline reference sequences 工作中的小妖精:为什么T细胞受体研究人员需要更多地关注种系参考序列
Pub Date : 2025-12-01 Epub Date: 2025-08-28 DOI: 10.1016/j.immuno.2025.100058
James M. Heather , Ayelet Peres , Gur Yaari , William Lees
The rise of T cell receptor (TCR) sequencing technologies is driving both new understandings of the immune system and the development of novel clinical platforms. Such analyses rely on comparing recombined TCR sequences to unrearranged germline reference sequences during V(D)J annotation. In this study we observed that, despite the importance of this step in TCR analysis, most published studies do not properly report the reference used. We use public datasets to illustrate why references should be explicitly specified: using IMGT/GENE-DB as an example, we document how the reference set changes over time. Furthermore we illustrate how prescriptivist interpretations of reference metadata may be obscuring rather than illuminating TCR biology, and demonstrate the need to perform full V gene sequencing in order to unambiguously determine the final translated TCR polypeptide sequence. In summary, we argue that in order to ensure the accuracy and reproducibility of TCR sequencing – an ever more pressing task as more TCR-based diagnostics and therapeutics are developed – we should all take more care with the development, use, and reporting of the TCR germline references used in our science.
T细胞受体(TCR)测序技术的兴起正在推动对免疫系统的新理解和新的临床平台的发展。这种分析依赖于在V(D)J注释期间将重组的TCR序列与未重组的种系参考序列进行比较。在本研究中,我们观察到,尽管这一步在TCR分析中很重要,但大多数已发表的研究并未正确报告所使用的参考文献。我们使用公共数据集来说明为什么应该明确指定引用:以IMGT/GENE-DB为例,我们记录了引用集如何随时间变化。此外,我们说明了参考元数据的规范解释如何模糊而不是阐明TCR生物学,并证明需要执行全V基因测序,以便明确确定最终翻译的TCR多肽序列。总之,我们认为,为了确保TCR测序的准确性和可重复性——随着越来越多基于TCR的诊断和治疗方法的开发,这是一项越来越紧迫的任务——我们都应该更加注意在我们的科学中使用的TCR生殖系参考文献的开发、使用和报告。
{"title":"The gremlin in the works: why T cell receptor researchers need to pay more attention to germline reference sequences","authors":"James M. Heather ,&nbsp;Ayelet Peres ,&nbsp;Gur Yaari ,&nbsp;William Lees","doi":"10.1016/j.immuno.2025.100058","DOIUrl":"10.1016/j.immuno.2025.100058","url":null,"abstract":"<div><div>The rise of T cell receptor (TCR) sequencing technologies is driving both new understandings of the immune system and the development of novel clinical platforms. Such analyses rely on comparing recombined TCR sequences to unrearranged germline reference sequences during V(D)J annotation. In this study we observed that, despite the importance of this step in TCR analysis, most published studies do not properly report the reference used. We use public datasets to illustrate why references should be explicitly specified: using IMGT/GENE-DB as an example, we document how the reference set changes over time. Furthermore we illustrate how prescriptivist interpretations of reference metadata may be obscuring rather than illuminating TCR biology, and demonstrate the need to perform full V gene sequencing in order to unambiguously determine the final translated TCR polypeptide sequence. In summary, we argue that in order to ensure the accuracy and reproducibility of TCR sequencing – an ever more pressing task as more TCR-based diagnostics and therapeutics are developed – we should all take more care with the development, use, and reporting of the TCR germline references used in our science.</div></div>","PeriodicalId":73343,"journal":{"name":"Immunoinformatics (Amsterdam, Netherlands)","volume":"20 ","pages":"Article 100058"},"PeriodicalIF":0.0,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145109926","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DoggifAI: A transformer based approach for antibody caninisation DoggifAI:一种基于变压器的抗体犬化方法
Pub Date : 2025-12-01 Epub Date: 2025-11-07 DOI: 10.1016/j.immuno.2025.100064
Dominik Grabarczyk , Mikołaj Kocikowski , Maciej Parys , Douglas R. Houston , Ted Hupp , Javier Antonio Alfaro , Shay B. Cohen
Antibody translation across species offers a compelling strategy to extend the vast and expensive investments in human therapeutic antibodies to veterinary oncology, with applications in both veterinary medicine and comparative oncology.
While precise, low-immunogenic treatments are essential for canine cancer care, traditional species conversion methods rely on ad hoc bioinformatics modifications. These methods often implicitly decouple the framework (FR) and complementarity-determining regions (CDRs), ignoring how structural changes in FRs can affect the conformation and function of CDRs. This can compromise binding specificity and require costly high-throughput in vitro screening.
To address this, we present DoggifAI, a transformer model that translates non-canine antibody sequences into canine ones by generating species-appropriate framework regions (FRs) based on desired CDRs. This allows the model to better preserve structural compatibility between FRs and CDRs. The model is pretrained in a T5-style text-to-text denoising task on a large multispecies antibody dataset, which allows further finetuning on a much smaller species-specific dataset.
DoggifAI generates highly canine-like antibodies and shows promising results in preserving binding specificity. To support further progress in this field, we also release a curated dataset of over 430,000 unique canine antibody chain sequences, significantly expanding the public sequence repertoire.
跨物种抗体翻译提供了一个令人信服的策略,将人类治疗性抗体的巨大而昂贵的投资扩展到兽医肿瘤学,在兽医医学和比较肿瘤学中都有应用。虽然精确的低免疫原性治疗对犬类癌症治疗至关重要,但传统的物种转换方法依赖于特别的生物信息学修饰。这些方法通常隐式解耦框架(FR)和互补决定区(cdr),忽略了FRs的结构变化如何影响cdr的构象和功能。这可能会损害结合特异性,需要昂贵的高通量体外筛选。为了解决这个问题,我们提出了DoggifAI,这是一个变压器模型,通过基于所需的cdr生成物种合适的框架区域(FRs),将非犬类抗体序列转化为犬类抗体序列。这使得模型可以更好地保持fr和cdr之间的结构兼容性。该模型在大型多物种抗体数据集上以t5风格的文本到文本去噪任务进行预训练,这允许在更小的物种特异性数据集上进一步微调。DoggifAI产生高度类似犬的抗体,并在保持结合特异性方面显示出有希望的结果。为了支持这一领域的进一步发展,我们还发布了一个超过43万个独特犬抗体链序列的精选数据集,大大扩展了公共序列库。
{"title":"DoggifAI: A transformer based approach for antibody caninisation","authors":"Dominik Grabarczyk ,&nbsp;Mikołaj Kocikowski ,&nbsp;Maciej Parys ,&nbsp;Douglas R. Houston ,&nbsp;Ted Hupp ,&nbsp;Javier Antonio Alfaro ,&nbsp;Shay B. Cohen","doi":"10.1016/j.immuno.2025.100064","DOIUrl":"10.1016/j.immuno.2025.100064","url":null,"abstract":"<div><div>Antibody translation across species offers a compelling strategy to extend the vast and expensive investments in human therapeutic antibodies to veterinary oncology, with applications in both veterinary medicine and comparative oncology.</div><div>While precise, low-immunogenic treatments are essential for canine cancer care, traditional species conversion methods rely on ad hoc bioinformatics modifications. These methods often implicitly decouple the framework (FR) and complementarity-determining regions (CDRs), ignoring how structural changes in FRs can affect the conformation and function of CDRs. This can compromise binding specificity and require costly high-throughput <em>in vitro</em> screening.</div><div>To address this, we present DoggifAI, a transformer model that translates non-canine antibody sequences into canine ones by generating species-appropriate framework regions (FRs) based on desired CDRs. This allows the model to better preserve structural compatibility between FRs and CDRs. The model is pretrained in a T5-style text-to-text denoising task on a large multispecies antibody dataset, which allows further finetuning on a much smaller species-specific dataset.</div><div>DoggifAI generates highly canine-like antibodies and shows promising results in preserving binding specificity. To support further progress in this field, we also release a curated dataset of over 430,000 unique canine antibody chain sequences, significantly expanding the public sequence repertoire.</div></div>","PeriodicalId":73343,"journal":{"name":"Immunoinformatics (Amsterdam, Netherlands)","volume":"20 ","pages":"Article 100064"},"PeriodicalIF":0.0,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145579206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning in AIRR diagnostics: Advances and applications AIRR诊断中的机器学习:进展和应用
Pub Date : 2025-12-01 Epub Date: 2025-10-21 DOI: 10.1016/j.immuno.2025.100062
Aslı Semerci , Celine AlBalaa , Brian Corrie , Dylan Duchen , Gisela Gabernet , Jinwoo Leem , Enkelejda Miho , Ulrik Stervbo , Justin Barton , Pieter Meysman , AIRR-Community
Recent advancements in sequencing technologies have led to an exponential increase in adaptive immune receptor repertoire (AIRR) data. These receptors, crucial to the adaptive immune system, are believed to have strong potential for diagnostic applications. The immune repertoires represent a wealth of data, creating a growing demand for robust computational methods to analyze and interpret this vast amount of information.
In this review, we examine the application of machine learning algorithms for the classification and analysis of AIRR-seq data for different diagnostic applications. We provide a high-level division of current approaches based on their focus on repertoire-level or sequence-level features. We provide an overview of the current state of public AIRR data sets available for model training. Finally, we briefly highlight what lessons can be learned from successful AIRR diagnostic approaches and what hurdles still must be overcome.
最近测序技术的进步导致适应性免疫受体库(AIRR)数据呈指数级增长。这些受体对适应性免疫系统至关重要,被认为具有很强的诊断应用潜力。免疫库代表了丰富的数据,创造了对强大的计算方法来分析和解释这大量信息的日益增长的需求。在这篇综述中,我们研究了机器学习算法在AIRR-seq数据分类和分析中的应用,以用于不同的诊断应用。我们对当前的方法进行了高层次的划分,基于它们对曲目级别或序列级别特征的关注。我们概述了可用于模型训练的公共AIRR数据集的当前状态。最后,我们简要地强调了从成功的AIRR诊断方法中可以吸取的经验教训以及仍然需要克服的障碍。
{"title":"Machine learning in AIRR diagnostics: Advances and applications","authors":"Aslı Semerci ,&nbsp;Celine AlBalaa ,&nbsp;Brian Corrie ,&nbsp;Dylan Duchen ,&nbsp;Gisela Gabernet ,&nbsp;Jinwoo Leem ,&nbsp;Enkelejda Miho ,&nbsp;Ulrik Stervbo ,&nbsp;Justin Barton ,&nbsp;Pieter Meysman ,&nbsp;AIRR-Community","doi":"10.1016/j.immuno.2025.100062","DOIUrl":"10.1016/j.immuno.2025.100062","url":null,"abstract":"<div><div>Recent advancements in sequencing technologies have led to an exponential increase in adaptive immune receptor repertoire (AIRR) data. These receptors, crucial to the adaptive immune system, are believed to have strong potential for diagnostic applications. The immune repertoires represent a wealth of data, creating a growing demand for robust computational methods to analyze and interpret this vast amount of information.</div><div>In this review, we examine the application of machine learning algorithms for the classification and analysis of AIRR-seq data for different diagnostic applications. We provide a high-level division of current approaches based on their focus on repertoire-level or sequence-level features. We provide an overview of the current state of public AIRR data sets available for model training. Finally, we briefly highlight what lessons can be learned from successful AIRR diagnostic approaches and what hurdles still must be overcome.</div></div>","PeriodicalId":73343,"journal":{"name":"Immunoinformatics (Amsterdam, Netherlands)","volume":"20 ","pages":"Article 100062"},"PeriodicalIF":0.0,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145417675","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IApred: A versatile open-source tool for predicting protein antigenicity across diverse pathogens IApred:一个多功能的开源工具,用于预测不同病原体的蛋白质抗原性
Pub Date : 2025-12-01 Epub Date: 2025-10-09 DOI: 10.1016/j.immuno.2025.100061
Sebastian Miles, Gonzalo Menafra, Andrés Iriarte, Jose Alejandro Chabalgoity
Accurate prediction of protein antigenicity is crucial for vaccine development, diagnostic test design, and therapeutic protein engineering. However, existing tools face limitations in accessibility, computational efficiency, and pathogen diversity. Here, we present IApred, an open-source intrinsic antigenicity predictor that addresses these challenges. IApred employs a Support Vector Machine (SVM) model trained on a comprehensive dataset of 918 high-antigenicity proteins from diverse pathogens, including Gram-positive and Gram-negative bacteria, viruses, fungi, protozoa, and helminths. The model incorporates features derived from physicochemical properties, E-descriptors, amino acid dimers and small linear motifs (SLiMs) to predict the probability of a protein eliciting a humoral immune response. In external validation, IApred demonstrated superior balanced performance (ROC AUC = 0.761, sensitivity = 0.702, specificity = 0.706) compared to existing tools (VaxiJen 2.0, VaxiJen 3.0 and ANTIGENpro), while maintaining high computational efficiency (approximately 1000 sequences per minute). IApred's host-and-pathogen-agnostic nature and integration capability into bioinformatic pipelines makes it versatile for diverse applications. A web-based version of the software is available at https://smilesinformatics.com/iapred, while the software and training code are freely available on GitHub (https://github.com/sebamiles/IAPred) and Zenodo (https://doi.org/10.5281/zenodo.14578279)
准确预测蛋白质抗原性对疫苗开发、诊断试验设计和治疗性蛋白质工程至关重要。然而,现有的工具在可及性、计算效率和病原体多样性方面面临限制。在这里,我们提出了IApred,一个开源的内在抗原性预测器,解决了这些挑战。IApred采用了一个支持向量机(SVM)模型,该模型训练了918个高抗原性蛋白质的综合数据集,这些蛋白质来自不同的病原体,包括革兰氏阳性和革兰氏阴性细菌、病毒、真菌、原生动物和蠕虫。该模型结合了来自物理化学性质、e -描述符、氨基酸二聚体和小线性基序(SLiMs)的特征,以预测蛋白质引发体液免疫反应的概率。在外部验证中,与现有工具(VaxiJen 2.0、VaxiJen 3.0和ANTIGENpro)相比,IApred表现出更好的平衡性能(ROC AUC = 0.761,灵敏度= 0.702,特异性= 0.706),同时保持较高的计算效率(每分钟约1000个序列)。IApred的宿主和病原体不可知性以及与生物信息管道的集成能力使其适用于各种应用。该软件的网络版本可在https://smilesinformatics.com/iapred上获得,而软件和培训代码可在GitHub (https://github.com/sebamiles/IAPred)和Zenodo (https://doi.org/10.5281/zenodo.14578279)上免费获得。
{"title":"IApred: A versatile open-source tool for predicting protein antigenicity across diverse pathogens","authors":"Sebastian Miles,&nbsp;Gonzalo Menafra,&nbsp;Andrés Iriarte,&nbsp;Jose Alejandro Chabalgoity","doi":"10.1016/j.immuno.2025.100061","DOIUrl":"10.1016/j.immuno.2025.100061","url":null,"abstract":"<div><div>Accurate prediction of protein antigenicity is crucial for vaccine development, diagnostic test design, and therapeutic protein engineering. However, existing tools face limitations in accessibility, computational efficiency, and pathogen diversity. Here, we present IApred, an open-source intrinsic antigenicity predictor that addresses these challenges. IApred employs a Support Vector Machine (SVM) model trained on a comprehensive dataset of 918 high-antigenicity proteins from diverse pathogens, including Gram-positive and Gram-negative bacteria, viruses, fungi, protozoa, and helminths. The model incorporates features derived from physicochemical properties, <em>E</em>-descriptors, amino acid dimers and small linear motifs (SLiMs) to predict the probability of a protein eliciting a humoral immune response. In external validation, IApred demonstrated superior balanced performance (ROC AUC = 0.761, sensitivity = 0.702, specificity = 0.706) compared to existing tools (VaxiJen 2.0, VaxiJen 3.0 and ANTIGENpro), while maintaining high computational efficiency (approximately 1000 sequences per minute). IApred's host-and-pathogen-agnostic nature and integration capability into bioinformatic pipelines makes it versatile for diverse applications. A web-based version of the software is available at <span><span>https://smilesinformatics.com/iapred</span><svg><path></path></svg></span>, while the software and training code are freely available on GitHub (<span><span>https://github.com/sebamiles/IAPred</span><svg><path></path></svg></span>) and Zenodo (<span><span>https://doi.org/10.5281/zenodo.14578279</span><svg><path></path></svg></span><strong>)</strong></div></div>","PeriodicalId":73343,"journal":{"name":"Immunoinformatics (Amsterdam, Netherlands)","volume":"20 ","pages":"Article 100061"},"PeriodicalIF":0.0,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145269685","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparison of different substitution matrices for distance based T-cell receptor epitope predictions using tcrdist3 使用tcrdist3进行基于距离的t细胞受体表位预测的不同替代矩阵的比较
Pub Date : 2025-09-01 Epub Date: 2025-07-07 DOI: 10.1016/j.immuno.2025.100051
Marc Hoffstedt, Hermann Wätzig, Knut Baumann
Various methods, differing in complexity, have been developed to predict T-cell receptor epitopes. tcrdist3, which implements an easy-to-interpret distance-based approach, has demonstrated performance comparable to the best feature-based methods. Here, a new substitution matrix for tcrdist3 is proposed and its performance is compared to various other substitution matrices. Small performance gains were possible; however tcrdist3 was found to perform reliably well with most substitution matrices. Randomly generated substitution matrices were used as a baseline and resulted in good classification results. It was observed that the prediction quality was negatively correlated with the relative standard deviation of the matrix used (i.e. a larger variance of the weights resulted in poorer predictivity). The most important factor of the tcrdist3-distance between two sequences that could be singled out is the number of substitutions. tcrdist3 implicitly considers the number of substitutions and the type of substitution simultaneously. Using substitution matrices with larger variance penalizes certain substitutions more strongly, which blurs the clusters of sequences with the same number of substitutions. Since the number of substitutions was a key predictor, this resulted in decreased prediction performance.
各种方法,不同的复杂性,已经开发预测t细胞受体表位。Tcrdist3实现了一种易于解释的基于距离的方法,其性能可与最佳的基于特征的方法相媲美。本文提出了一种新的tcrdist3替换矩阵,并将其性能与其他替换矩阵进行了比较。小的性能提升是可能的;然而,我们发现tcrdist3在大多数替换矩阵中都表现得很好。随机生成的替代矩阵作为基线,得到了良好的分类结果。我们观察到,预测质量与所用矩阵的相对标准偏差呈负相关(即权重方差越大,预测能力越差)。两个序列之间的tcrdist3距离最重要的因素是替换的数量。Tcrdist3隐式地同时考虑替换的数量和替换的类型。使用方差较大的替换矩阵对某些替换的惩罚更强烈,这使得具有相同替换次数的序列簇变得模糊。由于替换次数是一个关键的预测因素,这将导致预测性能下降。
{"title":"Comparison of different substitution matrices for distance based T-cell receptor epitope predictions using tcrdist3","authors":"Marc Hoffstedt,&nbsp;Hermann Wätzig,&nbsp;Knut Baumann","doi":"10.1016/j.immuno.2025.100051","DOIUrl":"10.1016/j.immuno.2025.100051","url":null,"abstract":"<div><div>Various methods, differing in complexity, have been developed to predict T-cell receptor epitopes. tcrdist3, which implements an easy-to-interpret distance-based approach, has demonstrated performance comparable to the best feature-based methods. Here, a new substitution matrix for tcrdist3 is proposed and its performance is compared to various other substitution matrices. Small performance gains were possible; however tcrdist3 was found to perform reliably well with most substitution matrices. Randomly generated substitution matrices were used as a baseline and resulted in good classification results. It was observed that the prediction quality was negatively correlated with the relative standard deviation of the matrix used (i.e. a larger variance of the weights resulted in poorer predictivity). The most important factor of the tcrdist3-distance between two sequences that could be singled out is the number of substitutions. tcrdist3 implicitly considers the number of substitutions and the type of substitution simultaneously. Using substitution matrices with larger variance penalizes certain substitutions more strongly, which blurs the clusters of sequences with the same number of substitutions. Since the number of substitutions was a key predictor, this resulted in decreased prediction performance.</div></div>","PeriodicalId":73343,"journal":{"name":"Immunoinformatics (Amsterdam, Netherlands)","volume":"19 ","pages":"Article 100051"},"PeriodicalIF":0.0,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144634320","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Structural insights on the differentiation and reversion of conformational changes in SARS-CoV-2 spike protein models across variants occurring from December, 2019 to November, 2021 2019年12月至2021年11月发生变异的SARS-CoV-2刺突蛋白模型构象变化分化和逆转的结构见解
Pub Date : 2025-09-01 Epub Date: 2025-06-27 DOI: 10.1016/j.immuno.2025.100055
Marni E. Cueno, Kenichi Imai
Conformational changes in the SARS-CoV-2 spike protein are critical for understanding viral evolution. In this study, we provide comparative structural and electrostatic analyses across variants, revealing both differentiation and reversion patterns not previously described in locked and activated spike conformations. More specifically, we generated SARS2 spike protein models from the various recorded variants between December, 2019 and November 2021, and performed structural superimposition, dendrogram analyses, and electrostatic mapping. We confirmed which locked and activated conformations differed and reversed between the Original spike protein model and subsequent SARS2 variants and subvariants. Additionally, among the spike protein models of subsequent SARS2 variants and subvariants during December, 2019-November, 2021, we likewise established structural variations and reversions among the locked and activated conformations. Moreover, we established the structural relationship and clustering among the locked and activated conformations of the SARS2 spike protein models. Furthermore, we determined the electrostatic potential of all generated SARS2 spike protein models to establish the surface charge distribution. Taken together, we found that certain locked and activated conformations of the Original SARS2 spike protein models exhibited both structural differences and, surprisingly, reversion when compared to subsequent variants and subvariants. Similarly, structural differentiation and reversion were also observed in the locked and activated conformations across the spike protein models. Additionally, we identified distinct structural clusters within the locked and activated conformations, establishing a structural relationship among certain SARS2 spike protein models. Moreover, we found that during spike evolution reorganization of the surface charge distribution occurs during structural differentiation and reversion.
SARS-CoV-2刺突蛋白的构象变化对于理解病毒进化至关重要。在这项研究中,我们提供了跨变体的比较结构和静电分析,揭示了之前未在锁定和激活的尖峰构象中描述的分化和逆转模式。更具体地说,我们从2019年12月至2021年11月期间记录的各种变体中生成了SARS2刺突蛋白模型,并进行了结构叠加、树突图分析和静电作图。我们确认了在原始刺突蛋白模型和随后的SARS2变异体和亚变异体之间锁定和激活的构象不同和逆转。此外,在2019年12月至2021年11月期间的后续SARS2变异体和亚变异体的刺突蛋白模型中,我们同样建立了锁定和激活构象之间的结构变化和逆转。此外,我们建立了SARS2刺突蛋白模型的锁定和激活构象之间的结构关系和聚类。此外,我们测定了所有生成的SARS2刺突蛋白模型的静电电位,以建立表面电荷分布。综上所述,我们发现,与随后的变体和亚变体相比,原始SARS2刺突蛋白模型的某些锁定和激活构象既表现出结构差异,又表现出令人惊讶的逆转。同样,在钉突蛋白模型的锁定和激活构象中也观察到结构分化和逆转。此外,我们在锁定和激活的构象中发现了不同的结构簇,建立了某些SARS2刺突蛋白模型之间的结构关系。此外,我们还发现,在尖刺演化过程中,表面电荷分布的重组发生在结构分化和逆转过程中。
{"title":"Structural insights on the differentiation and reversion of conformational changes in SARS-CoV-2 spike protein models across variants occurring from December, 2019 to November, 2021","authors":"Marni E. Cueno,&nbsp;Kenichi Imai","doi":"10.1016/j.immuno.2025.100055","DOIUrl":"10.1016/j.immuno.2025.100055","url":null,"abstract":"<div><div>Conformational changes in the SARS-CoV-2 spike protein are critical for understanding viral evolution. In this study, we provide comparative structural and electrostatic analyses across variants, revealing both differentiation and reversion patterns not previously described in locked and activated spike conformations. More specifically, we generated SARS2 spike protein models from the various recorded variants between December, 2019 and November 2021, and performed structural superimposition, dendrogram analyses, and electrostatic mapping. We confirmed which locked and activated conformations differed and reversed between the Original spike protein model and subsequent SARS2 variants and subvariants. Additionally, among the spike protein models of subsequent SARS2 variants and subvariants during December, 2019-November, 2021, we likewise established structural variations and reversions among the locked and activated conformations. Moreover, we established the structural relationship and clustering among the locked and activated conformations of the SARS2 spike protein models. Furthermore, we determined the electrostatic potential of all generated SARS2 spike protein models to establish the surface charge distribution. Taken together, we found that certain locked and activated conformations of the Original SARS2 spike protein models exhibited both structural differences and, surprisingly, reversion when compared to subsequent variants and subvariants. Similarly, structural differentiation and reversion were also observed in the locked and activated conformations across the spike protein models. Additionally, we identified distinct structural clusters within the locked and activated conformations, establishing a structural relationship among certain SARS2 spike protein models. Moreover, we found that during spike evolution reorganization of the surface charge distribution occurs during structural differentiation and reversion.</div></div>","PeriodicalId":73343,"journal":{"name":"Immunoinformatics (Amsterdam, Netherlands)","volume":"19 ","pages":"Article 100055"},"PeriodicalIF":0.0,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144523417","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Immunoinformatics (Amsterdam, Netherlands)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1