Protein language models enable prediction of polyreactivity of monospecific, bispecific, and heavy-chain-only antibodies.

IF 4.5 Q2 Medicine Antibody Therapeutics Pub Date : 2024-05-30 eCollection Date: 2024-07-01 DOI:10.1093/abt/tbae012

Xin Yu, Kostika Vangjeli, Anusha Prakash, Meha Chhaya, Samantha J Stanley, Noah Cohen, Lili Huang

{"title":"Protein language models enable prediction of polyreactivity of monospecific, bispecific, and heavy-chain-only antibodies.","authors":"Xin Yu, Kostika Vangjeli, Anusha Prakash, Meha Chhaya, Samantha J Stanley, Noah Cohen, Lili Huang","doi":"10.1093/abt/tbae012","DOIUrl":null,"url":null,"abstract":"Background: Early assessment of antibody off-target binding is essential for mitigating developability risks such as fast clearance, reduced efficacy, toxicity, and immunogenicity. The baculovirus particle (BVP) binding assay has been widely utilized to evaluate polyreactivity of antibodies. As a complementary approach, computational prediction of polyreactivity is desirable for counter-screening antibodies from in silico discovery campaigns. However, there is a lack of such models.Methods: Herein, we present the development of an ensemble of three deep learning models based on two pan-protein foundational protein language models (ESM2 and ProtT5) and an antibody-specific protein language model (PLM) (Antiberty). These models were trained in a transfer learning network to predict the outcomes in the BVP assay and the bovine serum albumin binding assay, which was developed as a complement to the BVP assay. The training was conducted on a large dataset of antibody sequences augmented with experimental conditions, which were collected through a highly efficient application system.Results: The resulting models demonstrated robust performance on canonical mAbs (monospecific with heavy and light chain), bispecific Abs, and single-domain Fc (VHH-Fc). PLMs outperformed a model built using molecular descriptors calculated from AlphaFold 2 predicted structures. Embeddings from the antibody-specific and foundational PLMs resulted in similar performance.Conclusion: To our knowledge, this represents the first application of PLMs to predict assay data on bispecifics and VHH-Fcs.","PeriodicalId":36655,"journal":{"name":"Antibody Therapeutics","volume":"7 3","pages":"199-208"},"PeriodicalIF":4.5000,"publicationDate":"2024-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11259759/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Antibody Therapeutics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/abt/tbae012","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/7/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"Medicine","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Early assessment of antibody off-target binding is essential for mitigating developability risks such as fast clearance, reduced efficacy, toxicity, and immunogenicity. The baculovirus particle (BVP) binding assay has been widely utilized to evaluate polyreactivity of antibodies. As a complementary approach, computational prediction of polyreactivity is desirable for counter-screening antibodies from in silico discovery campaigns. However, there is a lack of such models.

Methods: Herein, we present the development of an ensemble of three deep learning models based on two pan-protein foundational protein language models (ESM2 and ProtT5) and an antibody-specific protein language model (PLM) (Antiberty). These models were trained in a transfer learning network to predict the outcomes in the BVP assay and the bovine serum albumin binding assay, which was developed as a complement to the BVP assay. The training was conducted on a large dataset of antibody sequences augmented with experimental conditions, which were collected through a highly efficient application system.

Results: The resulting models demonstrated robust performance on canonical mAbs (monospecific with heavy and light chain), bispecific Abs, and single-domain Fc (VHH-Fc). PLMs outperformed a model built using molecular descriptors calculated from AlphaFold 2 predicted structures. Embeddings from the antibody-specific and foundational PLMs resulted in similar performance.

Conclusion: To our knowledge, this represents the first application of PLMs to predict assay data on bispecifics and VHH-Fcs.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

蛋白质语言模型可预测单特异性、双特异性和纯重链抗体的多反应性。

背景：抗体脱靶结合的早期评估对于降低开发风险（如快速清除、疗效降低、毒性和免疫原性）至关重要。杆状病毒颗粒（BVP）结合试验已被广泛用于评估抗体的多反应性。作为一种补充方法，多反应性的计算预测是反筛选硅学发现活动中抗体的理想方法。方法：在此，我们基于两个泛蛋白质基础蛋白质语言模型（ESM2 和 ProtT5）和一个抗体特异性蛋白质语言模型（PLM）（Antiberty），开发了三个深度学习模型的集合。在迁移学习网络中对这些模型进行了训练，以预测 BVP 检测和牛血清白蛋白结合检测的结果，牛血清白蛋白结合检测是作为 BVP 检测的补充而开发的。训练是在一个大型的抗体序列数据集上进行的，该数据集通过一个高效的应用系统收集了大量的实验条件：结果：由此产生的模型在典型 mAbs（重链和轻链的单特异性抗体）、双特异性抗体和单域 Fc（VHH-Fc）上表现出强大的性能。PLM 的表现优于使用 AlphaFold 2 预测结构计算的分子描述符建立的模型。来自抗体特异性和基础 PLM 的嵌入结果具有相似的性能：据我们所知，这是首次应用 PLM 预测双特异性抗体和 VHH-Fcs 的检测数据。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊