{"title":"Protein language models enable prediction of polyreactivity of monospecific, bispecific, and heavy-chain-only antibodies.","authors":"Xin Yu, Kostika Vangjeli, Anusha Prakash, Meha Chhaya, Samantha J Stanley, Noah Cohen, Lili Huang","doi":"10.1093/abt/tbae012","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Early assessment of antibody off-target binding is essential for mitigating developability risks such as fast clearance, reduced efficacy, toxicity, and immunogenicity. The baculovirus particle (BVP) binding assay has been widely utilized to evaluate polyreactivity of antibodies. As a complementary approach, computational prediction of polyreactivity is desirable for counter-screening antibodies from <i>in silico</i> discovery campaigns. However, there is a lack of such models.</p><p><strong>Methods: </strong>Herein, we present the development of an ensemble of three deep learning models based on two pan-protein foundational protein language models (ESM2 and ProtT5) and an antibody-specific protein language model (PLM) (Antiberty). These models were trained in a transfer learning network to predict the outcomes in the BVP assay and the bovine serum albumin binding assay, which was developed as a complement to the BVP assay. The training was conducted on a large dataset of antibody sequences augmented with experimental conditions, which were collected through a highly efficient application system.</p><p><strong>Results: </strong>The resulting models demonstrated robust performance on canonical mAbs (monospecific with heavy and light chain), bispecific Abs, and single-domain Fc (VHH-Fc). PLMs outperformed a model built using molecular descriptors calculated from AlphaFold 2 predicted structures. Embeddings from the antibody-specific and foundational PLMs resulted in similar performance.</p><p><strong>Conclusion: </strong>To our knowledge, this represents the first application of PLMs to predict assay data on bispecifics and VHH-Fcs.</p>","PeriodicalId":36655,"journal":{"name":"Antibody Therapeutics","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11259759/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Antibody Therapeutics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/abt/tbae012","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/7/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Early assessment of antibody off-target binding is essential for mitigating developability risks such as fast clearance, reduced efficacy, toxicity, and immunogenicity. The baculovirus particle (BVP) binding assay has been widely utilized to evaluate polyreactivity of antibodies. As a complementary approach, computational prediction of polyreactivity is desirable for counter-screening antibodies from in silico discovery campaigns. However, there is a lack of such models.
Methods: Herein, we present the development of an ensemble of three deep learning models based on two pan-protein foundational protein language models (ESM2 and ProtT5) and an antibody-specific protein language model (PLM) (Antiberty). These models were trained in a transfer learning network to predict the outcomes in the BVP assay and the bovine serum albumin binding assay, which was developed as a complement to the BVP assay. The training was conducted on a large dataset of antibody sequences augmented with experimental conditions, which were collected through a highly efficient application system.
Results: The resulting models demonstrated robust performance on canonical mAbs (monospecific with heavy and light chain), bispecific Abs, and single-domain Fc (VHH-Fc). PLMs outperformed a model built using molecular descriptors calculated from AlphaFold 2 predicted structures. Embeddings from the antibody-specific and foundational PLMs resulted in similar performance.
Conclusion: To our knowledge, this represents the first application of PLMs to predict assay data on bispecifics and VHH-Fcs.