Dan Sindhikara, Sabrina E Iskandar, Lindsey Guan, Rumit Maini, Christopher J Hipolito, Congliang Sun, Lisa A Vasicek, Adam Weinglass, S Adrian Saldanha
{"title":"通过无目标 mRNA 显示的大数据采集分析和预测糜蛋白酶底物偏好。","authors":"Dan Sindhikara, Sabrina E Iskandar, Lindsey Guan, Rumit Maini, Christopher J Hipolito, Congliang Sun, Lisa A Vasicek, Adam Weinglass, S Adrian Saldanha","doi":"10.1002/cbic.202400760","DOIUrl":null,"url":null,"abstract":"<p><p>Oral delivery of peptide therapeutics is limited by degradation by gut proteases like chymotrypsin. Existing databases of peptidases are limited in size and do not enable systematic analyses of protease substrate preferences, especially for non-natural amino acids. Thus, stability optimization of hit compounds is time and resource intensive. To accelerate the stability optimization of peptide ligands, we generated large datasets of chymotrypsin-resistant peptides via mRNA display to create a predictive model for chymotrypsin-resistant sequences. Through analysis of enriched motifs, we recapitulate known chymotrypsin cleavage sites, reveal positionally dependent effects of monomers on peptide cleavage, and report previously unidentified protective and destabilizing residues. We then developed a machine-learning-based model predicting peptide resistance to chymotrypsin cleavage and validated both model performance and the NGS experimental data by measuring chymotrypsin half-lives for a subset of peptides. Finally, we simulated stability predictions on non-natural amino acids through a leucine hold-out model and observed robust performance. Overall, we demonstrate the utility of mRNA display as a tool for big data generation and show that pairing mRNA display with machine learning yields valuable predictions for chymotrypsin cleavage. Expansion of this workflow to additional proteases could provide complementary predictive models that focus future peptide drug discovery efforts.</p>","PeriodicalId":140,"journal":{"name":"ChemBioChem","volume":" ","pages":"e202400760"},"PeriodicalIF":2.6000,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Analysis and Prediction of Chymotrypsin Substrate Preferences through Large Data Acquisition with Target-Free mRNA Display.\",\"authors\":\"Dan Sindhikara, Sabrina E Iskandar, Lindsey Guan, Rumit Maini, Christopher J Hipolito, Congliang Sun, Lisa A Vasicek, Adam Weinglass, S Adrian Saldanha\",\"doi\":\"10.1002/cbic.202400760\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Oral delivery of peptide therapeutics is limited by degradation by gut proteases like chymotrypsin. Existing databases of peptidases are limited in size and do not enable systematic analyses of protease substrate preferences, especially for non-natural amino acids. Thus, stability optimization of hit compounds is time and resource intensive. To accelerate the stability optimization of peptide ligands, we generated large datasets of chymotrypsin-resistant peptides via mRNA display to create a predictive model for chymotrypsin-resistant sequences. Through analysis of enriched motifs, we recapitulate known chymotrypsin cleavage sites, reveal positionally dependent effects of monomers on peptide cleavage, and report previously unidentified protective and destabilizing residues. We then developed a machine-learning-based model predicting peptide resistance to chymotrypsin cleavage and validated both model performance and the NGS experimental data by measuring chymotrypsin half-lives for a subset of peptides. Finally, we simulated stability predictions on non-natural amino acids through a leucine hold-out model and observed robust performance. Overall, we demonstrate the utility of mRNA display as a tool for big data generation and show that pairing mRNA display with machine learning yields valuable predictions for chymotrypsin cleavage. Expansion of this workflow to additional proteases could provide complementary predictive models that focus future peptide drug discovery efforts.</p>\",\"PeriodicalId\":140,\"journal\":{\"name\":\"ChemBioChem\",\"volume\":\" \",\"pages\":\"e202400760\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2024-11-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ChemBioChem\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1002/cbic.202400760\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ChemBioChem","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1002/cbic.202400760","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
Analysis and Prediction of Chymotrypsin Substrate Preferences through Large Data Acquisition with Target-Free mRNA Display.
Oral delivery of peptide therapeutics is limited by degradation by gut proteases like chymotrypsin. Existing databases of peptidases are limited in size and do not enable systematic analyses of protease substrate preferences, especially for non-natural amino acids. Thus, stability optimization of hit compounds is time and resource intensive. To accelerate the stability optimization of peptide ligands, we generated large datasets of chymotrypsin-resistant peptides via mRNA display to create a predictive model for chymotrypsin-resistant sequences. Through analysis of enriched motifs, we recapitulate known chymotrypsin cleavage sites, reveal positionally dependent effects of monomers on peptide cleavage, and report previously unidentified protective and destabilizing residues. We then developed a machine-learning-based model predicting peptide resistance to chymotrypsin cleavage and validated both model performance and the NGS experimental data by measuring chymotrypsin half-lives for a subset of peptides. Finally, we simulated stability predictions on non-natural amino acids through a leucine hold-out model and observed robust performance. Overall, we demonstrate the utility of mRNA display as a tool for big data generation and show that pairing mRNA display with machine learning yields valuable predictions for chymotrypsin cleavage. Expansion of this workflow to additional proteases could provide complementary predictive models that focus future peptide drug discovery efforts.
期刊介绍:
ChemBioChem (Impact Factor 2018: 2.641) publishes important breakthroughs across all areas at the interface of chemistry and biology, including the fields of chemical biology, bioorganic chemistry, bioinorganic chemistry, synthetic biology, biocatalysis, bionanotechnology, and biomaterials. It is published on behalf of Chemistry Europe, an association of 16 European chemical societies, and supported by the Asian Chemical Editorial Society (ACES).