Integrated data-driven and experimental approaches to accelerate lead optimization targeting SARS-CoV-2 main protease

IF 4.3 3区 材料科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC ACS Applied Electronic Materials Pub Date : 2023-06-14 DOI:10.1007/s10822-023-00509-1
Rohith Anand Varikoti, Katherine J. Schultz, Chathuri J. Kombala, Agustin Kruel, Kristoffer R. Brandvold, Mowei Zhou, Neeraj Kumar
{"title":"Integrated data-driven and experimental approaches to accelerate lead optimization targeting SARS-CoV-2 main protease","authors":"Rohith Anand Varikoti,&nbsp;Katherine J. Schultz,&nbsp;Chathuri J. Kombala,&nbsp;Agustin Kruel,&nbsp;Kristoffer R. Brandvold,&nbsp;Mowei Zhou,&nbsp;Neeraj Kumar","doi":"10.1007/s10822-023-00509-1","DOIUrl":null,"url":null,"abstract":"<div><p>Identification of potential therapeutic candidates can be expedited by integrating computational modeling with domain aware machine learning (ML) models followed by experimental validation in an iterative manner. Generative deep learning models can generate thousands of new candidates, however, their physiochemical and biochemical properties are typically not fully optimized. Using our recently developed deep learning models and a scaffold as a starting point, we generated tens of thousands of compounds for SARS-CoV-2 M<sup>pro</sup> that preserve the core scaffold. We utilized and implemented several computational tools such as structural alert and toxicity analysis, high throughput virtual screening, ML-based 3D quantitative structure-activity relationships, multi-parameter optimization, and graph neural networks on generated candidates to predict biological activity and binding affinity in advance. As a result of these combined computational endeavors, eight promising candidates were singled out and put through experimental testing using Native Mass Spectrometry and FRET-based functional assays. Two of the tested compounds with quinazoline-2-thiol and acetylpiperidine core moieties showed IC<span>\\(_{50}\\)</span> values in the low micromolar range: <span>\\(2.95\\pm 0.0017\\)</span> <span>\\(\\upmu\\)</span>M and 3.41±0.0015 <span>\\(\\upmu\\)</span>M, respectively. Molecular dynamics simulations further highlight that binding of these compounds results in allosteric modulations within the chain B and the interface domains of the M<sup>pro</sup>. Our integrated approach provides a platform for data driven lead optimization with rapid characterization and experimental validation in a closed loop that could be applied to other potential protein targets.</p></div>","PeriodicalId":3,"journal":{"name":"ACS Applied Electronic Materials","volume":null,"pages":null},"PeriodicalIF":4.3000,"publicationDate":"2023-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Electronic Materials","FirstCategoryId":"99","ListUrlMain":"https://link.springer.com/article/10.1007/s10822-023-00509-1","RegionNum":3,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Identification of potential therapeutic candidates can be expedited by integrating computational modeling with domain aware machine learning (ML) models followed by experimental validation in an iterative manner. Generative deep learning models can generate thousands of new candidates, however, their physiochemical and biochemical properties are typically not fully optimized. Using our recently developed deep learning models and a scaffold as a starting point, we generated tens of thousands of compounds for SARS-CoV-2 Mpro that preserve the core scaffold. We utilized and implemented several computational tools such as structural alert and toxicity analysis, high throughput virtual screening, ML-based 3D quantitative structure-activity relationships, multi-parameter optimization, and graph neural networks on generated candidates to predict biological activity and binding affinity in advance. As a result of these combined computational endeavors, eight promising candidates were singled out and put through experimental testing using Native Mass Spectrometry and FRET-based functional assays. Two of the tested compounds with quinazoline-2-thiol and acetylpiperidine core moieties showed IC\(_{50}\) values in the low micromolar range: \(2.95\pm 0.0017\) \(\upmu\)M and 3.41±0.0015 \(\upmu\)M, respectively. Molecular dynamics simulations further highlight that binding of these compounds results in allosteric modulations within the chain B and the interface domains of the Mpro. Our integrated approach provides a platform for data driven lead optimization with rapid characterization and experimental validation in a closed loop that could be applied to other potential protein targets.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
整合数据驱动和实验方法,加速针对SARS-CoV-2主要蛋白酶的导联优化
通过将计算建模与领域感知机器学习(ML)模型相结合,然后以迭代的方式进行实验验证,可以加快潜在治疗候选者的识别。生成式深度学习模型可以生成数千个新的候选对象,然而,它们的物理化学和生物化学特性通常没有得到充分优化。使用我们最近开发的深度学习模型和支架作为起点,我们为SARS-CoV-2 Mpro生成了数万种化合物,这些化合物保留了核心支架。我们利用并实现了几种计算工具,如结构警报和毒性分析,高通量虚拟筛选,基于ml的3D定量结构-活性关系,多参数优化和图神经网络对生成的候选物进行提前预测生物活性和结合亲和力。作为这些综合计算努力的结果,8个有希望的候选者被挑选出来,并使用Native质谱法和基于fret的功能分析进行实验测试。两种具有喹唑啉-2-硫醇和乙酰胡椒啶核心部分的化合物显示IC\(_{50}\) 低微摩尔范围内的值: \(2.95\pm 0.0017\) \(\upmu\)M和3.41±0.0015 \(\upmu\)分别为M。分子动力学模拟进一步强调,这些化合物的结合导致B链和Mpro界面域内的变构调节。我们的集成方法为数据驱动先导优化提供了一个平台,可以在闭环中快速表征和实验验证,可应用于其他潜在的蛋白质靶点。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
7.20
自引率
4.30%
发文量
567
期刊最新文献
Vitamin B12: prevention of human beings from lethal diseases and its food application. Current status and obstacles of narrowing yield gaps of four major crops. Cold shock treatment alleviates pitting in sweet cherry fruit by enhancing antioxidant enzymes activity and regulating membrane lipid metabolism. Removal of proteins and lipids affects structure, in vitro digestion and physicochemical properties of rice flour modified by heat-moisture treatment. Investigating the impact of climate variables on the organic honey yield in Turkey using XGBoost machine learning.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1