Ligandability and druggability assessment via machine learning

IF 27 2区化学 Q1 CHEMISTRY, MULTIDISCIPLINARY Wiley Interdisciplinary Reviews: Computational Molecular Science Pub Date : 2023-06-04 DOI:10.1002/wcms.1676

Francesco Di Palma, Carlo Abate, Sergio Decherchi, Andrea Cavalli

{"title":"Ligandability and druggability assessment via machine learning","authors":"Francesco Di Palma, Carlo Abate, Sergio Decherchi, Andrea Cavalli","doi":"10.1002/wcms.1676","DOIUrl":null,"url":null,"abstract":"<p>Drug discovery is a daunting and failure-prone task. A critical process in this research field is represented by the biological target and pocket identification steps as they heavily determine the subsequent efforts in selecting a putative ligand, most often a small molecule. Finding “ligandable” pockets, namely protein cavities that may accept a drug-like binder is instrumental to the more general and drug discovery oriented “druggability” estimation process. While high-throughput experimental techniques exist to identify putative binding sites other than the orthosteric one, these techniques are relatively expensive and not so commonly available in labs. In this regard, computational means of detecting ligandable pockets are advisable for their inexpensiveness and speed. These methods can become, in principle, particularly predictive when supported by machine learning methodologies that provide the modeling framework. As with any data-driven effort, the outcome critically depends on the input data, its featurization process and possible associated biases. Also, the machine learning task, (supervised/unsupervised) the learning method, and the possible usage of molecular dynamics data considerably shape the inherent assumptions of the modeling step. Defining a proper quantitative thermodynamic and/or kinetic score (or label) is key to the modeling process; here we revise literature and propose residence time as a novel ideal indicator of ligandability. Interestingly the vast majority of the methods does not keep into consideration kinetics nor thermodynamics when devising predictors.</p><p>This article is categorized under:\n </p>","PeriodicalId":236,"journal":{"name":"Wiley Interdisciplinary Reviews: Computational Molecular Science","volume":"13 5","pages":""},"PeriodicalIF":27.0000,"publicationDate":"2023-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/wcms.1676","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Wiley Interdisciplinary Reviews: Computational Molecular Science","FirstCategoryId":"92","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/wcms.1676","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 1

Abstract

Drug discovery is a daunting and failure-prone task. A critical process in this research field is represented by the biological target and pocket identification steps as they heavily determine the subsequent efforts in selecting a putative ligand, most often a small molecule. Finding “ligandable” pockets, namely protein cavities that may accept a drug-like binder is instrumental to the more general and drug discovery oriented “druggability” estimation process. While high-throughput experimental techniques exist to identify putative binding sites other than the orthosteric one, these techniques are relatively expensive and not so commonly available in labs. In this regard, computational means of detecting ligandable pockets are advisable for their inexpensiveness and speed. These methods can become, in principle, particularly predictive when supported by machine learning methodologies that provide the modeling framework. As with any data-driven effort, the outcome critically depends on the input data, its featurization process and possible associated biases. Also, the machine learning task, (supervised/unsupervised) the learning method, and the possible usage of molecular dynamics data considerably shape the inherent assumptions of the modeling step. Defining a proper quantitative thermodynamic and/or kinetic score (or label) is key to the modeling process; here we revise literature and propose residence time as a novel ideal indicator of ligandability. Interestingly the vast majority of the methods does not keep into consideration kinetics nor thermodynamics when devising predictors.

This article is categorized under:

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

通过机器学习进行可连接性和可药用性评估

药物发现是一项艰巨且容易失败的任务。该研究领域的一个关键过程是生物靶标和口袋识别步骤，因为它们在很大程度上决定了随后选择推定配体（通常是小分子）的努力。找到“可连接”的口袋，即可能接受类药物粘合剂的蛋白质腔，有助于更通用和以药物发现为导向的“可药用性”估计过程。虽然存在高通量实验技术来鉴定除原位结合位点之外的假定结合位点，但这些技术相对昂贵，在实验室中并不常见。在这方面，检测可连接物口袋的计算方法是可取的，因为它们的成本和速度都很低。原则上，当得到提供建模框架的机器学习方法的支持时，这些方法可以变得特别具有预测性。与任何数据驱动的努力一样，结果在很大程度上取决于输入数据、其特征化过程和可能的相关偏差。此外，机器学习任务、（有监督/无监督）学习方法以及分子动力学数据的可能使用极大地影响了建模步骤的固有假设。定义适当的定量热力学和/或动力学评分（或标签）是建模过程的关键；在这里，我们对文献进行了修订，并提出将停留时间作为一种新颖的理想的可结合性指标。有趣的是，在设计预测因子时，绝大多数方法都没有考虑动力学或热力学。本文分类如下：

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Wiley Interdisciplinary Reviews: Computational Molecular Science CHEMISTRY, MULTIDISCIPLINARY-MATHEMATICAL & COMPUTATIONAL BIOLOGY

CiteScore

28.90

自引率

1.80%

发文量

审稿时长

6-12 weeks

期刊介绍： Computational molecular sciences harness the power of rigorous chemical and physical theories, employing computer-based modeling, specialized hardware, software development, algorithm design, and database management to explore and illuminate every facet of molecular sciences. These interdisciplinary approaches form a bridge between chemistry, biology, and materials sciences, establishing connections with adjacent application-driven fields in both chemistry and biology. WIREs Computational Molecular Science stands as a platform to comprehensively review and spotlight research from these dynamic and interconnected fields.