Sandra De La Torre , Sebastián A. Cuesta , Luis Calle , José R. Mora , Jose L. Paz , Patricio J. Espinoza-Montero , Máryury Flores-Sumoza , Edgar A. Márquez
{"title":"Computational approaches for lead compound discovery in dipeptidyl peptidase-4 inhibition using machine learning and molecular dynamics techniques","authors":"Sandra De La Torre , Sebastián A. Cuesta , Luis Calle , José R. Mora , Jose L. Paz , Patricio J. Espinoza-Montero , Máryury Flores-Sumoza , Edgar A. Márquez","doi":"10.1016/j.compbiolchem.2024.108145","DOIUrl":null,"url":null,"abstract":"<div><p>The prediction of possible lead compounds from already-known drugs that may present DPP-4 inhibition activity imply a advantage in the drug development in terms of time and cost to find alternative medicines for the treatment of Type 2 Diabetes Mellitus (T2DM). The inhibition of dipeptidyl peptidase-4 (DPP-4) has been one of the most explored strategies to develop potential drugs against this condition. A diverse dataset of molecules with known experimental inhibitory activity against DPP-4 was constructed and used to develop predictive models using different machine-learning algorithms. Model M36 is the most promising one based on the internal and external performance showing values of Q<sup>2</sup> <sub>CV</sub> = 0.813, and Q<sup>2</sup> <sub>EXT</sub> = 0.803. The applicability domain evaluation and Tropsha's analysis were conducted to validate M36, indicating its robustness and accuracy in predicting pIC<sub>50</sub> values for organic molecules within the established domain. The physicochemical properties of the ligands, including electronegativity, polarizability, and van der Waals volume were relevant to predict the inhibition process. The model was then employed in the virtual screening of potential DPP4 inhibitors, finding 448 compounds from the DrugBank and 9 from DiaNat with potential inhibitory activity. Molecular docking and molecular dynamics simulations were used to get insight into the ligand-protein interaction. From the screening and the favorable molecular dynamic results, several compounds including Skimmin (pIC<sub>50</sub> = 3.54, Binding energy = −8.86 kcal/mol), bergenin (pIC<sub>50</sub> = 2.69, Binding energy = −13.90 kcal/mol), and DB07272 (pIC<sub>50</sub> = 3.97, Binding energy = −25.28 kcal/mol) seem to be promising hits to be tested and optimized in the treatment of T2DM. This results imply a important reduction in cost and time on the application of this drugs because all the information about the its metabolism is already available.</p></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":null,"pages":null},"PeriodicalIF":2.6000,"publicationDate":"2024-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Biology and Chemistry","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1476927124001336","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
The prediction of possible lead compounds from already-known drugs that may present DPP-4 inhibition activity imply a advantage in the drug development in terms of time and cost to find alternative medicines for the treatment of Type 2 Diabetes Mellitus (T2DM). The inhibition of dipeptidyl peptidase-4 (DPP-4) has been one of the most explored strategies to develop potential drugs against this condition. A diverse dataset of molecules with known experimental inhibitory activity against DPP-4 was constructed and used to develop predictive models using different machine-learning algorithms. Model M36 is the most promising one based on the internal and external performance showing values of Q2CV = 0.813, and Q2EXT = 0.803. The applicability domain evaluation and Tropsha's analysis were conducted to validate M36, indicating its robustness and accuracy in predicting pIC50 values for organic molecules within the established domain. The physicochemical properties of the ligands, including electronegativity, polarizability, and van der Waals volume were relevant to predict the inhibition process. The model was then employed in the virtual screening of potential DPP4 inhibitors, finding 448 compounds from the DrugBank and 9 from DiaNat with potential inhibitory activity. Molecular docking and molecular dynamics simulations were used to get insight into the ligand-protein interaction. From the screening and the favorable molecular dynamic results, several compounds including Skimmin (pIC50 = 3.54, Binding energy = −8.86 kcal/mol), bergenin (pIC50 = 2.69, Binding energy = −13.90 kcal/mol), and DB07272 (pIC50 = 3.97, Binding energy = −25.28 kcal/mol) seem to be promising hits to be tested and optimized in the treatment of T2DM. This results imply a important reduction in cost and time on the application of this drugs because all the information about the its metabolism is already available.
期刊介绍:
Computational Biology and Chemistry publishes original research papers and review articles in all areas of computational life sciences. High quality research contributions with a major computational component in the areas of nucleic acid and protein sequence research, molecular evolution, molecular genetics (functional genomics and proteomics), theory and practice of either biology-specific or chemical-biology-specific modeling, and structural biology of nucleic acids and proteins are particularly welcome. Exceptionally high quality research work in bioinformatics, systems biology, ecology, computational pharmacology, metabolism, biomedical engineering, epidemiology, and statistical genetics will also be considered.
Given their inherent uncertainty, protein modeling and molecular docking studies should be thoroughly validated. In the absence of experimental results for validation, the use of molecular dynamics simulations along with detailed free energy calculations, for example, should be used as complementary techniques to support the major conclusions. Submissions of premature modeling exercises without additional biological insights will not be considered.
Review articles will generally be commissioned by the editors and should not be submitted to the journal without explicit invitation. However prospective authors are welcome to send a brief (one to three pages) synopsis, which will be evaluated by the editors.