{"title":"A Robust Machine Learning Framework Built Upon Molecular Representations Predicts CYP450 Inhibition: Toward Precision in Drug Repurposing.","authors":"Sotiris Ouzounis, Vasilis Panagiotopoulos, Vivi Bafiti, Panagiotis Zoumpoulakis, Dionisis Cavouras, Ioannis Kalatzis, Minos-Timotheos Matsoukas, Theodora Katsila","doi":"10.1089/omi.2023.0075","DOIUrl":null,"url":null,"abstract":"<p><p>Human cytochrome P450 (CYP450) enzymes play a crucial role in drug metabolism and pharmacokinetics. CYP450 inhibition can lead to toxicity, in particular when drugs are co-administered with other drugs and xenobiotics or in the case of polypharmacy. Predicting CYP450 inhibition is also important for rational drug discovery and development, and precision in drug repurposing. In this overarching context, digital transformation of drug discovery and development, for example, using machine and deep learning approaches, offers prospects for prediction of CYP450 inhibition through computational models. We report here the development of a majority-voting machine learning framework to classify inhibitors and noninhibitors for seven major human liver CYP450 isoforms (CYP1A2, CYP2A6, CYP2B6, CYP2C9, CYP2C19, CYP2D6, and CYP3A4). For the machine learning models reported herein, we employed interaction fingerprints that were derived from molecular docking simulations, thus adding an additional layer of information for protein-ligand interactions. The proposed machine learning framework is based on the structure of the binding site of isoforms to produce predictions beyond previously reported approaches. Also, we carried out a comparative analysis so as to identify which representation of test compounds (molecular descriptors, molecular fingerprints, or protein-ligand interaction fingerprints) affects the predictive performance of the models. This work underlines the ways in which the structure of the enzyme catalytic site influences machine learning predictions and the need for robust frameworks toward better-informed predictions.</p>","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10357106/pdf/","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1089/omi.2023.0075","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
引用次数: 1
Abstract
Human cytochrome P450 (CYP450) enzymes play a crucial role in drug metabolism and pharmacokinetics. CYP450 inhibition can lead to toxicity, in particular when drugs are co-administered with other drugs and xenobiotics or in the case of polypharmacy. Predicting CYP450 inhibition is also important for rational drug discovery and development, and precision in drug repurposing. In this overarching context, digital transformation of drug discovery and development, for example, using machine and deep learning approaches, offers prospects for prediction of CYP450 inhibition through computational models. We report here the development of a majority-voting machine learning framework to classify inhibitors and noninhibitors for seven major human liver CYP450 isoforms (CYP1A2, CYP2A6, CYP2B6, CYP2C9, CYP2C19, CYP2D6, and CYP3A4). For the machine learning models reported herein, we employed interaction fingerprints that were derived from molecular docking simulations, thus adding an additional layer of information for protein-ligand interactions. The proposed machine learning framework is based on the structure of the binding site of isoforms to produce predictions beyond previously reported approaches. Also, we carried out a comparative analysis so as to identify which representation of test compounds (molecular descriptors, molecular fingerprints, or protein-ligand interaction fingerprints) affects the predictive performance of the models. This work underlines the ways in which the structure of the enzyme catalytic site influences machine learning predictions and the need for robust frameworks toward better-informed predictions.