Paulina Anna Wojtyło, Natalia Łapińska, Lucia Bellagamba, Emidio Camaioni, Aleksander Mendyk, Stefano Giovagnoli
{"title":"针对芳基烃受体激活剂的机器学习辅助自动预测工具的初步开发。","authors":"Paulina Anna Wojtyło, Natalia Łapińska, Lucia Bellagamba, Emidio Camaioni, Aleksander Mendyk, Stefano Giovagnoli","doi":"10.3390/pharmaceutics16111456","DOIUrl":null,"url":null,"abstract":"<p><p><b>Background</b>: The aryl hydrocarbon receptor (AhR) plays a crucial role in immune and metabolic processes. The large molecular diversity of ligands capable of activating AhR makes it impossible to determine the structural features useful for the design of new potent modulators. Thus, in the field of drug discovery, the intricate nature of AhR activation necessitates the development of novel tools to address related challenges. <b>Methods</b>: In this study, quantitative structure-activity relationship (QSAR) models of classification and regression were developed with the objective of identifying the most effective method for predicting AhR activity. The initial dataset was obtained by combining the ChEMBL and WIPO databases which contained 978 molecules with EC<sub>50</sub> values. The predictive models were developed using the automated machine learning platform mljar according to a 10-fold cross validation (10-CV) testing procedure. <b>Results</b>: The classification model demonstrated an accuracy value of 0.760 and F1 value of 0.789 for the test set. The root-mean-squared error (RMSE) was 5444, and the coefficient of determination (R<sup>2</sup>) was 0.208 for the regression model. The Shapley Additive Explanations (SHAP) method was then employed for a deeper comprehension of the impact of the variables on the model's predictions. As a practical application for scientific purposes, the best performing classification model was then used to develop an AhR web application. This application is accessible online and has been implemented in Streamlit. <b>Conclusions</b>: The findings may serve as a foundation in prompting further research into the development of a QSAR model, which could enhance comprehension of the influence of ligand structure on the modulation of AhR activity.</p>","PeriodicalId":19894,"journal":{"name":"Pharmaceutics","volume":"16 11","pages":""},"PeriodicalIF":4.9000,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11597659/pdf/","citationCount":"0","resultStr":"{\"title\":\"Initial Development of Automated Machine Learning-Assisted Prediction Tools for Aryl Hydrocarbon Receptor Activators.\",\"authors\":\"Paulina Anna Wojtyło, Natalia Łapińska, Lucia Bellagamba, Emidio Camaioni, Aleksander Mendyk, Stefano Giovagnoli\",\"doi\":\"10.3390/pharmaceutics16111456\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p><b>Background</b>: The aryl hydrocarbon receptor (AhR) plays a crucial role in immune and metabolic processes. The large molecular diversity of ligands capable of activating AhR makes it impossible to determine the structural features useful for the design of new potent modulators. Thus, in the field of drug discovery, the intricate nature of AhR activation necessitates the development of novel tools to address related challenges. <b>Methods</b>: In this study, quantitative structure-activity relationship (QSAR) models of classification and regression were developed with the objective of identifying the most effective method for predicting AhR activity. The initial dataset was obtained by combining the ChEMBL and WIPO databases which contained 978 molecules with EC<sub>50</sub> values. The predictive models were developed using the automated machine learning platform mljar according to a 10-fold cross validation (10-CV) testing procedure. <b>Results</b>: The classification model demonstrated an accuracy value of 0.760 and F1 value of 0.789 for the test set. The root-mean-squared error (RMSE) was 5444, and the coefficient of determination (R<sup>2</sup>) was 0.208 for the regression model. The Shapley Additive Explanations (SHAP) method was then employed for a deeper comprehension of the impact of the variables on the model's predictions. As a practical application for scientific purposes, the best performing classification model was then used to develop an AhR web application. This application is accessible online and has been implemented in Streamlit. <b>Conclusions</b>: The findings may serve as a foundation in prompting further research into the development of a QSAR model, which could enhance comprehension of the influence of ligand structure on the modulation of AhR activity.</p>\",\"PeriodicalId\":19894,\"journal\":{\"name\":\"Pharmaceutics\",\"volume\":\"16 11\",\"pages\":\"\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2024-11-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11597659/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pharmaceutics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.3390/pharmaceutics16111456\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PHARMACOLOGY & PHARMACY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pharmaceutics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3390/pharmaceutics16111456","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}
Initial Development of Automated Machine Learning-Assisted Prediction Tools for Aryl Hydrocarbon Receptor Activators.
Background: The aryl hydrocarbon receptor (AhR) plays a crucial role in immune and metabolic processes. The large molecular diversity of ligands capable of activating AhR makes it impossible to determine the structural features useful for the design of new potent modulators. Thus, in the field of drug discovery, the intricate nature of AhR activation necessitates the development of novel tools to address related challenges. Methods: In this study, quantitative structure-activity relationship (QSAR) models of classification and regression were developed with the objective of identifying the most effective method for predicting AhR activity. The initial dataset was obtained by combining the ChEMBL and WIPO databases which contained 978 molecules with EC50 values. The predictive models were developed using the automated machine learning platform mljar according to a 10-fold cross validation (10-CV) testing procedure. Results: The classification model demonstrated an accuracy value of 0.760 and F1 value of 0.789 for the test set. The root-mean-squared error (RMSE) was 5444, and the coefficient of determination (R2) was 0.208 for the regression model. The Shapley Additive Explanations (SHAP) method was then employed for a deeper comprehension of the impact of the variables on the model's predictions. As a practical application for scientific purposes, the best performing classification model was then used to develop an AhR web application. This application is accessible online and has been implemented in Streamlit. Conclusions: The findings may serve as a foundation in prompting further research into the development of a QSAR model, which could enhance comprehension of the influence of ligand structure on the modulation of AhR activity.
PharmaceuticsPharmacology, Toxicology and Pharmaceutics-Pharmaceutical Science
CiteScore
7.90
自引率
11.10%
发文量
2379
审稿时长
16.41 days
期刊介绍:
Pharmaceutics (ISSN 1999-4923) is an open access journal which provides an advanced forum for the science and technology of pharmaceutics and biopharmaceutics. It publishes reviews, regular research papers, communications, and short notes. Covered topics include pharmacokinetics, toxicokinetics, pharmacodynamics, pharmacogenetics and pharmacogenomics, and pharmaceutical formulation. Our aim is to encourage scientists to publish their experimental and theoretical details in as much detail as possible. There is no restriction on the length of the papers. The full experimental details must be provided so that the results can be reproduced.