Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga, Juan A. Castillo-Garit, Ansel Y. Rodríguez-Gonzalez, Oscar Martínez-Santiago, Stephen J. Barigye, Julio Madera, Noel Enrique Rodríguez-Maya, Pablo Duchowicz
{"title":"Exploring blood–brain barrier passage using atomic weighted vector and machine learning","authors":"Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga, Juan A. Castillo-Garit, Ansel Y. Rodríguez-Gonzalez, Oscar Martínez-Santiago, Stephen J. Barigye, Julio Madera, Noel Enrique Rodríguez-Maya, Pablo Duchowicz","doi":"10.1007/s00894-024-06188-5","DOIUrl":null,"url":null,"abstract":"<div><h3>Context</h3><p>This study investigates the potential of leveraging molecular properties, as determined by MD-LOVIs software and machine learning techniques, to predict the ability of compounds to cross the blood–brain barrier (BBB). Accurate prediction of BBB permeation is critical for the development of central nervous system (CNS) drugs. The study applies various machine learning models, including both classification and regression techniques, to predict BBB passage and molecular activity. Notably, classification models such as GBM-AWV (accuracy = 0.801), GLM-CN (accuracy = 0.808), SVMPoly-means (accuracy = 0.980), SVMPoly-AC (accuracy = 0.980), SVMPoly-MI_TI_SI (accuracy = 0.900), SVMPoly-GI (accuracy = 0.900), RF-means (accuracy = 0.870), and GLM-means (accuracy = 0.818) demonstrate high accuracy in predicting BBB passage. In contrast, regression models like ES-RLM-AG (<i>R</i><sup>2</sup> = 0.902), IB-IBK (<i>R</i><sup>2</sup> = 0.82), IB-Kstar (<i>R</i><sup>2</sup> = 0.834), IB-MLP (<i>R</i><sup>2</sup> = 0.843), and DRF-AWV (<i>R</i><sup>2</sup> = 0.810) exhibit strong performance in predicting molecular activity. The results show that classification models like GBM-AWV, GLM-CN, and SVMPoly variants, as well as regression models like ES-RLM-AG and IB-MLP, achieve high performance, demonstrating the effectiveness of machine learning in predicting BBB permeability.</p><h3>Methods</h3><p>The computational methods employed in this study include the MD-LOVIs software for generating molecular descriptors and several machine learning algorithms, including gradient boosting machines (GBM), generalized linear models (GLM), support vector machines (SVM) with polynomial kernels, random forests (RF), ensemble regression models, and instance-based learning algorithms. These models were trained and validated using various datasets to predict BBB passage and molecular activity, with the performance metrics reported for each model. Standard computational techniques were employed throughout, ensuring the reliability of the predictions.</p><h3>Graphical Abstract</h3>\n<div><figure><div><div><picture><source><img></source></picture></div></div></figure></div></div>","PeriodicalId":651,"journal":{"name":"Journal of Molecular Modeling","volume":"30 11","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Molecular Modeling","FirstCategoryId":"92","ListUrlMain":"https://link.springer.com/article/10.1007/s00894-024-06188-5","RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Context
This study investigates the potential of leveraging molecular properties, as determined by MD-LOVIs software and machine learning techniques, to predict the ability of compounds to cross the blood–brain barrier (BBB). Accurate prediction of BBB permeation is critical for the development of central nervous system (CNS) drugs. The study applies various machine learning models, including both classification and regression techniques, to predict BBB passage and molecular activity. Notably, classification models such as GBM-AWV (accuracy = 0.801), GLM-CN (accuracy = 0.808), SVMPoly-means (accuracy = 0.980), SVMPoly-AC (accuracy = 0.980), SVMPoly-MI_TI_SI (accuracy = 0.900), SVMPoly-GI (accuracy = 0.900), RF-means (accuracy = 0.870), and GLM-means (accuracy = 0.818) demonstrate high accuracy in predicting BBB passage. In contrast, regression models like ES-RLM-AG (R2 = 0.902), IB-IBK (R2 = 0.82), IB-Kstar (R2 = 0.834), IB-MLP (R2 = 0.843), and DRF-AWV (R2 = 0.810) exhibit strong performance in predicting molecular activity. The results show that classification models like GBM-AWV, GLM-CN, and SVMPoly variants, as well as regression models like ES-RLM-AG and IB-MLP, achieve high performance, demonstrating the effectiveness of machine learning in predicting BBB permeability.
Methods
The computational methods employed in this study include the MD-LOVIs software for generating molecular descriptors and several machine learning algorithms, including gradient boosting machines (GBM), generalized linear models (GLM), support vector machines (SVM) with polynomial kernels, random forests (RF), ensemble regression models, and instance-based learning algorithms. These models were trained and validated using various datasets to predict BBB passage and molecular activity, with the performance metrics reported for each model. Standard computational techniques were employed throughout, ensuring the reliability of the predictions.
期刊介绍:
The Journal of Molecular Modeling focuses on "hardcore" modeling, publishing high-quality research and reports. Founded in 1995 as a purely electronic journal, it has adapted its format to include a full-color print edition, and adjusted its aims and scope fit the fast-changing field of molecular modeling, with a particular focus on three-dimensional modeling.
Today, the journal covers all aspects of molecular modeling including life science modeling; materials modeling; new methods; and computational chemistry.
Topics include computer-aided molecular design; rational drug design, de novo ligand design, receptor modeling and docking; cheminformatics, data analysis, visualization and mining; computational medicinal chemistry; homology modeling; simulation of peptides, DNA and other biopolymers; quantitative structure-activity relationships (QSAR) and ADME-modeling; modeling of biological reaction mechanisms; and combined experimental and computational studies in which calculations play a major role.