{"title":"AChE和MAO-B在阿尔茨海默病中的双重抑制:机器学习方法和模型解释。","authors":"Qinghe Hou, Yan Li","doi":"10.1007/s11030-024-11061-x","DOIUrl":null,"url":null,"abstract":"<p><p>Alzheimer's disease (AD) is one of the most prevalent neurodegenerative diseases. Given the multifactorial pathophysiology of AD, monotargeted agents can only alleviate symptoms but not cure AD. Acetylcholinesterase (AChE) and Monoamine oxidase B (MAO-B) are two key targets in the treatment of AD, molecules that inhibiting both targets are considered promising avenue to develop more effective AD therapies. In the present work, a dual inhibition dataset containing 449 molecules was established, based on which five machine learning algorithms (KNN, SVM, RF, GBDT, and LGBM) four fingerprints (MACCS, ECFP4, RDKitFP, PubChemFP) and DRAGON descriptors were combined to develop 25 classification models in which GBDT paired with ECFP4 and RF paired with PubchemFP achieved the same best performance across multiple metrics (Accuracy = 0.92, F1 Score = 0.94, MCC = 0.81). Moreover, based on the curated bioactivity datasets of AChE and MAO-B, regression models were developed to predict pIC<sub>50</sub> values. For the AChE inhibition task, GBDT demonstrated the best performance (RMSE = 0.683, MAE = 0.500, R<sup>2</sup> = 0.721). The SVM algorithm emerged as the most effective for MAO-B inhibition (RMSE = 0.668, MAE = 0.507, R<sup>2</sup> = 0.675). The SHAP algorithm was used to interpret the optimal models, identifying and analyzing the key substructures and properties for both dual-target and single-target inhibitors. Moreover, molecules docking process provided potential mechanism and Structure-Activity Relationships (SAR) of dual-target inhibition further.</p>","PeriodicalId":708,"journal":{"name":"Molecular Diversity","volume":" ","pages":""},"PeriodicalIF":3.9000,"publicationDate":"2025-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Dual inhibition of AChE and MAO-B in Alzheimer's disease: machine learning approaches and model interpretations.\",\"authors\":\"Qinghe Hou, Yan Li\",\"doi\":\"10.1007/s11030-024-11061-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Alzheimer's disease (AD) is one of the most prevalent neurodegenerative diseases. Given the multifactorial pathophysiology of AD, monotargeted agents can only alleviate symptoms but not cure AD. Acetylcholinesterase (AChE) and Monoamine oxidase B (MAO-B) are two key targets in the treatment of AD, molecules that inhibiting both targets are considered promising avenue to develop more effective AD therapies. In the present work, a dual inhibition dataset containing 449 molecules was established, based on which five machine learning algorithms (KNN, SVM, RF, GBDT, and LGBM) four fingerprints (MACCS, ECFP4, RDKitFP, PubChemFP) and DRAGON descriptors were combined to develop 25 classification models in which GBDT paired with ECFP4 and RF paired with PubchemFP achieved the same best performance across multiple metrics (Accuracy = 0.92, F1 Score = 0.94, MCC = 0.81). Moreover, based on the curated bioactivity datasets of AChE and MAO-B, regression models were developed to predict pIC<sub>50</sub> values. For the AChE inhibition task, GBDT demonstrated the best performance (RMSE = 0.683, MAE = 0.500, R<sup>2</sup> = 0.721). The SVM algorithm emerged as the most effective for MAO-B inhibition (RMSE = 0.668, MAE = 0.507, R<sup>2</sup> = 0.675). The SHAP algorithm was used to interpret the optimal models, identifying and analyzing the key substructures and properties for both dual-target and single-target inhibitors. Moreover, molecules docking process provided potential mechanism and Structure-Activity Relationships (SAR) of dual-target inhibition further.</p>\",\"PeriodicalId\":708,\"journal\":{\"name\":\"Molecular Diversity\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-01-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Molecular Diversity\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1007/s11030-024-11061-x\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CHEMISTRY, APPLIED\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Diversity","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1007/s11030-024-11061-x","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, APPLIED","Score":null,"Total":0}
引用次数: 0
摘要
阿尔茨海默病(AD)是最常见的神经退行性疾病之一。鉴于阿尔茨海默病的多因素病理生理,单靶向药物只能缓解症状而不能治愈阿尔茨海默病。乙酰胆碱酯酶(AChE)和单胺氧化酶B (MAO-B)是治疗AD的两个关键靶点,抑制这两个靶点的分子被认为是开发更有效的AD治疗方法的有希望的途径。本文建立了包含449个分子的双抑制数据集,在此基础上,结合5种机器学习算法(KNN、SVM、RF、GBDT和LGBM)、4种指纹(MACCS、ECFP4、RDKitFP、PubChemFP)和DRAGON描述符,建立了25个分类模型,其中GBDT与ECFP4配对、RF与PubChemFP配对在多个指标上取得了相同的最佳性能(Accuracy = 0.92, F1 Score = 0.94, MCC = 0.81)。此外,基于整理的AChE和MAO-B生物活性数据集,建立回归模型预测pIC50值。对于AChE抑制任务,GBDT表现最好(RMSE = 0.683, MAE = 0.500, R2 = 0.721)。SVM算法对MAO-B的抑制效果最好(RMSE = 0.668, MAE = 0.507, R2 = 0.675)。利用SHAP算法对优化模型进行解释,识别并分析了双靶点和单靶点抑制剂的关键子结构和性能。此外,分子对接过程进一步提供了双靶点抑制的潜在机制和构效关系。
Dual inhibition of AChE and MAO-B in Alzheimer's disease: machine learning approaches and model interpretations.
Alzheimer's disease (AD) is one of the most prevalent neurodegenerative diseases. Given the multifactorial pathophysiology of AD, monotargeted agents can only alleviate symptoms but not cure AD. Acetylcholinesterase (AChE) and Monoamine oxidase B (MAO-B) are two key targets in the treatment of AD, molecules that inhibiting both targets are considered promising avenue to develop more effective AD therapies. In the present work, a dual inhibition dataset containing 449 molecules was established, based on which five machine learning algorithms (KNN, SVM, RF, GBDT, and LGBM) four fingerprints (MACCS, ECFP4, RDKitFP, PubChemFP) and DRAGON descriptors were combined to develop 25 classification models in which GBDT paired with ECFP4 and RF paired with PubchemFP achieved the same best performance across multiple metrics (Accuracy = 0.92, F1 Score = 0.94, MCC = 0.81). Moreover, based on the curated bioactivity datasets of AChE and MAO-B, regression models were developed to predict pIC50 values. For the AChE inhibition task, GBDT demonstrated the best performance (RMSE = 0.683, MAE = 0.500, R2 = 0.721). The SVM algorithm emerged as the most effective for MAO-B inhibition (RMSE = 0.668, MAE = 0.507, R2 = 0.675). The SHAP algorithm was used to interpret the optimal models, identifying and analyzing the key substructures and properties for both dual-target and single-target inhibitors. Moreover, molecules docking process provided potential mechanism and Structure-Activity Relationships (SAR) of dual-target inhibition further.
期刊介绍:
Molecular Diversity is a new publication forum for the rapid publication of refereed papers dedicated to describing the development, application and theory of molecular diversity and combinatorial chemistry in basic and applied research and drug discovery. The journal publishes both short and full papers, perspectives, news and reviews dealing with all aspects of the generation of molecular diversity, application of diversity for screening against alternative targets of all types (biological, biophysical, technological), analysis of results obtained and their application in various scientific disciplines/approaches including:
combinatorial chemistry and parallel synthesis;
small molecule libraries;
microwave synthesis;
flow synthesis;
fluorous synthesis;
diversity oriented synthesis (DOS);
nanoreactors;
click chemistry;
multiplex technologies;
fragment- and ligand-based design;
structure/function/SAR;
computational chemistry and molecular design;
chemoinformatics;
screening techniques and screening interfaces;
analytical and purification methods;
robotics, automation and miniaturization;
targeted libraries;
display libraries;
peptides and peptoids;
proteins;
oligonucleotides;
carbohydrates;
natural diversity;
new methods of library formulation and deconvolution;
directed evolution, origin of life and recombination;
search techniques, landscapes, random chemistry and more;