Bo Feng, Hui Yu, Xu Dong, Alejandro Díaz-Holguín, Huabin Hu
{"title":"Identification of bioactive compounds with popular single-atom modifications: comprehensive analysis and implications for compound design","authors":"Bo Feng, Hui Yu, Xu Dong, Alejandro Díaz-Holguín, Huabin Hu","doi":"10.1016/j.ejmech.2024.117051","DOIUrl":null,"url":null,"abstract":"The extensive bioactivity data available in public databases, such as ChEMBL, has facilitated in-depth structure-activity relationship (SAR) analyses, which are essential for understanding the impact of molecular modifications on biological activity. A central strategy in SAR analysis is the assessment of molecular similarity. Several approaches preferred by medicinal chemists have been developed to efficiently capture structurally related compounds on a large scale. Represented as a popular molecular editing strategy in hit-to-lead and lead optimization processes, we previously introduced four types of single-atom modifications (SAMs) and conducted a systematic analysis of their application in compound design. In this study, we expanded the analysis to cover 10 common SAMs, including carbon-nitrogen (N↔C), O↔C, N↔O, S↔O, as well as simpler modifications such as OH↔H, CH<sub>3</sub>↔H, and halogen-hydrogen (F, Cl, Br, I ↔ H) exchanges. Leveraging high-confidence bioactivity data from ChEMBL (version 34), we assembled the largest dataset of SAM pairs to date, comprising 374,979 pairs. Following an evaluation of the frequency of these SAM types in medicinal chemistry, we focused on SAM-induced activity cliffs (ACs), yielding over 7,400 ACs involving SAMs, substantially expanding the current knowledgebase of ACs associated with single-atom changes. Furthermore, structural analysis of these ACs, supported by experimental data, provides critical insights into the role of single-atom modifications in modulating compound activity, offering practical guidance for the structure-based optimization of molecular properties in drug development. As a result, we are providing open access to all identified ACs along with their associated structural information.","PeriodicalId":314,"journal":{"name":"European Journal of Medicinal Chemistry","volume":"16 1","pages":""},"PeriodicalIF":6.0000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Medicinal Chemistry","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.ejmech.2024.117051","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
引用次数: 0
Abstract
The extensive bioactivity data available in public databases, such as ChEMBL, has facilitated in-depth structure-activity relationship (SAR) analyses, which are essential for understanding the impact of molecular modifications on biological activity. A central strategy in SAR analysis is the assessment of molecular similarity. Several approaches preferred by medicinal chemists have been developed to efficiently capture structurally related compounds on a large scale. Represented as a popular molecular editing strategy in hit-to-lead and lead optimization processes, we previously introduced four types of single-atom modifications (SAMs) and conducted a systematic analysis of their application in compound design. In this study, we expanded the analysis to cover 10 common SAMs, including carbon-nitrogen (N↔C), O↔C, N↔O, S↔O, as well as simpler modifications such as OH↔H, CH3↔H, and halogen-hydrogen (F, Cl, Br, I ↔ H) exchanges. Leveraging high-confidence bioactivity data from ChEMBL (version 34), we assembled the largest dataset of SAM pairs to date, comprising 374,979 pairs. Following an evaluation of the frequency of these SAM types in medicinal chemistry, we focused on SAM-induced activity cliffs (ACs), yielding over 7,400 ACs involving SAMs, substantially expanding the current knowledgebase of ACs associated with single-atom changes. Furthermore, structural analysis of these ACs, supported by experimental data, provides critical insights into the role of single-atom modifications in modulating compound activity, offering practical guidance for the structure-based optimization of molecular properties in drug development. As a result, we are providing open access to all identified ACs along with their associated structural information.
期刊介绍:
The European Journal of Medicinal Chemistry is a global journal that publishes studies on all aspects of medicinal chemistry. It provides a medium for publication of original papers and also welcomes critical review papers.
A typical paper would report on the organic synthesis, characterization and pharmacological evaluation of compounds. Other topics of interest are drug design, QSAR, molecular modeling, drug-receptor interactions, molecular aspects of drug metabolism, prodrug synthesis and drug targeting. The journal expects manuscripts to present the rational for a study, provide insight into the design of compounds or understanding of mechanism, or clarify the targets.