首页 > 最新文献

SAR and QSAR in Environmental Research最新文献

英文 中文
Network-based clustering and statistical evaluation to elucidate structure-activity relationships of EZH2 inhibitors. 基于网络聚类和统计评价阐明EZH2抑制剂的构效关系。
IF 2.3 3区 环境科学与生态学 Q3 CHEMISTRY, MULTIDISCIPLINARY Pub Date : 2025-09-01 Epub Date: 2025-10-20 DOI: 10.1080/1062936X.2025.2569865
Danishuddin, M A Haque, G Madhukar, S Khan, Q M S Jamal, S Srivastava, J J Kim, K Ahmad

Enhancer of Zeste Homolog 2 (EZH2) inhibitors have demonstrated selective efficacy, but their broader therapeutic potential remains limited, highlighting the need to clarify the structural basis of their activity. The central aim of our study is to systematically analyse the structural diversity and activity patterns of known EZH2 inhibitors to provide insights that may guide incremental scaffold optimization. We examined 531 potential EZH2 inhibitors retrieved from ChEMBL through a cheminformatics workflow encompassing clustering, scaffold identification, activity cliff detection, and chemical space visualization. Using RDKit and NetworkX, 94 clusters were generated, of which 13 contained ten or more compounds. Notably, clusters 6, 16, 20, 21, and 31 exhibited favourable balances of structural homogeneity and enrichment scores, suggesting chemical cohesiveness and biological relevance for structure - activity relationship (SAR) prioritization. Statistical analyses revealed significant differences in mean pIC50 values across clusters, underscoring distinct activity distributions linked to structural groups. Scaffold analysis highlighted pyrrole - benzamide derivatives, particularly those incorporating morpholine and piperidine motifs, as enriched among potent inhibitors. Substructure evaluation further indicated that aromatic rings and aromatic amine groups were positively correlated with bioactivity. These findings delineate key SAR features of EZH2 inhibitors and provide guidance for scaffold refinement, hit identification, and lead optimization.

Zeste Homolog 2的增强子(Enhancer of Zeste Homolog 2, EZH2)抑制剂已显示出选择性疗效,但其更广泛的治疗潜力仍然有限,这突出表明需要澄清其活性的结构基础。我们研究的中心目标是系统地分析已知EZH2抑制剂的结构多样性和活性模式,以提供可能指导增量支架优化的见解。我们通过化学信息学工作流程,包括聚类、支架鉴定、活性悬崖检测和化学空间可视化,研究了从ChEMBL中检索到的531种潜在的EZH2抑制剂。使用RDKit和NetworkX,生成了94个簇,其中13个包含10个或更多的化合物。值得注意的是,集群6、16、20、21和31表现出良好的结构均匀性和富集分数平衡,表明结构-活性关系(SAR)优先级的化学内聚性和生物学相关性。统计分析显示,聚类之间的平均pIC50值存在显著差异,强调了与结构组相关的不同活动分布。脚手架分析强调了吡咯-苯酰胺衍生物,特别是那些含有morpholine和哌啶基序的衍生物,在强效抑制剂中富集。亚结构评价进一步表明,芳香环和芳香胺基团与生物活性呈正相关。这些发现描述了EZH2抑制剂的关键SAR特征,并为支架优化、命中识别和导联优化提供了指导。
{"title":"Network-based clustering and statistical evaluation to elucidate structure-activity relationships of EZH2 inhibitors.","authors":"Danishuddin, M A Haque, G Madhukar, S Khan, Q M S Jamal, S Srivastava, J J Kim, K Ahmad","doi":"10.1080/1062936X.2025.2569865","DOIUrl":"10.1080/1062936X.2025.2569865","url":null,"abstract":"<p><p>Enhancer of Zeste Homolog 2 (EZH2) inhibitors have demonstrated selective efficacy, but their broader therapeutic potential remains limited, highlighting the need to clarify the structural basis of their activity. The central aim of our study is to systematically analyse the structural diversity and activity patterns of known EZH2 inhibitors to provide insights that may guide incremental scaffold optimization. We examined 531 potential EZH2 inhibitors retrieved from ChEMBL through a cheminformatics workflow encompassing clustering, scaffold identification, activity cliff detection, and chemical space visualization. Using RDKit and NetworkX, 94 clusters were generated, of which 13 contained ten or more compounds. Notably, clusters 6, 16, 20, 21, and 31 exhibited favourable balances of structural homogeneity and enrichment scores, suggesting chemical cohesiveness and biological relevance for structure - activity relationship (SAR) prioritization. Statistical analyses revealed significant differences in mean pIC<sub>50</sub> values across clusters, underscoring distinct activity distributions linked to structural groups. Scaffold analysis highlighted pyrrole - benzamide derivatives, particularly those incorporating morpholine and piperidine motifs, as enriched among potent inhibitors. Substructure evaluation further indicated that aromatic rings and aromatic amine groups were positively correlated with bioactivity. These findings delineate key SAR features of EZH2 inhibitors and provide guidance for scaffold refinement, hit identification, and lead optimization.</p>","PeriodicalId":21446,"journal":{"name":"SAR and QSAR in Environmental Research","volume":" ","pages":"827-851"},"PeriodicalIF":2.3,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145329837","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Structural characterization of length-varying peptide sequences for peptide quantitative structure-activity relationship. 长度变化多肽序列的结构表征,用于多肽定量构效关系。
IF 2.3 3区 环境科学与生态学 Q3 CHEMISTRY, MULTIDISCIPLINARY Pub Date : 2025-08-01 Epub Date: 2025-09-10 DOI: 10.1080/1062936X.2025.2552141
Y Zhang, K Li, Y Gan, P Zhou

Peptide quantitative structure-activity relationship (pQSAR) has been widely used in the computational peptidology community to model, predict and explain the activity and function of bioactive peptides. Various amino acid descriptors (AADs) have been developed to characterize the residue building blocks of peptides at sequence level. However, a significant issue is that the total number of AAD-characterized descriptors is proportional to peptide length, thus causing inconsistency in the resulting descriptor vector matrix for a panel of length-varying peptide sequences (LVPSs), which cannot be engaged in pQSAR modelling. Currently, only one AAD-based scaling approach, termed auto-cross covariance (ACC) that was proposed thirty years ago, is available for treating such issue. In this study, we described the second AAD-based multivariate method to do so, namely Residue Descriptor-Distance Vector (RDDV). The strategy characterizes a peptide sequence by using an inter-residue pseudo-interaction potential between different pre-assigned amino acid types involved in the sequence, which results in a given (invariable) number of descriptor parameters for different LVPSs. Here, the RDDV was tested, examined and validated in an in-house pQSAR-oriented bioactive peptide data cluster, which was explored systematically with combinations of different AADs and regression tools. We also compared RDDV with the traditional ACC in multiple aspects.

肽定量构效关系(pQSAR)已广泛应用于计算肽学界,用于模拟、预测和解释生物活性肽的活性和功能。各种氨基酸描述符(AADs)已被开发用于在序列水平上表征肽的残基构建块。然而,一个重要的问题是,aad特征描述符的总数与肽长度成正比,从而导致长度变化肽序列(lvps)面板的描述符向量矩阵不一致,这不能用于pQSAR建模。目前,只有一种基于aad的标度方法,即30年前提出的自动交叉协方差(ACC),可用于处理此类问题。在本研究中,我们描述了第二种基于aad的多变量方法,即残差描述-距离向量(RDDV)。该策略通过在序列中涉及的不同预先分配的氨基酸类型之间使用残基间伪相互作用势来表征肽序列,从而为不同的lvps提供给定(不变)数量的描述符参数。在这里,RDDV在内部面向pqsar的生物活性肽数据簇中进行了测试、检验和验证,并通过不同的AADs和回归工具的组合进行了系统的探索。我们还从多个方面对RDDV与传统ACC进行了比较。
{"title":"Structural characterization of length-varying peptide sequences for peptide quantitative structure-activity relationship.","authors":"Y Zhang, K Li, Y Gan, P Zhou","doi":"10.1080/1062936X.2025.2552141","DOIUrl":"https://doi.org/10.1080/1062936X.2025.2552141","url":null,"abstract":"<p><p>Peptide quantitative structure-activity relationship (pQSAR) has been widely used in the computational peptidology community to model, predict and explain the activity and function of bioactive peptides. Various amino acid descriptors (AADs) have been developed to characterize the residue building blocks of peptides at sequence level. However, a significant issue is that the total number of AAD-characterized descriptors is proportional to peptide length, thus causing inconsistency in the resulting descriptor vector matrix for a panel of length-varying peptide sequences (LVPSs), which cannot be engaged in pQSAR modelling. Currently, only one AAD-based scaling approach, termed auto-cross covariance (ACC) that was proposed thirty years ago, is available for treating such issue. In this study, we described the second AAD-based multivariate method to do so, namely Residue Descriptor-Distance Vector (RDDV). The strategy characterizes a peptide sequence by using an inter-residue pseudo-interaction potential between different pre-assigned amino acid types involved in the sequence, which results in a given (invariable) number of descriptor parameters for different LVPSs. Here, the RDDV was tested, examined and validated in an in-house pQSAR-oriented bioactive peptide data cluster, which was explored systematically with combinations of different AADs and regression tools. We also compared RDDV with the traditional ACC in multiple aspects.</p>","PeriodicalId":21446,"journal":{"name":"SAR and QSAR in Environmental Research","volume":"36 8","pages":"727-751"},"PeriodicalIF":2.3,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145030613","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unravelling phosphorylation-induced impacts on inhibitor-CDK2 through multiple independent molecular dynamics simulations and deep learning. 通过多个独立的分子动力学模拟和深度学习揭示磷酸化诱导对cdk2抑制剂的影响。
IF 2.3 3区 环境科学与生态学 Q3 CHEMISTRY, MULTIDISCIPLINARY Pub Date : 2025-08-01 Epub Date: 2025-09-10 DOI: 10.1080/1062936X.2025.2552131
W Zhang, G Xu, X Li, J Cong, P Wang, Y Xu, B Wei

Phosphorylation plays an important role in the activity of CDK2 and inhibitor binding, but the corresponding molecular mechanism is still insufficiently known. To address this gap, the current study innovatively integrates molecular dynamics (MD) simulations, deep learning (DL) techniques, and free energy landscape (FEL) analysis to systematically explore the action mechanisms of two inhibitors (SCH and CYC) when CDK2 is in a phosphorylated state and bound state of CyclinE. With the help of MD trajectory-based DL, key functional domains such as the loops L3 loop and L7 are successfully identified. The results of FEL analysis show that the binding of CyclinE significantly enhances conformational stability of key functional regions of CDK2 (such as the L3 loop, L7 loop, and αC helix), while phosphorylation modification increases conformational diversity of the CDK2-related system. Further verification by quantum mechanics/molecular mechanics-generalized Born surface area (QM/MM-GBSA) calculations shows that binding of CyclinE can enhance the binding ability of inhibitors, while phosphorylation weakens this binding effect. Residue-based free energy estimation reveals the hot spot regions of inhibitor-CDK2 binding, providing crucial target information for structure-based drug design. This study provides theoretical foundations for the development of highly selective CDK2 inhibitors and might be of great significance for cancer targeted therapy.

磷酸化在CDK2活性和抑制剂结合中起重要作用,但其分子机制尚不清楚。为了解决这一空白,本研究创新性地整合了分子动力学(MD)模拟、深度学习(DL)技术和自由能景观(FEL)分析,系统地探索了CDK2处于磷酸化状态和CyclinE结合状态时两种抑制剂(SCH和CYC)的作用机制。借助基于MD轨迹的深度学习,成功地识别出了环L3环和环L7等关键功能域。FEL分析结果表明,CyclinE的结合显著增强了CDK2关键功能区(如L3环、L7环和αC螺旋)的构象稳定性,而磷酸化修饰增加了CDK2相关系统的构象多样性。通过量子力学/分子力学-广义Born表面积(QM/MM-GBSA)计算进一步验证,CyclinE的结合可以增强抑制剂的结合能力,而磷酸化会减弱这种结合作用。基于残基的自由能估计揭示了抑制剂- cdk2结合的热点区域,为基于结构的药物设计提供了关键的靶点信息。本研究为开发高选择性CDK2抑制剂提供了理论基础,对肿瘤靶向治疗具有重要意义。
{"title":"Unravelling phosphorylation-induced impacts on inhibitor-CDK2 through multiple independent molecular dynamics simulations and deep learning.","authors":"W Zhang, G Xu, X Li, J Cong, P Wang, Y Xu, B Wei","doi":"10.1080/1062936X.2025.2552131","DOIUrl":"https://doi.org/10.1080/1062936X.2025.2552131","url":null,"abstract":"<p><p>Phosphorylation plays an important role in the activity of CDK2 and inhibitor binding, but the corresponding molecular mechanism is still insufficiently known. To address this gap, the current study innovatively integrates molecular dynamics (MD) simulations, deep learning (DL) techniques, and free energy landscape (FEL) analysis to systematically explore the action mechanisms of two inhibitors (SCH and CYC) when CDK2 is in a phosphorylated state and bound state of CyclinE. With the help of MD trajectory-based DL, key functional domains such as the loops L3 loop and L7 are successfully identified. The results of FEL analysis show that the binding of CyclinE significantly enhances conformational stability of key functional regions of CDK2 (such as the L3 loop, L7 loop, and αC helix), while phosphorylation modification increases conformational diversity of the CDK2-related system. Further verification by quantum mechanics/molecular mechanics-generalized Born surface area (QM/MM-GBSA) calculations shows that binding of CyclinE can enhance the binding ability of inhibitors, while phosphorylation weakens this binding effect. Residue-based free energy estimation reveals the hot spot regions of inhibitor-CDK2 binding, providing crucial target information for structure-based drug design. This study provides theoretical foundations for the development of highly selective CDK2 inhibitors and might be of great significance for cancer targeted therapy.</p>","PeriodicalId":21446,"journal":{"name":"SAR and QSAR in Environmental Research","volume":"36 8","pages":"673-700"},"PeriodicalIF":2.3,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145030620","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Harnessing the potential of phytochemicals to design anti-filarial molecules targeting the MurE enzyme of Brugia malayi: a hierarchical virtual screening and molecular dynamics simulation study. 利用植物化学物质的潜力设计针对马来棕树MurE酶的抗丝虫分子:分层虚拟筛选和分子动力学模拟研究。
IF 2.3 3区 环境科学与生态学 Q3 CHEMISTRY, MULTIDISCIPLINARY Pub Date : 2025-08-01 Epub Date: 2025-09-10 DOI: 10.1080/1062936X.2025.2556512
D Prabhu, M Sureshan, S Rajamanikandan, J Jeyakanthan

Brugia malayi, a causative agent of lymphatic filariasis, relies on its endosymbiont Wolbachia for survival. MurE ligase, a key enzyme in Wolbachia peptidoglycan biosynthesis, serves as a promising drug target for anti-filarial therapy. In this study, we employed a hierarchical virtual screening pipeline to identify phytochemical inhibitors targeting the MurE enzyme of the Wolbachia endosymbiont of B. malayi (wBmMurE). A validated high-quality model of wBmMurE was used to screen 17,967 phytochemicals, and the identified hits were subjected to toxicity profiling, and ADME filters to select potent drug-like candidates. Five phytochemicals such as biotin, quisqualic acid, succinic acid, 9,14-dihydroxyoctadecanoic acid, and N-isovaleroylglycine with permissible ADME profiles showed favourable binding affinities (GlideScore range: -12.86 to -10.57 kcal/mol), and stable interactions with catalytically important residues were selected from screened hits. Comparative analysis with reported MurE inhibitors validated the superior affinity and drug-like behaviour of our identified leads. Molecular dynamics simulations of 300 ns confirmed the conformational stability of ligand-bound complexes, while MM-GBSA analysis supported their favourable binding free energies. The results revealed that the identified compounds have the tendency of binding within substrate binding cavity of wBmMurE. These findings suggest that selected phytochemicals could serve as starting points for the development of novel anti-filarial agents.

马来布鲁贾菌是淋巴丝虫病的病原体,依靠其内共生体沃尔巴克氏体生存。MurE连接酶是沃尔巴克氏菌肽聚糖生物合成的关键酶,是抗丝虫治疗的一个有前景的药物靶点。在这项研究中,我们采用分层虚拟筛选管道来鉴定针对马来芽孢杆菌沃尔巴克氏体内共生菌(wBmMurE)的MurE酶的植物化学抑制剂。一个经过验证的高质量wBmMurE模型被用于筛选17,967种植物化学物质,并对确定的命中进行毒性分析,并通过ADME过滤器选择有效的药物样候选物。生物素、准质酸、琥珀酸、9,14-二羟基十八烷酸和n -异戊酰甘氨酸等5种具有允许ADME谱的植物化学物质显示出良好的结合亲和力(GlideScore范围:-12.86至-10.57 kcal/mol),并且从筛选的位点中选择出与催化重要残基稳定的相互作用。与已报道的MurE抑制剂的比较分析证实了我们确定的线索具有优越的亲和力和药物样行为。300 ns的分子动力学模拟证实了配体结合配合物的构象稳定性,而MM-GBSA分析则支持其良好的结合自由能。结果表明,所鉴定的化合物在wBmMurE的底物结合腔内具有结合倾向。这些发现表明,选定的植物化学物质可以作为开发新型抗丝虫药的起点。
{"title":"Harnessing the potential of phytochemicals to design anti-filarial molecules targeting the MurE enzyme of <i>Brugia malayi</i>: a hierarchical virtual screening and molecular dynamics simulation study.","authors":"D Prabhu, M Sureshan, S Rajamanikandan, J Jeyakanthan","doi":"10.1080/1062936X.2025.2556512","DOIUrl":"https://doi.org/10.1080/1062936X.2025.2556512","url":null,"abstract":"<p><p><i>Brugia malayi</i>, a causative agent of lymphatic filariasis, relies on its endosymbiont <i>Wolbachia</i> for survival. MurE ligase, a key enzyme in <i>Wolbachia</i> peptidoglycan biosynthesis, serves as a promising drug target for anti-filarial therapy. In this study, we employed a hierarchical virtual screening pipeline to identify phytochemical inhibitors targeting the MurE enzyme of the <i>Wolbachia</i> endosymbiont of <i>B. malayi</i> (<i>wBm</i>MurE). A validated high-quality model of <i>wBm</i>MurE was used to screen 17,967 phytochemicals, and the identified hits were subjected to toxicity profiling, and ADME filters to select potent drug-like candidates. Five phytochemicals such as biotin, quisqualic acid, succinic acid, 9,14-dihydroxyoctadecanoic acid, and <i>N</i>-isovaleroylglycine with permissible ADME profiles showed favourable binding affinities (GlideScore range: -12.86 to -10.57 kcal/mol), and stable interactions with catalytically important residues were selected from screened hits. Comparative analysis with reported MurE inhibitors validated the superior affinity and drug-like behaviour of our identified leads. Molecular dynamics simulations of 300 ns confirmed the conformational stability of ligand-bound complexes, while MM-GBSA analysis supported their favourable binding free energies. The results revealed that the identified compounds have the tendency of binding within substrate binding cavity of <i>wBm</i>MurE. These findings suggest that selected phytochemicals could serve as starting points for the development of novel anti-filarial agents.</p>","PeriodicalId":21446,"journal":{"name":"SAR and QSAR in Environmental Research","volume":"36 8","pages":"753-773"},"PeriodicalIF":2.3,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145030666","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
First report on machine learning based multiclass classification of Caco-2 permeability using different balancing strategies. 基于机器学习的Caco-2渗透率多类分类研究
IF 2.3 3区 环境科学与生态学 Q3 CHEMISTRY, MULTIDISCIPLINARY Pub Date : 2025-08-01 Epub Date: 2025-09-08 DOI: 10.1080/1062936X.2025.2552134
I Dasgupta, S Gayen

Evaluating the permeability of different molecular structures across the Caco-2 cell line is crucial for drug discovery and development. The present study primarily focuses on developing machine learning-based multiclass classification models for predicting the permeability of molecules across the Caco-2 cell line. However, the class imbalance in permeability datasets poses a significant challenge for developing predictive models in the case of multiclass analysis. To address the class imbalance issue, we employed different balancing strategies, including oversampling, undersampling, and hybrid approaches, to balance the training set. A five-fold cross-validation approach was employed for optimizing the hyperparameters. After completion of the evaluation process, we concluded that the XGBoost multiclass classifier trained with ADASYN oversampling achieved the best performance (accuracy, 0.717; MCC, 0.512 on the test set). Additionally, extreme permeability classes were also classified separately, and the best model exhibited strong predictive performance (accuracy, 0.853; MCC, 0.703 on the test set). To enhance the interpretability of the best-performing models, we performed SHAP analysis to elucidate descriptor importance and provide explainability. Our findings demonstrate that appropriate data balancing strategies can significantly improve predictive performance in multiclass permeability classification, offering a valuable framework for drug permeability assessment.

评估不同分子结构在Caco-2细胞系上的通透性对药物的发现和开发至关重要。目前的研究主要集中在开发基于机器学习的多类分类模型,用于预测Caco-2细胞系中分子的通透性。然而,渗透率数据集的类别不平衡对开发多类别分析的预测模型提出了重大挑战。为了解决类不平衡问题,我们采用了不同的平衡策略,包括过采样、欠采样和混合方法来平衡训练集。采用五重交叉验证法对超参数进行优化。在完成评估过程后,我们得出结论,使用ADASYN过采样训练的XGBoost多类分类器的性能最好(准确率为0.717;MCC在测试集上为0.512)。此外,还对极端渗透率进行了分类,最佳模型具有较强的预测性能(准确率为0.853,MCC为0.703)。为了提高最佳表现模型的可解释性,我们进行了SHAP分析,以阐明描述符的重要性并提供可解释性。我们的研究结果表明,适当的数据平衡策略可以显著提高多类别渗透率分类的预测性能,为药物渗透率评估提供了一个有价值的框架。
{"title":"First report on machine learning based multiclass classification of Caco-2 permeability using different balancing strategies.","authors":"I Dasgupta, S Gayen","doi":"10.1080/1062936X.2025.2552134","DOIUrl":"10.1080/1062936X.2025.2552134","url":null,"abstract":"<p><p>Evaluating the permeability of different molecular structures across the Caco-2 cell line is crucial for drug discovery and development. The present study primarily focuses on developing machine learning-based multiclass classification models for predicting the permeability of molecules across the Caco-2 cell line. However, the class imbalance in permeability datasets poses a significant challenge for developing predictive models in the case of multiclass analysis. To address the class imbalance issue, we employed different balancing strategies, including oversampling, undersampling, and hybrid approaches, to balance the training set. A five-fold cross-validation approach was employed for optimizing the hyperparameters. After completion of the evaluation process, we concluded that the XGBoost multiclass classifier trained with ADASYN oversampling achieved the best performance (accuracy, 0.717; MCC, 0.512 on the test set). Additionally, extreme permeability classes were also classified separately, and the best model exhibited strong predictive performance (accuracy, 0.853; MCC, 0.703 on the test set). To enhance the interpretability of the best-performing models, we performed SHAP analysis to elucidate descriptor importance and provide explainability. Our findings demonstrate that appropriate data balancing strategies can significantly improve predictive performance in multiclass permeability classification, offering a valuable framework for drug permeability assessment.</p>","PeriodicalId":21446,"journal":{"name":"SAR and QSAR in Environmental Research","volume":" ","pages":"701-725"},"PeriodicalIF":2.3,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145016134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ToxAI_assistant: a web platform for a comprehensive study of the acute toxicity of xenobiotics following oral and intravenous administration in rats. ToxAI_assistant:一个综合研究大鼠口服和静脉给药后外源性药物急性毒性的网络平台。
IF 2.3 3区 环境科学与生态学 Q3 CHEMISTRY, MULTIDISCIPLINARY Pub Date : 2025-07-01 Epub Date: 2025-08-05 DOI: 10.1080/1062936X.2025.2535606
O V Tinkov, V Y Grigorev

The existing QSAR approaches for mammalian acute toxicity have been limited in scope, often relying on small or narrowly focused datasets and on classification endpoints. In contrast, our work leverages a sufficiently large curated dataset (9843 rat oral and 2323 intravenous LD50 values) to build regression models of acute toxicity. The best-performing QSAR models developed using 2D RDKit descriptors and the Cat Boost method achieve Q2 test = 0.66 at a data coverage of at least 77% within the applicability domain (AD) during validation of test sets. All models were rigorously validated according to OECD QSAR principles with clearly defined endpoints, explicit algorithms, and a well-characterized AD. The best QSAR models are integrated into the ToxAI_assistant web platform (https://tox-ai-assistant.streamlit.app/), which includes toxicity-level prediction with allowance for AD in terms of World Health Organization (WHO). We also provide mechanistic insight by identifying key toxicophores - substructural features statistically associated with high toxicity - thereby offering a structural interpretation. In sum, these elements (large and diverse data, regression modelling, WHO-based categorization, detailed fragment analysis, and AD assessment) together address the gaps of earlier studies and constitute the core novelty of our approach.

哺乳动物急性毒性的现有QSAR方法范围有限,通常依赖于小型或狭窄的数据集和分类终点。相比之下,我们的工作利用了一个足够大的整理数据集(9843只大鼠口服和2323只静脉注射LD50值)来构建急性毒性的回归模型。在测试集验证期间,使用2D RDKit描述符和Cat Boost方法开发的性能最好的QSAR模型在适用性域(AD)内的数据覆盖率至少为77%时实现Q2测试= 0.66。所有模型都根据OECD QSAR原则进行了严格验证,具有明确定义的终点,明确的算法和良好表征的AD。最好的QSAR模型被整合到ToxAI_assistant网络平台(https://tox-ai-assistant.streamlit.app/),其中包括根据世界卫生组织(WHO)考虑AD的毒性水平预测。我们还通过识别关键的毒性基团-统计上与高毒性相关的亚结构特征-从而提供结构解释,从而提供机制见解。总而言之,这些要素(庞大而多样的数据、回归模型、基于世卫组织的分类、详细的片段分析和AD评估)共同弥补了早期研究的空白,构成了我们方法的核心新颖性。
{"title":"ToxAI_assistant: a web platform for a comprehensive study of the acute toxicity of xenobiotics following oral and intravenous administration in rats.","authors":"O V Tinkov, V Y Grigorev","doi":"10.1080/1062936X.2025.2535606","DOIUrl":"10.1080/1062936X.2025.2535606","url":null,"abstract":"<p><p>The existing QSAR approaches for mammalian acute toxicity have been limited in scope, often relying on small or narrowly focused datasets and on classification endpoints. In contrast, our work leverages a sufficiently large curated dataset (9843 rat oral and 2323 intravenous LD<sub>50</sub> values) to build regression models of acute toxicity. The best-performing QSAR models developed using 2D RDKit descriptors and the Cat Boost method achieve <i>Q</i><sup>2</sup> <sub>test</sub> = 0.66 at a data coverage of at least 77% within the applicability domain (AD) during validation of test sets. All models were rigorously validated according to OECD QSAR principles with clearly defined endpoints, explicit algorithms, and a well-characterized AD. The best QSAR models are integrated into the ToxAI_assistant web platform (https://tox-ai-assistant.streamlit.app/), which includes toxicity-level prediction with allowance for AD in terms of World Health Organization (WHO). We also provide mechanistic insight by identifying key toxicophores - substructural features statistically associated with high toxicity - thereby offering a structural interpretation. In sum, these elements (large and diverse data, regression modelling, WHO-based categorization, detailed fragment analysis, and AD assessment) together address the gaps of earlier studies and constitute the core novelty of our approach.</p>","PeriodicalId":21446,"journal":{"name":"SAR and QSAR in Environmental Research","volume":" ","pages":"555-582"},"PeriodicalIF":2.3,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144785180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Targeting MAO-B selectivity: computational screening, docking, and molecular dynamics insights. 靶向MAO-B选择性:计算筛选,对接和分子动力学见解。
IF 2.3 3区 环境科学与生态学 Q3 CHEMISTRY, MULTIDISCIPLINARY Pub Date : 2025-07-01 Epub Date: 2025-08-12 DOI: 10.1080/1062936X.2025.2537248
K-M Thai, D-T Pham, T-M Ngo, H-T Nguyen, P-V Nguyen, T-Q Pham, D-N Nguyen, Q-T Nguyen, M-T Le

Monoamine oxidase B (MAO-B) is a key target in Parkinson's disease treatment due to its role in dopamine metabolism. This study applied a multi-stage in silico workflow - combining 3D-pharmacophore modelling, 2D-QSAR, ADMET filtering, docking, molecular dynamics (MD), and MM/PBSA analysis - to identify selective MAO-B inhibitors. From four datasets including ZINC, DrugBank, TCM, and UNPD, 22 top candidates were selected based on docking scores and predicted selectivity over MAO-A. MD simulations (200 ns) and binding free energy calculations identified four promising compounds - ZINC21285023, ZINC79651118, ZINC58283019, and UNPD89644 (crotafuran E)- that exhibited stable binding and favourable interactions with key residues such as Cys172 and Tyr435. These compounds demonstrated performance comparable to or better than safinamide and are strong candidates for further experimental validation as selective MAO-B inhibitors.

单胺氧化酶B (MAO-B)因其在多巴胺代谢中的作用而成为帕金森病治疗的关键靶点。本研究采用了多阶段的硅片工作流程——结合3d药效团建模、2D-QSAR、ADMET滤波、对接、分子动力学(MD)和MM/PBSA分析——来鉴定选择性MAO-B抑制剂。从ZINC、DrugBank、TCM和UNPD 4个数据集中,根据对接分数和对MAO-A的预测选择性,选出22个最佳候选数据。MD模拟(200 ns)和结合自由能计算确定了四种有前景的化合物ZINC21285023、ZINC79651118、ZINC58283019和UNPD89644 (crotafuran E),它们与Cys172和Tyr435等关键残基具有稳定的结合和良好的相互作用。这些化合物表现出与沙芬酰胺相当或更好的性能,是进一步实验验证选择性MAO-B抑制剂的有力候选者。
{"title":"Targeting MAO-B selectivity: computational screening, docking, and molecular dynamics insights.","authors":"K-M Thai, D-T Pham, T-M Ngo, H-T Nguyen, P-V Nguyen, T-Q Pham, D-N Nguyen, Q-T Nguyen, M-T Le","doi":"10.1080/1062936X.2025.2537248","DOIUrl":"10.1080/1062936X.2025.2537248","url":null,"abstract":"<p><p>Monoamine oxidase B (MAO-B) is a key target in Parkinson's disease treatment due to its role in dopamine metabolism. This study applied a multi-stage in silico workflow - combining 3D-pharmacophore modelling, 2D-QSAR, ADMET filtering, docking, molecular dynamics (MD), and MM/PBSA analysis - to identify selective MAO-B inhibitors. From four datasets including ZINC, DrugBank, TCM, and UNPD, 22 top candidates were selected based on docking scores and predicted selectivity over MAO-A. MD simulations (200 ns) and binding free energy calculations identified four promising compounds - ZINC21285023, ZINC79651118, ZINC58283019, and UNPD89644 (crotafuran E)- that exhibited stable binding and favourable interactions with key residues such as Cys172 and Tyr435. These compounds demonstrated performance comparable to or better than safinamide and are strong candidates for further experimental validation as selective MAO-B inhibitors.</p>","PeriodicalId":21446,"journal":{"name":"SAR and QSAR in Environmental Research","volume":" ","pages":"583-619"},"PeriodicalIF":2.3,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144837575","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning-based q-RASAR modelling for the in silico identification of novel α7nAChR agonists for anti-Alzheimer's drug discovery. 基于机器学习的q-RASAR模型用于新型α7nAChR激动剂的计算机识别,用于抗阿尔茨海默病药物的发现。
IF 2.3 3区 环境科学与生态学 Q3 CHEMISTRY, MULTIDISCIPLINARY Pub Date : 2025-07-01 Epub Date: 2025-08-15 DOI: 10.1080/1062936X.2025.2540820
V Kumar, K Roy

In this study, we employed a quantitative Read-Across Structure-Activity Relationship (q-RASAR) approach to develop a statistically robust machine learning (ML)-based q-RASAR model aimed at predicting the agonistic activity of compounds targeting the α7-nicotinic acetylcholine (α7nACh) receptor, a key therapeutic target in Alzheimer's disease (AD) due to its involvement in cognitive processes and neuroprotection. We developed a rigorously validated univariate q-RASAR linear regression (LR) model using an extensive dataset of 1,727 structurally diverse heterocyclic and aromatic hydrocarbon compounds sourced from the publicly available Binding Database (www.bindingdb.org). Additionally, we explored various other ML-based q-RASAR models to further enhance predictive performance. The established LR q-RASAR model was subsequently applied to predict the Mcule database (https://mcule.com/database/), containing 1,91,94,405 chemical compounds, to identify structurally relevant candidates with potential α7nACh receptor agonistic activity. Additionally, molecular docking analysis and molecular dynamics (MD) simulations for 100 ns were conducted to investigate interactions between the target protein and ligands. Overall, this investigation highlights the critical influence of hydrophobicity, electronic effects, ionization degree, and steric factors as key determinants in the design of potential anti-Alzheimer's agents.

在这项研究中,我们采用了定量的Read-Across Structure-Activity Relationship (q-RASAR)方法,建立了一个基于统计稳健的机器学习(ML)的q-RASAR模型,旨在预测针对α7-烟碱乙酰胆碱(α7nACh)受体的化合物的激动活性,α7-烟碱乙酰胆碱(α7nACh)受体是阿尔茨海默病(AD)的关键治疗靶点,因为它参与认知过程和神经保护。我们开发了一个经过严格验证的单变量q-RASAR线性回归(LR)模型,该模型使用了来自公开可用的Binding Database (www.bindingdb.org)的1,727个结构多样的杂环和芳烃化合物的广泛数据集。此外,我们还探索了各种其他基于ml的q-RASAR模型,以进一步提高预测性能。建立的LR q-RASAR模型随后应用于预测含有1,91,94,405种化合物的Mcule数据库(https://mcule.com/database/),以识别具有潜在α7nACh受体激动活性的结构相关候选化合物。此外,还进行了100 ns的分子对接分析和分子动力学(MD)模拟,以研究靶蛋白与配体之间的相互作用。总的来说,这项研究强调了疏水性、电子效应、电离度和空间因素是设计潜在抗阿尔茨海默病药物的关键决定因素。
{"title":"Machine learning-based q-RASAR modelling for the in silico identification of novel α7nAChR agonists for anti-Alzheimer's drug discovery.","authors":"V Kumar, K Roy","doi":"10.1080/1062936X.2025.2540820","DOIUrl":"10.1080/1062936X.2025.2540820","url":null,"abstract":"<p><p>In this study, we employed a quantitative Read-Across Structure-Activity Relationship (q-RASAR) approach to develop a statistically robust machine learning (ML)-based q-RASAR model aimed at predicting the agonistic activity of compounds targeting the α7-nicotinic acetylcholine (α7nACh) receptor, a key therapeutic target in Alzheimer's disease (AD) due to its involvement in cognitive processes and neuroprotection. We developed a rigorously validated univariate q-RASAR linear regression (LR) model using an extensive dataset of 1,727 structurally diverse heterocyclic and aromatic hydrocarbon compounds sourced from the publicly available Binding Database (www.bindingdb.org). Additionally, we explored various other ML-based q-RASAR models to further enhance predictive performance. The established LR q-RASAR model was subsequently applied to predict the Mcule database (https://mcule.com/database/), containing 1,91,94,405 chemical compounds, to identify structurally relevant candidates with potential α7nACh receptor agonistic activity. Additionally, molecular docking analysis and molecular dynamics (MD) simulations for 100 ns were conducted to investigate interactions between the target protein and ligands. Overall, this investigation highlights the critical influence of hydrophobicity, electronic effects, ionization degree, and steric factors as key determinants in the design of potential anti-Alzheimer's agents.</p>","PeriodicalId":21446,"journal":{"name":"SAR and QSAR in Environmental Research","volume":" ","pages":"621-649"},"PeriodicalIF":2.3,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144856187","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unveiling the biophysical basis of DYRK kinase family isoform selectivity mechanism of Abemaciclib using computational approaches. 利用计算方法揭示Abemaciclib的DYRK激酶家族亚型选择性机制的生物物理基础。
IF 2.3 3区 环境科学与生态学 Q3 CHEMISTRY, MULTIDISCIPLINARY Pub Date : 2025-07-01 Epub Date: 2025-09-04 DOI: 10.1080/1062936X.2025.2552133
K D Ursal, P Kar

Dual-specificity tyrosine phosphorylation-regulated kinases (DYRKs) play crucial roles in regulating cell growth and brain development. Dysregulation of these kinases is linked to disorders like Down syndrome and cancers. The selective inhibition of DYRK1A over other isoforms remains a significant challenge due to their high structural similarity. This study investigates the selectivity of Abemaciclib, an FDA-approved CDK4/6 inhibitor known to target DYRK1A, against other DYRK family isoforms. We employed molecular docking and molecular dynamics simulations, coupled with the Molecular Mechanics Poisson-Boltzmann Surface Area method, to evaluate the selectivity profile of Abemaciclib. Results showed that it binds strongest to DYRK1B, followed by DYRK1A, DYRK4, DYRK3 and DYRK2, which is validated with the statistical analysis. Enhanced selectivity for DYRK1B arises from stronger van der Waals and electrostatic interactions. Hydrophobic contacts and hydrogen bonds, especially within the kinase's hinge region, help stabilize the complex. Notably, Leu241 in DYRK1A and its identical residues in other isoforms play a pivotal role in these stabilizing interactions. Key residue differences, like Phe170, Glu239 and His285 in DYRK1A, contribute to specific interactions that underpin the molecular binding pattern. By identifying conserved and isoform-specific interactions, our study provides valuable insights for the rational design of potent and selective DYRK inhibitors.

双特异性酪氨酸磷酸化调节激酶(DYRKs)在调节细胞生长和大脑发育中发挥重要作用。这些激酶的失调与唐氏综合症和癌症等疾病有关。DYRK1A对其他亚型的选择性抑制仍然是一个重大挑战,因为它们的结构高度相似。本研究探讨了Abemaciclib(一种fda批准的靶向DYRK1A的CDK4/6抑制剂)对其他DYRK家族亚型的选择性。采用分子对接和分子动力学模拟,结合分子力学泊松-玻尔兹曼表面积方法,对Abemaciclib的选择性进行了评价。结果表明,其与DYRK1B结合最强,其次为DYRK1A、DYRK4、DYRK3和DYRK2,经统计学分析验证。DYRK1B的选择性增强是由于更强的范德华和静电相互作用。疏水接触和氢键,特别是在激酶的铰链区域,有助于稳定复合物。值得注意的是,DYRK1A中的Leu241及其在其他同工异构体中的相同残基在这些稳定相互作用中起关键作用。关键残基差异,如DYRK1A中的Phe170、Glu239和His285,有助于支持分子结合模式的特定相互作用。通过确定保守的和同工型特异性的相互作用,我们的研究为合理设计有效的和选择性的DYRK抑制剂提供了有价值的见解。
{"title":"Unveiling the biophysical basis of DYRK kinase family isoform selectivity mechanism of Abemaciclib using computational approaches.","authors":"K D Ursal, P Kar","doi":"10.1080/1062936X.2025.2552133","DOIUrl":"https://doi.org/10.1080/1062936X.2025.2552133","url":null,"abstract":"<p><p>Dual-specificity tyrosine phosphorylation-regulated kinases (DYRKs) play crucial roles in regulating cell growth and brain development. Dysregulation of these kinases is linked to disorders like Down syndrome and cancers. The selective inhibition of DYRK1A over other isoforms remains a significant challenge due to their high structural similarity. This study investigates the selectivity of Abemaciclib, an FDA-approved CDK4/6 inhibitor known to target DYRK1A, against other DYRK family isoforms. We employed molecular docking and molecular dynamics simulations, coupled with the Molecular Mechanics Poisson-Boltzmann Surface Area method, to evaluate the selectivity profile of Abemaciclib. Results showed that it binds strongest to DYRK1B, followed by DYRK1A, DYRK4, DYRK3 and DYRK2, which is validated with the statistical analysis. Enhanced selectivity for DYRK1B arises from stronger van der Waals and electrostatic interactions. Hydrophobic contacts and hydrogen bonds, especially within the kinase's hinge region, help stabilize the complex. Notably, Leu241 in DYRK1A and its identical residues in other isoforms play a pivotal role in these stabilizing interactions. Key residue differences, like Phe170, Glu239 and His285 in DYRK1A, contribute to specific interactions that underpin the molecular binding pattern. By identifying conserved and isoform-specific interactions, our study provides valuable insights for the rational design of potent and selective DYRK inhibitors.</p>","PeriodicalId":21446,"journal":{"name":"SAR and QSAR in Environmental Research","volume":"36 7","pages":"651-671"},"PeriodicalIF":2.3,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144993457","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An effective machine learning model for rat acute oral toxicity prediction of emerging chemicals: multi-domain applications and structure-activity relationships. 新兴化学物质大鼠急性口服毒性预测的有效机器学习模型:多领域应用和构效关系。
IF 2.3 3区 环境科学与生态学 Q3 CHEMISTRY, MULTIDISCIPLINARY Pub Date : 2025-06-01 Epub Date: 2025-07-31 DOI: 10.1080/1062936X.2025.2531172
J Yan, Z Shen

Given the widespread presence of emerging contaminants in the environment, assessing and ensuring their biosafety is urgent. Under the Globally Harmonized System (GHS), the LD50 parameter of acute oral toxicity (AOT) is crucial for chemical safety classification. Animal testing limitations have highlighted the need for alternative methods, and machine learning offers a new approach to predict LD50 through quantitative structure-activity relationship (QSAR) models. This study developed and optimized a machine learning model for LD50 classification of emerging contaminants based on data from more than 6000 known AOT. Using molecular descriptors and fingerprints, the model achieves an accuracy above 0.86 and a recall score over 0.84, outperforming previous models. The model's robustness was confirmed across various types of emerging contaminants. Shapley additive explanations (SHAP) identified key descriptors like BCUTp_1h, ATSC1pe, and SLogP_VSA4, while the information gain (IG) method highlighted alert substructures [P-O, P-S]. These findings suggest that compounds with high polarizability, mean electronegativity and significant surface area may adversely affect rats. This model enhances understanding of acute toxicity mechanisms and serves as a tool for early screening of safer compounds, promoting the design of greener chemicals.

鉴于新出现的污染物在环境中广泛存在,评估和确保其生物安全性迫在眉睫。在全球统一制度(GHS)下,急性口服毒性(AOT)的LD50参数是化学品安全分类的关键参数。动物实验的局限性突出了对替代方法的需求,机器学习提供了一种通过定量结构-活性关系(QSAR)模型预测LD50的新方法。本研究基于6000多个已知AOT的数据,开发并优化了一个用于新兴污染物LD50分类的机器学习模型。使用分子描述符和指纹,该模型的准确率超过0.86,召回率超过0.84,优于之前的模型。该模型的稳健性在各种类型的新兴污染物中得到了证实。Shapley加性解释(SHAP)识别了关键描述符,如BCUTp_1h、ATSC1pe和SLogP_VSA4,而信息增益(IG)方法突出了警报子结构[P-O, P-S]。这些发现表明,具有高极化率、平均电负性和显著表面积的化合物可能对大鼠产生不利影响。该模型增强了对急性毒性机制的理解,并作为早期筛选更安全化合物的工具,促进了绿色化学品的设计。
{"title":"An effective machine learning model for rat acute oral toxicity prediction of emerging chemicals: multi-domain applications and structure-activity relationships.","authors":"J Yan, Z Shen","doi":"10.1080/1062936X.2025.2531172","DOIUrl":"https://doi.org/10.1080/1062936X.2025.2531172","url":null,"abstract":"<p><p>Given the widespread presence of emerging contaminants in the environment, assessing and ensuring their biosafety is urgent. Under the Globally Harmonized System (GHS), the LD<sub>50</sub> parameter of acute oral toxicity (AOT) is crucial for chemical safety classification. Animal testing limitations have highlighted the need for alternative methods, and machine learning offers a new approach to predict LD<sub>50</sub> through quantitative structure-activity relationship (QSAR) models. This study developed and optimized a machine learning model for LD<sub>50</sub> classification of emerging contaminants based on data from more than 6000 known AOT. Using molecular descriptors and fingerprints, the model achieves an accuracy above 0.86 and a recall score over 0.84, outperforming previous models. The model's robustness was confirmed across various types of emerging contaminants. Shapley additive explanations (SHAP) identified key descriptors like BCUTp_1h, ATSC1pe, and SLogP_VSA4, while the information gain (IG) method highlighted alert substructures [P-O, P-S]. These findings suggest that compounds with high polarizability, mean electronegativity and significant surface area may adversely affect rats. This model enhances understanding of acute toxicity mechanisms and serves as a tool for early screening of safer compounds, promoting the design of greener chemicals.</p>","PeriodicalId":21446,"journal":{"name":"SAR and QSAR in Environmental Research","volume":"36 6","pages":"537-554"},"PeriodicalIF":2.3,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144754109","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
SAR and QSAR in Environmental Research
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1