We present machine learning studies devoted to the creation of predictive models for toxicity evaluation of imidazolium- and pyridinium-containing ionic liquids. New created predictive models were developed using the OCHEM. The predictive ability of the models was tested by cross-validation, giving a coefficient of determination q2 = 0.77–0.82. The models were applied to screen a virtual chemical library to the toxicity of ILs in Danio rerio and Daphnia magna bioassays. Models were used to predict toxicity for 25 ILs, which were then synthesized and tested in vivo. The in vivo toxicity studies found that D. magna is a more sensitive aquatic test organism than D. rerio – 67 % of the studied ILs are classified as extremely toxic with an LC50 range from 0.005 to 0.01 mg/l. At the same time, only one IL 1-dodecylpyridinium bromide with an LC50 of 0.08 mg/l is classified as extremely toxic, and 76 % are classified as slightly and moderately toxic compounds using D. rerio as a test organism. The most toxic ILs 5 and 19 were docked into the human AChE active center and demonstrated calculated binding energy values −9.5 and −9.3 kcal/mol that is comparable with the complexation of the human AChE inhibitor Donepezil, which provides insight into the potential molecular mechanisms of ILs toxicity. The created QSTR models are a successful tool for the toxicity analysis of new promising ILs. QSTR models demonstrated not only high predictive indicators but also a high percentage of correctly predicted toxicity values in vivo studies.
我们介绍了专门用于创建含咪唑和吡啶离子液体毒性评估预测模型的机器学习研究。我们使用 OCHEM 开发了新的预测模型。通过交叉验证测试了模型的预测能力,结果表明决定系数 q2 = 0.77-0.82。这些模型被应用于筛选虚拟化学库,以确定惰性惰性物质在真鲷和大型蚤生物测定中的毒性。利用模型预测了 25 种 IL 的毒性,然后合成了这些 IL 并进行了体内测试。体内毒性研究发现,大型蚤是一种比红腹锦蛇更敏感的水生试验生物--67%的所研究的ILs被归类为毒性极强,半数致死浓度范围为0.005至0.01毫克/升。同时,只有一种 LC50 值为 0.08 mg/l 的 1-dodecylpyridinium bromide 被归类为剧毒,而以 D. rerio 为测试生物的 76% 被归类为轻微和中等毒性化合物。将毒性最强的 ILs 5 和 19 与人类 AChE 活性中心对接,计算出的结合能值分别为-9.5 和-9.3 kcal/mol,与人类 AChE 抑制剂多奈哌齐的络合能值相当,这有助于深入了解 ILs 毒性的潜在分子机制。所创建的 QSTR 模型是一种成功的工具,可用于分析有潜力的新型 ILs 的毒性。QSTR 模型不仅具有很高的预测指标,而且在体内研究中正确预测毒性值的比例也很高。
{"title":"New QSTR models to evaluation of imidazolium- and pyridinium-contained ionic liquids toxicity","authors":"Ivan Semenyuta, Vasyl Kovalishyn, Diana Hodyna, Yuliia Startseva, Sergiy Rogalsky, Larysa Metelytsia","doi":"10.1016/j.comtox.2024.100309","DOIUrl":"https://doi.org/10.1016/j.comtox.2024.100309","url":null,"abstract":"<div><p>We present machine learning studies devoted to the creation of predictive models for toxicity evaluation of imidazolium- and pyridinium-containing ionic liquids. New created predictive models were developed using the OCHEM. The predictive ability of the models was tested by cross-validation, giving a coefficient of determination q<sup>2</sup> = 0.77–0.82. The models were applied to screen a virtual chemical library to the toxicity of ILs in Danio rerio and Daphnia magna bioassays. Models were used to predict toxicity for 25 ILs, which were then synthesized and tested in vivo. The in vivo toxicity studies found that D. magna is a more sensitive aquatic test organism than D. rerio – 67 % of the studied ILs are classified as extremely toxic with an LC<sub>50</sub> range from 0.005 to 0.01 mg/l. At the same time, only one IL 1-dodecylpyridinium bromide with an LC<sub>50</sub> of 0.08 mg/l is classified as extremely toxic, and 76 % are classified as slightly and moderately toxic compounds using D. rerio as a test organism. The most toxic ILs 5 and 19 were docked into the human AChE active center and demonstrated calculated binding energy values −9.5 and −9.3 kcal/mol that is comparable with the complexation of the human AChE inhibitor Donepezil, which provides insight into the potential molecular mechanisms of ILs toxicity. The created QSTR models are a successful tool for the toxicity analysis of new promising ILs. QSTR models demonstrated not only high predictive indicators but also a high percentage of correctly predicted toxicity values in vivo studies.</p></div>","PeriodicalId":37651,"journal":{"name":"Computational Toxicology","volume":"30 ","pages":"Article 100309"},"PeriodicalIF":0.0,"publicationDate":"2024-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140195734","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Adverse Outcome Pathways (AOPs) provide a basis for non-animal testing, by outlining the cascade of molecular and cellular events initiated upon stressor exposure, leading to adverse effects. In recent years, the scientific community has shown interest in developing AOPs through crowdsourcing, with the results archived in the AOP-Wiki: a centralized repository coordinated by the OECD, hosting nearly 512 AOPs (April, 2023). However, the AOP-Wiki platform currently lacks a versatile querying system, which hinders developers' exploration of the AOP network and impedes its practical use in risk assessment. This work proposes to unleash the full potential of the AOP-Wiki archive by adapting its data into a Labelled Property Graph (LPG) schema. Additionally, the tool offers a visual network query interface for both database-specific and natural language queries, facilitating the retrieval and analysis of graph data. The multi-query interface allows non-technical users to construct flexible queries, thereby enhancing the potential for AOP exploration. By reducing the time and technical requirements, the present query engine enhances the practical utilization of the valuable data within AOP-Wiki. To evaluate the platform, a case study is presented with three levels of use-case scenarios (simple, moderate, and complex queries). AOPWIKI-EXPLORER is freely available on GitHub (https://github.com/Crispae/AOPWiki_Explorer) for wider community reach and further enhancement.
{"title":"AOPWIKI-EXPLORER: An interactive graph-based query engine leveraging large language models","authors":"Saurav Kumar , Deepika Deepika , Karin Slater , Vikas Kumar","doi":"10.1016/j.comtox.2024.100308","DOIUrl":"https://doi.org/10.1016/j.comtox.2024.100308","url":null,"abstract":"<div><p>Adverse Outcome Pathways (AOPs) provide a basis for non-animal testing, by outlining the cascade of molecular and cellular events initiated upon stressor exposure, leading to adverse effects. In recent years, the scientific community has shown interest in developing AOPs through crowdsourcing, with the results archived in the AOP-Wiki: a centralized repository coordinated by the OECD, hosting nearly 512 AOPs (April, 2023). However, the AOP-Wiki platform currently lacks a versatile querying system, which hinders developers' exploration of the AOP network and impedes its practical use in risk assessment. This work proposes to unleash the full potential of the AOP-Wiki archive by adapting its data into a Labelled Property Graph (LPG) schema. Additionally, the tool offers a visual network query interface for both database-specific and natural language queries, facilitating the retrieval and analysis of graph data. The multi-query interface allows non-technical users to construct flexible queries, thereby enhancing the potential for AOP exploration. By reducing the time and technical requirements, the present query engine enhances the practical utilization of the valuable data within AOP-Wiki. To evaluate the platform, a case study is presented with three levels of use-case scenarios (simple, moderate, and complex queries). AOPWIKI-EXPLORER is freely available on GitHub (https://github.com/Crispae/AOPWiki_Explorer) for wider community reach and further enhancement.</p></div>","PeriodicalId":37651,"journal":{"name":"Computational Toxicology","volume":"30 ","pages":"Article 100308"},"PeriodicalIF":0.0,"publicationDate":"2024-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2468111324000100/pdfft?md5=542059b7f2c1ba3e8e43c9fa101d3325&pid=1-s2.0-S2468111324000100-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140309180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-03-19DOI: 10.1016/j.comtox.2024.100307
A. Rasim Barutcu
Sequencing depth and biological replication represent key experimental design considerations in toxicogenomics and risk assessment. However, their relative impacts on differential gene expression analysis remain unclear. Using an 8-dose chemical (Prochloraz) perturbation RNA-seq dataset in A549 cells, we systematically subsampled sequencing depth (5–100 %) and replicates (2–4) to evaluate effects on number of differentially expressed genes. While dose was the primary variance driver, replication had a greater influence than depth for optimizing detection power. With only 2 replicates, over 80% of the ∼2000 differential genes were unique to specific depths, indicating high variability. Increasing to 4 replicates substantially improved reproducibility, with over 550 genes consistently identified across most depths, representing 30% of the total differential genes. Higher replicates also increased the rate of overlap of benchmark dose pathways and precision of median benchmark dose estimates. However, key gene ontology pathways related to DNA replication, cell cycle, and division were consistently captured even at lower replicates. Thus, replication enhanced confidence but did not fundamentally expand biological findings. Our study delineates key trade-offs between sequencing depth and replication for toxicogenomic experimental design. While additional replicates fundamentally improve reproducibility, gains from depth exhibit diminishing returns. Prioritizing biological replication over depth provides a cost-effective approach to enhance interpretation without sacrificing detection of core gene expression patterns. Altogether, this study provides important insights into the experimental design of toxicogenomics experiments.
{"title":"Evaluation of Replicate Number and Sequencing Depth in Toxicology Dose-Response RNA-seq","authors":"A. Rasim Barutcu","doi":"10.1016/j.comtox.2024.100307","DOIUrl":"https://doi.org/10.1016/j.comtox.2024.100307","url":null,"abstract":"<div><p>Sequencing depth and biological replication represent key experimental design considerations in toxicogenomics and risk assessment. However, their relative impacts on differential gene expression analysis remain unclear. Using an 8-dose chemical (Prochloraz) perturbation RNA-seq dataset in A549 cells, we systematically subsampled sequencing depth (5–100 %) and replicates (2–4) to evaluate effects on number of differentially expressed genes. While dose was the primary variance driver, replication had a greater influence than depth for optimizing detection power. With only 2 replicates, over 80% of the ∼2000 differential genes were unique to specific depths, indicating high variability. Increasing to 4 replicates substantially improved reproducibility, with over 550 genes consistently identified across most depths, representing 30% of the total differential genes. Higher replicates also increased the rate of overlap of benchmark dose pathways and precision of median benchmark dose estimates. However, key gene ontology pathways related to DNA replication, cell cycle, and division were consistently captured even at lower replicates. Thus, replication enhanced confidence but did not fundamentally expand biological findings. Our study delineates key trade-offs between sequencing depth and replication for toxicogenomic experimental design. While additional replicates fundamentally improve reproducibility, gains from depth exhibit diminishing returns. Prioritizing biological replication over depth provides a cost-effective approach to enhance interpretation without sacrificing detection of core gene expression patterns. Altogether, this study provides important insights into the experimental design of toxicogenomics experiments.</p></div>","PeriodicalId":37651,"journal":{"name":"Computational Toxicology","volume":"30 ","pages":"Article 100307"},"PeriodicalIF":0.0,"publicationDate":"2024-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140180706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-03-16DOI: 10.1016/j.comtox.2024.100305
Christopher Barber, Crina Heghes, Laura Johnston
Advances in the development and application of in silico models in toxicology has been recognised by two OECD guidance documents (69: Guidance Document On The Validation Of (Quantitative) Structure-Activity Relationship [(Q)SAR] Models and 386: (Q)SAR Assessment Framework: Guidance for the regulatory assessment of (Q)SAR models, predictions, and results based on multiple predictions) published in 2007 and 2023 respectively. The former outlines criteria for appropriate model validation, whilst the latter provides guidance around assessing predictions derived from them. The concepts and criteria described within these guidelines have been used to establish a framework to support both model builders and those applying them to support regulatory decisions. Herein we demonstrate how to meet those criteria and propose where further guidance is essential for ensuring the consistent, confident, and safe application of in silico models in support of regulatory decisions.
{"title":"A framework to support the application of the OECD guidance documents on (Q)SAR model validation and prediction assessment for regulatory decisions","authors":"Christopher Barber, Crina Heghes, Laura Johnston","doi":"10.1016/j.comtox.2024.100305","DOIUrl":"https://doi.org/10.1016/j.comtox.2024.100305","url":null,"abstract":"<div><p>Advances in the development and application of in silico models in toxicology has been recognised by two OECD guidance documents (69: Guidance Document On The Validation Of (Quantitative) Structure-Activity Relationship [(Q)SAR] Models and 386: (Q)SAR Assessment Framework: Guidance for the regulatory assessment of (Q)SAR models, predictions, and results based on multiple predictions) published in 2007 and 2023 respectively. The former outlines criteria for appropriate model validation, whilst the latter provides guidance around assessing predictions derived from them. The concepts and criteria described within these guidelines have been used to establish a framework to support both model builders and those applying them to support regulatory decisions. Herein we demonstrate how to meet those criteria and propose where further guidance is essential for ensuring the consistent, confident, and safe application of in silico models in support of regulatory decisions.</p></div>","PeriodicalId":37651,"journal":{"name":"Computational Toxicology","volume":"30 ","pages":"Article 100305"},"PeriodicalIF":0.0,"publicationDate":"2024-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140190686","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-03-15DOI: 10.1016/j.comtox.2024.100306
Alejandro Gómez, Andrés Alarcón, Wilson Acosta, Andrés Malagón
Glyphosate is a widely used herbicide known for its effectiveness in weed control; and it is an inhibitor of the plant enzyme 5-enolpyruvylshikimate-3-phosphate synthase. Currently, it is one of the most extensively used non-specific herbicides in agroindustry. However, toxic effects of glyphosate have recently been reported, including endocrine disruption, metabolic alterations, teratogenic, tumorigenic, and hepatorenal effects. Additionally, there are environmental concerns related to possible interactions with proteins from microorganisms, aquatic organisms, and mammals.
Research on the description of these interactions has gained interest, primarily with the aim of generating recommendations in terms of its use and possible regulations. On the other hand, computational methods have emerged to identify potential targets or unintended targets among numerous possible receptors. Several programs, online services, and databases are available for use in these methods.
In this study, we employed a set of online tools for computational target fishing to identify receptors of glyphosate. A set of thirteen targets were selected using six fishing tools. Furthermore, docking procedures were performed to investigate the expected interactions and binding energies. Certain associations with diseases are also reported.
{"title":"Identification of potential human targets of glyphosate using in silico target fishing","authors":"Alejandro Gómez, Andrés Alarcón, Wilson Acosta, Andrés Malagón","doi":"10.1016/j.comtox.2024.100306","DOIUrl":"https://doi.org/10.1016/j.comtox.2024.100306","url":null,"abstract":"<div><p>Glyphosate is a widely used herbicide known for its effectiveness in weed control; and it is an inhibitor of the plant enzyme 5-enolpyruvylshikimate-3-phosphate synthase. Currently, it is one of the most extensively used non-specific herbicides in agroindustry. However, toxic effects of glyphosate have recently been reported, including endocrine disruption, metabolic alterations, teratogenic, tumorigenic, and hepatorenal effects. Additionally, there are environmental concerns related to possible interactions with proteins from microorganisms, aquatic organisms, and mammals.</p><p>Research on the description of these interactions has gained interest, primarily with the aim of generating recommendations in terms of its use and possible regulations. On the other hand, computational methods have emerged to identify potential targets or unintended targets among numerous possible receptors. Several programs, online services, and databases are available for use in these methods.</p><p>In this study, we employed a set of online tools for computational target fishing to identify receptors of glyphosate. A set of thirteen targets were selected using six fishing tools. Furthermore, docking procedures were performed to investigate the expected interactions and binding energies. Certain associations with diseases are also reported.</p></div>","PeriodicalId":37651,"journal":{"name":"Computational Toxicology","volume":"30 ","pages":"Article 100306"},"PeriodicalIF":0.0,"publicationDate":"2024-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140145179","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-02-28DOI: 10.1016/j.comtox.2024.100304
G. Patlewicz , P. Karamertzanis , K. Paul Friedman , M. Sannicola , I. Shah
Read-across is a well-established data-gap filling technique used within analogue or category approaches. Acceptance remains an issue, mainly due to the difficulties of addressing residual uncertainties associated with a read-across prediction and because assessments are expert-driven. Frameworks to develop, assess and document read-across may help reduce variability in read-across results. Data-driven read-across approaches such as Generalised Read-Across (GenRA) include quantification of uncertainties and performance. GenRA also offers opportunities on how New Approach Method (NAM) data can be systematically incorporated to support the read-across hypothesis. Herein, a systematic investigation of differences in expert-driven read-across with data-driven approaches was pursued in terms of building scientific confidence in the use of read-across. A dataset of expert-driven read-across assessments that made use of registration data as disseminated in the public International Uniform Chemical Information Database (IUCLID) (version 6) of Registration, Evaluation, Authorisation and Restriction of Chemicals (REACH) Study Results were compiled. A dataset of ∼5000 read-across cases pertaining to repeated dose and developmental toxicity was extracted and mapped to content within EPA’s Distributed Structure Searchable Toxicity database (DSSTox) to retrieve chemical name and structural identification information. Content could be mapped to ∼3600 cases which when filtered for unique cases with curated quantitative structure–activity relationship-ready SMILES resulted in 389 target-source analogue pairs. The similarity between target and the source analogues on the basis of different contexts – from structural similarity using chemical fingerprints to metabolic similarity using predicted metabolic information was evaluated. An attempt was also made to quantify the relative contribution each similarity context played relative to the target-source analogue pairs by deriving a model which predicted known analogue pairs. Finally, point of departure values (PODs) were predicted using the GenRA approach underpinned by data extracted from the EPA’s Toxicity Values Database (ToxValDB). The GenRA predicted PODs were compared with those reported within the REACH dossiers themselves. This study offers generalisable insights on how read-across is already applied for regulatory submissions and expectations on the levels of similarity necessary to make decisions.
{"title":"A systematic analysis of read-across within REACH registration dossiers","authors":"G. Patlewicz , P. Karamertzanis , K. Paul Friedman , M. Sannicola , I. Shah","doi":"10.1016/j.comtox.2024.100304","DOIUrl":"https://doi.org/10.1016/j.comtox.2024.100304","url":null,"abstract":"<div><p>Read-across is a well-established data-gap filling technique used within analogue or category approaches. Acceptance remains an issue, mainly due to the difficulties of addressing residual uncertainties associated with a read-across prediction and because assessments are expert-driven. Frameworks to develop, assess and document read-across may help reduce variability in read-across results. Data-driven read-across approaches such as Generalised Read-Across (GenRA) include quantification of uncertainties and performance. GenRA also offers opportunities on how New Approach Method (NAM) data can be systematically incorporated to support the read-across hypothesis. Herein, a systematic investigation of differences in expert-driven read-across with data-driven approaches was pursued in terms of building scientific confidence in the use of read-across. A dataset of expert-driven read-across assessments that made use of registration data as disseminated in the public International Uniform Chemical Information Database (IUCLID) (version 6) of Registration, Evaluation, Authorisation and Restriction of Chemicals (REACH) Study Results were compiled. A dataset of ∼5000 read-across cases pertaining to repeated dose and developmental toxicity was extracted and mapped to content within EPA’s Distributed Structure Searchable Toxicity database (DSSTox) to retrieve chemical name and structural identification information. Content could be mapped to ∼3600 cases which when filtered for unique cases with curated quantitative structure–activity relationship-ready SMILES resulted in 389 target-source analogue pairs. The similarity between target and the source analogues on the basis of different contexts – from structural similarity using chemical fingerprints to metabolic similarity using predicted metabolic information was evaluated. An attempt was also made to quantify the relative contribution each similarity context played relative to the target-source analogue pairs by deriving a model which predicted known analogue pairs. Finally, point of departure values (PODs) were predicted using the GenRA approach underpinned by data extracted from the EPA’s Toxicity Values Database (ToxValDB). The GenRA predicted PODs were compared with those reported within the REACH dossiers themselves. This study offers generalisable insights on how read-across is already applied for regulatory submissions and expectations on the levels of similarity necessary to make decisions.</p></div>","PeriodicalId":37651,"journal":{"name":"Computational Toxicology","volume":"30 ","pages":"Article 100304"},"PeriodicalIF":0.0,"publicationDate":"2024-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140063323","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-02-13DOI: 10.1016/j.comtox.2024.100302
Shweta Singh Chauhan , E. Azra Thaseen , Ramakrishnan Parthasarathi
Bacterial infections caused by resistant strains, especially those conferring multi-drug resistance (MDR), have become a severe health problem worldwide. Novobiocin (NB) is a widely used antibiotic that inhibits the action of DNA gyrase in Escherichia coli (E. coli). The drug's efficiency is hindered by its strong binding with the resistance causing efflux pump AcrAB-TolC on recurrent exposure. Consequently, the discovery of alternate/substitute analogue compounds for the parent drug with higher selectivity could counter drug resistance. In this work, we identified potent analogues of drug NB against the gyrase B enzyme by performing high throughput virtual screening of forty analogues that includes drug-likeness properties, pharmacokinetic parameters analysis, molecular docking, and molecular dynamics (MD) simulations. Our comprehensive pharmacological profiling with intrinsic analysis of selectivity and safety resulted in the identification of four potential compounds, C4 (ZINC218812366), C6 (ZINC221968665), C8 (ZINC49783724) and C10 (ZINC49783727), have better inhibitory and binding capacity against the primary target gyrase B subunit and reduced interaction with the counterpart of resistant target AcrB. These findings provide proof of concept for developing lead compounds targeting gyrase B and help in combatting AcrB-mediated drug resistance.
耐药菌株,尤其是具有多重耐药性(MDR)的耐药菌株引起的细菌感染已成为全球严重的健康问题。新生物素(NB)是一种广泛使用的抗生素,可抑制大肠杆菌(E. coli)中 DNA 回旋酶的作用。该药物在反复接触时会与导致耐药性的外排泵 AcrAB-TolC 发生强结合,从而影响其疗效。因此,发现具有更高选择性的母药替代/替代类似化合物可以对抗耐药性。在这项工作中,我们通过对 40 种类似物进行高通量虚拟筛选,包括药物相似性、药动学参数分析、分子对接和分子动力学(MD)模拟,确定了 NB 药物对回旋酶 B 的强效类似物。我们通过对选择性和安全性的内在分析进行了全面的药理学分析,最终确定了 C4(ZINC218812366)、C6(ZINC221968665)、C8(ZINC49783724)和 C10(ZINC49783727)这四种潜在化合物,它们对主要靶标回旋酶 B 亚基具有更好的抑制和结合能力,并减少了与抗性靶标 AcrB 的相互作用。这些发现为开发靶向回旋酶 B 的先导化合物提供了概念证明,有助于对抗 AcrB 介导的耐药性。
{"title":"Computational discovery of potent Escherichia coli DNA gyrase inhibitor: Selective and safer novobiocin analogues","authors":"Shweta Singh Chauhan , E. Azra Thaseen , Ramakrishnan Parthasarathi","doi":"10.1016/j.comtox.2024.100302","DOIUrl":"10.1016/j.comtox.2024.100302","url":null,"abstract":"<div><p>Bacterial infections caused by resistant strains, especially those conferring multi-drug resistance (MDR), have become a severe health problem worldwide. Novobiocin (NB) is a widely used antibiotic that inhibits the action of DNA gyrase in <em>Escherichia coli (E. coli).</em> The drug's efficiency is hindered by its strong binding with the resistance causing efflux pump AcrAB-TolC on recurrent exposure. Consequently, the discovery of alternate/substitute analogue compounds for the parent drug with higher selectivity could counter drug resistance. In this work, we identified potent analogues of drug NB against the gyrase B enzyme by performing high throughput virtual screening of forty analogues that includes drug-likeness properties, pharmacokinetic parameters analysis, molecular docking, and molecular dynamics (MD) simulations. Our comprehensive pharmacological profiling with intrinsic analysis of selectivity and safety resulted in the identification of four potential compounds, C4 (ZINC218812366), C6 (ZINC221968665), C8 (ZINC49783724) and C10 (ZINC49783727), have better inhibitory and binding capacity against the primary target gyrase B subunit and reduced interaction with the counterpart of resistant target AcrB. These findings provide proof of concept for developing lead compounds targeting gyrase B and help in combatting AcrB-mediated drug resistance.</p></div>","PeriodicalId":37651,"journal":{"name":"Computational Toxicology","volume":"29 ","pages":"Article 100302"},"PeriodicalIF":0.0,"publicationDate":"2024-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139892597","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-02-09DOI: 10.1016/j.comtox.2024.100301
Tia Tate, Grace Patlewicz, Imran Shah
Animal toxicity testing is time and resource intensive, making it difficult to keep pace with the number of substances requiring assessment. Machine learning (ML) models that use chemical structure information and high-throughput experimental data can be helpful in predicting potential toxicity. However, much of the toxicity data used to train ML models is biased with an unequal balance of positives and negatives primarily since substances selected for in vivo testing are expected to elicit some toxicity effect. To investigate the impact this bias had on predictive performance, various sampling approaches were used to balance in vivo toxicity data as part of a supervised ML workflow to predict hepatotoxicity outcomes from chemical structure and/or targeted transcriptomic data. From the chronic, subchronic, developmental, multigenerational reproductive, and subacute repeat-dose testing toxicity outcomes with a minimum of 50 positive and 50 negative substances, 18 different study-toxicity outcome combinations were evaluated in up to 7 ML models. These included Artificial Neural Networks, Random Forests, Bernouilli Naïve Bayes, Gradient Boosting, and Support Vector classification algorithms which were compared with a local approach, Generalised Read-Across (GenRA), a similarity-weighted k-Nearest Neighbour (k-NN) method. The mean CV F1 performance for unbalanced data across all classifiers and descriptors for chronic liver effects was 0.735 (0.0395 SD). Mean CV F1 performance dropped to 0.639 (0.073 SD) with over-sampling approaches though the poorer performance of KNN approaches in some cases contributed to the observed decrease (mean CV F1 performance excluding KNN was 0.697 (0.072 SD)). With under-sampling approaches, the mean CV F1 was 0.523 (0.083 SD). For developmental liver effects, the mean CV F1 performance was much lower with 0.089 (0.111 SD) for unbalanced approaches and 0.149 (0.084 SD) for under-sampling. Over-sampling approaches led to an increase in mean CV F1 performance (0.234, (0.107 SD)) for developmental liver toxicity. Model performance was found to be dependent on dataset, model type, balancing approach and feature selection. Accordingly tailoring ML workflows for predicting toxicity should consider class imbalance and rely on simple classifiers first.
动物毒性测试需要大量时间和资源,因此很难跟上需要评估的物质数量。使用化学结构信息和高通量实验数据的机器学习(ML)模型有助于预测潜在毒性。然而,用于训练 ML 模型的大部分毒性数据都存在偏差,阳性和阴性数据不平衡,这主要是因为被选中进行体内测试的物质预计会引起一些毒性效应。为了研究这种偏差对预测性能的影响,我们采用了各种取样方法来平衡体内毒性数据,作为监督式 ML 工作流程的一部分,以便从化学结构和/或靶向转录组数据中预测肝毒性结果。从至少 50 种阳性物质和 50 种阴性物质的慢性、亚慢性、发育、多代生殖和亚急性重复剂量测试毒性结果中,在多达 7 个 ML 模型中评估了 18 种不同的研究-毒性结果组合。这些模型包括人工神经网络、随机森林、Bernouilli Naïve Bayes、梯度提升和支持向量分类算法,并与一种本地方法--广义读数交叉(GenRA)--相似性加权 k-近邻(k-NN)方法进行了比较。在所有分类器和描述符的非平衡数据中,慢性肝脏效应的平均 CV F1 性能为 0.735(0.0395 SD)。过度取样方法的平均 CV F1 性能降至 0.639(0.073 标差),尽管在某些情况下 KNN 方法的性能较差也导致了观察到的性能下降(不包括 KNN 的平均 CV F1 性能为 0.697(0.072 标差))。采用取样不足法时,平均 CV F1 为 0.523(0.083 标差)。在发育肝效应方面,不平衡方法的平均 CV F1 性能更低,为 0.089(0.111 标差),而采样不足方法的平均 CV F1 性能为 0.149(0.084 标差)。过度取样方法导致发育期肝脏毒性的平均 CV F1 性能提高(0.234,(0.107 标差))。研究发现,模型性能取决于数据集、模型类型、平衡方法和特征选择。因此,在定制预测毒性的 ML 工作流程时应考虑类的不平衡性,并首先依赖简单的分类器。
{"title":"A comparison of machine learning approaches for predicting hepatotoxicity potential using chemical structure and targeted transcriptomic data","authors":"Tia Tate, Grace Patlewicz, Imran Shah","doi":"10.1016/j.comtox.2024.100301","DOIUrl":"https://doi.org/10.1016/j.comtox.2024.100301","url":null,"abstract":"<div><p>Animal toxicity testing is time and resource intensive, making it difficult to keep pace with the number of substances requiring assessment. Machine learning (ML) models that use chemical structure information and high-throughput experimental data can be helpful in predicting potential toxicity. However, much of the toxicity data used to train ML models is biased with an unequal balance of positives and negatives primarily since substances selected for <em>in vivo</em> testing are expected to elicit some toxicity effect. To investigate the impact this bias had on predictive performance, various sampling approaches were used to balance <em>in vivo</em> toxicity data as part of a supervised ML workflow to predict hepatotoxicity outcomes from chemical structure and/or targeted transcriptomic data. From the chronic, subchronic, developmental, multigenerational reproductive, and subacute repeat-dose testing toxicity outcomes with a minimum of 50 positive and 50 negative substances, 18 different study-toxicity outcome combinations were evaluated in up to 7 ML models. These included Artificial Neural Networks, Random Forests, Bernouilli Naïve Bayes, Gradient Boosting, and Support Vector classification algorithms which were compared with a local approach, Generalised Read-Across (GenRA), a similarity-weighted k-Nearest Neighbour (k-NN) method. The mean CV F1 performance for unbalanced data across all classifiers and descriptors for chronic liver effects was 0.735 (0.0395 SD). Mean CV F1 performance dropped to 0.639 (0.073 SD) with over-sampling approaches though the poorer performance of KNN approaches in some cases contributed to the observed decrease (mean CV F1 performance excluding KNN was 0.697 (0.072 SD)). With under-sampling approaches, the mean CV F1 was 0.523 (0.083 SD). For developmental liver effects, the mean CV F1 performance was much lower with 0.089 (0.111 SD) for unbalanced approaches and 0.149 (0.084 SD) for under-sampling. Over-sampling approaches led to an increase in mean CV F1 performance (0.234, (0.107 SD)) for developmental liver toxicity. Model performance was found to be dependent on dataset, model type, balancing approach and feature selection. Accordingly tailoring ML workflows for predicting toxicity should consider class imbalance and rely on simple classifiers first.</p></div>","PeriodicalId":37651,"journal":{"name":"Computational Toxicology","volume":"29 ","pages":"Article 100301"},"PeriodicalIF":0.0,"publicationDate":"2024-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139743839","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-29DOI: 10.1016/j.comtox.2024.100300
Steven Kane, Dan Newman, David J. Ponting, Edward Rosser, Robert Thomas, Jonathan D. Vessey, Samuel J. Webb, William H.J. Wood
To reach conclusions during chemical safety assessments, risk assessors need to ensure sufficient information is present to satisfy the decision criteria. This often requires data to be generated and, in some cases, insufficient knowledge is present, or it is not feasible to generate new data through experiments. Read-across is a powerful technique to fill such data gaps, however the expert-driven process can be time intensive and subjective in nature resulting in variation of approach. To overcome these barriers a prototype software application has been developed by Lhasa Limited to support decision making about the toxicity and potency of chemicals using a read-across approach. The application supports a workflow which allows the user to gather data and knowledge about a chemical of interest and possible read-across candidates. Relevant information is then presented that enables the user to decide if read-across can be performed and, if so, which analogue or category can be considered the most appropriate. Data and knowledge about the toxicity of a compound and potential analogues include assay and metabolism data, toxicophore identification and its local similarity, physico-chemical and pharmacokinetic properties and observed and predicted metabolic profile. The utility of the approach is demonstrated with case studies using N-nitrosamine compounds, where the conclusions from using the workflow supported by the software are concordant with the evidence base. The components of the workflow have been further validated by demonstrating that conclusions are significantly better than would be expect from the distribution of data in test sets. The approach taken demonstrates how software implementing intuitive workflows that guide experts during read-across can support decisions and how validation of the methods can increase confidence in the overall approach.
{"title":"Developing and validating read-across workflows that enable decision making for toxicity and potency: Case studies with N-nitrosamines","authors":"Steven Kane, Dan Newman, David J. Ponting, Edward Rosser, Robert Thomas, Jonathan D. Vessey, Samuel J. Webb, William H.J. Wood","doi":"10.1016/j.comtox.2024.100300","DOIUrl":"https://doi.org/10.1016/j.comtox.2024.100300","url":null,"abstract":"<div><p>To reach conclusions during chemical safety assessments, risk assessors need to ensure sufficient information is present to satisfy the decision criteria. This often requires data to be generated and, in some cases, insufficient knowledge is present, or it is not feasible to generate new data through experiments. Read-across is a powerful technique to fill such data gaps, however the expert-driven process can be time intensive and subjective in nature resulting in variation of approach. To overcome these barriers a prototype software application has been developed by Lhasa Limited to support decision making about the toxicity and potency of chemicals using a read-across approach. The application supports a workflow which allows the user to gather data and knowledge about a chemical of interest and possible read-across candidates. Relevant information is then presented that enables the user to decide if read-across can be performed and, if so, which analogue or category can be considered the most appropriate. Data and knowledge about the toxicity of a compound and potential analogues include assay and metabolism data, toxicophore identification and its local similarity, physico-chemical and pharmacokinetic properties and observed and predicted metabolic profile. The utility of the approach is demonstrated with case studies using <em>N</em>-nitrosamine compounds, where the conclusions from using the workflow supported by the software are concordant with the evidence base. The components of the workflow have been further validated by demonstrating that conclusions are significantly better than would be expect from the distribution of data in test sets. The approach taken demonstrates how software implementing intuitive workflows that guide experts during read-across can support decisions and how validation of the methods can increase confidence in the overall approach.</p></div>","PeriodicalId":37651,"journal":{"name":"Computational Toxicology","volume":"29 ","pages":"Article 100300"},"PeriodicalIF":0.0,"publicationDate":"2024-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139731861","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-23DOI: 10.1016/j.comtox.2024.100299
L.M. Bilinsky
This paper presents an ordinary differential equation (ODE) model of endogenous H2O2 metabolism in hepatocytes that is unique, at the time of writing, in its ability to accurately compute intracellular H2O2 concentration during incidents of oxidative stress and in its usefulness for constructing PBPK/PD models for ROS-generating xenobiotics. Versions of the model are presented for rat hepatocytes in vitro and mouse liver in vivo. A generic method is given for using the model to create PBPK/PD models which predict intracellular H2O2 concentration and oxidative-stress-induced hepatocyte death; these are identifiable from in vitro data sets reporting cell mortality following xenobiotic exposure at various levels. The procedure is demonstrated for the trivalent arsenical dimethylarsinous acid (DMAIII), which is produced in liver as part of the arsenic elimination pathway. This is the first model of H2O2 metabolism in hepatocytes to feature values for the endogenous rates of H2O2 production by mitochondria and other organelles which are inferred from the physiology literature, and to feature a detailed, realistic treatment of GSH metabolism; the latter is achieved by incorporating a minimal version of Reed and coworkers’ pioneering model of GSH metabolism in liver. Model simulations indicate that critical GSH depletion is the immediate trigger for intracellular H2O2 rising to concentrations associated with apoptosis (), that this may only occur hours after the xenobiotic concentration peaks (“delay effect”), that when critical GSH depletion does occur, H2O2 concentration rises rapidly in a sequence of two boundary layers, characterized by the kinetics of glutathione peroxidase (first boundary layer) and catalase (second boundary layer), and that intracellular H2O2 concentration implies critical GSH depletion. There has been speculation that ROS levels in the range associated with apoptosis simply indicate, rather than cause, an apoptotic milieu. Model simulations are consistent with this view. In a result of interest to the wider physiology community, the delay effect is shown to provide a GSH-based mechanism by which cells can distinguish transient elevations in H2O2 concentration, of use in intracellular signaling, from persistent ones indicative of either pathology or the presence of toxins, the second state of affairs eventually triggering apoptosis.
{"title":"A computational model of endogenous hydrogen peroxide metabolism in hepatocytes, featuring a critical role for GSH","authors":"L.M. Bilinsky","doi":"10.1016/j.comtox.2024.100299","DOIUrl":"10.1016/j.comtox.2024.100299","url":null,"abstract":"<div><p>This paper presents an ordinary differential equation (ODE) model of endogenous H<sub>2</sub>O<sub>2</sub> <!-->metabolism in hepatocytes that is unique, at the time of writing, in its ability to accurately compute intracellular H<sub>2</sub>O<sub>2</sub> <!-->concentration during incidents of oxidative stress and in its usefulness for constructing PBPK/PD models for ROS-generating xenobiotics. Versions of the model are presented for rat hepatocytes <em>in vitro</em> and mouse liver <em>in vivo</em>. A generic method is given for using the model to create PBPK/PD models which predict intracellular H<sub>2</sub>O<sub>2</sub> <!-->concentration and oxidative-stress-induced hepatocyte death; these are identifiable from <em>in vitro</em> data sets reporting cell mortality following xenobiotic exposure at various levels. The procedure is demonstrated for the trivalent arsenical dimethylarsinous acid (DMA<sup><em>III</em></sup>), which is produced in liver as part of the arsenic elimination pathway. This is the first model of H<sub>2</sub>O<sub>2</sub> <!-->metabolism in hepatocytes to feature values for the endogenous rates of H<sub>2</sub>O<sub>2</sub> <!-->production by mitochondria and other organelles which are inferred from the physiology literature, and to feature a detailed, realistic treatment of GSH metabolism; the latter is achieved by incorporating a minimal version of Reed and coworkers’ pioneering model of GSH metabolism in liver. Model simulations indicate that critical GSH depletion is the immediate trigger for intracellular H<sub>2</sub>O<sub>2</sub> <!-->rising to concentrations associated with apoptosis (<span><math><mrow><mo>></mo><mn>1</mn><mi>μ</mi><mi>M</mi></mrow></math></span>), that this may only occur hours after the xenobiotic concentration peaks (“delay effect”), that when critical GSH depletion does occur, H<sub>2</sub>O<sub>2</sub> <!-->concentration rises rapidly in a sequence of two boundary layers, characterized by the kinetics of glutathione peroxidase (first boundary layer) and catalase (second boundary layer), and that intracellular H<sub>2</sub>O<sub>2</sub> <!-->concentration <span><math><mrow><mo>></mo><mn>1</mn><mi>μ</mi><mi>M</mi></mrow></math></span> implies critical GSH depletion. There has been speculation that ROS levels in the range associated with apoptosis simply indicate, rather than cause, an apoptotic milieu. Model simulations are consistent with this view. In a result of interest to the wider physiology community, the delay effect is shown to provide a GSH-based mechanism by which cells can distinguish transient elevations in H<sub>2</sub>O<sub>2</sub> <!-->concentration, of use in intracellular signaling, from persistent ones indicative of either pathology or the presence of toxins, the second state of affairs eventually triggering apoptosis.</p></div>","PeriodicalId":37651,"journal":{"name":"Computational Toxicology","volume":"29 ","pages":"Article 100299"},"PeriodicalIF":0.0,"publicationDate":"2024-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139640107","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}