Brent Wyatt, A. P. Davis, Thomas C. Wiegers, Jolene Wiegers, Sakib Abrar, D. Sciaky, Fern Barkalow, Melissa Strong, C. Mattingly
{"title":"将比较毒物基因组学数据库中的环境健康数据集转化为和弦图,使分子机制可视化","authors":"Brent Wyatt, A. P. Davis, Thomas C. Wiegers, Jolene Wiegers, Sakib Abrar, D. Sciaky, Fern Barkalow, Melissa Strong, C. Mattingly","doi":"10.3389/ftox.2024.1437884","DOIUrl":null,"url":null,"abstract":"In environmental health, the specific molecular mechanisms connecting a chemical exposure to an adverse endpoint are often unknown, reflecting knowledge gaps. At the public Comparative Toxicogenomics Database (CTD; https://ctdbase.org/), we integrate manually curated, literature-based interactions from CTD to compute four-unit blocks of information organized as a potential step-wise molecular mechanism, known as “CGPD-tetramers,” wherein a chemical interacts with a gene product to trigger a phenotype which can be linked to a disease. These computationally derived datasets can be used to fill the gaps and offer testable mechanistic information. Users can generate CGPD-tetramers for any combination of chemical, gene, phenotype, and/or disease of interest at CTD; however, such queries typically result in the generation of thousands of CGPD-tetramers. Here, we describe a novel approach to transform these large datasets into user-friendly chord diagrams using R. This visualization process is straightforward, simple to implement, and accessible to inexperienced users that have never used R before. Combining CGPD-tetramers into a single chord diagram helps identify potential key chemicals, genes, phenotypes, and diseases. This visualization allows users to more readily analyze computational datasets that can fill the exposure knowledge gaps in the environmental health continuum.","PeriodicalId":73111,"journal":{"name":"Frontiers in toxicology","volume":null,"pages":null},"PeriodicalIF":3.6000,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Transforming environmental health datasets from the comparative toxicogenomics database into chord diagrams to visualize molecular mechanisms\",\"authors\":\"Brent Wyatt, A. P. Davis, Thomas C. Wiegers, Jolene Wiegers, Sakib Abrar, D. Sciaky, Fern Barkalow, Melissa Strong, C. Mattingly\",\"doi\":\"10.3389/ftox.2024.1437884\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In environmental health, the specific molecular mechanisms connecting a chemical exposure to an adverse endpoint are often unknown, reflecting knowledge gaps. At the public Comparative Toxicogenomics Database (CTD; https://ctdbase.org/), we integrate manually curated, literature-based interactions from CTD to compute four-unit blocks of information organized as a potential step-wise molecular mechanism, known as “CGPD-tetramers,” wherein a chemical interacts with a gene product to trigger a phenotype which can be linked to a disease. These computationally derived datasets can be used to fill the gaps and offer testable mechanistic information. Users can generate CGPD-tetramers for any combination of chemical, gene, phenotype, and/or disease of interest at CTD; however, such queries typically result in the generation of thousands of CGPD-tetramers. Here, we describe a novel approach to transform these large datasets into user-friendly chord diagrams using R. This visualization process is straightforward, simple to implement, and accessible to inexperienced users that have never used R before. Combining CGPD-tetramers into a single chord diagram helps identify potential key chemicals, genes, phenotypes, and diseases. This visualization allows users to more readily analyze computational datasets that can fill the exposure knowledge gaps in the environmental health continuum.\",\"PeriodicalId\":73111,\"journal\":{\"name\":\"Frontiers in toxicology\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":3.6000,\"publicationDate\":\"2024-07-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in toxicology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3389/ftox.2024.1437884\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"TOXICOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in toxicology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/ftox.2024.1437884","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"TOXICOLOGY","Score":null,"Total":0}
引用次数: 0
摘要
在环境健康领域,化学品暴露与不良终点之间的具体分子机制往往是未知的,这反映了知识上的差距。在公共比较毒物基因组学数据库(CTD; https://ctdbase.org/)中,我们整合了 CTD 中人工编辑的、基于文献的相互作用,计算出四个单位的信息块,作为潜在的逐步分子机制,即 "CGPD-tetramers",其中一种化学品与一种基因产物相互作用,引发一种表型,而这种表型可以与疾病联系起来。这些计算得出的数据集可以用来填补空白,提供可检验的机理信息。用户可以在 CTD 上为感兴趣的化学物质、基因、表型和/或疾病的任何组合生成 CGPD-四聚体;但是,这种查询通常会生成数千个 CGPD-四聚体。在这里,我们介绍了一种使用 R 将这些大型数据集转化为用户友好和弦图的新方法。这种可视化过程简单明了、易于实现,而且从未使用过 R 的无经验用户也可以使用。将 CGPD 四聚体组合成单个弦图有助于识别潜在的关键化学物质、基因、表型和疾病。这种可视化方法可以让用户更轻松地分析计算数据集,从而填补环境健康连续体中暴露知识的空白。
Transforming environmental health datasets from the comparative toxicogenomics database into chord diagrams to visualize molecular mechanisms
In environmental health, the specific molecular mechanisms connecting a chemical exposure to an adverse endpoint are often unknown, reflecting knowledge gaps. At the public Comparative Toxicogenomics Database (CTD; https://ctdbase.org/), we integrate manually curated, literature-based interactions from CTD to compute four-unit blocks of information organized as a potential step-wise molecular mechanism, known as “CGPD-tetramers,” wherein a chemical interacts with a gene product to trigger a phenotype which can be linked to a disease. These computationally derived datasets can be used to fill the gaps and offer testable mechanistic information. Users can generate CGPD-tetramers for any combination of chemical, gene, phenotype, and/or disease of interest at CTD; however, such queries typically result in the generation of thousands of CGPD-tetramers. Here, we describe a novel approach to transform these large datasets into user-friendly chord diagrams using R. This visualization process is straightforward, simple to implement, and accessible to inexperienced users that have never used R before. Combining CGPD-tetramers into a single chord diagram helps identify potential key chemicals, genes, phenotypes, and diseases. This visualization allows users to more readily analyze computational datasets that can fill the exposure knowledge gaps in the environmental health continuum.