Bowen Yang, Tan Meng, Xinrui Wang, Jun Li, Shuang Zhao, Yingheng Wang, Shu Yi, Yi Zhou, Yi Zhang, Liang Li, Li Guo
{"title":"CAT Bridge:从多组学数据中进行基因-代谢物关联挖掘的高效工具包。","authors":"Bowen Yang, Tan Meng, Xinrui Wang, Jun Li, Shuang Zhao, Yingheng Wang, Shu Yi, Yi Zhou, Yi Zhang, Liang Li, Li Guo","doi":"10.1093/gigascience/giae083","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>With advancements in sequencing and mass spectrometry technologies, multiomics data can now be easily acquired for understanding complex biological systems. Nevertheless, substantial challenges remain in determining the association between gene-metabolite pairs due to the nonlinear and multifactorial interactions within cellular networks. The complexity arises from the interplay of multiple genes and metabolites, often involving feedback loops and time-dependent regulatory mechanisms that are not easily captured by traditional analysis methods.</p><p><strong>Findings: </strong>Here, we introduce Compounds And Transcripts Bridge (abbreviated as CAT Bridge, available at https://catbridge.work), a free user-friendly platform for longitudinal multiomics analysis to efficiently identify transcripts associated with metabolites using time-series omics data. To evaluate the association of gene-metabolite pairs, CAT Bridge is a pioneering work benchmarking a set of statistical methods spanning causality estimation and correlation coefficient calculation for multiomics analysis. Additionally, CAT Bridge features an artificial intelligence agent to assist users interpreting the association results.</p><p><strong>Conclusions: </strong>We applied CAT Bridge to experimentally obtained Capsicum chinense (chili pepper) and public human and Escherichia coli time-series transcriptome and metabolome datasets. CAT Bridge successfully identified genes involved in the biosynthesis of capsaicin in C. chinense. Furthermore, case study results showed that the convergent cross-mapping method outperforms traditional approaches in longitudinal multiomics analyses. CAT Bridge simplifies access to various established methods for longitudinal multiomics analysis and enables researchers to swiftly identify associated gene-metabolite pairs for further validation.</p>","PeriodicalId":12581,"journal":{"name":"GigaScience","volume":"13 ","pages":""},"PeriodicalIF":11.8000,"publicationDate":"2024-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11548955/pdf/","citationCount":"0","resultStr":"{\"title\":\"CAT Bridge: an efficient toolkit for gene-metabolite association mining from multiomics data.\",\"authors\":\"Bowen Yang, Tan Meng, Xinrui Wang, Jun Li, Shuang Zhao, Yingheng Wang, Shu Yi, Yi Zhou, Yi Zhang, Liang Li, Li Guo\",\"doi\":\"10.1093/gigascience/giae083\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>With advancements in sequencing and mass spectrometry technologies, multiomics data can now be easily acquired for understanding complex biological systems. Nevertheless, substantial challenges remain in determining the association between gene-metabolite pairs due to the nonlinear and multifactorial interactions within cellular networks. The complexity arises from the interplay of multiple genes and metabolites, often involving feedback loops and time-dependent regulatory mechanisms that are not easily captured by traditional analysis methods.</p><p><strong>Findings: </strong>Here, we introduce Compounds And Transcripts Bridge (abbreviated as CAT Bridge, available at https://catbridge.work), a free user-friendly platform for longitudinal multiomics analysis to efficiently identify transcripts associated with metabolites using time-series omics data. To evaluate the association of gene-metabolite pairs, CAT Bridge is a pioneering work benchmarking a set of statistical methods spanning causality estimation and correlation coefficient calculation for multiomics analysis. Additionally, CAT Bridge features an artificial intelligence agent to assist users interpreting the association results.</p><p><strong>Conclusions: </strong>We applied CAT Bridge to experimentally obtained Capsicum chinense (chili pepper) and public human and Escherichia coli time-series transcriptome and metabolome datasets. CAT Bridge successfully identified genes involved in the biosynthesis of capsaicin in C. chinense. Furthermore, case study results showed that the convergent cross-mapping method outperforms traditional approaches in longitudinal multiomics analyses. CAT Bridge simplifies access to various established methods for longitudinal multiomics analysis and enables researchers to swiftly identify associated gene-metabolite pairs for further validation.</p>\",\"PeriodicalId\":12581,\"journal\":{\"name\":\"GigaScience\",\"volume\":\"13 \",\"pages\":\"\"},\"PeriodicalIF\":11.8000,\"publicationDate\":\"2024-01-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11548955/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"GigaScience\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1093/gigascience/giae083\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"GigaScience","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/gigascience/giae083","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
CAT Bridge: an efficient toolkit for gene-metabolite association mining from multiomics data.
Background: With advancements in sequencing and mass spectrometry technologies, multiomics data can now be easily acquired for understanding complex biological systems. Nevertheless, substantial challenges remain in determining the association between gene-metabolite pairs due to the nonlinear and multifactorial interactions within cellular networks. The complexity arises from the interplay of multiple genes and metabolites, often involving feedback loops and time-dependent regulatory mechanisms that are not easily captured by traditional analysis methods.
Findings: Here, we introduce Compounds And Transcripts Bridge (abbreviated as CAT Bridge, available at https://catbridge.work), a free user-friendly platform for longitudinal multiomics analysis to efficiently identify transcripts associated with metabolites using time-series omics data. To evaluate the association of gene-metabolite pairs, CAT Bridge is a pioneering work benchmarking a set of statistical methods spanning causality estimation and correlation coefficient calculation for multiomics analysis. Additionally, CAT Bridge features an artificial intelligence agent to assist users interpreting the association results.
Conclusions: We applied CAT Bridge to experimentally obtained Capsicum chinense (chili pepper) and public human and Escherichia coli time-series transcriptome and metabolome datasets. CAT Bridge successfully identified genes involved in the biosynthesis of capsaicin in C. chinense. Furthermore, case study results showed that the convergent cross-mapping method outperforms traditional approaches in longitudinal multiomics analyses. CAT Bridge simplifies access to various established methods for longitudinal multiomics analysis and enables researchers to swiftly identify associated gene-metabolite pairs for further validation.
期刊介绍:
GigaScience seeks to transform data dissemination and utilization in the life and biomedical sciences. As an online open-access open-data journal, it specializes in publishing "big-data" studies encompassing various fields. Its scope includes not only "omic" type data and the fields of high-throughput biology currently serviced by large public repositories, but also the growing range of more difficult-to-access data, such as imaging, neuroscience, ecology, cohort data, systems biology and other new types of large-scale shareable data.