首页 > 最新文献

Database: The Journal of Biological Databases and Curation最新文献

英文 中文
MIPD: Molecules, Imagings, and Clinical Phenotype Integrated Database. MIPD:分子,图像和临床表型集成数据库。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-04-21 DOI: 10.1093/database/baaf029
Jiaojiao Zhao, Min Wu, Meihua Wan, Xue Li, Jie Li, Qin Liu, Minghao Xiong, Mengjie Tu, Jun Zhou, Shilin Li, Jie Zhang, Jiangping Fu, Yin Zhang, Chungang Zhao, Litong Qin, Xue Yang, Hong Zhao, Yan Zhang, Fanxin Zeng

Due to tumor heterogeneity, a subset of patients fails to benefit from current treatment strategies. However, an integrated analysis of imaging features, genetic molecules, and clinical phenotypes can characterize tumor heterogeneity, enabling the development of more personalized treatment approaches. Despite its potential, cross-modal databases remain underexplored. To address this gap, we established a comprehensive database encompassing 9965 genes, 5449 proteins, 1121 metabolites, 283 pathways, 854 imaging features, and 73 clinical factors from colorectal cancer patients. This database identifies significantly distinct molecules and imaging features associated with clinical phenotypes and provides survival analysis based on these features. Additionally, it offers genetic molecule annotations, comparative expression levels between tumor and normal tissues, imaging features linked to genetic molecules, and imaging-based models for predicting gene expression levels. Furthermore, the database highlights correlations between genetic molecules, clinical factors, and imaging features. In summary, we present MIPD (Molecules, Imaging, and Clinical Phenotype Correlation Database), a user-friendly, interactive, and specialized platform accessible at http://corgenerf.com. MIPD facilitates the interpretability of cross-modal data by providing query, browse, search, visualization, and download functionalities, thereby offering a valuable resource for advancing precision medicine in colorectal cancer. Database URL: http://corgenerf.

由于肿瘤的异质性,一部分患者不能从目前的治疗策略中获益。然而,对影像学特征、遗传分子和临床表型的综合分析可以表征肿瘤的异质性,从而开发出更加个性化的治疗方法。尽管具有潜力,但跨模式数据库仍未得到充分开发。为了弥补这一空白,我们建立了一个包含9965个基因、5449个蛋白质、1121个代谢物、283个通路、854个成像特征和73个结直肠癌患者临床因素的综合数据库。该数据库识别与临床表型相关的显著不同的分子和成像特征,并提供基于这些特征的生存分析。此外,它还提供遗传分子注释,肿瘤和正常组织之间的比较表达水平,与遗传分子相关的成像特征,以及用于预测基因表达水平的基于成像的模型。此外,该数据库突出了遗传分子、临床因素和影像学特征之间的相关性。总之,我们提出了MIPD(分子、成像和临床表型相关数据库),这是一个用户友好的、互动的、专门的平台,可在http://corgenerf.com上访问。MIPD通过提供查询、浏览、搜索、可视化和下载功能,促进了跨模式数据的可解释性,从而为推进结直肠癌的精准医学提供了宝贵的资源。数据库地址:http://corgenerf。
{"title":"MIPD: Molecules, Imagings, and Clinical Phenotype Integrated Database.","authors":"Jiaojiao Zhao, Min Wu, Meihua Wan, Xue Li, Jie Li, Qin Liu, Minghao Xiong, Mengjie Tu, Jun Zhou, Shilin Li, Jie Zhang, Jiangping Fu, Yin Zhang, Chungang Zhao, Litong Qin, Xue Yang, Hong Zhao, Yan Zhang, Fanxin Zeng","doi":"10.1093/database/baaf029","DOIUrl":"https://doi.org/10.1093/database/baaf029","url":null,"abstract":"<p><p>Due to tumor heterogeneity, a subset of patients fails to benefit from current treatment strategies. However, an integrated analysis of imaging features, genetic molecules, and clinical phenotypes can characterize tumor heterogeneity, enabling the development of more personalized treatment approaches. Despite its potential, cross-modal databases remain underexplored. To address this gap, we established a comprehensive database encompassing 9965 genes, 5449 proteins, 1121 metabolites, 283 pathways, 854 imaging features, and 73 clinical factors from colorectal cancer patients. This database identifies significantly distinct molecules and imaging features associated with clinical phenotypes and provides survival analysis based on these features. Additionally, it offers genetic molecule annotations, comparative expression levels between tumor and normal tissues, imaging features linked to genetic molecules, and imaging-based models for predicting gene expression levels. Furthermore, the database highlights correlations between genetic molecules, clinical factors, and imaging features. In summary, we present MIPD (Molecules, Imaging, and Clinical Phenotype Correlation Database), a user-friendly, interactive, and specialized platform accessible at http://corgenerf.com. MIPD facilitates the interpretability of cross-modal data by providing query, browse, search, visualization, and download functionalities, thereby offering a valuable resource for advancing precision medicine in colorectal cancer. Database URL: http://corgenerf.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144126854","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Localizatome: a database for stress-dependent subcellular localization changes in proteins. Localizatome:一个蛋白质中应力依赖性亚细胞定位变化的数据库。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-04-21 DOI: 10.1093/database/baaf028
Takahide Matsushima, Yuki Naito, Tomoki Chiba, Ryota Kurimoto, Keiko Itano, Koji Ochiai, Koichi Takahashi, Naoki Goshima, Hiroshi Asahara

Understanding protein subcellular localization and its dynamic changes is crucial for elucidating cellular function and disease mechanisms, particularly under stress conditions, where protein localization changes can modulate cellular responses. Currently available databases provide insights into protein localization under steady-state conditions; however, stress-related dynamic localization changes remain poorly understood. Here, we present the Localizatome, a comprehensive database that captures stress-induced protein localization dynamics in living cells. Using an original high-throughput microscopy system and machine learning algorithms, we analysed the localization patterns of 10 287 fluorescent protein-fused human proteins in HeLa cells before and after exposure to oxidative stress. Our analysis revealed that 1910 proteins exhibited oxidative stress-dependent localization changes, particularly forming distinct foci. Among them, there were stress granule assembly factors and autophagy-related proteins, as well as components of various signalling pathways. Subsequent characterization identified some specific amino acid motifs and intrinsically disordered regions associated with stress-induced protein redistribution. The Localizatome provides open access to these data through a web-based interface, supporting a wide range of studies on cellular stress response and disease mechanisms. Database URL https://localizatome.embrys.jp/.

了解蛋白质亚细胞定位及其动态变化对于阐明细胞功能和疾病机制至关重要,特别是在应激条件下,蛋白质定位变化可以调节细胞反应。目前可用的数据库提供了在稳态条件下蛋白质定位的见解;然而,与应力相关的动态局部化变化仍然知之甚少。在这里,我们介绍了Localizatome,一个捕获活细胞中应力诱导的蛋白质定位动态的综合数据库。利用原始的高通量显微镜系统和机器学习算法,我们分析了氧化应激前后HeLa细胞中10 287种荧光蛋白融合的人蛋白的定位模式。我们的分析显示,1910蛋白表现出氧化应激依赖的定位变化,特别是形成不同的焦点。其中包括胁迫颗粒组装因子和自噬相关蛋白,以及各种信号通路的组分。随后的表征确定了一些特定的氨基酸基序和与应力诱导的蛋白质再分配相关的内在紊乱区域。Localizatome通过基于网络的界面提供对这些数据的开放访问,支持对细胞应激反应和疾病机制的广泛研究。数据库URL https://localizatome.embrys.jp/。
{"title":"Localizatome: a database for stress-dependent subcellular localization changes in proteins.","authors":"Takahide Matsushima, Yuki Naito, Tomoki Chiba, Ryota Kurimoto, Keiko Itano, Koji Ochiai, Koichi Takahashi, Naoki Goshima, Hiroshi Asahara","doi":"10.1093/database/baaf028","DOIUrl":"https://doi.org/10.1093/database/baaf028","url":null,"abstract":"<p><p>Understanding protein subcellular localization and its dynamic changes is crucial for elucidating cellular function and disease mechanisms, particularly under stress conditions, where protein localization changes can modulate cellular responses. Currently available databases provide insights into protein localization under steady-state conditions; however, stress-related dynamic localization changes remain poorly understood. Here, we present the Localizatome, a comprehensive database that captures stress-induced protein localization dynamics in living cells. Using an original high-throughput microscopy system and machine learning algorithms, we analysed the localization patterns of 10 287 fluorescent protein-fused human proteins in HeLa cells before and after exposure to oxidative stress. Our analysis revealed that 1910 proteins exhibited oxidative stress-dependent localization changes, particularly forming distinct foci. Among them, there were stress granule assembly factors and autophagy-related proteins, as well as components of various signalling pathways. Subsequent characterization identified some specific amino acid motifs and intrinsically disordered regions associated with stress-induced protein redistribution. The Localizatome provides open access to these data through a web-based interface, supporting a wide range of studies on cellular stress response and disease mechanisms. Database URL https://localizatome.embrys.jp/.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144126891","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Localizatome: a database for stress-dependent subcellular localization changes in proteins. Localizatome:一个蛋白质中应力依赖性亚细胞定位变化的数据库。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-04-21 DOI: 10.1093/database/baaf028
Takahide Matsushima, Yuki Naito, Tomoki Chiba, Ryota Kurimoto, Keiko Itano, Koji Ochiai, Koichi Takahashi, Naoki Goshima, Hiroshi Asahara

Understanding protein subcellular localization and its dynamic changes is crucial for elucidating cellular function and disease mechanisms, particularly under stress conditions, where protein localization changes can modulate cellular responses. Currently available databases provide insights into protein localization under steady-state conditions; however, stress-related dynamic localization changes remain poorly understood. Here, we present the Localizatome, a comprehensive database that captures stress-induced protein localization dynamics in living cells. Using an original high-throughput microscopy system and machine learning algorithms, we analysed the localization patterns of 10 287 fluorescent protein-fused human proteins in HeLa cells before and after exposure to oxidative stress. Our analysis revealed that 1910 proteins exhibited oxidative stress-dependent localization changes, particularly forming distinct foci. Among them, there were stress granule assembly factors and autophagy-related proteins, as well as components of various signalling pathways. Subsequent characterization identified some specific amino acid motifs and intrinsically disordered regions associated with stress-induced protein redistribution. The Localizatome provides open access to these data through a web-based interface, supporting a wide range of studies on cellular stress response and disease mechanisms. Database URL https://localizatome.embrys.jp/.

了解蛋白质亚细胞定位及其动态变化对于阐明细胞功能和疾病机制至关重要,特别是在应激条件下,蛋白质定位变化可以调节细胞反应。目前可用的数据库提供了在稳态条件下蛋白质定位的见解;然而,与应力相关的动态局部化变化仍然知之甚少。在这里,我们介绍了Localizatome,一个捕获活细胞中应力诱导的蛋白质定位动态的综合数据库。利用原始的高通量显微镜系统和机器学习算法,我们分析了氧化应激前后HeLa细胞中10 287种荧光蛋白融合的人蛋白的定位模式。我们的分析显示,1910蛋白表现出氧化应激依赖的定位变化,特别是形成不同的焦点。其中包括胁迫颗粒组装因子和自噬相关蛋白,以及各种信号通路的组分。随后的表征确定了一些特定的氨基酸基序和与应力诱导的蛋白质再分配相关的内在紊乱区域。Localizatome通过基于网络的界面提供对这些数据的开放访问,支持对细胞应激反应和疾病机制的广泛研究。数据库URL https://localizatome.embrys.jp/。
{"title":"Localizatome: a database for stress-dependent subcellular localization changes in proteins.","authors":"Takahide Matsushima, Yuki Naito, Tomoki Chiba, Ryota Kurimoto, Keiko Itano, Koji Ochiai, Koichi Takahashi, Naoki Goshima, Hiroshi Asahara","doi":"10.1093/database/baaf028","DOIUrl":"10.1093/database/baaf028","url":null,"abstract":"<p><p>Understanding protein subcellular localization and its dynamic changes is crucial for elucidating cellular function and disease mechanisms, particularly under stress conditions, where protein localization changes can modulate cellular responses. Currently available databases provide insights into protein localization under steady-state conditions; however, stress-related dynamic localization changes remain poorly understood. Here, we present the Localizatome, a comprehensive database that captures stress-induced protein localization dynamics in living cells. Using an original high-throughput microscopy system and machine learning algorithms, we analysed the localization patterns of 10 287 fluorescent protein-fused human proteins in HeLa cells before and after exposure to oxidative stress. Our analysis revealed that 1910 proteins exhibited oxidative stress-dependent localization changes, particularly forming distinct foci. Among them, there were stress granule assembly factors and autophagy-related proteins, as well as components of various signalling pathways. Subsequent characterization identified some specific amino acid motifs and intrinsically disordered regions associated with stress-induced protein redistribution. The Localizatome provides open access to these data through a web-based interface, supporting a wide range of studies on cellular stress response and disease mechanisms. Database URL https://localizatome.embrys.jp/.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12010962/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143984325","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MIPD: Molecules, Imagings, and Clinical Phenotype Integrated Database. MIPD:分子,图像和临床表型集成数据库。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-04-21 DOI: 10.1093/database/baaf029
Jiaojiao Zhao, Min Wu, Meihua Wan, Xue Li, Jie Li, Qin Liu, Minghao Xiong, Mengjie Tu, Jun Zhou, Shilin Li, Jie Zhang, Jiangping Fu, Yin Zhang, Chungang Zhao, Litong Qin, Xue Yang, Hong Zhao, Yan Zhang, Fanxin Zeng

Due to tumor heterogeneity, a subset of patients fails to benefit from current treatment strategies. However, an integrated analysis of imaging features, genetic molecules, and clinical phenotypes can characterize tumor heterogeneity, enabling the development of more personalized treatment approaches. Despite its potential, cross-modal databases remain underexplored. To address this gap, we established a comprehensive database encompassing 9965 genes, 5449 proteins, 1121 metabolites, 283 pathways, 854 imaging features, and 73 clinical factors from colorectal cancer patients. This database identifies significantly distinct molecules and imaging features associated with clinical phenotypes and provides survival analysis based on these features. Additionally, it offers genetic molecule annotations, comparative expression levels between tumor and normal tissues, imaging features linked to genetic molecules, and imaging-based models for predicting gene expression levels. Furthermore, the database highlights correlations between genetic molecules, clinical factors, and imaging features. In summary, we present MIPD (Molecules, Imaging, and Clinical Phenotype Correlation Database), a user-friendly, interactive, and specialized platform accessible at http://corgenerf.com. MIPD facilitates the interpretability of cross-modal data by providing query, browse, search, visualization, and download functionalities, thereby offering a valuable resource for advancing precision medicine in colorectal cancer. Database URL: http://corgenerf.

由于肿瘤的异质性,一部分患者不能从目前的治疗策略中获益。然而,对影像学特征、遗传分子和临床表型的综合分析可以表征肿瘤的异质性,从而开发出更加个性化的治疗方法。尽管具有潜力,但跨模式数据库仍未得到充分开发。为了弥补这一空白,我们建立了一个包含9965个基因、5449个蛋白质、1121个代谢物、283个通路、854个成像特征和73个结直肠癌患者临床因素的综合数据库。该数据库识别与临床表型相关的显著不同的分子和成像特征,并提供基于这些特征的生存分析。此外,它还提供遗传分子注释,肿瘤和正常组织之间的比较表达水平,与遗传分子相关的成像特征,以及用于预测基因表达水平的基于成像的模型。此外,该数据库突出了遗传分子、临床因素和影像学特征之间的相关性。总之,我们提出了MIPD(分子、成像和临床表型相关数据库),这是一个用户友好的、互动的、专门的平台,可在http://corgenerf.com上访问。MIPD通过提供查询、浏览、搜索、可视化和下载功能,促进了跨模式数据的可解释性,从而为推进结直肠癌的精准医学提供了宝贵的资源。数据库地址:http://corgenerf。
{"title":"MIPD: Molecules, Imagings, and Clinical Phenotype Integrated Database.","authors":"Jiaojiao Zhao, Min Wu, Meihua Wan, Xue Li, Jie Li, Qin Liu, Minghao Xiong, Mengjie Tu, Jun Zhou, Shilin Li, Jie Zhang, Jiangping Fu, Yin Zhang, Chungang Zhao, Litong Qin, Xue Yang, Hong Zhao, Yan Zhang, Fanxin Zeng","doi":"10.1093/database/baaf029","DOIUrl":"10.1093/database/baaf029","url":null,"abstract":"<p><p>Due to tumor heterogeneity, a subset of patients fails to benefit from current treatment strategies. However, an integrated analysis of imaging features, genetic molecules, and clinical phenotypes can characterize tumor heterogeneity, enabling the development of more personalized treatment approaches. Despite its potential, cross-modal databases remain underexplored. To address this gap, we established a comprehensive database encompassing 9965 genes, 5449 proteins, 1121 metabolites, 283 pathways, 854 imaging features, and 73 clinical factors from colorectal cancer patients. This database identifies significantly distinct molecules and imaging features associated with clinical phenotypes and provides survival analysis based on these features. Additionally, it offers genetic molecule annotations, comparative expression levels between tumor and normal tissues, imaging features linked to genetic molecules, and imaging-based models for predicting gene expression levels. Furthermore, the database highlights correlations between genetic molecules, clinical factors, and imaging features. In summary, we present MIPD (Molecules, Imaging, and Clinical Phenotype Correlation Database), a user-friendly, interactive, and specialized platform accessible at http://corgenerf.com. MIPD facilitates the interpretability of cross-modal data by providing query, browse, search, visualization, and download functionalities, thereby offering a valuable resource for advancing precision medicine in colorectal cancer. Database URL: http://corgenerf.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12010968/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143968083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AFED, a comprehensive resource for Aspergillus flavus gene expression profiling. AFED,黄曲霉基因表达谱的综合资源。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-04-18 DOI: 10.1093/database/baaf033
Brian M Mack, Matthew D Lebar

The Aspergillus flavus expression database (AFED) is a comprehensive resource dedicated to exploring gene expression in A. flavus, a significant fungal pathogen that threatens food security by contaminating crops with aflatoxin. Given the complex regulation of aflatoxin biosynthesis and the lack of centralized expression data resources for this important pathogen, a database integrating diverse experimental conditions is essential for understanding its biology and developing control strategies. Public RNA sequencing data were used to quantify gene expression abundance for 604 A. flavus samples from 52 experiments. Using abundance data, we created an AFED accessible through a web-based interface that allows for the expression profiles of genes to be conveniently examined across different growth conditions and life cycle stages. Expression profiles can be visualized through either an interactive bar plot for single gene queries or a heatmap for multiple gene queries. A gene co-expression network based on samples containing at least 10 million mapped reads is also available, which allows users to identify genes that are co-expressed with an individual gene or set of genes and displays the functional enrichment among the co-expressed genes. Database URL: https://a-flavus-expression-db-jyqnpeuvta-uc.a.run.app.

黄曲霉表达数据库(Aspergillus flavus expression database, AFED)是一个致力于探索黄曲霉基因表达的综合性资源,黄曲霉是一种重要的真菌病原体,通过黄曲霉毒素污染农作物,威胁粮食安全。鉴于黄曲霉毒素生物合成的复杂调控和缺乏集中的表达数据资源,一个整合多种实验条件的数据库对于了解其生物学和制定控制策略至关重要。利用公开RNA测序数据对52个实验604份黄曲霉样品的基因表达丰度进行定量分析。利用丰度数据,我们创建了一个可通过网络界面访问的AFED,该界面允许在不同的生长条件和生命周期阶段方便地检查基因的表达谱。表达谱可以通过单个基因查询的交互式条形图或多个基因查询的热图来可视化。基于包含至少1000万个映射reads的样本的基因共表达网络也可用,它允许用户识别与单个基因或一组基因共表达的基因,并显示共表达基因之间的功能富集。数据库地址:https://a-flavus-expression-db-jyqnpeuvta-uc.a.run.app。
{"title":"AFED, a comprehensive resource for Aspergillus flavus gene expression profiling.","authors":"Brian M Mack, Matthew D Lebar","doi":"10.1093/database/baaf033","DOIUrl":"https://doi.org/10.1093/database/baaf033","url":null,"abstract":"<p><p>The Aspergillus flavus expression database (AFED) is a comprehensive resource dedicated to exploring gene expression in A. flavus, a significant fungal pathogen that threatens food security by contaminating crops with aflatoxin. Given the complex regulation of aflatoxin biosynthesis and the lack of centralized expression data resources for this important pathogen, a database integrating diverse experimental conditions is essential for understanding its biology and developing control strategies. Public RNA sequencing data were used to quantify gene expression abundance for 604 A. flavus samples from 52 experiments. Using abundance data, we created an AFED accessible through a web-based interface that allows for the expression profiles of genes to be conveniently examined across different growth conditions and life cycle stages. Expression profiles can be visualized through either an interactive bar plot for single gene queries or a heatmap for multiple gene queries. A gene co-expression network based on samples containing at least 10 million mapped reads is also available, which allows users to identify genes that are co-expressed with an individual gene or set of genes and displays the functional enrichment among the co-expressed genes. Database URL: https://a-flavus-expression-db-jyqnpeuvta-uc.a.run.app.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144126270","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AFED, a comprehensive resource for Aspergillus flavus gene expression profiling. AFED,黄曲霉基因表达谱的综合资源。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-04-18 DOI: 10.1093/database/baaf033
Brian M Mack, Matthew D Lebar

The Aspergillus flavus expression database (AFED) is a comprehensive resource dedicated to exploring gene expression in A. flavus, a significant fungal pathogen that threatens food security by contaminating crops with aflatoxin. Given the complex regulation of aflatoxin biosynthesis and the lack of centralized expression data resources for this important pathogen, a database integrating diverse experimental conditions is essential for understanding its biology and developing control strategies. Public RNA sequencing data were used to quantify gene expression abundance for 604 A. flavus samples from 52 experiments. Using abundance data, we created an AFED accessible through a web-based interface that allows for the expression profiles of genes to be conveniently examined across different growth conditions and life cycle stages. Expression profiles can be visualized through either an interactive bar plot for single gene queries or a heatmap for multiple gene queries. A gene co-expression network based on samples containing at least 10 million mapped reads is also available, which allows users to identify genes that are co-expressed with an individual gene or set of genes and displays the functional enrichment among the co-expressed genes. Database URL: https://a-flavus-expression-db-jyqnpeuvta-uc.a.run.app.

黄曲霉表达数据库(Aspergillus flavus expression database, AFED)是一个致力于探索黄曲霉基因表达的综合性资源,黄曲霉是一种重要的真菌病原体,通过黄曲霉毒素污染农作物,威胁粮食安全。鉴于黄曲霉毒素生物合成的复杂调控和缺乏集中的表达数据资源,一个整合多种实验条件的数据库对于了解其生物学和制定控制策略至关重要。利用公开RNA测序数据对52个实验604份黄曲霉样品的基因表达丰度进行定量分析。利用丰度数据,我们创建了一个可通过网络界面访问的AFED,该界面允许在不同的生长条件和生命周期阶段方便地检查基因的表达谱。表达谱可以通过单个基因查询的交互式条形图或多个基因查询的热图来可视化。基于包含至少1000万个映射reads的样本的基因共表达网络也可用,它允许用户识别与单个基因或一组基因共表达的基因,并显示共表达基因之间的功能富集。数据库地址:https://a-flavus-expression-db-jyqnpeuvta-uc.a.run.app。
{"title":"AFED, a comprehensive resource for Aspergillus flavus gene expression profiling.","authors":"Brian M Mack, Matthew D Lebar","doi":"10.1093/database/baaf033","DOIUrl":"10.1093/database/baaf033","url":null,"abstract":"<p><p>The Aspergillus flavus expression database (AFED) is a comprehensive resource dedicated to exploring gene expression in A. flavus, a significant fungal pathogen that threatens food security by contaminating crops with aflatoxin. Given the complex regulation of aflatoxin biosynthesis and the lack of centralized expression data resources for this important pathogen, a database integrating diverse experimental conditions is essential for understanding its biology and developing control strategies. Public RNA sequencing data were used to quantify gene expression abundance for 604 A. flavus samples from 52 experiments. Using abundance data, we created an AFED accessible through a web-based interface that allows for the expression profiles of genes to be conveniently examined across different growth conditions and life cycle stages. Expression profiles can be visualized through either an interactive bar plot for single gene queries or a heatmap for multiple gene queries. A gene co-expression network based on samples containing at least 10 million mapped reads is also available, which allows users to identify genes that are co-expressed with an individual gene or set of genes and displays the functional enrichment among the co-expressed genes. Database URL: https://a-flavus-expression-db-jyqnpeuvta-uc.a.run.app.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12007493/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143984209","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GrameneOryza: a comprehensive resource for Oryza genomes, genetic variation, and functional data. GrameneOryza:水稻基因组、遗传变异和功能数据的综合资源。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-04-04 DOI: 10.1093/database/baaf021
Sharon Wei, Kapeel Chougule, Andrew Olson, Zhenyuan Lu, Marcela K Tello-Ruiz, Vivek Kumar, Sunita Kumari, Lifang Zhang, Audra Olson, Catherine Kim, Nick Gladman, Doreen Ware

Rice is a vital staple crop, sustaining over half of the global population, and is a key model for genetic research. To support the growing need for comprehensive and accessible rice genomic data, GrameneOryza (https://oryza.gramene.org) was developed as an online resource adhering to FAIR (Findable, Accessible, Interoperable, and Reusable) principles of data management. It distinguishes itself through its comprehensive multispecies focus, encompassing a wide variety of Oryza genomes and related species, and its integration with FAIR principles to ensure data accessibility and usability. It offers a community curated selection of high-quality Oryza genomes, genetic variation, gene function, and trait data. The latest release, version 8, includes 28 Oryza genomes, covering wild rice and domesticated cultivars. These genomes, along with Leersia perrieri and seven additional outgroup species, form the basis for 38 K protein-coding gene family trees, essential for identifying orthologs, paralogs, and developing pan-gene sets. GrameneOryza's genetic variation data features 66 million single-nucleotide variants (SNVs) anchored to the Os-Nipponbare-Reference-IRGSP-1.0 genome, derived from various studies, including the Rice Genome 3 K (RG3K) project. The RG3K sequence reads were also mapped to seven additional platinum-quality Asian rice genomes, resulting in 19 million SNVs for each genome, significantly expanding the coverage of genetic variation beyond the Nipponbare reference. Of the 66 million SNVs on IRGSP-1.0, 27 million acquired standardized reference SNP cluster identifiers (rsIDs) from the European Variation Archive release v5. Additionally, 1200 distinct phenotypes provide a comprehensive overview of quantitative trait loci (QTL) features. The newly introduced Oryza CLIMtools portal offers insights into environmental impacts on genome adaptation. The platform's integrated search interface, along with a BLAST server and curation tools, facilitates user access to genomic, phylogenetic, gene function, and QTL data, supporting broad research applications. Database URL: https://oryza.gramene.org.

水稻是一种重要的主要作物,养活了全球一半以上的人口,是基因研究的关键模型。为了支持对全面和可访问的水稻基因组数据日益增长的需求,GrameneOryza (https://oryza.gramene.org)被开发为遵循FAIR(可查找、可访问、可互操作和可重用)数据管理原则的在线资源。它以全面的多物种研究为重点,涵盖了各种各样的Oryza基因组和相关物种,并与FAIR原则相结合,以确保数据的可访问性和可用性。它提供了一个社区策划的高质量稻基因组,遗传变异,基因功能和性状数据的选择。最新发布的版本8包括28个水稻基因组,涵盖野生稻和驯化品种。这些基因组,连同狐猴和7个额外的外群物种,构成了38k蛋白编码基因家谱的基础,对于识别同源物、相似物和发展泛基因集至关重要。GrameneOryza的遗传变异数据包含6600万个单核苷酸变异(snv),这些变异锚定在Os-Nipponbare-Reference-IRGSP-1.0基因组上,这些变异来自包括水稻基因组3k (RG3K)项目在内的各种研究。RG3K序列还被映射到另外7个铂质亚洲水稻基因组,每个基因组有1900万个snv,大大扩大了Nipponbare参考基因的遗传变异覆盖范围。在IRGSP-1.0上的6600万snv中,2700万从欧洲变异档案版本v5中获得了标准化参考SNP集群标识符(rsid)。此外,1200种不同的表型提供了数量性状位点(QTL)特征的全面概述。新推出的Oryza CLIMtools门户网站提供了对环境对基因组适应的影响的见解。该平台的集成搜索界面,以及BLAST服务器和管理工具,方便用户访问基因组,系统发育,基因功能和QTL数据,支持广泛的研究应用。数据库地址:https://oryza.gramene.org。
{"title":"GrameneOryza: a comprehensive resource for Oryza genomes, genetic variation, and functional data.","authors":"Sharon Wei, Kapeel Chougule, Andrew Olson, Zhenyuan Lu, Marcela K Tello-Ruiz, Vivek Kumar, Sunita Kumari, Lifang Zhang, Audra Olson, Catherine Kim, Nick Gladman, Doreen Ware","doi":"10.1093/database/baaf021","DOIUrl":"https://doi.org/10.1093/database/baaf021","url":null,"abstract":"<p><p>Rice is a vital staple crop, sustaining over half of the global population, and is a key model for genetic research. To support the growing need for comprehensive and accessible rice genomic data, GrameneOryza (https://oryza.gramene.org) was developed as an online resource adhering to FAIR (Findable, Accessible, Interoperable, and Reusable) principles of data management. It distinguishes itself through its comprehensive multispecies focus, encompassing a wide variety of Oryza genomes and related species, and its integration with FAIR principles to ensure data accessibility and usability. It offers a community curated selection of high-quality Oryza genomes, genetic variation, gene function, and trait data. The latest release, version 8, includes 28 Oryza genomes, covering wild rice and domesticated cultivars. These genomes, along with Leersia perrieri and seven additional outgroup species, form the basis for 38 K protein-coding gene family trees, essential for identifying orthologs, paralogs, and developing pan-gene sets. GrameneOryza's genetic variation data features 66 million single-nucleotide variants (SNVs) anchored to the Os-Nipponbare-Reference-IRGSP-1.0 genome, derived from various studies, including the Rice Genome 3 K (RG3K) project. The RG3K sequence reads were also mapped to seven additional platinum-quality Asian rice genomes, resulting in 19 million SNVs for each genome, significantly expanding the coverage of genetic variation beyond the Nipponbare reference. Of the 66 million SNVs on IRGSP-1.0, 27 million acquired standardized reference SNP cluster identifiers (rsIDs) from the European Variation Archive release v5. Additionally, 1200 distinct phenotypes provide a comprehensive overview of quantitative trait loci (QTL) features. The newly introduced Oryza CLIMtools portal offers insights into environmental impacts on genome adaptation. The platform's integrated search interface, along with a BLAST server and curation tools, facilitates user access to genomic, phylogenetic, gene function, and QTL data, supporting broad research applications. Database URL: https://oryza.gramene.org.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144126836","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GrameneOryza: a comprehensive resource for Oryza genomes, genetic variation, and functional data. GrameneOryza:水稻基因组、遗传变异和功能数据的综合资源。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-04-04 DOI: 10.1093/database/baaf021
Sharon Wei, Kapeel Chougule, Andrew Olson, Zhenyuan Lu, Marcela K Tello-Ruiz, Vivek Kumar, Sunita Kumari, Lifang Zhang, Audra Olson, Catherine Kim, Nick Gladman, Doreen Ware

Rice is a vital staple crop, sustaining over half of the global population, and is a key model for genetic research. To support the growing need for comprehensive and accessible rice genomic data, GrameneOryza (https://oryza.gramene.org) was developed as an online resource adhering to FAIR (Findable, Accessible, Interoperable, and Reusable) principles of data management. It distinguishes itself through its comprehensive multispecies focus, encompassing a wide variety of Oryza genomes and related species, and its integration with FAIR principles to ensure data accessibility and usability. It offers a community curated selection of high-quality Oryza genomes, genetic variation, gene function, and trait data. The latest release, version 8, includes 28 Oryza genomes, covering wild rice and domesticated cultivars. These genomes, along with Leersia perrieri and seven additional outgroup species, form the basis for 38 K protein-coding gene family trees, essential for identifying orthologs, paralogs, and developing pan-gene sets. GrameneOryza's genetic variation data features 66 million single-nucleotide variants (SNVs) anchored to the Os-Nipponbare-Reference-IRGSP-1.0 genome, derived from various studies, including the Rice Genome 3 K (RG3K) project. The RG3K sequence reads were also mapped to seven additional platinum-quality Asian rice genomes, resulting in 19 million SNVs for each genome, significantly expanding the coverage of genetic variation beyond the Nipponbare reference. Of the 66 million SNVs on IRGSP-1.0, 27 million acquired standardized reference SNP cluster identifiers (rsIDs) from the European Variation Archive release v5. Additionally, 1200 distinct phenotypes provide a comprehensive overview of quantitative trait loci (QTL) features. The newly introduced Oryza CLIMtools portal offers insights into environmental impacts on genome adaptation. The platform's integrated search interface, along with a BLAST server and curation tools, facilitates user access to genomic, phylogenetic, gene function, and QTL data, supporting broad research applications. Database URL: https://oryza.gramene.org.

水稻是一种重要的主要作物,养活了全球一半以上的人口,是基因研究的关键模型。为了支持对全面和可访问的水稻基因组数据日益增长的需求,GrameneOryza (https://oryza.gramene.org)被开发为遵循FAIR(可查找、可访问、可互操作和可重用)数据管理原则的在线资源。它以全面的多物种研究为重点,涵盖了各种各样的Oryza基因组和相关物种,并与FAIR原则相结合,以确保数据的可访问性和可用性。它提供了一个社区策划的高质量稻基因组,遗传变异,基因功能和性状数据的选择。最新发布的版本8包括28个水稻基因组,涵盖野生稻和驯化品种。这些基因组,连同狐猴和7个额外的外群物种,构成了38k蛋白编码基因家谱的基础,对于识别同源物、相似物和发展泛基因集至关重要。GrameneOryza的遗传变异数据包含6600万个单核苷酸变异(snv),这些变异锚定在Os-Nipponbare-Reference-IRGSP-1.0基因组上,这些变异来自包括水稻基因组3k (RG3K)项目在内的各种研究。RG3K序列还被映射到另外7个铂质亚洲水稻基因组,每个基因组有1900万个snv,大大扩大了Nipponbare参考基因的遗传变异覆盖范围。在IRGSP-1.0上的6600万snv中,2700万从欧洲变异档案版本v5中获得了标准化参考SNP集群标识符(rsid)。此外,1200种不同的表型提供了数量性状位点(QTL)特征的全面概述。新推出的Oryza CLIMtools门户网站提供了对环境对基因组适应的影响的见解。该平台的集成搜索界面,以及BLAST服务器和管理工具,方便用户访问基因组,系统发育,基因功能和QTL数据,支持广泛的研究应用。数据库地址:https://oryza.gramene.org。
{"title":"GrameneOryza: a comprehensive resource for Oryza genomes, genetic variation, and functional data.","authors":"Sharon Wei, Kapeel Chougule, Andrew Olson, Zhenyuan Lu, Marcela K Tello-Ruiz, Vivek Kumar, Sunita Kumari, Lifang Zhang, Audra Olson, Catherine Kim, Nick Gladman, Doreen Ware","doi":"10.1093/database/baaf021","DOIUrl":"10.1093/database/baaf021","url":null,"abstract":"<p><p>Rice is a vital staple crop, sustaining over half of the global population, and is a key model for genetic research. To support the growing need for comprehensive and accessible rice genomic data, GrameneOryza (https://oryza.gramene.org) was developed as an online resource adhering to FAIR (Findable, Accessible, Interoperable, and Reusable) principles of data management. It distinguishes itself through its comprehensive multispecies focus, encompassing a wide variety of Oryza genomes and related species, and its integration with FAIR principles to ensure data accessibility and usability. It offers a community curated selection of high-quality Oryza genomes, genetic variation, gene function, and trait data. The latest release, version 8, includes 28 Oryza genomes, covering wild rice and domesticated cultivars. These genomes, along with Leersia perrieri and seven additional outgroup species, form the basis for 38 K protein-coding gene family trees, essential for identifying orthologs, paralogs, and developing pan-gene sets. GrameneOryza's genetic variation data features 66 million single-nucleotide variants (SNVs) anchored to the Os-Nipponbare-Reference-IRGSP-1.0 genome, derived from various studies, including the Rice Genome 3 K (RG3K) project. The RG3K sequence reads were also mapped to seven additional platinum-quality Asian rice genomes, resulting in 19 million SNVs for each genome, significantly expanding the coverage of genetic variation beyond the Nipponbare reference. Of the 66 million SNVs on IRGSP-1.0, 27 million acquired standardized reference SNP cluster identifiers (rsIDs) from the European Variation Archive release v5. Additionally, 1200 distinct phenotypes provide a comprehensive overview of quantitative trait loci (QTL) features. The newly introduced Oryza CLIMtools portal offers insights into environmental impacts on genome adaptation. The platform's integrated search interface, along with a BLAST server and curation tools, facilitates user access to genomic, phylogenetic, gene function, and QTL data, supporting broad research applications. Database URL: https://oryza.gramene.org.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11986821/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143968793","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
mirTarCLASH: a comprehensive miRNA target database based on chimeric read-based experiments. mirTarCLASH:基于嵌合读实验的综合miRNA靶点数据库。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-04-03 DOI: 10.1093/database/baaf023
Tzu-Hsien Yang, Xiang-Wei Li, Yuan-Han Lee, Shang-Yi Lu, Wei-Sheng Wu, Heng-Chi Lee

MicroRNAs (miRNAs) can target messenger RNAs to control their degradation or translation repression effects. Therefore, identifying the target and binding sites of different miRNAs is essential for understanding miRNA functions. To investigate these interactions, researchers have employed the cross-linking, ligation, and sequencing of hybrids (CLASH-seq) and similar CLASH-like approaches to generate chimeric reads formed by miRNAs and their targeting segments. These chimeric reads allow for the direct extraction of both the miRNA-target gene pairs and their corresponding binding sites. Nevertheless, these studies lack user-friendly platforms for researchers to investigate these interactions efficiently, thus hindering scientists' ability to explore miRNA functions. To address this gap, we developed mirTarCLASH, a comprehensive database that deposits 502 061/322 707/224 452 unique hybrid reads from human/mouse/worm miRNA chimeric read-based experiments. In mirTarCLASH, the chimera analysis algorithm ChiRA and two distinct binding site inference tools, RNAup and miRanda, were adopted to facilitate the exploration of miRNA-target pairs derived from CLASH-like experiments. Compared with existing similar repositories, mirTarCLASH further enables several confidence evaluation filters with visualization functions for the extracted results. The results can be further refined based on the key properties of the miRNA targeting sites, including read depths, numbers of supporting algorithms, and cross-linking-induced mutations, to enhance confidence levels. In addition, these miRNA-binding sites are visually represented through an integrated transcript atlas. Finally, we demonstrated the biological applicability of mirTarCLASH via the well-characterized example interaction between cel-let-7-5p and lin-41 in Caenorhabditis elegans, showcasing the potential of mirTarCLASH to provide novel insights for subsequent experimental research designs. The constructed mirTarCLASH database is freely available at https://cosbi.ee.ncku.edu.tw/MirTarClash. Database URL: https://cosbi.ee.ncku.edu.tw/MirTarClash.

MicroRNAs (miRNAs)可以靶向信使rna来控制其降解或翻译抑制作用。因此,确定不同miRNA的靶点和结合位点对于了解miRNA的功能至关重要。为了研究这些相互作用,研究人员采用了杂交体的交联、连接和测序(collision -seq)以及类似的类碰撞方法来生成由mirna及其靶向片段形成的嵌合读段。这些嵌合读取允许直接提取mirna靶基因对及其相应的结合位点。然而,这些研究缺乏用户友好的平台供研究人员有效地研究这些相互作用,从而阻碍了科学家探索miRNA功能的能力。为了解决这一空白,我们开发了mirTarCLASH,这是一个综合数据库,包含了来自人类/小鼠/蠕虫miRNA嵌合读取实验的502 061/322 707/224 452个独特的杂交读取。在mirTarCLASH中,我们采用了嵌合体分析算法ChiRA和两种不同的结合位点推断工具RNAup和miRanda,以方便探索来自于类clash实验的mirna -靶对。与现有的类似存储库相比,mirTarCLASH进一步为提取的结果提供了多个具有可视化功能的置信度评估过滤器。结果可以根据miRNA靶向位点的关键特性(包括读取深度、支持算法的数量和交联诱导突变)进一步完善,以提高置信度。此外,这些mirna结合位点通过整合的转录图谱直观地表示出来。最后,我们通过在秀丽隐杆线虫中细胞-let-7-5p和lin-41之间的典型相互作用证明了mirTarCLASH的生物学适用性,展示了mirTarCLASH为后续实验研究设计提供新见解的潜力。构建的mirTarCLASH数据库可以在https://cosbi.ee.ncku.edu.tw/MirTarClash上免费获得。数据库地址:https://cosbi.ee.ncku.edu.tw/MirTarClash。
{"title":"mirTarCLASH: a comprehensive miRNA target database based on chimeric read-based experiments.","authors":"Tzu-Hsien Yang, Xiang-Wei Li, Yuan-Han Lee, Shang-Yi Lu, Wei-Sheng Wu, Heng-Chi Lee","doi":"10.1093/database/baaf023","DOIUrl":"10.1093/database/baaf023","url":null,"abstract":"<p><p>MicroRNAs (miRNAs) can target messenger RNAs to control their degradation or translation repression effects. Therefore, identifying the target and binding sites of different miRNAs is essential for understanding miRNA functions. To investigate these interactions, researchers have employed the cross-linking, ligation, and sequencing of hybrids (CLASH-seq) and similar CLASH-like approaches to generate chimeric reads formed by miRNAs and their targeting segments. These chimeric reads allow for the direct extraction of both the miRNA-target gene pairs and their corresponding binding sites. Nevertheless, these studies lack user-friendly platforms for researchers to investigate these interactions efficiently, thus hindering scientists' ability to explore miRNA functions. To address this gap, we developed mirTarCLASH, a comprehensive database that deposits 502 061/322 707/224 452 unique hybrid reads from human/mouse/worm miRNA chimeric read-based experiments. In mirTarCLASH, the chimera analysis algorithm ChiRA and two distinct binding site inference tools, RNAup and miRanda, were adopted to facilitate the exploration of miRNA-target pairs derived from CLASH-like experiments. Compared with existing similar repositories, mirTarCLASH further enables several confidence evaluation filters with visualization functions for the extracted results. The results can be further refined based on the key properties of the miRNA targeting sites, including read depths, numbers of supporting algorithms, and cross-linking-induced mutations, to enhance confidence levels. In addition, these miRNA-binding sites are visually represented through an integrated transcript atlas. Finally, we demonstrated the biological applicability of mirTarCLASH via the well-characterized example interaction between cel-let-7-5p and lin-41 in Caenorhabditis elegans, showcasing the potential of mirTarCLASH to provide novel insights for subsequent experimental research designs. The constructed mirTarCLASH database is freely available at https://cosbi.ee.ncku.edu.tw/MirTarClash. Database URL: https://cosbi.ee.ncku.edu.tw/MirTarClash.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11971479/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143787788","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
mirTarCLASH: a comprehensive miRNA target database based on chimeric read-based experiments. mirTarCLASH:基于嵌合读实验的综合miRNA靶点数据库。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-04-03 DOI: 10.1093/database/baaf023
Tzu-Hsien Yang, Xiang-Wei Li, Yuan-Han Lee, Shang-Yi Lu, Wei-Sheng Wu, Heng-Chi Lee

MicroRNAs (miRNAs) can target messenger RNAs to control their degradation or translation repression effects. Therefore, identifying the target and binding sites of different miRNAs is essential for understanding miRNA functions. To investigate these interactions, researchers have employed the cross-linking, ligation, and sequencing of hybrids (CLASH-seq) and similar CLASH-like approaches to generate chimeric reads formed by miRNAs and their targeting segments. These chimeric reads allow for the direct extraction of both the miRNA-target gene pairs and their corresponding binding sites. Nevertheless, these studies lack user-friendly platforms for researchers to investigate these interactions efficiently, thus hindering scientists' ability to explore miRNA functions. To address this gap, we developed mirTarCLASH, a comprehensive database that deposits 502 061/322 707/224 452 unique hybrid reads from human/mouse/worm miRNA chimeric read-based experiments. In mirTarCLASH, the chimera analysis algorithm ChiRA and two distinct binding site inference tools, RNAup and miRanda, were adopted to facilitate the exploration of miRNA-target pairs derived from CLASH-like experiments. Compared with existing similar repositories, mirTarCLASH further enables several confidence evaluation filters with visualization functions for the extracted results. The results can be further refined based on the key properties of the miRNA targeting sites, including read depths, numbers of supporting algorithms, and cross-linking-induced mutations, to enhance confidence levels. In addition, these miRNA-binding sites are visually represented through an integrated transcript atlas. Finally, we demonstrated the biological applicability of mirTarCLASH via the well-characterized example interaction between cel-let-7-5p and lin-41 in Caenorhabditis elegans, showcasing the potential of mirTarCLASH to provide novel insights for subsequent experimental research designs. The constructed mirTarCLASH database is freely available at https://cosbi.ee.ncku.edu.tw/MirTarClash. Database URL: https://cosbi.ee.ncku.edu.tw/MirTarClash.

MicroRNAs (miRNAs)可以靶向信使rna来控制其降解或翻译抑制作用。因此,确定不同miRNA的靶点和结合位点对于了解miRNA的功能至关重要。为了研究这些相互作用,研究人员采用了杂交体的交联、连接和测序(collision -seq)以及类似的类碰撞方法来生成由mirna及其靶向片段形成的嵌合读段。这些嵌合读取允许直接提取mirna靶基因对及其相应的结合位点。然而,这些研究缺乏用户友好的平台供研究人员有效地研究这些相互作用,从而阻碍了科学家探索miRNA功能的能力。为了解决这一空白,我们开发了mirTarCLASH,这是一个综合数据库,包含了来自人类/小鼠/蠕虫miRNA嵌合读取实验的502 061/322 707/224 452个独特的杂交读取。在mirTarCLASH中,我们采用了嵌合体分析算法ChiRA和两种不同的结合位点推断工具RNAup和miRanda,以方便探索来自于类clash实验的mirna -靶对。与现有的类似存储库相比,mirTarCLASH进一步为提取的结果提供了多个具有可视化功能的置信度评估过滤器。结果可以根据miRNA靶向位点的关键特性(包括读取深度、支持算法的数量和交联诱导突变)进一步完善,以提高置信度。此外,这些mirna结合位点通过整合的转录图谱直观地表示出来。最后,我们通过在秀丽隐杆线虫中细胞-let-7-5p和lin-41之间的典型相互作用证明了mirTarCLASH的生物学适用性,展示了mirTarCLASH为后续实验研究设计提供新见解的潜力。构建的mirTarCLASH数据库可以在https://cosbi.ee.ncku.edu.tw/MirTarClash上免费获得。数据库地址:https://cosbi.ee.ncku.edu.tw/MirTarClash。
{"title":"mirTarCLASH: a comprehensive miRNA target database based on chimeric read-based experiments.","authors":"Tzu-Hsien Yang, Xiang-Wei Li, Yuan-Han Lee, Shang-Yi Lu, Wei-Sheng Wu, Heng-Chi Lee","doi":"10.1093/database/baaf023","DOIUrl":"https://doi.org/10.1093/database/baaf023","url":null,"abstract":"<p><p>MicroRNAs (miRNAs) can target messenger RNAs to control their degradation or translation repression effects. Therefore, identifying the target and binding sites of different miRNAs is essential for understanding miRNA functions. To investigate these interactions, researchers have employed the cross-linking, ligation, and sequencing of hybrids (CLASH-seq) and similar CLASH-like approaches to generate chimeric reads formed by miRNAs and their targeting segments. These chimeric reads allow for the direct extraction of both the miRNA-target gene pairs and their corresponding binding sites. Nevertheless, these studies lack user-friendly platforms for researchers to investigate these interactions efficiently, thus hindering scientists' ability to explore miRNA functions. To address this gap, we developed mirTarCLASH, a comprehensive database that deposits 502 061/322 707/224 452 unique hybrid reads from human/mouse/worm miRNA chimeric read-based experiments. In mirTarCLASH, the chimera analysis algorithm ChiRA and two distinct binding site inference tools, RNAup and miRanda, were adopted to facilitate the exploration of miRNA-target pairs derived from CLASH-like experiments. Compared with existing similar repositories, mirTarCLASH further enables several confidence evaluation filters with visualization functions for the extracted results. The results can be further refined based on the key properties of the miRNA targeting sites, including read depths, numbers of supporting algorithms, and cross-linking-induced mutations, to enhance confidence levels. In addition, these miRNA-binding sites are visually represented through an integrated transcript atlas. Finally, we demonstrated the biological applicability of mirTarCLASH via the well-characterized example interaction between cel-let-7-5p and lin-41 in Caenorhabditis elegans, showcasing the potential of mirTarCLASH to provide novel insights for subsequent experimental research designs. The constructed mirTarCLASH database is freely available at https://cosbi.ee.ncku.edu.tw/MirTarClash. Database URL: https://cosbi.ee.ncku.edu.tw/MirTarClash.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144126858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Database: The Journal of Biological Databases and Curation
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1