首页 > 最新文献

Database: The Journal of Biological Databases and Curation最新文献

英文 中文
Localizatome: a database for stress-dependent subcellular localization changes in proteins. Localizatome:一个蛋白质中应力依赖性亚细胞定位变化的数据库。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-04-21 DOI: 10.1093/database/baaf028
Takahide Matsushima, Yuki Naito, Tomoki Chiba, Ryota Kurimoto, Keiko Itano, Koji Ochiai, Koichi Takahashi, Naoki Goshima, Hiroshi Asahara

Understanding protein subcellular localization and its dynamic changes is crucial for elucidating cellular function and disease mechanisms, particularly under stress conditions, where protein localization changes can modulate cellular responses. Currently available databases provide insights into protein localization under steady-state conditions; however, stress-related dynamic localization changes remain poorly understood. Here, we present the Localizatome, a comprehensive database that captures stress-induced protein localization dynamics in living cells. Using an original high-throughput microscopy system and machine learning algorithms, we analysed the localization patterns of 10 287 fluorescent protein-fused human proteins in HeLa cells before and after exposure to oxidative stress. Our analysis revealed that 1910 proteins exhibited oxidative stress-dependent localization changes, particularly forming distinct foci. Among them, there were stress granule assembly factors and autophagy-related proteins, as well as components of various signalling pathways. Subsequent characterization identified some specific amino acid motifs and intrinsically disordered regions associated with stress-induced protein redistribution. The Localizatome provides open access to these data through a web-based interface, supporting a wide range of studies on cellular stress response and disease mechanisms. Database URL https://localizatome.embrys.jp/.

了解蛋白质亚细胞定位及其动态变化对于阐明细胞功能和疾病机制至关重要,特别是在应激条件下,蛋白质定位变化可以调节细胞反应。目前可用的数据库提供了在稳态条件下蛋白质定位的见解;然而,与应力相关的动态局部化变化仍然知之甚少。在这里,我们介绍了Localizatome,一个捕获活细胞中应力诱导的蛋白质定位动态的综合数据库。利用原始的高通量显微镜系统和机器学习算法,我们分析了氧化应激前后HeLa细胞中10 287种荧光蛋白融合的人蛋白的定位模式。我们的分析显示,1910蛋白表现出氧化应激依赖的定位变化,特别是形成不同的焦点。其中包括胁迫颗粒组装因子和自噬相关蛋白,以及各种信号通路的组分。随后的表征确定了一些特定的氨基酸基序和与应力诱导的蛋白质再分配相关的内在紊乱区域。Localizatome通过基于网络的界面提供对这些数据的开放访问,支持对细胞应激反应和疾病机制的广泛研究。数据库URL https://localizatome.embrys.jp/。
{"title":"Localizatome: a database for stress-dependent subcellular localization changes in proteins.","authors":"Takahide Matsushima, Yuki Naito, Tomoki Chiba, Ryota Kurimoto, Keiko Itano, Koji Ochiai, Koichi Takahashi, Naoki Goshima, Hiroshi Asahara","doi":"10.1093/database/baaf028","DOIUrl":"10.1093/database/baaf028","url":null,"abstract":"<p><p>Understanding protein subcellular localization and its dynamic changes is crucial for elucidating cellular function and disease mechanisms, particularly under stress conditions, where protein localization changes can modulate cellular responses. Currently available databases provide insights into protein localization under steady-state conditions; however, stress-related dynamic localization changes remain poorly understood. Here, we present the Localizatome, a comprehensive database that captures stress-induced protein localization dynamics in living cells. Using an original high-throughput microscopy system and machine learning algorithms, we analysed the localization patterns of 10 287 fluorescent protein-fused human proteins in HeLa cells before and after exposure to oxidative stress. Our analysis revealed that 1910 proteins exhibited oxidative stress-dependent localization changes, particularly forming distinct foci. Among them, there were stress granule assembly factors and autophagy-related proteins, as well as components of various signalling pathways. Subsequent characterization identified some specific amino acid motifs and intrinsically disordered regions associated with stress-induced protein redistribution. The Localizatome provides open access to these data through a web-based interface, supporting a wide range of studies on cellular stress response and disease mechanisms. Database URL https://localizatome.embrys.jp/.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12010962/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143984325","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MIPD: Molecules, Imagings, and Clinical Phenotype Integrated Database. MIPD:分子,图像和临床表型集成数据库。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-04-21 DOI: 10.1093/database/baaf029
Jiaojiao Zhao, Min Wu, Meihua Wan, Xue Li, Jie Li, Qin Liu, Minghao Xiong, Mengjie Tu, Jun Zhou, Shilin Li, Jie Zhang, Jiangping Fu, Yin Zhang, Chungang Zhao, Litong Qin, Xue Yang, Hong Zhao, Yan Zhang, Fanxin Zeng

Due to tumor heterogeneity, a subset of patients fails to benefit from current treatment strategies. However, an integrated analysis of imaging features, genetic molecules, and clinical phenotypes can characterize tumor heterogeneity, enabling the development of more personalized treatment approaches. Despite its potential, cross-modal databases remain underexplored. To address this gap, we established a comprehensive database encompassing 9965 genes, 5449 proteins, 1121 metabolites, 283 pathways, 854 imaging features, and 73 clinical factors from colorectal cancer patients. This database identifies significantly distinct molecules and imaging features associated with clinical phenotypes and provides survival analysis based on these features. Additionally, it offers genetic molecule annotations, comparative expression levels between tumor and normal tissues, imaging features linked to genetic molecules, and imaging-based models for predicting gene expression levels. Furthermore, the database highlights correlations between genetic molecules, clinical factors, and imaging features. In summary, we present MIPD (Molecules, Imaging, and Clinical Phenotype Correlation Database), a user-friendly, interactive, and specialized platform accessible at http://corgenerf.com. MIPD facilitates the interpretability of cross-modal data by providing query, browse, search, visualization, and download functionalities, thereby offering a valuable resource for advancing precision medicine in colorectal cancer. Database URL: http://corgenerf.

由于肿瘤的异质性,一部分患者不能从目前的治疗策略中获益。然而,对影像学特征、遗传分子和临床表型的综合分析可以表征肿瘤的异质性,从而开发出更加个性化的治疗方法。尽管具有潜力,但跨模式数据库仍未得到充分开发。为了弥补这一空白,我们建立了一个包含9965个基因、5449个蛋白质、1121个代谢物、283个通路、854个成像特征和73个结直肠癌患者临床因素的综合数据库。该数据库识别与临床表型相关的显著不同的分子和成像特征,并提供基于这些特征的生存分析。此外,它还提供遗传分子注释,肿瘤和正常组织之间的比较表达水平,与遗传分子相关的成像特征,以及用于预测基因表达水平的基于成像的模型。此外,该数据库突出了遗传分子、临床因素和影像学特征之间的相关性。总之,我们提出了MIPD(分子、成像和临床表型相关数据库),这是一个用户友好的、互动的、专门的平台,可在http://corgenerf.com上访问。MIPD通过提供查询、浏览、搜索、可视化和下载功能,促进了跨模式数据的可解释性,从而为推进结直肠癌的精准医学提供了宝贵的资源。数据库地址:http://corgenerf。
{"title":"MIPD: Molecules, Imagings, and Clinical Phenotype Integrated Database.","authors":"Jiaojiao Zhao, Min Wu, Meihua Wan, Xue Li, Jie Li, Qin Liu, Minghao Xiong, Mengjie Tu, Jun Zhou, Shilin Li, Jie Zhang, Jiangping Fu, Yin Zhang, Chungang Zhao, Litong Qin, Xue Yang, Hong Zhao, Yan Zhang, Fanxin Zeng","doi":"10.1093/database/baaf029","DOIUrl":"10.1093/database/baaf029","url":null,"abstract":"<p><p>Due to tumor heterogeneity, a subset of patients fails to benefit from current treatment strategies. However, an integrated analysis of imaging features, genetic molecules, and clinical phenotypes can characterize tumor heterogeneity, enabling the development of more personalized treatment approaches. Despite its potential, cross-modal databases remain underexplored. To address this gap, we established a comprehensive database encompassing 9965 genes, 5449 proteins, 1121 metabolites, 283 pathways, 854 imaging features, and 73 clinical factors from colorectal cancer patients. This database identifies significantly distinct molecules and imaging features associated with clinical phenotypes and provides survival analysis based on these features. Additionally, it offers genetic molecule annotations, comparative expression levels between tumor and normal tissues, imaging features linked to genetic molecules, and imaging-based models for predicting gene expression levels. Furthermore, the database highlights correlations between genetic molecules, clinical factors, and imaging features. In summary, we present MIPD (Molecules, Imaging, and Clinical Phenotype Correlation Database), a user-friendly, interactive, and specialized platform accessible at http://corgenerf.com. MIPD facilitates the interpretability of cross-modal data by providing query, browse, search, visualization, and download functionalities, thereby offering a valuable resource for advancing precision medicine in colorectal cancer. Database URL: http://corgenerf.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12010968/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143968083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AFED, a comprehensive resource for Aspergillus flavus gene expression profiling. AFED,黄曲霉基因表达谱的综合资源。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-04-18 DOI: 10.1093/database/baaf033
Brian M Mack, Matthew D Lebar

The Aspergillus flavus expression database (AFED) is a comprehensive resource dedicated to exploring gene expression in A. flavus, a significant fungal pathogen that threatens food security by contaminating crops with aflatoxin. Given the complex regulation of aflatoxin biosynthesis and the lack of centralized expression data resources for this important pathogen, a database integrating diverse experimental conditions is essential for understanding its biology and developing control strategies. Public RNA sequencing data were used to quantify gene expression abundance for 604 A. flavus samples from 52 experiments. Using abundance data, we created an AFED accessible through a web-based interface that allows for the expression profiles of genes to be conveniently examined across different growth conditions and life cycle stages. Expression profiles can be visualized through either an interactive bar plot for single gene queries or a heatmap for multiple gene queries. A gene co-expression network based on samples containing at least 10 million mapped reads is also available, which allows users to identify genes that are co-expressed with an individual gene or set of genes and displays the functional enrichment among the co-expressed genes. Database URL: https://a-flavus-expression-db-jyqnpeuvta-uc.a.run.app.

黄曲霉表达数据库(Aspergillus flavus expression database, AFED)是一个致力于探索黄曲霉基因表达的综合性资源,黄曲霉是一种重要的真菌病原体,通过黄曲霉毒素污染农作物,威胁粮食安全。鉴于黄曲霉毒素生物合成的复杂调控和缺乏集中的表达数据资源,一个整合多种实验条件的数据库对于了解其生物学和制定控制策略至关重要。利用公开RNA测序数据对52个实验604份黄曲霉样品的基因表达丰度进行定量分析。利用丰度数据,我们创建了一个可通过网络界面访问的AFED,该界面允许在不同的生长条件和生命周期阶段方便地检查基因的表达谱。表达谱可以通过单个基因查询的交互式条形图或多个基因查询的热图来可视化。基于包含至少1000万个映射reads的样本的基因共表达网络也可用,它允许用户识别与单个基因或一组基因共表达的基因,并显示共表达基因之间的功能富集。数据库地址:https://a-flavus-expression-db-jyqnpeuvta-uc.a.run.app。
{"title":"AFED, a comprehensive resource for Aspergillus flavus gene expression profiling.","authors":"Brian M Mack, Matthew D Lebar","doi":"10.1093/database/baaf033","DOIUrl":"https://doi.org/10.1093/database/baaf033","url":null,"abstract":"<p><p>The Aspergillus flavus expression database (AFED) is a comprehensive resource dedicated to exploring gene expression in A. flavus, a significant fungal pathogen that threatens food security by contaminating crops with aflatoxin. Given the complex regulation of aflatoxin biosynthesis and the lack of centralized expression data resources for this important pathogen, a database integrating diverse experimental conditions is essential for understanding its biology and developing control strategies. Public RNA sequencing data were used to quantify gene expression abundance for 604 A. flavus samples from 52 experiments. Using abundance data, we created an AFED accessible through a web-based interface that allows for the expression profiles of genes to be conveniently examined across different growth conditions and life cycle stages. Expression profiles can be visualized through either an interactive bar plot for single gene queries or a heatmap for multiple gene queries. A gene co-expression network based on samples containing at least 10 million mapped reads is also available, which allows users to identify genes that are co-expressed with an individual gene or set of genes and displays the functional enrichment among the co-expressed genes. Database URL: https://a-flavus-expression-db-jyqnpeuvta-uc.a.run.app.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144126270","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AFED, a comprehensive resource for Aspergillus flavus gene expression profiling. AFED,黄曲霉基因表达谱的综合资源。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-04-18 DOI: 10.1093/database/baaf033
Brian M Mack, Matthew D Lebar

The Aspergillus flavus expression database (AFED) is a comprehensive resource dedicated to exploring gene expression in A. flavus, a significant fungal pathogen that threatens food security by contaminating crops with aflatoxin. Given the complex regulation of aflatoxin biosynthesis and the lack of centralized expression data resources for this important pathogen, a database integrating diverse experimental conditions is essential for understanding its biology and developing control strategies. Public RNA sequencing data were used to quantify gene expression abundance for 604 A. flavus samples from 52 experiments. Using abundance data, we created an AFED accessible through a web-based interface that allows for the expression profiles of genes to be conveniently examined across different growth conditions and life cycle stages. Expression profiles can be visualized through either an interactive bar plot for single gene queries or a heatmap for multiple gene queries. A gene co-expression network based on samples containing at least 10 million mapped reads is also available, which allows users to identify genes that are co-expressed with an individual gene or set of genes and displays the functional enrichment among the co-expressed genes. Database URL: https://a-flavus-expression-db-jyqnpeuvta-uc.a.run.app.

黄曲霉表达数据库(Aspergillus flavus expression database, AFED)是一个致力于探索黄曲霉基因表达的综合性资源,黄曲霉是一种重要的真菌病原体,通过黄曲霉毒素污染农作物,威胁粮食安全。鉴于黄曲霉毒素生物合成的复杂调控和缺乏集中的表达数据资源,一个整合多种实验条件的数据库对于了解其生物学和制定控制策略至关重要。利用公开RNA测序数据对52个实验604份黄曲霉样品的基因表达丰度进行定量分析。利用丰度数据,我们创建了一个可通过网络界面访问的AFED,该界面允许在不同的生长条件和生命周期阶段方便地检查基因的表达谱。表达谱可以通过单个基因查询的交互式条形图或多个基因查询的热图来可视化。基于包含至少1000万个映射reads的样本的基因共表达网络也可用,它允许用户识别与单个基因或一组基因共表达的基因,并显示共表达基因之间的功能富集。数据库地址:https://a-flavus-expression-db-jyqnpeuvta-uc.a.run.app。
{"title":"AFED, a comprehensive resource for Aspergillus flavus gene expression profiling.","authors":"Brian M Mack, Matthew D Lebar","doi":"10.1093/database/baaf033","DOIUrl":"10.1093/database/baaf033","url":null,"abstract":"<p><p>The Aspergillus flavus expression database (AFED) is a comprehensive resource dedicated to exploring gene expression in A. flavus, a significant fungal pathogen that threatens food security by contaminating crops with aflatoxin. Given the complex regulation of aflatoxin biosynthesis and the lack of centralized expression data resources for this important pathogen, a database integrating diverse experimental conditions is essential for understanding its biology and developing control strategies. Public RNA sequencing data were used to quantify gene expression abundance for 604 A. flavus samples from 52 experiments. Using abundance data, we created an AFED accessible through a web-based interface that allows for the expression profiles of genes to be conveniently examined across different growth conditions and life cycle stages. Expression profiles can be visualized through either an interactive bar plot for single gene queries or a heatmap for multiple gene queries. A gene co-expression network based on samples containing at least 10 million mapped reads is also available, which allows users to identify genes that are co-expressed with an individual gene or set of genes and displays the functional enrichment among the co-expressed genes. Database URL: https://a-flavus-expression-db-jyqnpeuvta-uc.a.run.app.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12007493/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143984209","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GrameneOryza: a comprehensive resource for Oryza genomes, genetic variation, and functional data. GrameneOryza:水稻基因组、遗传变异和功能数据的综合资源。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-04-04 DOI: 10.1093/database/baaf021
Sharon Wei, Kapeel Chougule, Andrew Olson, Zhenyuan Lu, Marcela K Tello-Ruiz, Vivek Kumar, Sunita Kumari, Lifang Zhang, Audra Olson, Catherine Kim, Nick Gladman, Doreen Ware

Rice is a vital staple crop, sustaining over half of the global population, and is a key model for genetic research. To support the growing need for comprehensive and accessible rice genomic data, GrameneOryza (https://oryza.gramene.org) was developed as an online resource adhering to FAIR (Findable, Accessible, Interoperable, and Reusable) principles of data management. It distinguishes itself through its comprehensive multispecies focus, encompassing a wide variety of Oryza genomes and related species, and its integration with FAIR principles to ensure data accessibility and usability. It offers a community curated selection of high-quality Oryza genomes, genetic variation, gene function, and trait data. The latest release, version 8, includes 28 Oryza genomes, covering wild rice and domesticated cultivars. These genomes, along with Leersia perrieri and seven additional outgroup species, form the basis for 38 K protein-coding gene family trees, essential for identifying orthologs, paralogs, and developing pan-gene sets. GrameneOryza's genetic variation data features 66 million single-nucleotide variants (SNVs) anchored to the Os-Nipponbare-Reference-IRGSP-1.0 genome, derived from various studies, including the Rice Genome 3 K (RG3K) project. The RG3K sequence reads were also mapped to seven additional platinum-quality Asian rice genomes, resulting in 19 million SNVs for each genome, significantly expanding the coverage of genetic variation beyond the Nipponbare reference. Of the 66 million SNVs on IRGSP-1.0, 27 million acquired standardized reference SNP cluster identifiers (rsIDs) from the European Variation Archive release v5. Additionally, 1200 distinct phenotypes provide a comprehensive overview of quantitative trait loci (QTL) features. The newly introduced Oryza CLIMtools portal offers insights into environmental impacts on genome adaptation. The platform's integrated search interface, along with a BLAST server and curation tools, facilitates user access to genomic, phylogenetic, gene function, and QTL data, supporting broad research applications. Database URL: https://oryza.gramene.org.

水稻是一种重要的主要作物,养活了全球一半以上的人口,是基因研究的关键模型。为了支持对全面和可访问的水稻基因组数据日益增长的需求,GrameneOryza (https://oryza.gramene.org)被开发为遵循FAIR(可查找、可访问、可互操作和可重用)数据管理原则的在线资源。它以全面的多物种研究为重点,涵盖了各种各样的Oryza基因组和相关物种,并与FAIR原则相结合,以确保数据的可访问性和可用性。它提供了一个社区策划的高质量稻基因组,遗传变异,基因功能和性状数据的选择。最新发布的版本8包括28个水稻基因组,涵盖野生稻和驯化品种。这些基因组,连同狐猴和7个额外的外群物种,构成了38k蛋白编码基因家谱的基础,对于识别同源物、相似物和发展泛基因集至关重要。GrameneOryza的遗传变异数据包含6600万个单核苷酸变异(snv),这些变异锚定在Os-Nipponbare-Reference-IRGSP-1.0基因组上,这些变异来自包括水稻基因组3k (RG3K)项目在内的各种研究。RG3K序列还被映射到另外7个铂质亚洲水稻基因组,每个基因组有1900万个snv,大大扩大了Nipponbare参考基因的遗传变异覆盖范围。在IRGSP-1.0上的6600万snv中,2700万从欧洲变异档案版本v5中获得了标准化参考SNP集群标识符(rsid)。此外,1200种不同的表型提供了数量性状位点(QTL)特征的全面概述。新推出的Oryza CLIMtools门户网站提供了对环境对基因组适应的影响的见解。该平台的集成搜索界面,以及BLAST服务器和管理工具,方便用户访问基因组,系统发育,基因功能和QTL数据,支持广泛的研究应用。数据库地址:https://oryza.gramene.org。
{"title":"GrameneOryza: a comprehensive resource for Oryza genomes, genetic variation, and functional data.","authors":"Sharon Wei, Kapeel Chougule, Andrew Olson, Zhenyuan Lu, Marcela K Tello-Ruiz, Vivek Kumar, Sunita Kumari, Lifang Zhang, Audra Olson, Catherine Kim, Nick Gladman, Doreen Ware","doi":"10.1093/database/baaf021","DOIUrl":"https://doi.org/10.1093/database/baaf021","url":null,"abstract":"<p><p>Rice is a vital staple crop, sustaining over half of the global population, and is a key model for genetic research. To support the growing need for comprehensive and accessible rice genomic data, GrameneOryza (https://oryza.gramene.org) was developed as an online resource adhering to FAIR (Findable, Accessible, Interoperable, and Reusable) principles of data management. It distinguishes itself through its comprehensive multispecies focus, encompassing a wide variety of Oryza genomes and related species, and its integration with FAIR principles to ensure data accessibility and usability. It offers a community curated selection of high-quality Oryza genomes, genetic variation, gene function, and trait data. The latest release, version 8, includes 28 Oryza genomes, covering wild rice and domesticated cultivars. These genomes, along with Leersia perrieri and seven additional outgroup species, form the basis for 38 K protein-coding gene family trees, essential for identifying orthologs, paralogs, and developing pan-gene sets. GrameneOryza's genetic variation data features 66 million single-nucleotide variants (SNVs) anchored to the Os-Nipponbare-Reference-IRGSP-1.0 genome, derived from various studies, including the Rice Genome 3 K (RG3K) project. The RG3K sequence reads were also mapped to seven additional platinum-quality Asian rice genomes, resulting in 19 million SNVs for each genome, significantly expanding the coverage of genetic variation beyond the Nipponbare reference. Of the 66 million SNVs on IRGSP-1.0, 27 million acquired standardized reference SNP cluster identifiers (rsIDs) from the European Variation Archive release v5. Additionally, 1200 distinct phenotypes provide a comprehensive overview of quantitative trait loci (QTL) features. The newly introduced Oryza CLIMtools portal offers insights into environmental impacts on genome adaptation. The platform's integrated search interface, along with a BLAST server and curation tools, facilitates user access to genomic, phylogenetic, gene function, and QTL data, supporting broad research applications. Database URL: https://oryza.gramene.org.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144126836","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GrameneOryza: a comprehensive resource for Oryza genomes, genetic variation, and functional data. GrameneOryza:水稻基因组、遗传变异和功能数据的综合资源。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-04-04 DOI: 10.1093/database/baaf021
Sharon Wei, Kapeel Chougule, Andrew Olson, Zhenyuan Lu, Marcela K Tello-Ruiz, Vivek Kumar, Sunita Kumari, Lifang Zhang, Audra Olson, Catherine Kim, Nick Gladman, Doreen Ware

Rice is a vital staple crop, sustaining over half of the global population, and is a key model for genetic research. To support the growing need for comprehensive and accessible rice genomic data, GrameneOryza (https://oryza.gramene.org) was developed as an online resource adhering to FAIR (Findable, Accessible, Interoperable, and Reusable) principles of data management. It distinguishes itself through its comprehensive multispecies focus, encompassing a wide variety of Oryza genomes and related species, and its integration with FAIR principles to ensure data accessibility and usability. It offers a community curated selection of high-quality Oryza genomes, genetic variation, gene function, and trait data. The latest release, version 8, includes 28 Oryza genomes, covering wild rice and domesticated cultivars. These genomes, along with Leersia perrieri and seven additional outgroup species, form the basis for 38 K protein-coding gene family trees, essential for identifying orthologs, paralogs, and developing pan-gene sets. GrameneOryza's genetic variation data features 66 million single-nucleotide variants (SNVs) anchored to the Os-Nipponbare-Reference-IRGSP-1.0 genome, derived from various studies, including the Rice Genome 3 K (RG3K) project. The RG3K sequence reads were also mapped to seven additional platinum-quality Asian rice genomes, resulting in 19 million SNVs for each genome, significantly expanding the coverage of genetic variation beyond the Nipponbare reference. Of the 66 million SNVs on IRGSP-1.0, 27 million acquired standardized reference SNP cluster identifiers (rsIDs) from the European Variation Archive release v5. Additionally, 1200 distinct phenotypes provide a comprehensive overview of quantitative trait loci (QTL) features. The newly introduced Oryza CLIMtools portal offers insights into environmental impacts on genome adaptation. The platform's integrated search interface, along with a BLAST server and curation tools, facilitates user access to genomic, phylogenetic, gene function, and QTL data, supporting broad research applications. Database URL: https://oryza.gramene.org.

水稻是一种重要的主要作物,养活了全球一半以上的人口,是基因研究的关键模型。为了支持对全面和可访问的水稻基因组数据日益增长的需求,GrameneOryza (https://oryza.gramene.org)被开发为遵循FAIR(可查找、可访问、可互操作和可重用)数据管理原则的在线资源。它以全面的多物种研究为重点,涵盖了各种各样的Oryza基因组和相关物种,并与FAIR原则相结合,以确保数据的可访问性和可用性。它提供了一个社区策划的高质量稻基因组,遗传变异,基因功能和性状数据的选择。最新发布的版本8包括28个水稻基因组,涵盖野生稻和驯化品种。这些基因组,连同狐猴和7个额外的外群物种,构成了38k蛋白编码基因家谱的基础,对于识别同源物、相似物和发展泛基因集至关重要。GrameneOryza的遗传变异数据包含6600万个单核苷酸变异(snv),这些变异锚定在Os-Nipponbare-Reference-IRGSP-1.0基因组上,这些变异来自包括水稻基因组3k (RG3K)项目在内的各种研究。RG3K序列还被映射到另外7个铂质亚洲水稻基因组,每个基因组有1900万个snv,大大扩大了Nipponbare参考基因的遗传变异覆盖范围。在IRGSP-1.0上的6600万snv中,2700万从欧洲变异档案版本v5中获得了标准化参考SNP集群标识符(rsid)。此外,1200种不同的表型提供了数量性状位点(QTL)特征的全面概述。新推出的Oryza CLIMtools门户网站提供了对环境对基因组适应的影响的见解。该平台的集成搜索界面,以及BLAST服务器和管理工具,方便用户访问基因组,系统发育,基因功能和QTL数据,支持广泛的研究应用。数据库地址:https://oryza.gramene.org。
{"title":"GrameneOryza: a comprehensive resource for Oryza genomes, genetic variation, and functional data.","authors":"Sharon Wei, Kapeel Chougule, Andrew Olson, Zhenyuan Lu, Marcela K Tello-Ruiz, Vivek Kumar, Sunita Kumari, Lifang Zhang, Audra Olson, Catherine Kim, Nick Gladman, Doreen Ware","doi":"10.1093/database/baaf021","DOIUrl":"10.1093/database/baaf021","url":null,"abstract":"<p><p>Rice is a vital staple crop, sustaining over half of the global population, and is a key model for genetic research. To support the growing need for comprehensive and accessible rice genomic data, GrameneOryza (https://oryza.gramene.org) was developed as an online resource adhering to FAIR (Findable, Accessible, Interoperable, and Reusable) principles of data management. It distinguishes itself through its comprehensive multispecies focus, encompassing a wide variety of Oryza genomes and related species, and its integration with FAIR principles to ensure data accessibility and usability. It offers a community curated selection of high-quality Oryza genomes, genetic variation, gene function, and trait data. The latest release, version 8, includes 28 Oryza genomes, covering wild rice and domesticated cultivars. These genomes, along with Leersia perrieri and seven additional outgroup species, form the basis for 38 K protein-coding gene family trees, essential for identifying orthologs, paralogs, and developing pan-gene sets. GrameneOryza's genetic variation data features 66 million single-nucleotide variants (SNVs) anchored to the Os-Nipponbare-Reference-IRGSP-1.0 genome, derived from various studies, including the Rice Genome 3 K (RG3K) project. The RG3K sequence reads were also mapped to seven additional platinum-quality Asian rice genomes, resulting in 19 million SNVs for each genome, significantly expanding the coverage of genetic variation beyond the Nipponbare reference. Of the 66 million SNVs on IRGSP-1.0, 27 million acquired standardized reference SNP cluster identifiers (rsIDs) from the European Variation Archive release v5. Additionally, 1200 distinct phenotypes provide a comprehensive overview of quantitative trait loci (QTL) features. The newly introduced Oryza CLIMtools portal offers insights into environmental impacts on genome adaptation. The platform's integrated search interface, along with a BLAST server and curation tools, facilitates user access to genomic, phylogenetic, gene function, and QTL data, supporting broad research applications. Database URL: https://oryza.gramene.org.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11986821/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143968793","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
mirTarCLASH: a comprehensive miRNA target database based on chimeric read-based experiments. mirTarCLASH:基于嵌合读实验的综合miRNA靶点数据库。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-04-03 DOI: 10.1093/database/baaf023
Tzu-Hsien Yang, Xiang-Wei Li, Yuan-Han Lee, Shang-Yi Lu, Wei-Sheng Wu, Heng-Chi Lee

MicroRNAs (miRNAs) can target messenger RNAs to control their degradation or translation repression effects. Therefore, identifying the target and binding sites of different miRNAs is essential for understanding miRNA functions. To investigate these interactions, researchers have employed the cross-linking, ligation, and sequencing of hybrids (CLASH-seq) and similar CLASH-like approaches to generate chimeric reads formed by miRNAs and their targeting segments. These chimeric reads allow for the direct extraction of both the miRNA-target gene pairs and their corresponding binding sites. Nevertheless, these studies lack user-friendly platforms for researchers to investigate these interactions efficiently, thus hindering scientists' ability to explore miRNA functions. To address this gap, we developed mirTarCLASH, a comprehensive database that deposits 502 061/322 707/224 452 unique hybrid reads from human/mouse/worm miRNA chimeric read-based experiments. In mirTarCLASH, the chimera analysis algorithm ChiRA and two distinct binding site inference tools, RNAup and miRanda, were adopted to facilitate the exploration of miRNA-target pairs derived from CLASH-like experiments. Compared with existing similar repositories, mirTarCLASH further enables several confidence evaluation filters with visualization functions for the extracted results. The results can be further refined based on the key properties of the miRNA targeting sites, including read depths, numbers of supporting algorithms, and cross-linking-induced mutations, to enhance confidence levels. In addition, these miRNA-binding sites are visually represented through an integrated transcript atlas. Finally, we demonstrated the biological applicability of mirTarCLASH via the well-characterized example interaction between cel-let-7-5p and lin-41 in Caenorhabditis elegans, showcasing the potential of mirTarCLASH to provide novel insights for subsequent experimental research designs. The constructed mirTarCLASH database is freely available at https://cosbi.ee.ncku.edu.tw/MirTarClash. Database URL: https://cosbi.ee.ncku.edu.tw/MirTarClash.

MicroRNAs (miRNAs)可以靶向信使rna来控制其降解或翻译抑制作用。因此,确定不同miRNA的靶点和结合位点对于了解miRNA的功能至关重要。为了研究这些相互作用,研究人员采用了杂交体的交联、连接和测序(collision -seq)以及类似的类碰撞方法来生成由mirna及其靶向片段形成的嵌合读段。这些嵌合读取允许直接提取mirna靶基因对及其相应的结合位点。然而,这些研究缺乏用户友好的平台供研究人员有效地研究这些相互作用,从而阻碍了科学家探索miRNA功能的能力。为了解决这一空白,我们开发了mirTarCLASH,这是一个综合数据库,包含了来自人类/小鼠/蠕虫miRNA嵌合读取实验的502 061/322 707/224 452个独特的杂交读取。在mirTarCLASH中,我们采用了嵌合体分析算法ChiRA和两种不同的结合位点推断工具RNAup和miRanda,以方便探索来自于类clash实验的mirna -靶对。与现有的类似存储库相比,mirTarCLASH进一步为提取的结果提供了多个具有可视化功能的置信度评估过滤器。结果可以根据miRNA靶向位点的关键特性(包括读取深度、支持算法的数量和交联诱导突变)进一步完善,以提高置信度。此外,这些mirna结合位点通过整合的转录图谱直观地表示出来。最后,我们通过在秀丽隐杆线虫中细胞-let-7-5p和lin-41之间的典型相互作用证明了mirTarCLASH的生物学适用性,展示了mirTarCLASH为后续实验研究设计提供新见解的潜力。构建的mirTarCLASH数据库可以在https://cosbi.ee.ncku.edu.tw/MirTarClash上免费获得。数据库地址:https://cosbi.ee.ncku.edu.tw/MirTarClash。
{"title":"mirTarCLASH: a comprehensive miRNA target database based on chimeric read-based experiments.","authors":"Tzu-Hsien Yang, Xiang-Wei Li, Yuan-Han Lee, Shang-Yi Lu, Wei-Sheng Wu, Heng-Chi Lee","doi":"10.1093/database/baaf023","DOIUrl":"10.1093/database/baaf023","url":null,"abstract":"<p><p>MicroRNAs (miRNAs) can target messenger RNAs to control their degradation or translation repression effects. Therefore, identifying the target and binding sites of different miRNAs is essential for understanding miRNA functions. To investigate these interactions, researchers have employed the cross-linking, ligation, and sequencing of hybrids (CLASH-seq) and similar CLASH-like approaches to generate chimeric reads formed by miRNAs and their targeting segments. These chimeric reads allow for the direct extraction of both the miRNA-target gene pairs and their corresponding binding sites. Nevertheless, these studies lack user-friendly platforms for researchers to investigate these interactions efficiently, thus hindering scientists' ability to explore miRNA functions. To address this gap, we developed mirTarCLASH, a comprehensive database that deposits 502 061/322 707/224 452 unique hybrid reads from human/mouse/worm miRNA chimeric read-based experiments. In mirTarCLASH, the chimera analysis algorithm ChiRA and two distinct binding site inference tools, RNAup and miRanda, were adopted to facilitate the exploration of miRNA-target pairs derived from CLASH-like experiments. Compared with existing similar repositories, mirTarCLASH further enables several confidence evaluation filters with visualization functions for the extracted results. The results can be further refined based on the key properties of the miRNA targeting sites, including read depths, numbers of supporting algorithms, and cross-linking-induced mutations, to enhance confidence levels. In addition, these miRNA-binding sites are visually represented through an integrated transcript atlas. Finally, we demonstrated the biological applicability of mirTarCLASH via the well-characterized example interaction between cel-let-7-5p and lin-41 in Caenorhabditis elegans, showcasing the potential of mirTarCLASH to provide novel insights for subsequent experimental research designs. The constructed mirTarCLASH database is freely available at https://cosbi.ee.ncku.edu.tw/MirTarClash. Database URL: https://cosbi.ee.ncku.edu.tw/MirTarClash.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11971479/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143787788","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
mirTarCLASH: a comprehensive miRNA target database based on chimeric read-based experiments. mirTarCLASH:基于嵌合读实验的综合miRNA靶点数据库。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-04-03 DOI: 10.1093/database/baaf023
Tzu-Hsien Yang, Xiang-Wei Li, Yuan-Han Lee, Shang-Yi Lu, Wei-Sheng Wu, Heng-Chi Lee

MicroRNAs (miRNAs) can target messenger RNAs to control their degradation or translation repression effects. Therefore, identifying the target and binding sites of different miRNAs is essential for understanding miRNA functions. To investigate these interactions, researchers have employed the cross-linking, ligation, and sequencing of hybrids (CLASH-seq) and similar CLASH-like approaches to generate chimeric reads formed by miRNAs and their targeting segments. These chimeric reads allow for the direct extraction of both the miRNA-target gene pairs and their corresponding binding sites. Nevertheless, these studies lack user-friendly platforms for researchers to investigate these interactions efficiently, thus hindering scientists' ability to explore miRNA functions. To address this gap, we developed mirTarCLASH, a comprehensive database that deposits 502 061/322 707/224 452 unique hybrid reads from human/mouse/worm miRNA chimeric read-based experiments. In mirTarCLASH, the chimera analysis algorithm ChiRA and two distinct binding site inference tools, RNAup and miRanda, were adopted to facilitate the exploration of miRNA-target pairs derived from CLASH-like experiments. Compared with existing similar repositories, mirTarCLASH further enables several confidence evaluation filters with visualization functions for the extracted results. The results can be further refined based on the key properties of the miRNA targeting sites, including read depths, numbers of supporting algorithms, and cross-linking-induced mutations, to enhance confidence levels. In addition, these miRNA-binding sites are visually represented through an integrated transcript atlas. Finally, we demonstrated the biological applicability of mirTarCLASH via the well-characterized example interaction between cel-let-7-5p and lin-41 in Caenorhabditis elegans, showcasing the potential of mirTarCLASH to provide novel insights for subsequent experimental research designs. The constructed mirTarCLASH database is freely available at https://cosbi.ee.ncku.edu.tw/MirTarClash. Database URL: https://cosbi.ee.ncku.edu.tw/MirTarClash.

MicroRNAs (miRNAs)可以靶向信使rna来控制其降解或翻译抑制作用。因此,确定不同miRNA的靶点和结合位点对于了解miRNA的功能至关重要。为了研究这些相互作用,研究人员采用了杂交体的交联、连接和测序(collision -seq)以及类似的类碰撞方法来生成由mirna及其靶向片段形成的嵌合读段。这些嵌合读取允许直接提取mirna靶基因对及其相应的结合位点。然而,这些研究缺乏用户友好的平台供研究人员有效地研究这些相互作用,从而阻碍了科学家探索miRNA功能的能力。为了解决这一空白,我们开发了mirTarCLASH,这是一个综合数据库,包含了来自人类/小鼠/蠕虫miRNA嵌合读取实验的502 061/322 707/224 452个独特的杂交读取。在mirTarCLASH中,我们采用了嵌合体分析算法ChiRA和两种不同的结合位点推断工具RNAup和miRanda,以方便探索来自于类clash实验的mirna -靶对。与现有的类似存储库相比,mirTarCLASH进一步为提取的结果提供了多个具有可视化功能的置信度评估过滤器。结果可以根据miRNA靶向位点的关键特性(包括读取深度、支持算法的数量和交联诱导突变)进一步完善,以提高置信度。此外,这些mirna结合位点通过整合的转录图谱直观地表示出来。最后,我们通过在秀丽隐杆线虫中细胞-let-7-5p和lin-41之间的典型相互作用证明了mirTarCLASH的生物学适用性,展示了mirTarCLASH为后续实验研究设计提供新见解的潜力。构建的mirTarCLASH数据库可以在https://cosbi.ee.ncku.edu.tw/MirTarClash上免费获得。数据库地址:https://cosbi.ee.ncku.edu.tw/MirTarClash。
{"title":"mirTarCLASH: a comprehensive miRNA target database based on chimeric read-based experiments.","authors":"Tzu-Hsien Yang, Xiang-Wei Li, Yuan-Han Lee, Shang-Yi Lu, Wei-Sheng Wu, Heng-Chi Lee","doi":"10.1093/database/baaf023","DOIUrl":"https://doi.org/10.1093/database/baaf023","url":null,"abstract":"<p><p>MicroRNAs (miRNAs) can target messenger RNAs to control their degradation or translation repression effects. Therefore, identifying the target and binding sites of different miRNAs is essential for understanding miRNA functions. To investigate these interactions, researchers have employed the cross-linking, ligation, and sequencing of hybrids (CLASH-seq) and similar CLASH-like approaches to generate chimeric reads formed by miRNAs and their targeting segments. These chimeric reads allow for the direct extraction of both the miRNA-target gene pairs and their corresponding binding sites. Nevertheless, these studies lack user-friendly platforms for researchers to investigate these interactions efficiently, thus hindering scientists' ability to explore miRNA functions. To address this gap, we developed mirTarCLASH, a comprehensive database that deposits 502 061/322 707/224 452 unique hybrid reads from human/mouse/worm miRNA chimeric read-based experiments. In mirTarCLASH, the chimera analysis algorithm ChiRA and two distinct binding site inference tools, RNAup and miRanda, were adopted to facilitate the exploration of miRNA-target pairs derived from CLASH-like experiments. Compared with existing similar repositories, mirTarCLASH further enables several confidence evaluation filters with visualization functions for the extracted results. The results can be further refined based on the key properties of the miRNA targeting sites, including read depths, numbers of supporting algorithms, and cross-linking-induced mutations, to enhance confidence levels. In addition, these miRNA-binding sites are visually represented through an integrated transcript atlas. Finally, we demonstrated the biological applicability of mirTarCLASH via the well-characterized example interaction between cel-let-7-5p and lin-41 in Caenorhabditis elegans, showcasing the potential of mirTarCLASH to provide novel insights for subsequent experimental research designs. The constructed mirTarCLASH database is freely available at https://cosbi.ee.ncku.edu.tw/MirTarClash. Database URL: https://cosbi.ee.ncku.edu.tw/MirTarClash.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144126858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Post-composing ontology terms for efficient phenotyping in plant breeding. 植物育种中高效表型的后组合本体术语。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-03-21 DOI: 10.1093/database/baaf020
Naama Menda, Bryan J Ellerbrock, Christiano C Simoes, Srikanth Kumar Karaikal, Christine Nyaga, Mirella Flores-Gonzalez, Isaak Y Tecle, David Lyon, Afolabi Agbona, Paterne A Agre, Prasad Peteti, Violet Akech, Amos Asiimwe, Eglantine Fauvelle, Karima Meghar, Thierry Tran, Dominique Dufour, Laurel Cooper, Marie-Angélique Laporte, Elizabeth Arnaud, Lukas A Mueller

Ontologies are widely used in databases to standardize data, improving data quality, integration, and ease of comparison. Within ontologies tailored to diverse use cases, post-composing user-defined terms reconciles the demands for standardization on the one hand and flexibility on the other. In many instances of Breedbase, a digital ecosystem for plant breeding designed for genomic selection, the goal is to capture phenotypic data using highly curated and rigorous crop ontologies, while adapting to the specific requirements of plant breeders to record data quickly and efficiently. For example, post-composing enables users to tailor ontology terms to suit specific and granular use cases such as repeated measurements on different plant parts and special sample preparation techniques. To achieve this, we have implemented a post-composing tool based on orthogonal ontologies providing users with the ability to introduce additional levels of phenotyping granularity tailored to unique experimental designs. Post-composed terms are designed to be reused by all breeding programs within a Breedbase instance but are not exported to the crop reference ontologies. Breedbase users can post-compose terms across various categories, such as plant anatomy, treatments, temporal events, and breeding cycles, and, as a result, generate highly specific terms for more accurate phenotyping.

本体广泛用于数据库中,用于标准化数据、提高数据质量、集成和易于比较。在针对不同用例定制的本体中,用户定义术语的后组合一方面要协调标准化需求,另一方面要协调灵活性需求。在许多情况下,为基因组选择而设计的植物育种数字生态系统Breedbase的目标是使用高度策划和严格的作物本体捕获表型数据,同时适应植物育种者快速有效地记录数据的特定要求。例如,后期组合使用户能够定制本体术语,以适应特定的和细粒度的用例,例如对不同植物部位的重复测量和特殊的样品制备技术。为了实现这一点,我们实现了一个基于正交本体的后期组合工具,为用户提供了根据独特的实验设计引入额外的表型粒度级别的能力。后组合的术语被设计为可以被Breedbase实例中的所有育种程序重用,但不会导出到作物引用本体。Breedbase用户可以跨各种类别(如植物解剖、处理、时间事件和育种周期)对术语进行后期组合,从而生成高度特定的术语,以获得更准确的表型。
{"title":"Post-composing ontology terms for efficient phenotyping in plant breeding.","authors":"Naama Menda, Bryan J Ellerbrock, Christiano C Simoes, Srikanth Kumar Karaikal, Christine Nyaga, Mirella Flores-Gonzalez, Isaak Y Tecle, David Lyon, Afolabi Agbona, Paterne A Agre, Prasad Peteti, Violet Akech, Amos Asiimwe, Eglantine Fauvelle, Karima Meghar, Thierry Tran, Dominique Dufour, Laurel Cooper, Marie-Angélique Laporte, Elizabeth Arnaud, Lukas A Mueller","doi":"10.1093/database/baaf020","DOIUrl":"https://doi.org/10.1093/database/baaf020","url":null,"abstract":"<p><p>Ontologies are widely used in databases to standardize data, improving data quality, integration, and ease of comparison. Within ontologies tailored to diverse use cases, post-composing user-defined terms reconciles the demands for standardization on the one hand and flexibility on the other. In many instances of Breedbase, a digital ecosystem for plant breeding designed for genomic selection, the goal is to capture phenotypic data using highly curated and rigorous crop ontologies, while adapting to the specific requirements of plant breeders to record data quickly and efficiently. For example, post-composing enables users to tailor ontology terms to suit specific and granular use cases such as repeated measurements on different plant parts and special sample preparation techniques. To achieve this, we have implemented a post-composing tool based on orthogonal ontologies providing users with the ability to introduce additional levels of phenotyping granularity tailored to unique experimental designs. Post-composed terms are designed to be reused by all breeding programs within a Breedbase instance but are not exported to the crop reference ontologies. Breedbase users can post-compose terms across various categories, such as plant anatomy, treatments, temporal events, and breeding cycles, and, as a result, generate highly specific terms for more accurate phenotyping.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144126879","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Post-composing ontology terms for efficient phenotyping in plant breeding. 后期合成本体术语,实现植物育种中的高效表型。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-03-21 DOI: 10.1093/database/baaf020
Naama Menda, Bryan J Ellerbrock, Christiano C Simoes, Srikanth Kumar Karaikal, Christine Nyaga, Mirella Flores-Gonzalez, Isaak Y Tecle, David Lyon, Afolabi Agbona, Paterne A Agre, Prasad Peteti, Violet Akech, Amos Asiimwe, Eglantine Fauvelle, Karima Meghar, Thierry Tran, Dominique Dufour, Laurel Cooper, Marie-Angélique Laporte, Elizabeth Arnaud, Lukas A Mueller

Ontologies are widely used in databases to standardize data, improving data quality, integration, and ease of comparison. Within ontologies tailored to diverse use cases, post-composing user-defined terms reconciles the demands for standardization on the one hand and flexibility on the other. In many instances of Breedbase, a digital ecosystem for plant breeding designed for genomic selection, the goal is to capture phenotypic data using highly curated and rigorous crop ontologies, while adapting to the specific requirements of plant breeders to record data quickly and efficiently. For example, post-composing enables users to tailor ontology terms to suit specific and granular use cases such as repeated measurements on different plant parts and special sample preparation techniques. To achieve this, we have implemented a post-composing tool based on orthogonal ontologies providing users with the ability to introduce additional levels of phenotyping granularity tailored to unique experimental designs. Post-composed terms are designed to be reused by all breeding programs within a Breedbase instance but are not exported to the crop reference ontologies. Breedbase users can post-compose terms across various categories, such as plant anatomy, treatments, temporal events, and breeding cycles, and, as a result, generate highly specific terms for more accurate phenotyping.

本体论被广泛应用于数据库中,以实现数据标准化,提高数据质量、集成度和比较便利性。在为不同用例量身定制的本体中,后期合成用户定义的术语可以兼顾标准化需求和灵活性需求。Breedbase 是一个为基因组选育而设计的植物育种数字生态系统,在许多情况下,其目标是使用经过高度整理和严格定义的作物本体来捕获表型数据,同时适应植物育种者快速高效记录数据的具体要求。例如,通过后期合成,用户可以定制本体术语,以适应特定的细粒度使用情况,如对不同植物部位的重复测量和特殊的样品制备技术。为此,我们在正交本体论的基础上开发了一种后期合成工具,使用户能够根据独特的实验设计引入额外的表型粒度。后期合成的术语可被Breedbase实例中的所有育种计划重复使用,但不会导出到作物参考本体中。Breedbase 用户可在植物解剖学、处理、时间事件和育种周期等不同类别中后组合术语,从而生成高度具体的术语,实现更准确的表型分析。
{"title":"Post-composing ontology terms for efficient phenotyping in plant breeding.","authors":"Naama Menda, Bryan J Ellerbrock, Christiano C Simoes, Srikanth Kumar Karaikal, Christine Nyaga, Mirella Flores-Gonzalez, Isaak Y Tecle, David Lyon, Afolabi Agbona, Paterne A Agre, Prasad Peteti, Violet Akech, Amos Asiimwe, Eglantine Fauvelle, Karima Meghar, Thierry Tran, Dominique Dufour, Laurel Cooper, Marie-Angélique Laporte, Elizabeth Arnaud, Lukas A Mueller","doi":"10.1093/database/baaf020","DOIUrl":"10.1093/database/baaf020","url":null,"abstract":"<p><p>Ontologies are widely used in databases to standardize data, improving data quality, integration, and ease of comparison. Within ontologies tailored to diverse use cases, post-composing user-defined terms reconciles the demands for standardization on the one hand and flexibility on the other. In many instances of Breedbase, a digital ecosystem for plant breeding designed for genomic selection, the goal is to capture phenotypic data using highly curated and rigorous crop ontologies, while adapting to the specific requirements of plant breeders to record data quickly and efficiently. For example, post-composing enables users to tailor ontology terms to suit specific and granular use cases such as repeated measurements on different plant parts and special sample preparation techniques. To achieve this, we have implemented a post-composing tool based on orthogonal ontologies providing users with the ability to introduce additional levels of phenotyping granularity tailored to unique experimental designs. Post-composed terms are designed to be reused by all breeding programs within a Breedbase instance but are not exported to the crop reference ontologies. Breedbase users can post-compose terms across various categories, such as plant anatomy, treatments, temporal events, and breeding cycles, and, as a result, generate highly specific terms for more accurate phenotyping.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11927528/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143673564","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Database: The Journal of Biological Databases and Curation
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1