首页 > 最新文献

Database: The Journal of Biological Databases and Curation最新文献

英文 中文
GrameneOryza: a comprehensive resource for Oryza genomes, genetic variation, and functional data. GrameneOryza:水稻基因组、遗传变异和功能数据的综合资源。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-04-04 DOI: 10.1093/database/baaf021
Sharon Wei, Kapeel Chougule, Andrew Olson, Zhenyuan Lu, Marcela K Tello-Ruiz, Vivek Kumar, Sunita Kumari, Lifang Zhang, Audra Olson, Catherine Kim, Nick Gladman, Doreen Ware

Rice is a vital staple crop, sustaining over half of the global population, and is a key model for genetic research. To support the growing need for comprehensive and accessible rice genomic data, GrameneOryza (https://oryza.gramene.org) was developed as an online resource adhering to FAIR (Findable, Accessible, Interoperable, and Reusable) principles of data management. It distinguishes itself through its comprehensive multispecies focus, encompassing a wide variety of Oryza genomes and related species, and its integration with FAIR principles to ensure data accessibility and usability. It offers a community curated selection of high-quality Oryza genomes, genetic variation, gene function, and trait data. The latest release, version 8, includes 28 Oryza genomes, covering wild rice and domesticated cultivars. These genomes, along with Leersia perrieri and seven additional outgroup species, form the basis for 38 K protein-coding gene family trees, essential for identifying orthologs, paralogs, and developing pan-gene sets. GrameneOryza's genetic variation data features 66 million single-nucleotide variants (SNVs) anchored to the Os-Nipponbare-Reference-IRGSP-1.0 genome, derived from various studies, including the Rice Genome 3 K (RG3K) project. The RG3K sequence reads were also mapped to seven additional platinum-quality Asian rice genomes, resulting in 19 million SNVs for each genome, significantly expanding the coverage of genetic variation beyond the Nipponbare reference. Of the 66 million SNVs on IRGSP-1.0, 27 million acquired standardized reference SNP cluster identifiers (rsIDs) from the European Variation Archive release v5. Additionally, 1200 distinct phenotypes provide a comprehensive overview of quantitative trait loci (QTL) features. The newly introduced Oryza CLIMtools portal offers insights into environmental impacts on genome adaptation. The platform's integrated search interface, along with a BLAST server and curation tools, facilitates user access to genomic, phylogenetic, gene function, and QTL data, supporting broad research applications. Database URL: https://oryza.gramene.org.

水稻是一种重要的主要作物,养活了全球一半以上的人口,是基因研究的关键模型。为了支持对全面和可访问的水稻基因组数据日益增长的需求,GrameneOryza (https://oryza.gramene.org)被开发为遵循FAIR(可查找、可访问、可互操作和可重用)数据管理原则的在线资源。它以全面的多物种研究为重点,涵盖了各种各样的Oryza基因组和相关物种,并与FAIR原则相结合,以确保数据的可访问性和可用性。它提供了一个社区策划的高质量稻基因组,遗传变异,基因功能和性状数据的选择。最新发布的版本8包括28个水稻基因组,涵盖野生稻和驯化品种。这些基因组,连同狐猴和7个额外的外群物种,构成了38k蛋白编码基因家谱的基础,对于识别同源物、相似物和发展泛基因集至关重要。GrameneOryza的遗传变异数据包含6600万个单核苷酸变异(snv),这些变异锚定在Os-Nipponbare-Reference-IRGSP-1.0基因组上,这些变异来自包括水稻基因组3k (RG3K)项目在内的各种研究。RG3K序列还被映射到另外7个铂质亚洲水稻基因组,每个基因组有1900万个snv,大大扩大了Nipponbare参考基因的遗传变异覆盖范围。在IRGSP-1.0上的6600万snv中,2700万从欧洲变异档案版本v5中获得了标准化参考SNP集群标识符(rsid)。此外,1200种不同的表型提供了数量性状位点(QTL)特征的全面概述。新推出的Oryza CLIMtools门户网站提供了对环境对基因组适应的影响的见解。该平台的集成搜索界面,以及BLAST服务器和管理工具,方便用户访问基因组,系统发育,基因功能和QTL数据,支持广泛的研究应用。数据库地址:https://oryza.gramene.org。
{"title":"GrameneOryza: a comprehensive resource for Oryza genomes, genetic variation, and functional data.","authors":"Sharon Wei, Kapeel Chougule, Andrew Olson, Zhenyuan Lu, Marcela K Tello-Ruiz, Vivek Kumar, Sunita Kumari, Lifang Zhang, Audra Olson, Catherine Kim, Nick Gladman, Doreen Ware","doi":"10.1093/database/baaf021","DOIUrl":"https://doi.org/10.1093/database/baaf021","url":null,"abstract":"<p><p>Rice is a vital staple crop, sustaining over half of the global population, and is a key model for genetic research. To support the growing need for comprehensive and accessible rice genomic data, GrameneOryza (https://oryza.gramene.org) was developed as an online resource adhering to FAIR (Findable, Accessible, Interoperable, and Reusable) principles of data management. It distinguishes itself through its comprehensive multispecies focus, encompassing a wide variety of Oryza genomes and related species, and its integration with FAIR principles to ensure data accessibility and usability. It offers a community curated selection of high-quality Oryza genomes, genetic variation, gene function, and trait data. The latest release, version 8, includes 28 Oryza genomes, covering wild rice and domesticated cultivars. These genomes, along with Leersia perrieri and seven additional outgroup species, form the basis for 38 K protein-coding gene family trees, essential for identifying orthologs, paralogs, and developing pan-gene sets. GrameneOryza's genetic variation data features 66 million single-nucleotide variants (SNVs) anchored to the Os-Nipponbare-Reference-IRGSP-1.0 genome, derived from various studies, including the Rice Genome 3 K (RG3K) project. The RG3K sequence reads were also mapped to seven additional platinum-quality Asian rice genomes, resulting in 19 million SNVs for each genome, significantly expanding the coverage of genetic variation beyond the Nipponbare reference. Of the 66 million SNVs on IRGSP-1.0, 27 million acquired standardized reference SNP cluster identifiers (rsIDs) from the European Variation Archive release v5. Additionally, 1200 distinct phenotypes provide a comprehensive overview of quantitative trait loci (QTL) features. The newly introduced Oryza CLIMtools portal offers insights into environmental impacts on genome adaptation. The platform's integrated search interface, along with a BLAST server and curation tools, facilitates user access to genomic, phylogenetic, gene function, and QTL data, supporting broad research applications. Database URL: https://oryza.gramene.org.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144126836","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GrameneOryza: a comprehensive resource for Oryza genomes, genetic variation, and functional data. GrameneOryza:水稻基因组、遗传变异和功能数据的综合资源。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-04-04 DOI: 10.1093/database/baaf021
Sharon Wei, Kapeel Chougule, Andrew Olson, Zhenyuan Lu, Marcela K Tello-Ruiz, Vivek Kumar, Sunita Kumari, Lifang Zhang, Audra Olson, Catherine Kim, Nick Gladman, Doreen Ware

Rice is a vital staple crop, sustaining over half of the global population, and is a key model for genetic research. To support the growing need for comprehensive and accessible rice genomic data, GrameneOryza (https://oryza.gramene.org) was developed as an online resource adhering to FAIR (Findable, Accessible, Interoperable, and Reusable) principles of data management. It distinguishes itself through its comprehensive multispecies focus, encompassing a wide variety of Oryza genomes and related species, and its integration with FAIR principles to ensure data accessibility and usability. It offers a community curated selection of high-quality Oryza genomes, genetic variation, gene function, and trait data. The latest release, version 8, includes 28 Oryza genomes, covering wild rice and domesticated cultivars. These genomes, along with Leersia perrieri and seven additional outgroup species, form the basis for 38 K protein-coding gene family trees, essential for identifying orthologs, paralogs, and developing pan-gene sets. GrameneOryza's genetic variation data features 66 million single-nucleotide variants (SNVs) anchored to the Os-Nipponbare-Reference-IRGSP-1.0 genome, derived from various studies, including the Rice Genome 3 K (RG3K) project. The RG3K sequence reads were also mapped to seven additional platinum-quality Asian rice genomes, resulting in 19 million SNVs for each genome, significantly expanding the coverage of genetic variation beyond the Nipponbare reference. Of the 66 million SNVs on IRGSP-1.0, 27 million acquired standardized reference SNP cluster identifiers (rsIDs) from the European Variation Archive release v5. Additionally, 1200 distinct phenotypes provide a comprehensive overview of quantitative trait loci (QTL) features. The newly introduced Oryza CLIMtools portal offers insights into environmental impacts on genome adaptation. The platform's integrated search interface, along with a BLAST server and curation tools, facilitates user access to genomic, phylogenetic, gene function, and QTL data, supporting broad research applications. Database URL: https://oryza.gramene.org.

水稻是一种重要的主要作物,养活了全球一半以上的人口,是基因研究的关键模型。为了支持对全面和可访问的水稻基因组数据日益增长的需求,GrameneOryza (https://oryza.gramene.org)被开发为遵循FAIR(可查找、可访问、可互操作和可重用)数据管理原则的在线资源。它以全面的多物种研究为重点,涵盖了各种各样的Oryza基因组和相关物种,并与FAIR原则相结合,以确保数据的可访问性和可用性。它提供了一个社区策划的高质量稻基因组,遗传变异,基因功能和性状数据的选择。最新发布的版本8包括28个水稻基因组,涵盖野生稻和驯化品种。这些基因组,连同狐猴和7个额外的外群物种,构成了38k蛋白编码基因家谱的基础,对于识别同源物、相似物和发展泛基因集至关重要。GrameneOryza的遗传变异数据包含6600万个单核苷酸变异(snv),这些变异锚定在Os-Nipponbare-Reference-IRGSP-1.0基因组上,这些变异来自包括水稻基因组3k (RG3K)项目在内的各种研究。RG3K序列还被映射到另外7个铂质亚洲水稻基因组,每个基因组有1900万个snv,大大扩大了Nipponbare参考基因的遗传变异覆盖范围。在IRGSP-1.0上的6600万snv中,2700万从欧洲变异档案版本v5中获得了标准化参考SNP集群标识符(rsid)。此外,1200种不同的表型提供了数量性状位点(QTL)特征的全面概述。新推出的Oryza CLIMtools门户网站提供了对环境对基因组适应的影响的见解。该平台的集成搜索界面,以及BLAST服务器和管理工具,方便用户访问基因组,系统发育,基因功能和QTL数据,支持广泛的研究应用。数据库地址:https://oryza.gramene.org。
{"title":"GrameneOryza: a comprehensive resource for Oryza genomes, genetic variation, and functional data.","authors":"Sharon Wei, Kapeel Chougule, Andrew Olson, Zhenyuan Lu, Marcela K Tello-Ruiz, Vivek Kumar, Sunita Kumari, Lifang Zhang, Audra Olson, Catherine Kim, Nick Gladman, Doreen Ware","doi":"10.1093/database/baaf021","DOIUrl":"10.1093/database/baaf021","url":null,"abstract":"<p><p>Rice is a vital staple crop, sustaining over half of the global population, and is a key model for genetic research. To support the growing need for comprehensive and accessible rice genomic data, GrameneOryza (https://oryza.gramene.org) was developed as an online resource adhering to FAIR (Findable, Accessible, Interoperable, and Reusable) principles of data management. It distinguishes itself through its comprehensive multispecies focus, encompassing a wide variety of Oryza genomes and related species, and its integration with FAIR principles to ensure data accessibility and usability. It offers a community curated selection of high-quality Oryza genomes, genetic variation, gene function, and trait data. The latest release, version 8, includes 28 Oryza genomes, covering wild rice and domesticated cultivars. These genomes, along with Leersia perrieri and seven additional outgroup species, form the basis for 38 K protein-coding gene family trees, essential for identifying orthologs, paralogs, and developing pan-gene sets. GrameneOryza's genetic variation data features 66 million single-nucleotide variants (SNVs) anchored to the Os-Nipponbare-Reference-IRGSP-1.0 genome, derived from various studies, including the Rice Genome 3 K (RG3K) project. The RG3K sequence reads were also mapped to seven additional platinum-quality Asian rice genomes, resulting in 19 million SNVs for each genome, significantly expanding the coverage of genetic variation beyond the Nipponbare reference. Of the 66 million SNVs on IRGSP-1.0, 27 million acquired standardized reference SNP cluster identifiers (rsIDs) from the European Variation Archive release v5. Additionally, 1200 distinct phenotypes provide a comprehensive overview of quantitative trait loci (QTL) features. The newly introduced Oryza CLIMtools portal offers insights into environmental impacts on genome adaptation. The platform's integrated search interface, along with a BLAST server and curation tools, facilitates user access to genomic, phylogenetic, gene function, and QTL data, supporting broad research applications. Database URL: https://oryza.gramene.org.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11986821/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143968793","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
mirTarCLASH: a comprehensive miRNA target database based on chimeric read-based experiments. mirTarCLASH:基于嵌合读实验的综合miRNA靶点数据库。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-04-03 DOI: 10.1093/database/baaf023
Tzu-Hsien Yang, Xiang-Wei Li, Yuan-Han Lee, Shang-Yi Lu, Wei-Sheng Wu, Heng-Chi Lee

MicroRNAs (miRNAs) can target messenger RNAs to control their degradation or translation repression effects. Therefore, identifying the target and binding sites of different miRNAs is essential for understanding miRNA functions. To investigate these interactions, researchers have employed the cross-linking, ligation, and sequencing of hybrids (CLASH-seq) and similar CLASH-like approaches to generate chimeric reads formed by miRNAs and their targeting segments. These chimeric reads allow for the direct extraction of both the miRNA-target gene pairs and their corresponding binding sites. Nevertheless, these studies lack user-friendly platforms for researchers to investigate these interactions efficiently, thus hindering scientists' ability to explore miRNA functions. To address this gap, we developed mirTarCLASH, a comprehensive database that deposits 502 061/322 707/224 452 unique hybrid reads from human/mouse/worm miRNA chimeric read-based experiments. In mirTarCLASH, the chimera analysis algorithm ChiRA and two distinct binding site inference tools, RNAup and miRanda, were adopted to facilitate the exploration of miRNA-target pairs derived from CLASH-like experiments. Compared with existing similar repositories, mirTarCLASH further enables several confidence evaluation filters with visualization functions for the extracted results. The results can be further refined based on the key properties of the miRNA targeting sites, including read depths, numbers of supporting algorithms, and cross-linking-induced mutations, to enhance confidence levels. In addition, these miRNA-binding sites are visually represented through an integrated transcript atlas. Finally, we demonstrated the biological applicability of mirTarCLASH via the well-characterized example interaction between cel-let-7-5p and lin-41 in Caenorhabditis elegans, showcasing the potential of mirTarCLASH to provide novel insights for subsequent experimental research designs. The constructed mirTarCLASH database is freely available at https://cosbi.ee.ncku.edu.tw/MirTarClash. Database URL: https://cosbi.ee.ncku.edu.tw/MirTarClash.

MicroRNAs (miRNAs)可以靶向信使rna来控制其降解或翻译抑制作用。因此,确定不同miRNA的靶点和结合位点对于了解miRNA的功能至关重要。为了研究这些相互作用,研究人员采用了杂交体的交联、连接和测序(collision -seq)以及类似的类碰撞方法来生成由mirna及其靶向片段形成的嵌合读段。这些嵌合读取允许直接提取mirna靶基因对及其相应的结合位点。然而,这些研究缺乏用户友好的平台供研究人员有效地研究这些相互作用,从而阻碍了科学家探索miRNA功能的能力。为了解决这一空白,我们开发了mirTarCLASH,这是一个综合数据库,包含了来自人类/小鼠/蠕虫miRNA嵌合读取实验的502 061/322 707/224 452个独特的杂交读取。在mirTarCLASH中,我们采用了嵌合体分析算法ChiRA和两种不同的结合位点推断工具RNAup和miRanda,以方便探索来自于类clash实验的mirna -靶对。与现有的类似存储库相比,mirTarCLASH进一步为提取的结果提供了多个具有可视化功能的置信度评估过滤器。结果可以根据miRNA靶向位点的关键特性(包括读取深度、支持算法的数量和交联诱导突变)进一步完善,以提高置信度。此外,这些mirna结合位点通过整合的转录图谱直观地表示出来。最后,我们通过在秀丽隐杆线虫中细胞-let-7-5p和lin-41之间的典型相互作用证明了mirTarCLASH的生物学适用性,展示了mirTarCLASH为后续实验研究设计提供新见解的潜力。构建的mirTarCLASH数据库可以在https://cosbi.ee.ncku.edu.tw/MirTarClash上免费获得。数据库地址:https://cosbi.ee.ncku.edu.tw/MirTarClash。
{"title":"mirTarCLASH: a comprehensive miRNA target database based on chimeric read-based experiments.","authors":"Tzu-Hsien Yang, Xiang-Wei Li, Yuan-Han Lee, Shang-Yi Lu, Wei-Sheng Wu, Heng-Chi Lee","doi":"10.1093/database/baaf023","DOIUrl":"10.1093/database/baaf023","url":null,"abstract":"<p><p>MicroRNAs (miRNAs) can target messenger RNAs to control their degradation or translation repression effects. Therefore, identifying the target and binding sites of different miRNAs is essential for understanding miRNA functions. To investigate these interactions, researchers have employed the cross-linking, ligation, and sequencing of hybrids (CLASH-seq) and similar CLASH-like approaches to generate chimeric reads formed by miRNAs and their targeting segments. These chimeric reads allow for the direct extraction of both the miRNA-target gene pairs and their corresponding binding sites. Nevertheless, these studies lack user-friendly platforms for researchers to investigate these interactions efficiently, thus hindering scientists' ability to explore miRNA functions. To address this gap, we developed mirTarCLASH, a comprehensive database that deposits 502 061/322 707/224 452 unique hybrid reads from human/mouse/worm miRNA chimeric read-based experiments. In mirTarCLASH, the chimera analysis algorithm ChiRA and two distinct binding site inference tools, RNAup and miRanda, were adopted to facilitate the exploration of miRNA-target pairs derived from CLASH-like experiments. Compared with existing similar repositories, mirTarCLASH further enables several confidence evaluation filters with visualization functions for the extracted results. The results can be further refined based on the key properties of the miRNA targeting sites, including read depths, numbers of supporting algorithms, and cross-linking-induced mutations, to enhance confidence levels. In addition, these miRNA-binding sites are visually represented through an integrated transcript atlas. Finally, we demonstrated the biological applicability of mirTarCLASH via the well-characterized example interaction between cel-let-7-5p and lin-41 in Caenorhabditis elegans, showcasing the potential of mirTarCLASH to provide novel insights for subsequent experimental research designs. The constructed mirTarCLASH database is freely available at https://cosbi.ee.ncku.edu.tw/MirTarClash. Database URL: https://cosbi.ee.ncku.edu.tw/MirTarClash.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11971479/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143787788","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
mirTarCLASH: a comprehensive miRNA target database based on chimeric read-based experiments. mirTarCLASH:基于嵌合读实验的综合miRNA靶点数据库。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-04-03 DOI: 10.1093/database/baaf023
Tzu-Hsien Yang, Xiang-Wei Li, Yuan-Han Lee, Shang-Yi Lu, Wei-Sheng Wu, Heng-Chi Lee

MicroRNAs (miRNAs) can target messenger RNAs to control their degradation or translation repression effects. Therefore, identifying the target and binding sites of different miRNAs is essential for understanding miRNA functions. To investigate these interactions, researchers have employed the cross-linking, ligation, and sequencing of hybrids (CLASH-seq) and similar CLASH-like approaches to generate chimeric reads formed by miRNAs and their targeting segments. These chimeric reads allow for the direct extraction of both the miRNA-target gene pairs and their corresponding binding sites. Nevertheless, these studies lack user-friendly platforms for researchers to investigate these interactions efficiently, thus hindering scientists' ability to explore miRNA functions. To address this gap, we developed mirTarCLASH, a comprehensive database that deposits 502 061/322 707/224 452 unique hybrid reads from human/mouse/worm miRNA chimeric read-based experiments. In mirTarCLASH, the chimera analysis algorithm ChiRA and two distinct binding site inference tools, RNAup and miRanda, were adopted to facilitate the exploration of miRNA-target pairs derived from CLASH-like experiments. Compared with existing similar repositories, mirTarCLASH further enables several confidence evaluation filters with visualization functions for the extracted results. The results can be further refined based on the key properties of the miRNA targeting sites, including read depths, numbers of supporting algorithms, and cross-linking-induced mutations, to enhance confidence levels. In addition, these miRNA-binding sites are visually represented through an integrated transcript atlas. Finally, we demonstrated the biological applicability of mirTarCLASH via the well-characterized example interaction between cel-let-7-5p and lin-41 in Caenorhabditis elegans, showcasing the potential of mirTarCLASH to provide novel insights for subsequent experimental research designs. The constructed mirTarCLASH database is freely available at https://cosbi.ee.ncku.edu.tw/MirTarClash. Database URL: https://cosbi.ee.ncku.edu.tw/MirTarClash.

MicroRNAs (miRNAs)可以靶向信使rna来控制其降解或翻译抑制作用。因此,确定不同miRNA的靶点和结合位点对于了解miRNA的功能至关重要。为了研究这些相互作用,研究人员采用了杂交体的交联、连接和测序(collision -seq)以及类似的类碰撞方法来生成由mirna及其靶向片段形成的嵌合读段。这些嵌合读取允许直接提取mirna靶基因对及其相应的结合位点。然而,这些研究缺乏用户友好的平台供研究人员有效地研究这些相互作用,从而阻碍了科学家探索miRNA功能的能力。为了解决这一空白,我们开发了mirTarCLASH,这是一个综合数据库,包含了来自人类/小鼠/蠕虫miRNA嵌合读取实验的502 061/322 707/224 452个独特的杂交读取。在mirTarCLASH中,我们采用了嵌合体分析算法ChiRA和两种不同的结合位点推断工具RNAup和miRanda,以方便探索来自于类clash实验的mirna -靶对。与现有的类似存储库相比,mirTarCLASH进一步为提取的结果提供了多个具有可视化功能的置信度评估过滤器。结果可以根据miRNA靶向位点的关键特性(包括读取深度、支持算法的数量和交联诱导突变)进一步完善,以提高置信度。此外,这些mirna结合位点通过整合的转录图谱直观地表示出来。最后,我们通过在秀丽隐杆线虫中细胞-let-7-5p和lin-41之间的典型相互作用证明了mirTarCLASH的生物学适用性,展示了mirTarCLASH为后续实验研究设计提供新见解的潜力。构建的mirTarCLASH数据库可以在https://cosbi.ee.ncku.edu.tw/MirTarClash上免费获得。数据库地址:https://cosbi.ee.ncku.edu.tw/MirTarClash。
{"title":"mirTarCLASH: a comprehensive miRNA target database based on chimeric read-based experiments.","authors":"Tzu-Hsien Yang, Xiang-Wei Li, Yuan-Han Lee, Shang-Yi Lu, Wei-Sheng Wu, Heng-Chi Lee","doi":"10.1093/database/baaf023","DOIUrl":"https://doi.org/10.1093/database/baaf023","url":null,"abstract":"<p><p>MicroRNAs (miRNAs) can target messenger RNAs to control their degradation or translation repression effects. Therefore, identifying the target and binding sites of different miRNAs is essential for understanding miRNA functions. To investigate these interactions, researchers have employed the cross-linking, ligation, and sequencing of hybrids (CLASH-seq) and similar CLASH-like approaches to generate chimeric reads formed by miRNAs and their targeting segments. These chimeric reads allow for the direct extraction of both the miRNA-target gene pairs and their corresponding binding sites. Nevertheless, these studies lack user-friendly platforms for researchers to investigate these interactions efficiently, thus hindering scientists' ability to explore miRNA functions. To address this gap, we developed mirTarCLASH, a comprehensive database that deposits 502 061/322 707/224 452 unique hybrid reads from human/mouse/worm miRNA chimeric read-based experiments. In mirTarCLASH, the chimera analysis algorithm ChiRA and two distinct binding site inference tools, RNAup and miRanda, were adopted to facilitate the exploration of miRNA-target pairs derived from CLASH-like experiments. Compared with existing similar repositories, mirTarCLASH further enables several confidence evaluation filters with visualization functions for the extracted results. The results can be further refined based on the key properties of the miRNA targeting sites, including read depths, numbers of supporting algorithms, and cross-linking-induced mutations, to enhance confidence levels. In addition, these miRNA-binding sites are visually represented through an integrated transcript atlas. Finally, we demonstrated the biological applicability of mirTarCLASH via the well-characterized example interaction between cel-let-7-5p and lin-41 in Caenorhabditis elegans, showcasing the potential of mirTarCLASH to provide novel insights for subsequent experimental research designs. The constructed mirTarCLASH database is freely available at https://cosbi.ee.ncku.edu.tw/MirTarClash. Database URL: https://cosbi.ee.ncku.edu.tw/MirTarClash.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144126858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Post-composing ontology terms for efficient phenotyping in plant breeding. 植物育种中高效表型的后组合本体术语。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-03-21 DOI: 10.1093/database/baaf020
Naama Menda, Bryan J Ellerbrock, Christiano C Simoes, Srikanth Kumar Karaikal, Christine Nyaga, Mirella Flores-Gonzalez, Isaak Y Tecle, David Lyon, Afolabi Agbona, Paterne A Agre, Prasad Peteti, Violet Akech, Amos Asiimwe, Eglantine Fauvelle, Karima Meghar, Thierry Tran, Dominique Dufour, Laurel Cooper, Marie-Angélique Laporte, Elizabeth Arnaud, Lukas A Mueller

Ontologies are widely used in databases to standardize data, improving data quality, integration, and ease of comparison. Within ontologies tailored to diverse use cases, post-composing user-defined terms reconciles the demands for standardization on the one hand and flexibility on the other. In many instances of Breedbase, a digital ecosystem for plant breeding designed for genomic selection, the goal is to capture phenotypic data using highly curated and rigorous crop ontologies, while adapting to the specific requirements of plant breeders to record data quickly and efficiently. For example, post-composing enables users to tailor ontology terms to suit specific and granular use cases such as repeated measurements on different plant parts and special sample preparation techniques. To achieve this, we have implemented a post-composing tool based on orthogonal ontologies providing users with the ability to introduce additional levels of phenotyping granularity tailored to unique experimental designs. Post-composed terms are designed to be reused by all breeding programs within a Breedbase instance but are not exported to the crop reference ontologies. Breedbase users can post-compose terms across various categories, such as plant anatomy, treatments, temporal events, and breeding cycles, and, as a result, generate highly specific terms for more accurate phenotyping.

本体广泛用于数据库中,用于标准化数据、提高数据质量、集成和易于比较。在针对不同用例定制的本体中,用户定义术语的后组合一方面要协调标准化需求,另一方面要协调灵活性需求。在许多情况下,为基因组选择而设计的植物育种数字生态系统Breedbase的目标是使用高度策划和严格的作物本体捕获表型数据,同时适应植物育种者快速有效地记录数据的特定要求。例如,后期组合使用户能够定制本体术语,以适应特定的和细粒度的用例,例如对不同植物部位的重复测量和特殊的样品制备技术。为了实现这一点,我们实现了一个基于正交本体的后期组合工具,为用户提供了根据独特的实验设计引入额外的表型粒度级别的能力。后组合的术语被设计为可以被Breedbase实例中的所有育种程序重用,但不会导出到作物引用本体。Breedbase用户可以跨各种类别(如植物解剖、处理、时间事件和育种周期)对术语进行后期组合,从而生成高度特定的术语,以获得更准确的表型。
{"title":"Post-composing ontology terms for efficient phenotyping in plant breeding.","authors":"Naama Menda, Bryan J Ellerbrock, Christiano C Simoes, Srikanth Kumar Karaikal, Christine Nyaga, Mirella Flores-Gonzalez, Isaak Y Tecle, David Lyon, Afolabi Agbona, Paterne A Agre, Prasad Peteti, Violet Akech, Amos Asiimwe, Eglantine Fauvelle, Karima Meghar, Thierry Tran, Dominique Dufour, Laurel Cooper, Marie-Angélique Laporte, Elizabeth Arnaud, Lukas A Mueller","doi":"10.1093/database/baaf020","DOIUrl":"https://doi.org/10.1093/database/baaf020","url":null,"abstract":"<p><p>Ontologies are widely used in databases to standardize data, improving data quality, integration, and ease of comparison. Within ontologies tailored to diverse use cases, post-composing user-defined terms reconciles the demands for standardization on the one hand and flexibility on the other. In many instances of Breedbase, a digital ecosystem for plant breeding designed for genomic selection, the goal is to capture phenotypic data using highly curated and rigorous crop ontologies, while adapting to the specific requirements of plant breeders to record data quickly and efficiently. For example, post-composing enables users to tailor ontology terms to suit specific and granular use cases such as repeated measurements on different plant parts and special sample preparation techniques. To achieve this, we have implemented a post-composing tool based on orthogonal ontologies providing users with the ability to introduce additional levels of phenotyping granularity tailored to unique experimental designs. Post-composed terms are designed to be reused by all breeding programs within a Breedbase instance but are not exported to the crop reference ontologies. Breedbase users can post-compose terms across various categories, such as plant anatomy, treatments, temporal events, and breeding cycles, and, as a result, generate highly specific terms for more accurate phenotyping.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144126879","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Post-composing ontology terms for efficient phenotyping in plant breeding. 后期合成本体术语,实现植物育种中的高效表型。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-03-21 DOI: 10.1093/database/baaf020
Naama Menda, Bryan J Ellerbrock, Christiano C Simoes, Srikanth Kumar Karaikal, Christine Nyaga, Mirella Flores-Gonzalez, Isaak Y Tecle, David Lyon, Afolabi Agbona, Paterne A Agre, Prasad Peteti, Violet Akech, Amos Asiimwe, Eglantine Fauvelle, Karima Meghar, Thierry Tran, Dominique Dufour, Laurel Cooper, Marie-Angélique Laporte, Elizabeth Arnaud, Lukas A Mueller

Ontologies are widely used in databases to standardize data, improving data quality, integration, and ease of comparison. Within ontologies tailored to diverse use cases, post-composing user-defined terms reconciles the demands for standardization on the one hand and flexibility on the other. In many instances of Breedbase, a digital ecosystem for plant breeding designed for genomic selection, the goal is to capture phenotypic data using highly curated and rigorous crop ontologies, while adapting to the specific requirements of plant breeders to record data quickly and efficiently. For example, post-composing enables users to tailor ontology terms to suit specific and granular use cases such as repeated measurements on different plant parts and special sample preparation techniques. To achieve this, we have implemented a post-composing tool based on orthogonal ontologies providing users with the ability to introduce additional levels of phenotyping granularity tailored to unique experimental designs. Post-composed terms are designed to be reused by all breeding programs within a Breedbase instance but are not exported to the crop reference ontologies. Breedbase users can post-compose terms across various categories, such as plant anatomy, treatments, temporal events, and breeding cycles, and, as a result, generate highly specific terms for more accurate phenotyping.

本体论被广泛应用于数据库中,以实现数据标准化,提高数据质量、集成度和比较便利性。在为不同用例量身定制的本体中,后期合成用户定义的术语可以兼顾标准化需求和灵活性需求。Breedbase 是一个为基因组选育而设计的植物育种数字生态系统,在许多情况下,其目标是使用经过高度整理和严格定义的作物本体来捕获表型数据,同时适应植物育种者快速高效记录数据的具体要求。例如,通过后期合成,用户可以定制本体术语,以适应特定的细粒度使用情况,如对不同植物部位的重复测量和特殊的样品制备技术。为此,我们在正交本体论的基础上开发了一种后期合成工具,使用户能够根据独特的实验设计引入额外的表型粒度。后期合成的术语可被Breedbase实例中的所有育种计划重复使用,但不会导出到作物参考本体中。Breedbase 用户可在植物解剖学、处理、时间事件和育种周期等不同类别中后组合术语,从而生成高度具体的术语,实现更准确的表型分析。
{"title":"Post-composing ontology terms for efficient phenotyping in plant breeding.","authors":"Naama Menda, Bryan J Ellerbrock, Christiano C Simoes, Srikanth Kumar Karaikal, Christine Nyaga, Mirella Flores-Gonzalez, Isaak Y Tecle, David Lyon, Afolabi Agbona, Paterne A Agre, Prasad Peteti, Violet Akech, Amos Asiimwe, Eglantine Fauvelle, Karima Meghar, Thierry Tran, Dominique Dufour, Laurel Cooper, Marie-Angélique Laporte, Elizabeth Arnaud, Lukas A Mueller","doi":"10.1093/database/baaf020","DOIUrl":"10.1093/database/baaf020","url":null,"abstract":"<p><p>Ontologies are widely used in databases to standardize data, improving data quality, integration, and ease of comparison. Within ontologies tailored to diverse use cases, post-composing user-defined terms reconciles the demands for standardization on the one hand and flexibility on the other. In many instances of Breedbase, a digital ecosystem for plant breeding designed for genomic selection, the goal is to capture phenotypic data using highly curated and rigorous crop ontologies, while adapting to the specific requirements of plant breeders to record data quickly and efficiently. For example, post-composing enables users to tailor ontology terms to suit specific and granular use cases such as repeated measurements on different plant parts and special sample preparation techniques. To achieve this, we have implemented a post-composing tool based on orthogonal ontologies providing users with the ability to introduce additional levels of phenotyping granularity tailored to unique experimental designs. Post-composed terms are designed to be reused by all breeding programs within a Breedbase instance but are not exported to the crop reference ontologies. Breedbase users can post-compose terms across various categories, such as plant anatomy, treatments, temporal events, and breeding cycles, and, as a result, generate highly specific terms for more accurate phenotyping.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11927528/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143673564","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A comprehensive experimental comparison between federated and centralized learning. 联合学习与集中学习的综合实验比较。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-03-19 DOI: 10.1093/database/baaf016
Swier Garst, Julian Dekker, Marcel Reinders

Federated learning is an upcoming machine learning paradigm which allows data from multiple sources to be used for training of classifiers without the data leaving the source it originally resides. This can be highly valuable for use cases such as medical research, where gathering data at a central location can be quite complicated due to privacy and legal concerns of the data. In such cases, federated learning has the potential to vastly speed up the research cycle. Although federated and central learning have been compared from a theoretical perspective, an extensive experimental comparison of performances and learning behavior still lacks. We have performed a comprehensive experimental comparison between federated and centralized learning. We evaluated various classifiers on various datasets exploring influences of different sample distributions as well as different class distributions across the clients. The results show similar performances under a wide variety of settings between the federated and central learning strategies. Federated learning is able to deal with various imbalances in the data distributions. It is sensitive to batch effects between different datasets when they coincide with location, similar to central learning, but this setting might go unobserved more easily. Federated learning seems to be robust to various challenges such as skewed data distributions, high data dimensionality, multiclass problems, and complex models. Taken together, the insights from our comparison gives much promise for applying federated learning as an alternative to sharing data. Code for reproducing the results in this work can be found at: https://github.com/swiergarst/FLComparison.

联盟学习是一种即将出现的机器学习范式,它允许将多个来源的数据用于训练分类器,而无需离开数据的原始来源。这对于医学研究等用例非常有价值,因为在医学研究中,由于数据的隐私和法律问题,在中央位置收集数据可能会相当复杂。在这种情况下,联合学习有可能大大加快研究周期。虽然联合学习和集中学习已经从理论角度进行了比较,但仍然缺乏对性能和学习行为的广泛实验比较。我们对联合学习和集中学习进行了全面的实验比较。我们对各种数据集上的分类器进行了评估,探讨了不同样本分布以及不同客户机上不同类别分布的影响。结果表明,联合学习和集中学习策略在各种设置下的性能相似。联合学习能够处理数据分布中的各种不平衡。当不同数据集的位置重合时,它对不同数据集之间的批次效应很敏感,这一点与集中学习类似,但这种情况可能更容易被忽略。联盟学习似乎对各种挑战都很稳健,例如偏斜数据分布、高数据维度、多类问题和复杂模型。综合来看,我们的比较结果为联合学习作为数据共享的替代方案提供了广阔的应用前景。转载本研究成果的代码请访问:https://github.com/swiergarst/FLComparison。
{"title":"A comprehensive experimental comparison between federated and centralized learning.","authors":"Swier Garst, Julian Dekker, Marcel Reinders","doi":"10.1093/database/baaf016","DOIUrl":"10.1093/database/baaf016","url":null,"abstract":"<p><p>Federated learning is an upcoming machine learning paradigm which allows data from multiple sources to be used for training of classifiers without the data leaving the source it originally resides. This can be highly valuable for use cases such as medical research, where gathering data at a central location can be quite complicated due to privacy and legal concerns of the data. In such cases, federated learning has the potential to vastly speed up the research cycle. Although federated and central learning have been compared from a theoretical perspective, an extensive experimental comparison of performances and learning behavior still lacks. We have performed a comprehensive experimental comparison between federated and centralized learning. We evaluated various classifiers on various datasets exploring influences of different sample distributions as well as different class distributions across the clients. The results show similar performances under a wide variety of settings between the federated and central learning strategies. Federated learning is able to deal with various imbalances in the data distributions. It is sensitive to batch effects between different datasets when they coincide with location, similar to central learning, but this setting might go unobserved more easily. Federated learning seems to be robust to various challenges such as skewed data distributions, high data dimensionality, multiclass problems, and complex models. Taken together, the insights from our comparison gives much promise for applying federated learning as an alternative to sharing data. Code for reproducing the results in this work can be found at: https://github.com/swiergarst/FLComparison.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11928227/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143673562","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A comprehensive experimental comparison between federated and centralized learning. 联合学习与集中学习的综合实验比较。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-03-19 DOI: 10.1093/database/baaf016
Swier Garst, Julian Dekker, Marcel Reinders

Federated learning is an upcoming machine learning paradigm which allows data from multiple sources to be used for training of classifiers without the data leaving the source it originally resides. This can be highly valuable for use cases such as medical research, where gathering data at a central location can be quite complicated due to privacy and legal concerns of the data. In such cases, federated learning has the potential to vastly speed up the research cycle. Although federated and central learning have been compared from a theoretical perspective, an extensive experimental comparison of performances and learning behavior still lacks. We have performed a comprehensive experimental comparison between federated and centralized learning. We evaluated various classifiers on various datasets exploring influences of different sample distributions as well as different class distributions across the clients. The results show similar performances under a wide variety of settings between the federated and central learning strategies. Federated learning is able to deal with various imbalances in the data distributions. It is sensitive to batch effects between different datasets when they coincide with location, similar to central learning, but this setting might go unobserved more easily. Federated learning seems to be robust to various challenges such as skewed data distributions, high data dimensionality, multiclass problems, and complex models. Taken together, the insights from our comparison gives much promise for applying federated learning as an alternative to sharing data. Code for reproducing the results in this work can be found at: https://github.com/swiergarst/FLComparison.

联邦学习是一种即将到来的机器学习范式,它允许使用来自多个数据源的数据来训练分类器,而无需数据离开其原始驻留的源。这对于医学研究等用例非常有价值,因为在这些用例中,由于数据的隐私和法律问题,在中心位置收集数据可能非常复杂。在这种情况下,联合学习有可能大大加快研究周期。虽然已经从理论角度对联邦学习和中央学习进行了比较,但还缺乏广泛的性能和学习行为的实验比较。我们对联邦学习和集中式学习进行了全面的实验比较。我们在不同的数据集上评估了不同的分类器,探索了不同样本分布以及客户端不同类别分布的影响。结果表明,在各种设置下,联邦学习策略和中央学习策略的性能相似。联邦学习能够处理数据分布中的各种不平衡。当不同的数据集与位置重合时,它对批处理效果很敏感,类似于中心学习,但这种设置可能更容易被观察到。联邦学习似乎对各种挑战都很健壮,比如倾斜的数据分布、高数据维度、多类问题和复杂模型。总的来说,从我们的比较中得出的见解为应用联邦学习作为共享数据的替代方案提供了很大的希望。在此工作中复制结果的代码可以在:https://github.com/swiergarst/FLComparison上找到。
{"title":"A comprehensive experimental comparison between federated and centralized learning.","authors":"Swier Garst, Julian Dekker, Marcel Reinders","doi":"10.1093/database/baaf016","DOIUrl":"https://doi.org/10.1093/database/baaf016","url":null,"abstract":"<p><p>Federated learning is an upcoming machine learning paradigm which allows data from multiple sources to be used for training of classifiers without the data leaving the source it originally resides. This can be highly valuable for use cases such as medical research, where gathering data at a central location can be quite complicated due to privacy and legal concerns of the data. In such cases, federated learning has the potential to vastly speed up the research cycle. Although federated and central learning have been compared from a theoretical perspective, an extensive experimental comparison of performances and learning behavior still lacks. We have performed a comprehensive experimental comparison between federated and centralized learning. We evaluated various classifiers on various datasets exploring influences of different sample distributions as well as different class distributions across the clients. The results show similar performances under a wide variety of settings between the federated and central learning strategies. Federated learning is able to deal with various imbalances in the data distributions. It is sensitive to batch effects between different datasets when they coincide with location, similar to central learning, but this setting might go unobserved more easily. Federated learning seems to be robust to various challenges such as skewed data distributions, high data dimensionality, multiclass problems, and complex models. Taken together, the insights from our comparison gives much promise for applying federated learning as an alternative to sharing data. Code for reproducing the results in this work can be found at: https://github.com/swiergarst/FLComparison.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144126974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
VarGuideAtlas: a repository of variant interpretation guidelines. VarGuideAtlas:变体解释指南资料库。
IF 3.6 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-03-11 DOI: 10.1093/database/baaf017
Mireia Costa, Alberto García S, Oscar Pastor

Variant interpretation guidelines guide the process of determining the role of DNA variants in patients' health. Currently, hundreds of guidelines exist, each applicable to a particular clinical domain. However, they are scattered across multiple resources and scientific literature. To address this issue, we present VarGuideAtlas, a comprehensive repository of variant interpretation guidelines that compiles information from ClinGen, ClinVar, and PubMed. Our repository offers a user-friendly web interface with advanced search capabilities, enabling clinicians and researchers to efficiently find relevant guidelines tailored to specific genes, diseases, or variant types. We employ ontologies to characterize each guideline, ensuring consistency and improving interoperability with bioinformatics tools. VarGuideAtlas represents a significant advance toward standardizing variant interpretation practices, facilitating more informed decision-making, improved clinical outcomes, and more precise genomic research. VarGuideAtlas is publicly accessible via a web-based platform (https://genomics-hub.pros.dsic.upv.es:3016/).

变异解释指南指导确定DNA变异在患者健康中的作用的过程。目前,存在着数百种指南,每一种都适用于特定的临床领域。然而,它们分散在多个资源和科学文献中。为了解决这个问题,我们提出了VarGuideAtlas,这是一个综合的变体解释指南库,汇集了来自ClinGen、ClinVar和PubMed的信息。我们的知识库提供了一个用户友好的网络界面,具有先进的搜索功能,使临床医生和研究人员能够有效地找到针对特定基因、疾病或变异类型的相关指南。我们使用本体来描述每个指南,确保一致性并提高与生物信息学工具的互操作性。VarGuideAtlas在标准化变异解释实践、促进更明智的决策、改善临床结果和更精确的基因组研究方面取得了重大进展。VarGuideAtlas可通过网络平台(https://genomics-hub.pros.dsic.upv.es:3016/)公开访问。
{"title":"VarGuideAtlas: a repository of variant interpretation guidelines.","authors":"Mireia Costa, Alberto García S, Oscar Pastor","doi":"10.1093/database/baaf017","DOIUrl":"10.1093/database/baaf017","url":null,"abstract":"<p><p>Variant interpretation guidelines guide the process of determining the role of DNA variants in patients' health. Currently, hundreds of guidelines exist, each applicable to a particular clinical domain. However, they are scattered across multiple resources and scientific literature. To address this issue, we present VarGuideAtlas, a comprehensive repository of variant interpretation guidelines that compiles information from ClinGen, ClinVar, and PubMed. Our repository offers a user-friendly web interface with advanced search capabilities, enabling clinicians and researchers to efficiently find relevant guidelines tailored to specific genes, diseases, or variant types. We employ ontologies to characterize each guideline, ensuring consistency and improving interoperability with bioinformatics tools. VarGuideAtlas represents a significant advance toward standardizing variant interpretation practices, facilitating more informed decision-making, improved clinical outcomes, and more precise genomic research. VarGuideAtlas is publicly accessible via a web-based platform (https://genomics-hub.pros.dsic.upv.es:3016/).</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.6,"publicationDate":"2025-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11895764/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143604068","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
VarGuideAtlas: a repository of variant interpretation guidelines. VarGuideAtlas:一个变体解释指南的存储库。
IF 3.4 4区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-03-11 DOI: 10.1093/database/baaf017
Mireia Costa, Alberto García S, Oscar Pastor

Variant interpretation guidelines guide the process of determining the role of DNA variants in patients' health. Currently, hundreds of guidelines exist, each applicable to a particular clinical domain. However, they are scattered across multiple resources and scientific literature. To address this issue, we present VarGuideAtlas, a comprehensive repository of variant interpretation guidelines that compiles information from ClinGen, ClinVar, and PubMed. Our repository offers a user-friendly web interface with advanced search capabilities, enabling clinicians and researchers to efficiently find relevant guidelines tailored to specific genes, diseases, or variant types. We employ ontologies to characterize each guideline, ensuring consistency and improving interoperability with bioinformatics tools. VarGuideAtlas represents a significant advance toward standardizing variant interpretation practices, facilitating more informed decision-making, improved clinical outcomes, and more precise genomic research. VarGuideAtlas is publicly accessible via a web-based platform (https://genomics-hub.pros.dsic.upv.es:3016/).

变异解释指南指导确定DNA变异在患者健康中的作用的过程。目前,存在着数百种指南,每一种都适用于特定的临床领域。然而,它们分散在多个资源和科学文献中。为了解决这个问题,我们提出了VarGuideAtlas,这是一个综合的变体解释指南库,汇集了来自ClinGen、ClinVar和PubMed的信息。我们的知识库提供了一个用户友好的网络界面,具有先进的搜索功能,使临床医生和研究人员能够有效地找到针对特定基因、疾病或变异类型的相关指南。我们使用本体来描述每个指南,确保一致性并提高与生物信息学工具的互操作性。VarGuideAtlas在标准化变异解释实践、促进更明智的决策、改善临床结果和更精确的基因组研究方面取得了重大进展。VarGuideAtlas可通过网络平台(https://genomics-hub.pros.dsic.upv.es:3016/)公开访问。
{"title":"VarGuideAtlas: a repository of variant interpretation guidelines.","authors":"Mireia Costa, Alberto García S, Oscar Pastor","doi":"10.1093/database/baaf017","DOIUrl":"https://doi.org/10.1093/database/baaf017","url":null,"abstract":"<p><p>Variant interpretation guidelines guide the process of determining the role of DNA variants in patients' health. Currently, hundreds of guidelines exist, each applicable to a particular clinical domain. However, they are scattered across multiple resources and scientific literature. To address this issue, we present VarGuideAtlas, a comprehensive repository of variant interpretation guidelines that compiles information from ClinGen, ClinVar, and PubMed. Our repository offers a user-friendly web interface with advanced search capabilities, enabling clinicians and researchers to efficiently find relevant guidelines tailored to specific genes, diseases, or variant types. We employ ontologies to characterize each guideline, ensuring consistency and improving interoperability with bioinformatics tools. VarGuideAtlas represents a significant advance toward standardizing variant interpretation practices, facilitating more informed decision-making, improved clinical outcomes, and more precise genomic research. VarGuideAtlas is publicly accessible via a web-based platform (https://genomics-hub.pros.dsic.upv.es:3016/).</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":"2025 ","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144126926","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Database: The Journal of Biological Databases and Curation
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1