首页 > 最新文献

Genomics, Proteomics & Bioinformatics最新文献

英文 中文
Review and Evaluate the Bioinformatics Analysis Strategies of ATAC-seq and CUT&Tag Data. 回顾和评估 ATAC-seq 和 CUT&Tag 数据的生物信息学分析策略。
IF 9.5 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-09-10 DOI: 10.1093/gpbjnl/qzae054
Siyuan Cheng,Benpeng Miao,Tiandao Li,Guoyan Zhao,Bo Zhang
Efficient and reliable profiling methods are essential to study epigenetics. Tn5, one of the first identified prokaryotic transposases with high DNA-binding and tagmentation efficiency, is widely adopted in different genomic and epigenomic protocols for high-throughputly exploring the genome and epigenome. Based on Tn5, the Assay for Transposase-Accessible Chromatin using sequencing (ATAC-seq) and the Cleavage Under Targets and Tagmentation (CUT&Tag) were developed to measure chromatin accessibility and detect DNA-protein interactions. These methodologies can be applied to large amounts of biological samples with low-input levels, such as rare tissues, embryos, and sorted single cells. However, fast and proper processing of these epigenomic data has become a bottleneck because massive data production continues to increase quickly. Furthermore, inappropriate data analysis can generate biased or misleading conclusions. Therefore, it is essential to evaluate the performance of Tn5-based ATAC-seq and CUT&Tag data processing bioinformatics tools, many of which were developed mostly for analyzing chromatin immunoprecipitation followed by sequencing (ChIP-seq) data. Here, we conducted a comprehensive benchmarking analysis to evaluate the performance of eight popular software for processing ATAC-seq and CUT&Tag data. We compared the sensitivity, specificity, and peak width distribution for both narrow-type and broad-type peak calling. We also tested the influence of the availability of control IgG input in CUT&Tag data analysis. Finally, we evaluated the differential analysis strategies commonly used for analyzing the CUT&Tag data. Our study provided comprehensive guidance for selecting bioinformatics tools and recommended analysis strategies, which were implemented into Docker/Singularity images for streamlined data analysis.
高效可靠的分析方法对研究表观遗传学至关重要。Tn5是最早发现的原核生物转座酶之一,具有很高的DNA结合和标记效率,在不同的基因组学和表观基因组学方案中被广泛采用,用于高通量探索基因组和表观基因组。在 Tn5 的基础上,开发了利用测序的转座酶染色质可及性分析(ATAC-seq)和目标下裂解和标记(CUT&Tag),以测量染色质可及性和检测 DNA 蛋白相互作用。这些方法可用于大量低输入水平的生物样本,如稀有组织、胚胎和分选的单细胞。然而,由于海量数据的产生持续快速增长,快速、正确地处理这些表观基因组数据已成为一个瓶颈。此外,不恰当的数据分析会产生有偏见或误导性的结论。因此,评估基于 Tn5 的 ATAC-seq 和 CUT&Tag 数据处理生物信息学工具的性能非常重要,其中许多工具主要是为分析染色质免疫沉淀后测序(ChIP-seq)数据而开发的。在这里,我们进行了一项全面的基准分析,评估了八种常用软件在处理 ATAC-seq 和 CUT&Tag 数据方面的性能。我们比较了窄型和宽型峰调用的灵敏度、特异性和峰宽分布。我们还测试了对照 IgG 输入对 CUT&Tag 数据分析的影响。最后,我们评估了常用于分析 CUT&Tag 数据的差异分析策略。我们的研究为选择生物信息学工具和推荐的分析策略提供了全面的指导,这些工具和策略已实施到 Docker/Singularity 映像中,以简化数据分析。
{"title":"Review and Evaluate the Bioinformatics Analysis Strategies of ATAC-seq and CUT&Tag Data.","authors":"Siyuan Cheng,Benpeng Miao,Tiandao Li,Guoyan Zhao,Bo Zhang","doi":"10.1093/gpbjnl/qzae054","DOIUrl":"https://doi.org/10.1093/gpbjnl/qzae054","url":null,"abstract":"Efficient and reliable profiling methods are essential to study epigenetics. Tn5, one of the first identified prokaryotic transposases with high DNA-binding and tagmentation efficiency, is widely adopted in different genomic and epigenomic protocols for high-throughputly exploring the genome and epigenome. Based on Tn5, the Assay for Transposase-Accessible Chromatin using sequencing (ATAC-seq) and the Cleavage Under Targets and Tagmentation (CUT&Tag) were developed to measure chromatin accessibility and detect DNA-protein interactions. These methodologies can be applied to large amounts of biological samples with low-input levels, such as rare tissues, embryos, and sorted single cells. However, fast and proper processing of these epigenomic data has become a bottleneck because massive data production continues to increase quickly. Furthermore, inappropriate data analysis can generate biased or misleading conclusions. Therefore, it is essential to evaluate the performance of Tn5-based ATAC-seq and CUT&Tag data processing bioinformatics tools, many of which were developed mostly for analyzing chromatin immunoprecipitation followed by sequencing (ChIP-seq) data. Here, we conducted a comprehensive benchmarking analysis to evaluate the performance of eight popular software for processing ATAC-seq and CUT&Tag data. We compared the sensitivity, specificity, and peak width distribution for both narrow-type and broad-type peak calling. We also tested the influence of the availability of control IgG input in CUT&Tag data analysis. Finally, we evaluated the differential analysis strategies commonly used for analyzing the CUT&Tag data. Our study provided comprehensive guidance for selecting bioinformatics tools and recommended analysis strategies, which were implemented into Docker/Singularity images for streamlined data analysis.","PeriodicalId":12528,"journal":{"name":"Genomics, Proteomics & Bioinformatics","volume":null,"pages":null},"PeriodicalIF":9.5,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142217469","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identification of highly repetitive barley enhancers with long-range regulation potential via STARR-seq 通过 STARR-seq 鉴定具有长程调控潜力的高度重复大麦增强子
IF 9.5 2区 生物学 Q1 Mathematics Pub Date : 2024-02-21 DOI: 10.1093/gpbjnl/qzae012
Wanlin Zhou, Haoran Shi, Zhiqiang Wang, Yuxin Huang, Lin Ni, Xudong Chen, Yan Liu, Haojie Li, Caixia Li, Yaxi Liu
Abstract Enhancers are DNA sequences that can strengthen transcription initiation. However, the global identification of plant enhancers is complicated due to uncertainty in the distance and orientation of enhancers, especially in species with large genomes. In this study, we performed self-transcribing active regulatory region sequencing (STARR-seq) for the first time to identify enhancers across the barley genome. A total of 7323 enhancers were successfully identified, and among 45 randomly selected enhancers, over 75% were effective as validated by a dual-luciferase reporter assay system in the lower epidermis of tobacco leaves. Interestingly, up to 53.5% of the barley enhancers were repetitive sequences, especially transposable elements (TEs), thus reinforcing the vital role of repetitive enhancers in gene expression. Both the common active transcription mark H3K4me3 and repressive histone mark H3K27me3 were abundant among the barley STARR-seq enhancers. In addition, the functional range of barley STARR-seq enhancers seemed much broader than that of rice or maize and extended to ± 100 kb of the gene body, and this finding was consistent with the high expression levels of genes in the genome. This work specifically depicts the unique features of barley enhancers and provides available barley enhancers for further utilization.
摘要 增强子是能够加强转录启动的 DNA 序列。然而,由于增强子的距离和方向不确定,植物增强子的全球鉴定非常复杂,尤其是在基因组较大的物种中。在这项研究中,我们首次进行了自转录活性调控区测序(STARR-seq),以鉴定大麦基因组中的增强子。共成功鉴定出7323个增强子,在随机选择的45个增强子中,超过75%的增强子通过烟草叶片下表皮的双荧光素酶报告检测系统验证是有效的。有趣的是,高达53.5%的大麦增强子是重复序列,尤其是转座元件(TE),从而加强了重复增强子在基因表达中的重要作用。在大麦 STARR-seq 增强子中,常见的活性转录标记 H3K4me3 和抑制性组蛋白标记 H3K27me3 都很丰富。此外,与水稻和玉米相比,大麦 STARR-seq 增强子的功能范围似乎更广,可延伸至基因体的±100 kb,这一发现与基因组中基因的高表达水平相一致。这项工作具体描述了大麦增强子的独特特征,并提供了可供进一步利用的大麦增强子。
{"title":"Identification of highly repetitive barley enhancers with long-range regulation potential via STARR-seq","authors":"Wanlin Zhou, Haoran Shi, Zhiqiang Wang, Yuxin Huang, Lin Ni, Xudong Chen, Yan Liu, Haojie Li, Caixia Li, Yaxi Liu","doi":"10.1093/gpbjnl/qzae012","DOIUrl":"https://doi.org/10.1093/gpbjnl/qzae012","url":null,"abstract":"<jats:title>Abstract</jats:title> Enhancers are DNA sequences that can strengthen transcription initiation. However, the global identification of plant enhancers is complicated due to uncertainty in the distance and orientation of enhancers, especially in species with large genomes. In this study, we performed self-transcribing active regulatory region sequencing (STARR-seq) for the first time to identify enhancers across the barley genome. A total of 7323 enhancers were successfully identified, and among 45 randomly selected enhancers, over 75% were effective as validated by a dual-luciferase reporter assay system in the lower epidermis of tobacco leaves. Interestingly, up to 53.5% of the barley enhancers were repetitive sequences, especially transposable elements (TEs), thus reinforcing the vital role of repetitive enhancers in gene expression. Both the common active transcription mark H3K4me3 and repressive histone mark H3K27me3 were abundant among the barley STARR-seq enhancers. In addition, the functional range of barley STARR-seq enhancers seemed much broader than that of rice or maize and extended to ± 100 kb of the gene body, and this finding was consistent with the high expression levels of genes in the genome. This work specifically depicts the unique features of barley enhancers and provides available barley enhancers for further utilization.","PeriodicalId":12528,"journal":{"name":"Genomics, Proteomics & Bioinformatics","volume":null,"pages":null},"PeriodicalIF":9.5,"publicationDate":"2024-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139922163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CpG island definition and methylation mapping of the T2T-YAO genome T2T-YAO 基因组的 CpG 岛定义和甲基化图谱
IF 9.5 2区 生物学 Q1 Mathematics Pub Date : 2024-02-02 DOI: 10.1093/gpbjnl/qzae009
Ming Xiao, Rui Wei, Jun Yu, Chujie Gao, Fengyi Yang, Le Zhang
Abstract Precisely defining and mapping all cytosine positions and their clusters, known as CpG islands (CGIs), as well as their methylation status are pivotal for genome-wide epigenetic studies, especially when population-centric reference genomes are ready for timely application. Here we first align the two high-quality reference genomes, T2T-YAO and T2T-CHM13, from different ethnic backgrounds in a base-by-base fashion and compute their genome-wide density-defined and position-defined CGIs. Second, mapping some representative genome-wide methylation data from selected organs onto the two genomes, we find that there are about 4.7–5.8% sequence divergency of variable categories depending on quality cutoffs. Genes among the divergent sequences are mostly associated with neurological functions. Moreover, CGIs associated with the divergent sequences are significantly different with respect to CpG density and observed CpG/expected CpG (O/E) ratio between the two genomes. Finally, we find that the T2T-YAO genome not only has a greater CpG site coverage than that of the T2T-CHM13 genome when whole-genome bisulfite sequencing (WGBS) data from the European and American populations are mapped to each reference, but also show more hyper-methylated CpG sites as compared to the T2T-CHM13 genome. Our study suggests that future genome-wide epigenetic studies of the Chinese populations rely on both acquisition of high-quality methylation data and subsequent precision CGI mapping based on the Chinese T2T reference.
摘要 精确定义和绘制所有胞嘧啶位置及其簇(称为 CpG 岛(CGIs))以及它们的甲基化状态对于全基因组表观遗传学研究至关重要,尤其是当以人群为中心的参考基因组已经准备就绪可以及时应用时。在这里,我们首先对来自不同种族背景的两个高质量参考基因组 T2T-YAO 和 T2T-CHM13 进行逐碱基对齐,并计算它们的全基因组密度定义和位置定义的 CGI。其次,我们将一些来自选定器官的代表性全基因组甲基化数据映射到这两个基因组上,发现根据质量截断值的不同,变量类别的序列差异约为 4.7-5.8%。差异序列中的基因大多与神经功能有关。此外,与差异序列相关的 CGIs 在 CpG 密度和观察到的 CpG/预期 CpG(O/E)比率方面在两个基因组之间存在显著差异。最后,我们发现,与 T2T-CHM13 基因组相比,将欧美人群的全基因组亚硫酸氢盐测序(WGBS)数据映射到每个参考系时,T2T-YAO 基因组不仅比 T2T-CHM13 基因组有更大的 CpG 位点覆盖率,而且还显示出更多的高甲基化 CpG 位点。我们的研究表明,未来对中国人群的全基因组表观遗传学研究有赖于获取高质量的甲基化数据以及随后基于中国 T2T 参考文献的精确 CGI 图谱绘制。
{"title":"CpG island definition and methylation mapping of the T2T-YAO genome","authors":"Ming Xiao, Rui Wei, Jun Yu, Chujie Gao, Fengyi Yang, Le Zhang","doi":"10.1093/gpbjnl/qzae009","DOIUrl":"https://doi.org/10.1093/gpbjnl/qzae009","url":null,"abstract":"<jats:title>Abstract</jats:title> Precisely defining and mapping all cytosine positions and their clusters, known as CpG islands (CGIs), as well as their methylation status are pivotal for genome-wide epigenetic studies, especially when population-centric reference genomes are ready for timely application. Here we first align the two high-quality reference genomes, T2T-YAO and T2T-CHM13, from different ethnic backgrounds in a base-by-base fashion and compute their genome-wide density-defined and position-defined CGIs. Second, mapping some representative genome-wide methylation data from selected organs onto the two genomes, we find that there are about 4.7–5.8% sequence divergency of variable categories depending on quality cutoffs. Genes among the divergent sequences are mostly associated with neurological functions. Moreover, CGIs associated with the divergent sequences are significantly different with respect to CpG density and observed CpG/expected CpG (O/E) ratio between the two genomes. Finally, we find that the T2T-YAO genome not only has a greater CpG site coverage than that of the T2T-CHM13 genome when whole-genome bisulfite sequencing (WGBS) data from the European and American populations are mapped to each reference, but also show more hyper-methylated CpG sites as compared to the T2T-CHM13 genome. Our study suggests that future genome-wide epigenetic studies of the Chinese populations rely on both acquisition of high-quality methylation data and subsequent precision CGI mapping based on the Chinese T2T reference.","PeriodicalId":12528,"journal":{"name":"Genomics, Proteomics & Bioinformatics","volume":null,"pages":null},"PeriodicalIF":9.5,"publicationDate":"2024-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139922148","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pindel-TD: a tandem duplication detector based on a pattern growth approach Pindel-TD:基于模式增长方法的串联重复检测器
IF 9.5 2区 生物学 Q1 Mathematics Pub Date : 2024-01-23 DOI: 10.1093/gpbjnl/qzae008
Xiaofei Yang, Gaoyang Zheng, Peng Jia, Songbo Wang, Kai Ye
Abstract Tandem duplication (TD) is a major type of structural variation (SV) that plays an important role in novel gene formation and human diseases. However, TDs are often missed or incorrectly classified as insertions by most modern SV detection methods due to the lack of specialized operation on TD-related mutational signals. Herein, we developed a TD detection module for the Pindel tool, referred to as Pindel-TD, based on a TD-specific pattern growth approach. Pindel-TD is capable of detecting TDs with a wide size range at single nucleotide resolution. Using simulated and real read data from HG002, we demonstrated that Pindel-TD outperforms other leading methods in terms of precision, recall, F1-score, and robustness. Furthermore, by applying Pindel-TD to data generated from the K562 cancer cell line, we identified a TD located at the seventh exon of SAGE1, providing an explanation for its high expression. Pindel-TD is available for non-commercial use at https://github.com/xjtu-omics/pindel.
摘要 串联重复(TD)是结构变异(SV)的一种主要类型,在新基因形成和人类疾病中发挥着重要作用。然而,由于缺乏对 TD 相关突变信号的专门操作,大多数现代 SV 检测方法经常会遗漏 TD 或将其错误地归类为插入。在此,我们为 Pindel 工具开发了一个 TD 检测模块,称为 Pindel-TD,它基于一种 TD 特异性模式生长方法。Pindel-TD 能够以单核苷酸分辨率检测出大范围的 TD。我们使用 HG002 的模拟和真实读数数据证明,Pindel-TD 在精确度、召回率、F1-分数和稳健性方面都优于其他领先方法。此外,通过将 Pindel-TD 应用于 K562 癌细胞系产生的数据,我们发现了位于 SAGE1 第七外显子的 TD,为其高表达提供了解释。Pindel-TD 可在 https://github.com/xjtu-omics/pindel 网站上供非商业使用。
{"title":"Pindel-TD: a tandem duplication detector based on a pattern growth approach","authors":"Xiaofei Yang, Gaoyang Zheng, Peng Jia, Songbo Wang, Kai Ye","doi":"10.1093/gpbjnl/qzae008","DOIUrl":"https://doi.org/10.1093/gpbjnl/qzae008","url":null,"abstract":"<jats:title>Abstract</jats:title> Tandem duplication (TD) is a major type of structural variation (SV) that plays an important role in novel gene formation and human diseases. However, TDs are often missed or incorrectly classified as insertions by most modern SV detection methods due to the lack of specialized operation on TD-related mutational signals. Herein, we developed a TD detection module for the Pindel tool, referred to as Pindel-TD, based on a TD-specific pattern growth approach. Pindel-TD is capable of detecting TDs with a wide size range at single nucleotide resolution. Using simulated and real read data from HG002, we demonstrated that Pindel-TD outperforms other leading methods in terms of precision, recall, F1-score, and robustness. Furthermore, by applying Pindel-TD to data generated from the K562 cancer cell line, we identified a TD located at the seventh exon of SAGE1, providing an explanation for its high expression. Pindel-TD is available for non-commercial use at https://github.com/xjtu-omics/pindel.","PeriodicalId":12528,"journal":{"name":"Genomics, Proteomics & Bioinformatics","volume":null,"pages":null},"PeriodicalIF":9.5,"publicationDate":"2024-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139922076","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrated Single-cell Multiomic Analysis of HIV Latency Reversal Reveals Novel Regulators of Viral Reactivation 艾滋病毒潜伏期逆转的单细胞多组学综合分析揭示了病毒再激活的新型调控因子
IF 9.5 2区 生物学 Q1 Mathematics Pub Date : 2024-01-11 DOI: 10.1093/gpbjnl/qzae003
Manickam Ashokkumar, Wenwen Mei, Jackson J Peterson, Yuriko Harigaya, David M Murdoch, David M Margolis, Caleb Kornfein, Alex Oesterling, Zhicheng Guo, Cynthia D Rudin, Yuchao Jiang, Edward P Browne
Abstract Despite the success of antiretroviral therapy, human immunodeficiency virus (HIV) cannot be cured because of a reservoir of latently infected cells that evades therapy. To understand the mechanisms of HIV latency, we employed an integrated single-cell RNA sequencing (RNA-seq) and single-cell assay for transposase-accessible chromatin with sequencing (ATAC-seq) approach to simultaneously profile the transcriptomic and epigenomic characteristics of ∼ 125,000 latently infected primary CD4 cells after reactivation using three different latency reversing agents. Differentially expressed genes and differentially accessible motifs were used to examine transcriptional pathways and transcription factor (TF) activities across the cell population. We identified cellular transcripts and TFs whose expression/activity was correlated with viral reactivation and demonstrated that a machine learning model trained on these data was 75%–79% accurate at predicting viral reactivation. Finally, we validated the role of two candidate HIV-regulating factors, FOXP1 and GATA3, in viral transcription. These data demonstrate the power of integrated multimodal single-cell analysis to uncover novel relationships between host cell factors and HIV latency.
摘要 尽管抗逆转录病毒疗法取得了成功,但人类免疫缺陷病毒(HIV)仍无法治愈,因为潜伏感染的细胞库逃避了治疗。为了了解艾滋病毒潜伏的机制,我们采用了单细胞 RNA 测序(RNA-seq)和单细胞转座酶可访问染色质测序(ATAC-seq)的综合方法,同时分析了使用三种不同的潜伏逆转剂重新激活后 125,000 个潜伏感染的初级 CD4 细胞的转录组和表观组特征。我们利用差异表达基因和差异可及基序来研究整个细胞群的转录途径和转录因子(TF)活性。我们确定了其表达/活性与病毒再活化相关的细胞转录本和转录因子,并证明根据这些数据训练的机器学习模型在预测病毒再活化方面的准确率为 75%-79%。最后,我们验证了两个候选 HIV 调节因子 FOXP1 和 GATA3 在病毒转录中的作用。这些数据证明了综合多模态单细胞分析在揭示宿主细胞因子与 HIV 潜伏期之间的新型关系方面的强大功能。
{"title":"Integrated Single-cell Multiomic Analysis of HIV Latency Reversal Reveals Novel Regulators of Viral Reactivation","authors":"Manickam Ashokkumar, Wenwen Mei, Jackson J Peterson, Yuriko Harigaya, David M Murdoch, David M Margolis, Caleb Kornfein, Alex Oesterling, Zhicheng Guo, Cynthia D Rudin, Yuchao Jiang, Edward P Browne","doi":"10.1093/gpbjnl/qzae003","DOIUrl":"https://doi.org/10.1093/gpbjnl/qzae003","url":null,"abstract":"<jats:title>Abstract</jats:title> Despite the success of antiretroviral therapy, human immunodeficiency virus (HIV) cannot be cured because of a reservoir of latently infected cells that evades therapy. To understand the mechanisms of HIV latency, we employed an integrated single-cell RNA sequencing (RNA-seq) and single-cell assay for transposase-accessible chromatin with sequencing (ATAC-seq) approach to simultaneously profile the transcriptomic and epigenomic characteristics of ∼ 125,000 latently infected primary CD4 cells after reactivation using three different latency reversing agents. Differentially expressed genes and differentially accessible motifs were used to examine transcriptional pathways and transcription factor (TF) activities across the cell population. We identified cellular transcripts and TFs whose expression/activity was correlated with viral reactivation and demonstrated that a machine learning model trained on these data was 75%–79% accurate at predicting viral reactivation. Finally, we validated the role of two candidate HIV-regulating factors, FOXP1 and GATA3, in viral transcription. These data demonstrate the power of integrated multimodal single-cell analysis to uncover novel relationships between host cell factors and HIV latency.","PeriodicalId":12528,"journal":{"name":"Genomics, Proteomics & Bioinformatics","volume":null,"pages":null},"PeriodicalIF":9.5,"publicationDate":"2024-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139922087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SMARTdb: An Integrated Database for Exploring Single-cell Multi-omics Data of Reproductive Medicine SMARTdb:用于探索生殖医学单细胞多组学数据的集成数据库
IF 9.5 2区 生物学 Q1 Mathematics Pub Date : 2024-01-11 DOI: 10.1093/gpbjnl/qzae005
Zekai Liu, Zhen Yuan, Yunlei Guo, Ruilin Wang, Yusheng Guan, Zhanglian Wang, Yunan Chen, Tianlu Wang, Meining Jiang, Shuhui Bian
Abstract Single-cell multi-omics sequencing has greatly accelerated reproductive research in recent years, and the data are continually growing. However, utilizing these data resources is challenging for wet-lab researchers. A comprehensive platform for exploring single-cell multi-omics data related to reproduction is urgently needed. Here we introduce the single-cell multi-omics atlas of reproduction (SMARTdb), which is an integrative and user-friendly platform for exploring molecular dynamics of reproductive development, aging, and disease, covering multi-omics, multi-species, and multi-stage data. We have curated and analyzed single-cell transcriptome and epigenome data of over 2.0 million cells from 6 species across whole lifespan. A series of powerful functionalities are provided, such as “Query gene expression”, “DIY expression plot”, “DNA methylation plot”, and “Epigenome browser”. With SMARTdb, we found that the male germ-cell-specific expression pattern of RPL39L and RPL10L is conserved between human and other model animals. Moreover, DNA hypomethylation and open chromatin may regulate the specific expression pattern of RPL39L collectively in both male and female germ cells. In summary, SMARTdb is a powerful platform for convenient data mining and gaining novel insights into reproductive development, aging, and disease. SMARTdb is publicly available at https://smart-db.cn.
摘要 近年来,单细胞多组学测序技术大大加快了生殖研究的步伐,数据量也在持续增长。然而,对于湿实验室研究人员来说,利用这些数据资源是一项挑战。目前迫切需要一个探索生殖相关单细胞多组学数据的综合平台。我们在此介绍生殖单细胞多组学图谱(SMARTdb),它是一个用于探索生殖发育、衰老和疾病分子动态的综合性、用户友好型平台,涵盖了多组学、多物种和多阶段数据。我们已经整理并分析了来自 6 个物种、跨越整个生命周期的 200 多万个细胞的单细胞转录组和表观基因组数据。它提供了一系列强大的功能,如 "查询基因表达"、"DIY表达图谱"、"DNA甲基化图谱 "和 "表观基因组浏览器"。通过SMARTdb,我们发现RPL39L和RPL10L的雄性生殖细胞特异性表达模式在人类和其他模式动物之间是保守的。此外,DNA 低甲基化和开放染色质可能共同调控 RPL39L 在雄性和雌性生殖细胞中的特异性表达模式。总之,SMARTdb 是一个功能强大的平台,可以方便地进行数据挖掘,并获得有关生殖发育、衰老和疾病的新见解。SMARTdb 可在 https://smart-db.cn 公开获取。
{"title":"SMARTdb: An Integrated Database for Exploring Single-cell Multi-omics Data of Reproductive Medicine","authors":"Zekai Liu, Zhen Yuan, Yunlei Guo, Ruilin Wang, Yusheng Guan, Zhanglian Wang, Yunan Chen, Tianlu Wang, Meining Jiang, Shuhui Bian","doi":"10.1093/gpbjnl/qzae005","DOIUrl":"https://doi.org/10.1093/gpbjnl/qzae005","url":null,"abstract":"<jats:title>Abstract</jats:title> Single-cell multi-omics sequencing has greatly accelerated reproductive research in recent years, and the data are continually growing. However, utilizing these data resources is challenging for wet-lab researchers. A comprehensive platform for exploring single-cell multi-omics data related to reproduction is urgently needed. Here we introduce the single-cell multi-omics atlas of reproduction (SMARTdb), which is an integrative and user-friendly platform for exploring molecular dynamics of reproductive development, aging, and disease, covering multi-omics, multi-species, and multi-stage data. We have curated and analyzed single-cell transcriptome and epigenome data of over 2.0 million cells from 6 species across whole lifespan. A series of powerful functionalities are provided, such as “Query gene expression”, “DIY expression plot”, “DNA methylation plot”, and “Epigenome browser”. With SMARTdb, we found that the male germ-cell-specific expression pattern of RPL39L and RPL10L is conserved between human and other model animals. Moreover, DNA hypomethylation and open chromatin may regulate the specific expression pattern of RPL39L collectively in both male and female germ cells. In summary, SMARTdb is a powerful platform for convenient data mining and gaining novel insights into reproductive development, aging, and disease. SMARTdb is publicly available at https://smart-db.cn.","PeriodicalId":12528,"journal":{"name":"Genomics, Proteomics & Bioinformatics","volume":null,"pages":null},"PeriodicalIF":9.5,"publicationDate":"2024-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139922075","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MSIsensor-RNA: Microsatellite Instability Detection for Bulk and Single-cell Gene Expression Data MSIsensor-RNA:批量和单细胞基因表达数据的微卫星不稳定性检测
IF 9.5 2区 生物学 Q1 Mathematics Pub Date : 2024-01-11 DOI: 10.1093/gpbjnl/qzae004
Peng Jia, Xuanhao Yang, Xiaofei Yang, Tingjie Wang, Yu Xu, Kai Ye
Abstract Microsatellite instability (MSI) is an indispensable biomarker in cancer immunotherapy. Currently, MSI scoring methods by high-throughput omics methods have gained popularity and demonstrated better performance than the gold standard method for MSI detection. However, the MSI detection method on expression data, especially single-cell expression data, is still lacking, limiting the scope of clinical application and prohibiting the investigation of MSI at a single-cell level. Herein, we developed MSIsensor-RNA, an accurate, robust, adaptable, and standalone software to detect MSI status based on expression values of MSI-associated genes. We demonstrated the favorable performance and promise of MSIsensor-RNA in both bulk and single-cell gene expression data in multiplatform technologies including RNA sequencing (RNA-seq), microarray, and single-cell RNA-seq. MSIsensor-RNA is a versatile, efficient, and robust method for MSI status detection from both bulk and single-cell gene expression data in clinical studies and applications. MSIsensor-RNA is available at https://github.com/xjtu-omics/msisensor-rna.
摘要 微卫星不稳定性(MSI)是癌症免疫疗法中不可或缺的生物标志物。目前,通过高通量组学方法进行MSI评分的方法已得到普及,并显示出优于MSI检测金标准方法的性能。然而,表达数据尤其是单细胞表达数据的MSI检测方法仍然缺乏,限制了临床应用的范围,也阻碍了单细胞水平的MSI研究。在此,我们开发了MSIsensor-RNA,这是一种基于MSI相关基因表达值检测MSI状态的准确、稳健、适应性强的独立软件。我们展示了MSIsensor-RNA在RNA测序(RNA-seq)、微阵列和单细胞RNA-seq等多平台技术中批量和单细胞基因表达数据方面的良好性能和前景。MSIsensor-RNA 是一种多功能、高效、稳健的方法,可在临床研究和应用中从大量和单细胞基因表达数据中检测 MSI 状态。MSIsensor-RNA可在https://github.com/xjtu-omics/msisensor-rna。
{"title":"MSIsensor-RNA: Microsatellite Instability Detection for Bulk and Single-cell Gene Expression Data","authors":"Peng Jia, Xuanhao Yang, Xiaofei Yang, Tingjie Wang, Yu Xu, Kai Ye","doi":"10.1093/gpbjnl/qzae004","DOIUrl":"https://doi.org/10.1093/gpbjnl/qzae004","url":null,"abstract":"<jats:title>Abstract</jats:title> Microsatellite instability (MSI) is an indispensable biomarker in cancer immunotherapy. Currently, MSI scoring methods by high-throughput omics methods have gained popularity and demonstrated better performance than the gold standard method for MSI detection. However, the MSI detection method on expression data, especially single-cell expression data, is still lacking, limiting the scope of clinical application and prohibiting the investigation of MSI at a single-cell level. Herein, we developed MSIsensor-RNA, an accurate, robust, adaptable, and standalone software to detect MSI status based on expression values of MSI-associated genes. We demonstrated the favorable performance and promise of MSIsensor-RNA in both bulk and single-cell gene expression data in multiplatform technologies including RNA sequencing (RNA-seq), microarray, and single-cell RNA-seq. MSIsensor-RNA is a versatile, efficient, and robust method for MSI status detection from both bulk and single-cell gene expression data in clinical studies and applications. MSIsensor-RNA is available at https://github.com/xjtu-omics/msisensor-rna.","PeriodicalId":12528,"journal":{"name":"Genomics, Proteomics & Bioinformatics","volume":null,"pages":null},"PeriodicalIF":9.5,"publicationDate":"2024-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139922078","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Transcriptome Dynamics and Cell Dialogs Between Oocytes and Granulosa Cells in Mouse Follicle Development 小鼠卵泡发育过程中卵母细胞和颗粒细胞之间的转录组动态和细胞对话
IF 9.5 2区 生物学 Q1 Mathematics Pub Date : 2024-01-10 DOI: 10.1093/gpbjnl/qzad001
Wenju Liu, Xinyu Cui, Yuhan Zhang, Liang Gu, Yuanlin He, Jing Li, Shaorong Gao, Rui Gao, Cizhong Jiang
Abstract The development and maturation of follicles is a sophisticated and multistage process. The dynamic gene expression of oocytes and the surrounding somatic cells and the dialogs between these cells are critical to this process. We accurately classified the follicle development into nine stages and profiled the gene expression of mouse oocytes, the companion granulosa cells, and cumulus cells. The clustering of the transcriptomes showed the trajectory to the two distinct development courses of oocytes and the surrounding somatic cells. Gene expression changes precipitously increased at Type 4 stage and drastically droped afterwards within both oocytes and granulosa cells. Moreover, the number of differentially expressed genes between oocytes and granulosa cells dramatically increased at Type 4 stage, most of which persistently passed on to the later stages. Strikingly, cell communications within and between oocytes and granulosa cells became active from Type 4 onwards. Cell dialogs connected oocytes and granulosa cells in both unidirectional and bidirectional manners. TGFB2/3, TGFBR2/3, INHBA/B, and ACVR1/1B/2B of TGF-β signaling pathway functioned in the follicle development. NOTCH signaling pathway regulated the development of granulosa cells. Additionally, many maternally DNA methylation- or H3K27me3-imprinted genes remained active in granulosa cells but silent in oocytes during oogenesis. Collectively, Type 4 is the key turning point when significant transcription changes diverge the fate of oocytes and granulosa cells, and the cell dialogs become active to assure follicle development. These findings shed new insights into transcriptomic dynamics and cell dialogs facilitating the development and maturation of oocytes and follicles.
摘要 卵泡的发育和成熟是一个复杂的多阶段过程。卵母细胞和周围体细胞的动态基因表达以及这些细胞之间的对话对这一过程至关重要。我们准确地将卵泡发育划分为九个阶段,并对小鼠卵母细胞、伴随的颗粒细胞和积层细胞的基因表达进行了分析。转录组的聚类显示了卵母细胞和周围体细胞两个不同发育过程的轨迹。在卵母细胞和颗粒细胞中,基因表达的变化在第4型阶段急剧增加,之后又急剧下降。此外,卵母细胞和颗粒细胞之间差异表达基因的数量在第 4 型阶段急剧增加,其中大部分持续到后期阶段。引人注目的是,从第四型开始,卵母细胞和颗粒细胞内部和之间的细胞通讯开始活跃起来。细胞对话以单向和双向方式连接卵母细胞和颗粒细胞。TGF-β信号通路中的TGFB2/3、TGFBR2/3、INHBA/B和ACVR1/1B/2B在卵泡发育中起作用。NOTCH信号通路调控颗粒细胞的发育。此外,在卵子发生过程中,许多母源DNA甲基化或H3K27me3印迹基因在颗粒细胞中保持活性,但在卵母细胞中却沉默不语。总之,第4型是一个关键的转折点,在这一时期,卵母细胞和颗粒细胞的命运发生了重大转录变化,细胞对话变得活跃,从而确保了卵泡的发育。这些发现揭示了促进卵母细胞和卵泡发育和成熟的转录组动态和细胞对话的新见解。
{"title":"Transcriptome Dynamics and Cell Dialogs Between Oocytes and Granulosa Cells in Mouse Follicle Development","authors":"Wenju Liu, Xinyu Cui, Yuhan Zhang, Liang Gu, Yuanlin He, Jing Li, Shaorong Gao, Rui Gao, Cizhong Jiang","doi":"10.1093/gpbjnl/qzad001","DOIUrl":"https://doi.org/10.1093/gpbjnl/qzad001","url":null,"abstract":"<jats:title>Abstract</jats:title> The development and maturation of follicles is a sophisticated and multistage process. The dynamic gene expression of oocytes and the surrounding somatic cells and the dialogs between these cells are critical to this process. We accurately classified the follicle development into nine stages and profiled the gene expression of mouse oocytes, the companion granulosa cells, and cumulus cells. The clustering of the transcriptomes showed the trajectory to the two distinct development courses of oocytes and the surrounding somatic cells. Gene expression changes precipitously increased at Type 4 stage and drastically droped afterwards within both oocytes and granulosa cells. Moreover, the number of differentially expressed genes between oocytes and granulosa cells dramatically increased at Type 4 stage, most of which persistently passed on to the later stages. Strikingly, cell communications within and between oocytes and granulosa cells became active from Type 4 onwards. Cell dialogs connected oocytes and granulosa cells in both unidirectional and bidirectional manners. TGFB2/3, TGFBR2/3, INHBA/B, and ACVR1/1B/2B of TGF-β signaling pathway functioned in the follicle development. NOTCH signaling pathway regulated the development of granulosa cells. Additionally, many maternally DNA methylation- or H3K27me3-imprinted genes remained active in granulosa cells but silent in oocytes during oogenesis. Collectively, Type 4 is the key turning point when significant transcription changes diverge the fate of oocytes and granulosa cells, and the cell dialogs become active to assure follicle development. These findings shed new insights into transcriptomic dynamics and cell dialogs facilitating the development and maturation of oocytes and follicles.","PeriodicalId":12528,"journal":{"name":"Genomics, Proteomics & Bioinformatics","volume":null,"pages":null},"PeriodicalIF":9.5,"publicationDate":"2024-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139922084","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DVsc: An Automated Framework for Efficiently Detecting Viral Infection from Single-Cell Transcriptomics Data DVsc:从单细胞转录组学数据中高效检测病毒感染的自动化框架
IF 9.5 2区 生物学 Q1 Mathematics Pub Date : 2024-01-10 DOI: 10.1093/gpbjnl/qzad007
Fei Leng, Song Mei, Xiaolin Zhou, Xuanshi Liu, Yefeng Yuan, Wenjian Xu, Chongyi Hao, Ruolan Guo, Chanjuan Hao, Wei Li, Peng Zhang
Abstract Single-cell RNA sequencing (scRNA-seq) has emerged as a valuable tool for studying cellular heterogeneity in various fields, particularly in virological research. By studying the viral and cellular transcriptomes, the dynamics of viral infection can be investigated at a single-cell resolution. However, limited studies have been conducted to investigate whether RNA transcripts from clinical samples contain substantial amounts of viral RNAs, and a specific computational framework for efficiently detecting viral reads based on scRNA-seq data has not been developed. Hence, we introduce DVsc, an open-source framework for precise quantitative analysis of viral infection from single-cell transcriptomics data. When applied to approximately 200 diverse clinical samples that were infected by more than 10 different viruses, DVsc demonstrated high accuracy in systematically detecting viral infection across a wide array of cell types. This innovative bioinformatics pipeline could be crucial for addressing the potential effects of surreptitiously invading viruses on certain illnesses, as well as for designing novel medicines to target viruses in specific host cell subsets and evaluating the efficacy of treatment. DVsc supports the FASTQ format as an input and is compatible with multiple single-cell sequencing platforms. Moreover, it could also be applied to sequences from bulk RNA-sequencing data. DVsc is available at http://62.234.32.33:5000/DVsc.
摘要 单细胞 RNA 测序(scRNA-seq)已成为各领域,特别是病毒学研究中研究细胞异质性的重要工具。通过研究病毒和细胞转录组,可以在单细胞分辨率下研究病毒感染的动态变化。然而,目前对临床样本中的 RNA 转录本是否含有大量病毒 RNA 的研究还很有限,基于 scRNA-seq 数据有效检测病毒读数的特定计算框架也尚未开发出来。因此,我们引入了 DVsc,这是一个开源框架,用于从单细胞转录组学数据中对病毒感染进行精确的定量分析。DVsc 应用于约 200 个不同的临床样本,这些样本受到 10 多种不同病毒的感染,DVsc 在系统检测各种细胞类型的病毒感染方面表现出很高的准确性。这一创新的生物信息学管道对于研究偷偷入侵的病毒对某些疾病的潜在影响、设计针对特定宿主细胞亚群病毒的新型药物以及评估治疗效果至关重要。DVsc 支持将 FASTQ 格式作为输入,并与多种单细胞测序平台兼容。此外,它还可应用于大容量 RNA 测序数据的序列。DVsc可在http://62.234.32.33:5000/DVsc。
{"title":"DVsc: An Automated Framework for Efficiently Detecting Viral Infection from Single-Cell Transcriptomics Data","authors":"Fei Leng, Song Mei, Xiaolin Zhou, Xuanshi Liu, Yefeng Yuan, Wenjian Xu, Chongyi Hao, Ruolan Guo, Chanjuan Hao, Wei Li, Peng Zhang","doi":"10.1093/gpbjnl/qzad007","DOIUrl":"https://doi.org/10.1093/gpbjnl/qzad007","url":null,"abstract":"<jats:title>Abstract</jats:title> Single-cell RNA sequencing (scRNA-seq) has emerged as a valuable tool for studying cellular heterogeneity in various fields, particularly in virological research. By studying the viral and cellular transcriptomes, the dynamics of viral infection can be investigated at a single-cell resolution. However, limited studies have been conducted to investigate whether RNA transcripts from clinical samples contain substantial amounts of viral RNAs, and a specific computational framework for efficiently detecting viral reads based on scRNA-seq data has not been developed. Hence, we introduce DVsc, an open-source framework for precise quantitative analysis of viral infection from single-cell transcriptomics data. When applied to approximately 200 diverse clinical samples that were infected by more than 10 different viruses, DVsc demonstrated high accuracy in systematically detecting viral infection across a wide array of cell types. This innovative bioinformatics pipeline could be crucial for addressing the potential effects of surreptitiously invading viruses on certain illnesses, as well as for designing novel medicines to target viruses in specific host cell subsets and evaluating the efficacy of treatment. DVsc supports the FASTQ format as an input and is compatible with multiple single-cell sequencing platforms. Moreover, it could also be applied to sequences from bulk RNA-sequencing data. DVsc is available at http://62.234.32.33:5000/DVsc.","PeriodicalId":12528,"journal":{"name":"Genomics, Proteomics & Bioinformatics","volume":null,"pages":null},"PeriodicalIF":9.5,"publicationDate":"2024-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139921998","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GametesOmics: A Comprehensive Multi-omics Database for Exploring the Gametogenesis in Humans and Mice GametesOmics:探索人类和小鼠配子发生的综合多组学数据库
IF 9.5 2区 生物学 Q1 Mathematics Pub Date : 2024-01-10 DOI: 10.1093/gpbjnl/qzad004
Jianting An, Jing Wang, Siming Kong, Shi Song, Wei Chen, Peng Yuan, Qilong He, Yidong Chen, Ye Li, Yi Yang, Wei Wang, Rong Li, Liying Yan, Zhiqiang Yan, Jie Qiao
Abstract Gametogenesis plays an important role in the reproduction and evolution of species. The transcriptomic and epigenetic alterations in this process can influence the reproductive capacity, fertilization, and embryonic development. The rapidly increasing single-cell studies have provided valuable multi-omics resources. However, data from different layers and sequencing platforms have not been uniformed and integrated, which greatly limits their use for exploring the molecular mechanisms that underlie oogenesis and spermatogenesis. Here, we developed GametesOmics, a comprehensive database that integrated the data of gene expression, DNA methylation, and chromatin accessibility during oogenesis and spermatogenesis in humans and mice. GametesOmics provides a user-friendly website and various tools, including Search and Advanced Search for querying the expression and epigenetic modification of each gene; Tools with Differentially expressed genes (DEGs) analysis for identifying DEGs, Correlation analysis for demonstrating the genetic and epigenetic changes, Visualization for displaying single-cell cluster and screening marker genes as well as master transcription factors (TFs), and MethylView for studying the genomic distribution of epigenetic modifications. GametesOmics also provides Genome Browser and Orthologs for tracking and comparing gene expression, DNA methylations, as well as chromatin accessibilities between humans and mice. GametesOmics offers a comprehensive resource for biologists and clinicians to decipher the cell fate transition in germ cell development, and can be accessed at http://gametesomics.cn/.
摘要 配子发生在物种繁衍和进化过程中发挥着重要作用。这一过程中的转录组和表观遗传学改变会影响生殖能力、受精和胚胎发育。迅速增加的单细胞研究提供了宝贵的多组学资源。然而,来自不同层和测序平台的数据尚未统一整合,这极大地限制了它们在探索卵子和精子发生的分子机制方面的应用。在此,我们开发了GametesOmics,一个整合了人类和小鼠卵子和精子发生过程中基因表达、DNA甲基化和染色质可及性数据的综合数据库。GametesOmics 提供了一个用户友好型网站和多种工具,包括用于查询每个基因的表达和表观遗传修饰的搜索和高级搜索;用于识别 DEGs 的差异表达基因(DEGs)分析工具;用于展示遗传和表观遗传变化的相关性分析;用于显示单细胞集群和筛选标记基因以及主转录因子(TFs)的可视化工具;以及用于研究表观遗传修饰的基因组分布的 MethylView 工具。GametesOmics 还提供基因组浏览器(Genome Browser)和同源物(Orthologs),用于跟踪和比较人类与小鼠之间的基因表达、DNA 甲基化以及染色质可及性。GametesOmics 为生物学家和临床医生提供了一个全面的资源,用于破译生殖细胞发育过程中的细胞命运转换,可通过 http://gametesomics.cn/ 访问。
{"title":"GametesOmics: A Comprehensive Multi-omics Database for Exploring the Gametogenesis in Humans and Mice","authors":"Jianting An, Jing Wang, Siming Kong, Shi Song, Wei Chen, Peng Yuan, Qilong He, Yidong Chen, Ye Li, Yi Yang, Wei Wang, Rong Li, Liying Yan, Zhiqiang Yan, Jie Qiao","doi":"10.1093/gpbjnl/qzad004","DOIUrl":"https://doi.org/10.1093/gpbjnl/qzad004","url":null,"abstract":"<jats:title>Abstract</jats:title> Gametogenesis plays an important role in the reproduction and evolution of species. The transcriptomic and epigenetic alterations in this process can influence the reproductive capacity, fertilization, and embryonic development. The rapidly increasing single-cell studies have provided valuable multi-omics resources. However, data from different layers and sequencing platforms have not been uniformed and integrated, which greatly limits their use for exploring the molecular mechanisms that underlie oogenesis and spermatogenesis. Here, we developed GametesOmics, a comprehensive database that integrated the data of gene expression, DNA methylation, and chromatin accessibility during oogenesis and spermatogenesis in humans and mice. GametesOmics provides a user-friendly website and various tools, including Search and Advanced Search for querying the expression and epigenetic modification of each gene; Tools with Differentially expressed genes (DEGs) analysis for identifying DEGs, Correlation analysis for demonstrating the genetic and epigenetic changes, Visualization for displaying single-cell cluster and screening marker genes as well as master transcription factors (TFs), and MethylView for studying the genomic distribution of epigenetic modifications. GametesOmics also provides Genome Browser and Orthologs for tracking and comparing gene expression, DNA methylations, as well as chromatin accessibilities between humans and mice. GametesOmics offers a comprehensive resource for biologists and clinicians to decipher the cell fate transition in germ cell development, and can be accessed at http://gametesomics.cn/.","PeriodicalId":12528,"journal":{"name":"Genomics, Proteomics & Bioinformatics","volume":null,"pages":null},"PeriodicalIF":9.5,"publicationDate":"2024-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139922081","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Genomics, Proteomics & Bioinformatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1