Xi Wang, Qiwei Lian, Haoyu Dong, Shuo Xu, Yaru Su, Xiaohui Wu
Gene set scoring (GSS) has been routinely conducted for gene expression analysis of bulk or single-cell RNA sequencing (RNA-seq) data, which helps to decipher single-cell heterogeneity and cell type-specific variability by incorporating prior knowledge from functional gene sets. Single-cell assay for transposase accessible chromatin using sequencing (scATAC-seq) is a powerful technique for interrogating single-cell chromatin-based gene regulation, and genes or gene sets with dynamic regulatory potentials can be regarded as cell type-specific markers as if in single-cell RNA-seq (scRNA-seq). However, there are few GSS tools specifically designed for scATAC-seq, and the applicability and performance of RNA-seq GSS tools on scATAC-seq data remain to be investigated. Here, we systematically benchmarked ten GSS tools, including four bulk RNA-seq tools, five scRNA-seq tools, and one scATAC-seq method. First, using matched scATAC-seq and scRNA-seq datasets, we found that the performance of GSS tools on scATAC-seq data was comparable to that on scRNA-seq, suggesting their applicability to scATAC-seq. Then, the performance of different GSS tools was extensively evaluated using up to ten scATAC-seq datasets. Moreover, we evaluated the impact of gene activity conversion, dropout imputation, and gene set collections on the results of GSS. Results show that dropout imputation can significantly promote the performance of almost all GSS tools, while the impact of gene activity conversion methods or gene set collections on GSS performance is more dependent on GSS tools or datasets. Finally, we provided practical guidelines for choosing appropriate preprocessing methods and GSS tools in different application scenarios.
{"title":"Benchmarking Algorithms for Gene Set Scoring of Single-cell ATAC-seq Data.","authors":"Xi Wang, Qiwei Lian, Haoyu Dong, Shuo Xu, Yaru Su, Xiaohui Wu","doi":"10.1093/gpbjnl/qzae014","DOIUrl":"10.1093/gpbjnl/qzae014","url":null,"abstract":"<p><p>Gene set scoring (GSS) has been routinely conducted for gene expression analysis of bulk or single-cell RNA sequencing (RNA-seq) data, which helps to decipher single-cell heterogeneity and cell type-specific variability by incorporating prior knowledge from functional gene sets. Single-cell assay for transposase accessible chromatin using sequencing (scATAC-seq) is a powerful technique for interrogating single-cell chromatin-based gene regulation, and genes or gene sets with dynamic regulatory potentials can be regarded as cell type-specific markers as if in single-cell RNA-seq (scRNA-seq). However, there are few GSS tools specifically designed for scATAC-seq, and the applicability and performance of RNA-seq GSS tools on scATAC-seq data remain to be investigated. Here, we systematically benchmarked ten GSS tools, including four bulk RNA-seq tools, five scRNA-seq tools, and one scATAC-seq method. First, using matched scATAC-seq and scRNA-seq datasets, we found that the performance of GSS tools on scATAC-seq data was comparable to that on scRNA-seq, suggesting their applicability to scATAC-seq. Then, the performance of different GSS tools was extensively evaluated using up to ten scATAC-seq datasets. Moreover, we evaluated the impact of gene activity conversion, dropout imputation, and gene set collections on the results of GSS. Results show that dropout imputation can significantly promote the performance of almost all GSS tools, while the impact of gene activity conversion methods or gene set collections on GSS performance is more dependent on GSS tools or datasets. Finally, we provided practical guidelines for choosing appropriate preprocessing methods and GSS tools in different application scenarios.</p>","PeriodicalId":94020,"journal":{"name":"Genomics, proteomics & bioinformatics","volume":"22 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11423854/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141763492","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Weidong Liu, Yuhua Wang, Shuxin Yao, Guoqiang Han, Jin Hu, Rong Yin, Fuling Zhou, Ying Cheng, Haojian Zhang
Hematopoietic homeostasis is maintained by hematopoietic stem cells (HSCs), and it is tightly controlled at multiple levels to sustain the self-renewal capacity and differentiation potential of HSCs. Dysregulation of self-renewal and differentiation of HSCs leads to the development of hematologic diseases, including acute myeloid leukemia (AML). Thus, understanding the underlying mechanisms of HSC maintenance and the development of hematologic malignancies is one of the fundamental scientific endeavors in stem cell biology. N 6-methyladenosine (m6A) is a common modification in mammalian messenger RNAs (mRNAs) and plays important roles in various biological processes. In this study, we performed a comparative analysis of the dynamics of the RNA m6A methylome of hematopoietic stem and progenitor cells (HSPCs) and leukemia-initiating cells (LICs) in AML. We found that RNA m6A modification regulates the transformation of long-term HSCs into short-term HSCs and determines the lineage commitment of HSCs. Interestingly, m6A modification leads to reprogramming that promotes cellular transformation during AML development, and LIC-specific m6A targets are recognized by different m6A readers. Moreover, the very long chain fatty acid transporter ATP-binding cassette subfamily D member 2 (ABCD2) is a key factor that promotes AML development, and deletion of ABCD2 damages clonogenic ability, inhibits proliferation, and promotes apoptosis of human leukemia cells. This study provides a comprehensive understanding of the role of m6A in regulating cell state transition in normal hematopoiesis and leukemogenesis, and identifies ABCD2 as a key factor in AML development.
造血稳态由造血干细胞(HSCs)维持,它受到多层次的严格控制,以维持造血干细胞的自我更新能力和分化潜能。造血干细胞自我更新和分化失调会导致血液病的发生,包括急性髓性白血病(AML)。因此,了解造血干细胞维持和血液恶性肿瘤发展的内在机制是干细胞生物学的基础科学研究之一。N 6-甲基腺苷(m6A)是哺乳动物信使核糖核酸(mRNA)中的一种常见修饰,在各种生物过程中发挥着重要作用。在这项研究中,我们对急性髓细胞性白血病中造血干细胞和祖细胞(HSPCs)以及白血病启动细胞(LICs)的RNA m6A甲基组的动态进行了比较分析。我们发现,RNA m6A修饰调控长期造血干细胞向短期造血干细胞的转化,并决定造血干细胞的系承。有趣的是,m6A修饰会导致重编程,从而促进急性髓细胞性白血病发育过程中的细胞转化,而LIC特异性m6A靶点会被不同的m6A阅读器识别。此外,超长链脂肪酸转运体 ATP 结合盒 D 亚家族成员 2(ABCD2)是促进急性髓细胞性白血病发展的关键因素,缺失 ABCD2 会损害人类白血病细胞的克隆生成能力、抑制增殖并促进凋亡。这项研究全面了解了 m6A 在正常造血和白血病发生过程中调控细胞状态转变的作用,并发现 ABCD2 是急性髓细胞性白血病发生的关键因素。
{"title":"Reprogramming of RNA m6A Modification Is Required for Acute Myeloid Leukemia Development.","authors":"Weidong Liu, Yuhua Wang, Shuxin Yao, Guoqiang Han, Jin Hu, Rong Yin, Fuling Zhou, Ying Cheng, Haojian Zhang","doi":"10.1093/gpbjnl/qzae049","DOIUrl":"https://doi.org/10.1093/gpbjnl/qzae049","url":null,"abstract":"<p><p>Hematopoietic homeostasis is maintained by hematopoietic stem cells (HSCs), and it is tightly controlled at multiple levels to sustain the self-renewal capacity and differentiation potential of HSCs. Dysregulation of self-renewal and differentiation of HSCs leads to the development of hematologic diseases, including acute myeloid leukemia (AML). Thus, understanding the underlying mechanisms of HSC maintenance and the development of hematologic malignancies is one of the fundamental scientific endeavors in stem cell biology. N 6-methyladenosine (m6A) is a common modification in mammalian messenger RNAs (mRNAs) and plays important roles in various biological processes. In this study, we performed a comparative analysis of the dynamics of the RNA m6A methylome of hematopoietic stem and progenitor cells (HSPCs) and leukemia-initiating cells (LICs) in AML. We found that RNA m6A modification regulates the transformation of long-term HSCs into short-term HSCs and determines the lineage commitment of HSCs. Interestingly, m6A modification leads to reprogramming that promotes cellular transformation during AML development, and LIC-specific m6A targets are recognized by different m6A readers. Moreover, the very long chain fatty acid transporter ATP-binding cassette subfamily D member 2 (ABCD2) is a key factor that promotes AML development, and deletion of ABCD2 damages clonogenic ability, inhibits proliferation, and promotes apoptosis of human leukemia cells. This study provides a comprehensive understanding of the role of m6A in regulating cell state transition in normal hematopoiesis and leukemogenesis, and identifies ABCD2 as a key factor in AML development.</p>","PeriodicalId":94020,"journal":{"name":"Genomics, proteomics & bioinformatics","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141447874","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jiuxin Qu, Wanfei Liu, Shuyan Chen, Chi Wu, Wenjie Lai, Rui Qin, Feidi Ye, Yuanchun Li, Liang Fu, Guofang Deng, Lei Liu, Qiang Lin, Peng Cui
The commonly-used drug susceptibility testing (DST) relies on bacterial culture and faces shortcomings such as long turnaround time and clone/subclone selection. We developed a targeted deep amplification sequencing (DAS) method directly applied to clinical specimens. In this DAS panel, we examined 941 drug-resistant mutations associated with 20 anti-tuberculosis drugs with an initial amount of 4 pg DNA and reduced clinical testing time from 20 days to two days. A prospective study was conducted using 115 clinical specimens mainly with Xpert® Mycobacterium tuberculosis/rifampicin (Xpert MTB/RIF) assay positive to evaluate drug-resistant mutation detection. DAS was performed on culture-free specimens, while culture-dependent isolates were used for phenotypic DST, DAS, and whole-genome sequencing (WGS). For in silico molecular DST, our result based on DAS panel revealed the similar accuracy to three published reports based on WGS. For 82 isolates, application of DAS showed better sensitivity (93.03% vs. 92.16%), specificity (96.10% vs. 95.02%), and accuracy (91.33% vs. 90.62%) than Mykrobe software using WGS. Compared to culture-dependent WGS, culture-free DAS provides a full picture of sequence variation at population level, exhibiting in detail the gain-and-loss variants caused by bacterial culture. Our study performs a systematic verification of the advantages of DAS in clinical applications and comprehensively illustrates the discrepancy in Mycobacterium tuberculosis before and after culture.
常用的药敏试验(DST)依赖于细菌培养,面临着周转时间长和克隆/亚克隆选择等缺点。我们开发了一种直接应用于临床标本的靶向深度扩增测序(DAS)方法。在这一 DAS 面板中,我们检测了与 20 种抗结核药物相关的 941 个耐药突变,初始 DNA 量为 4 pg,并将临床检测时间从 20 天缩短至两天。一项前瞻性研究使用了 115 份临床标本,主要是 Xpert® 结核分枝杆菌/利福平(Xpert MTB/RIF)检测呈阳性的标本,以评估耐药突变的检测情况。DAS在无培养标本上进行,而依赖培养的分离株则用于表型DST、DAS和全基因组测序(WGS)。在硅分子 DST 方面,我们基于 DAS 面板得出的结果与已发表的三篇基于 WGS 的报告具有相似的准确性。对于 82 个分离物,应用 DAS 的灵敏度(93.03% 对 92.16%)、特异性(96.10% 对 95.02%)和准确性(91.33% 对 90.62%)均优于使用 WGS 的 Mykrobe 软件。与依赖培养基的 WGS 相比,无培养基 DAS 能提供群体水平上序列变异的全貌,详细展示细菌培养引起的增减变异。我们的研究系统地验证了 DAS 在临床应用中的优势,并全面说明了培养前后结核分枝杆菌的差异。
{"title":"Deep Amplicon Sequencing Reveals Culture Selection of Mycobacterium Tuberculosis by Clinical Samples.","authors":"Jiuxin Qu, Wanfei Liu, Shuyan Chen, Chi Wu, Wenjie Lai, Rui Qin, Feidi Ye, Yuanchun Li, Liang Fu, Guofang Deng, Lei Liu, Qiang Lin, Peng Cui","doi":"10.1093/gpbjnl/qzae046","DOIUrl":"https://doi.org/10.1093/gpbjnl/qzae046","url":null,"abstract":"<p><p>The commonly-used drug susceptibility testing (DST) relies on bacterial culture and faces shortcomings such as long turnaround time and clone/subclone selection. We developed a targeted deep amplification sequencing (DAS) method directly applied to clinical specimens. In this DAS panel, we examined 941 drug-resistant mutations associated with 20 anti-tuberculosis drugs with an initial amount of 4 pg DNA and reduced clinical testing time from 20 days to two days. A prospective study was conducted using 115 clinical specimens mainly with Xpert® Mycobacterium tuberculosis/rifampicin (Xpert MTB/RIF) assay positive to evaluate drug-resistant mutation detection. DAS was performed on culture-free specimens, while culture-dependent isolates were used for phenotypic DST, DAS, and whole-genome sequencing (WGS). For in silico molecular DST, our result based on DAS panel revealed the similar accuracy to three published reports based on WGS. For 82 isolates, application of DAS showed better sensitivity (93.03% vs. 92.16%), specificity (96.10% vs. 95.02%), and accuracy (91.33% vs. 90.62%) than Mykrobe software using WGS. Compared to culture-dependent WGS, culture-free DAS provides a full picture of sequence variation at population level, exhibiting in detail the gain-and-loss variants caused by bacterial culture. Our study performs a systematic verification of the advantages of DAS in clinical applications and comprehensively illustrates the discrepancy in Mycobacterium tuberculosis before and after culture.</p>","PeriodicalId":94020,"journal":{"name":"Genomics, proteomics & bioinformatics","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141319301","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ribonuclease P (RNase P) was first described in the 1970's as an endoribonuclease acting in the maturation of precursor transfer RNAs (tRNAs). More recent studies, however, have uncovered non-canonical roles for RNase P and its components. Here, we review the recent progress of its involvement in chromatin assembly, DNA damage response, and maintenance of genome stability with implications in tumorigenesis. The possibility of RNase P as a therapeutic target in cancer is also discussed.
核糖核酸酶 P(RNase P)最早于 20 世纪 70 年代被描述为一种内切核糖核酸酶,在前体转运核糖核酸(tRNA)的成熟过程中发挥作用。然而,最近的研究发现了 RNase P 及其成分的非典型作用。在此,我们回顾了 RNase P 参与染色质组装、DNA 损伤反应和维持基因组稳定性的最新进展,以及对肿瘤发生的影响。我们还讨论了将 RNase P 作为癌症治疗靶点的可能性。
{"title":"RNase P: Beyond Precursor tRNA Processing.","authors":"Peipei Wang, Juntao Lin, Xiangyang Zheng, Xingzhi Xu","doi":"10.1093/gpbjnl/qzae016","DOIUrl":"10.1093/gpbjnl/qzae016","url":null,"abstract":"<p><p>Ribonuclease P (RNase P) was first described in the 1970's as an endoribonuclease acting in the maturation of precursor transfer RNAs (tRNAs). More recent studies, however, have uncovered non-canonical roles for RNase P and its components. Here, we review the recent progress of its involvement in chromatin assembly, DNA damage response, and maintenance of genome stability with implications in tumorigenesis. The possibility of RNase P as a therapeutic target in cancer is also discussed.</p>","PeriodicalId":94020,"journal":{"name":"Genomics, proteomics & bioinformatics","volume":"22 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141307672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Gunhwan Ko, Jae Ho Lee, Young Mi Sim, Wangho Song, Byung-Ha Yoon, Iksu Byeon, Bang Hyuck Lee, Sang-Ok Kim, Jinhyuk Choi, Insoo Jang, Hyerin Kim, Jin Ok Yang, Kiwon Jang, Sora Kim, Jong-Hwan Kim, Jongbum Jeon, Jaeeun Jung, Seungwoo Hwang, Ji-Hwan Park, Pan-Gyu Kim, Seon-Young Kim, Byungwook Lee
During the last decade, the generation and accumulation of petabase-scale high-throughput sequencing data have resulted in great challenges, including access to human data, as well as transfer, storage, and sharing of enormous amounts of data. To promote data-driven biological research, the Korean government announced that all biological data generated from government-funded research projects should be deposited at the Korea BioData Station (K-BDS), which consists of multiple databases for individual data types. Here, we introduce the Korean Nucleotide Archive (KoNA), a repository of nucleotide sequence data. As of July 2022, the Korean Read Archive in KoNA has collected over 477 TB of raw next-generation sequencing data from national genome projects. To ensure data quality and prepare for international alignment, a standard operating procedure was adopted, which is similar to that of the International Nucleotide Sequence Database Collaboration. The standard operating procedure includes quality control processes for submitted data and metadata using an automated pipeline, followed by manual examination. To ensure fast and stable data transfer, a high-speed transmission system called GBox is used in KoNA. Furthermore, the data uploaded to or downloaded from KoNA through GBox can be readily processed using a cloud computing service called Bio-Express. This seamless coupling of KoNA, GBox, and Bio-Express enhances the data experience, including submission, access, and analysis of raw nucleotide sequences. KoNA not only satisfies the unmet needs for a national sequence repository in Korea but also provides datasets to researchers globally and contributes to advances in genomics. The KoNA is available at https://www.kobic.re.kr/kona/.
{"title":"KoNA: Korean Nucleotide Archive as A New Data Repository for Nucleotide Sequence Data.","authors":"Gunhwan Ko, Jae Ho Lee, Young Mi Sim, Wangho Song, Byung-Ha Yoon, Iksu Byeon, Bang Hyuck Lee, Sang-Ok Kim, Jinhyuk Choi, Insoo Jang, Hyerin Kim, Jin Ok Yang, Kiwon Jang, Sora Kim, Jong-Hwan Kim, Jongbum Jeon, Jaeeun Jung, Seungwoo Hwang, Ji-Hwan Park, Pan-Gyu Kim, Seon-Young Kim, Byungwook Lee","doi":"10.1093/gpbjnl/qzae017","DOIUrl":"10.1093/gpbjnl/qzae017","url":null,"abstract":"<p><p>During the last decade, the generation and accumulation of petabase-scale high-throughput sequencing data have resulted in great challenges, including access to human data, as well as transfer, storage, and sharing of enormous amounts of data. To promote data-driven biological research, the Korean government announced that all biological data generated from government-funded research projects should be deposited at the Korea BioData Station (K-BDS), which consists of multiple databases for individual data types. Here, we introduce the Korean Nucleotide Archive (KoNA), a repository of nucleotide sequence data. As of July 2022, the Korean Read Archive in KoNA has collected over 477 TB of raw next-generation sequencing data from national genome projects. To ensure data quality and prepare for international alignment, a standard operating procedure was adopted, which is similar to that of the International Nucleotide Sequence Database Collaboration. The standard operating procedure includes quality control processes for submitted data and metadata using an automated pipeline, followed by manual examination. To ensure fast and stable data transfer, a high-speed transmission system called GBox is used in KoNA. Furthermore, the data uploaded to or downloaded from KoNA through GBox can be readily processed using a cloud computing service called Bio-Express. This seamless coupling of KoNA, GBox, and Bio-Express enhances the data experience, including submission, access, and analysis of raw nucleotide sequences. KoNA not only satisfies the unmet needs for a national sequence repository in Korea but also provides datasets to researchers globally and contributes to advances in genomics. The KoNA is available at https://www.kobic.re.kr/kona/.</p>","PeriodicalId":94020,"journal":{"name":"Genomics, proteomics & bioinformatics","volume":"22 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141307671","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Correction to: dbDEMC 3.0: Functional Exploration of Differentially Expressed miRNAs in Cancers of Human and Model Organisms.","authors":"","doi":"10.1093/gpbjnl/qzae037","DOIUrl":"https://doi.org/10.1093/gpbjnl/qzae037","url":null,"abstract":"","PeriodicalId":94020,"journal":{"name":"Genomics, proteomics & bioinformatics","volume":"22 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141307670","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Erythropoiesis is a finely regulated and complex process that involves multiple transformations from hematopoietic stem cells to mature red blood cells at hematopoietic sites from the embryonic to the adult stages. Investigations into its molecular mechanisms have generated a wealth of expression data, including bulk and single-cell RNA sequencing data. A comprehensively integrated and well-curated erythropoiesis-specific database will greatly facilitate the mining of gene expression data and enable large-scale research of erythropoiesis and erythroid-related diseases. Here, we present EryDB, an open-access and comprehensive database dedicated to the collection, integration, analysis, and visualization of transcriptomic data for erythropoiesis and erythroid-related diseases. Currently, the database includes expertly curated quality-assured data of 3803 samples and 1,187,119 single cells derived from 107 public studies of three species (Homo sapiens, Mus musculus, and Danio rerio), nine tissue types, and five diseases. EryDB provides users with the ability to not only browse the molecular features of erythropoiesis between tissues and species, but also perform computational analyses of single-cell and bulk RNA sequencing data, thus serving as a convenient platform for customized queries and analyses. EryDB v1.0 is freely accessible at https://ngdc.cncb.ac.cn/EryDB/home.
{"title":"EryDB: A Transcriptomic Profile Database for Erythropoiesis and Erythroid-related Diseases.","authors":"Guangmin Zheng, Song Wu, Zhaojun Zhang, Zijuan Xin, Lijuan Zhang, Siqi Zhao, Jing Wu, Yanxia Liu, Meng Li, Xiuyan Ruan, Nan Qiao, Yiming Bao, Hongzhu Qu, Xiangdong Fang","doi":"10.1093/gpbjnl/qzae029","DOIUrl":"https://doi.org/10.1093/gpbjnl/qzae029","url":null,"abstract":"<p><p>Erythropoiesis is a finely regulated and complex process that involves multiple transformations from hematopoietic stem cells to mature red blood cells at hematopoietic sites from the embryonic to the adult stages. Investigations into its molecular mechanisms have generated a wealth of expression data, including bulk and single-cell RNA sequencing data. A comprehensively integrated and well-curated erythropoiesis-specific database will greatly facilitate the mining of gene expression data and enable large-scale research of erythropoiesis and erythroid-related diseases. Here, we present EryDB, an open-access and comprehensive database dedicated to the collection, integration, analysis, and visualization of transcriptomic data for erythropoiesis and erythroid-related diseases. Currently, the database includes expertly curated quality-assured data of 3803 samples and 1,187,119 single cells derived from 107 public studies of three species (Homo sapiens, Mus musculus, and Danio rerio), nine tissue types, and five diseases. EryDB provides users with the ability to not only browse the molecular features of erythropoiesis between tissues and species, but also perform computational analyses of single-cell and bulk RNA sequencing data, thus serving as a convenient platform for customized queries and analyses. EryDB v1.0 is freely accessible at https://ngdc.cncb.ac.cn/EryDB/home.</p>","PeriodicalId":94020,"journal":{"name":"Genomics, proteomics & bioinformatics","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142484013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-12-01Epub Date: 2023-09-22DOI: 10.1016/j.gpb.2023.09.001
Xue Zhang
{"title":"T2T-YAO Reference Genome of Han Chinese - New Step in Advancing Precision Medicine in China.","authors":"Xue Zhang","doi":"10.1016/j.gpb.2023.09.001","DOIUrl":"10.1016/j.gpb.2023.09.001","url":null,"abstract":"","PeriodicalId":94020,"journal":{"name":"Genomics, proteomics & bioinformatics","volume":" ","pages":"1083-1084"},"PeriodicalIF":0.0,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11082255/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41109240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}