Pub Date : 2022-09-09DOI: 10.1186/s13059-022-02755-2
Klara Kuret, Aram Gustav Amalietti, D Marc Jones, Charlotte Capitanchik, Jernej Ule
Background: Crosslinking and immunoprecipitation (CLIP) is a method used to identify in vivo RNA-protein binding sites on a transcriptome-wide scale. With the increasing amounts of available data for RNA-binding proteins (RBPs), it is important to understand to what degree the enriched motifs specify the RNA-binding profiles of RBPs in cells.
Results: We develop positionally enriched k-mer analysis (PEKA), a computational tool for efficient analysis of enriched motifs from individual CLIP datasets, which minimizes the impact of technical and regional genomic biases by internal data normalization. We cross-validate PEKA with mCross and show that the use of input control for background correction is not required to yield high specificity of enriched motifs. We identify motif classes with common enrichment patterns across eCLIP datasets and across RNA regions, while also observing variations in the specificity and the extent of motif enrichment across eCLIP datasets, between variant CLIP protocols, and between CLIP and in vitro binding data. Thereby, we gain insights into the contributions of technical and regional genomic biases to the enriched motifs, and find how motif enrichment features relate to the domain composition and low-complexity regions of the studied proteins.
Conclusions: Our study provides insights into the overall contributions of regional binding preferences, protein domains, and low-complexity regions to the specificity of protein-RNA interactions, and shows the value of cross-motif and cross-RBP comparison for data interpretation. Our results are presented for exploratory analysis via an online platform in an RBP-centric and motif-centric manner ( https://imaps.goodwright.com/apps/peka/ ).
{"title":"Positional motif analysis reveals the extent of specificity of protein-RNA interactions observed by CLIP.","authors":"Klara Kuret, Aram Gustav Amalietti, D Marc Jones, Charlotte Capitanchik, Jernej Ule","doi":"10.1186/s13059-022-02755-2","DOIUrl":"10.1186/s13059-022-02755-2","url":null,"abstract":"<p><strong>Background: </strong>Crosslinking and immunoprecipitation (CLIP) is a method used to identify in vivo RNA-protein binding sites on a transcriptome-wide scale. With the increasing amounts of available data for RNA-binding proteins (RBPs), it is important to understand to what degree the enriched motifs specify the RNA-binding profiles of RBPs in cells.</p><p><strong>Results: </strong>We develop positionally enriched k-mer analysis (PEKA), a computational tool for efficient analysis of enriched motifs from individual CLIP datasets, which minimizes the impact of technical and regional genomic biases by internal data normalization. We cross-validate PEKA with mCross and show that the use of input control for background correction is not required to yield high specificity of enriched motifs. We identify motif classes with common enrichment patterns across eCLIP datasets and across RNA regions, while also observing variations in the specificity and the extent of motif enrichment across eCLIP datasets, between variant CLIP protocols, and between CLIP and in vitro binding data. Thereby, we gain insights into the contributions of technical and regional genomic biases to the enriched motifs, and find how motif enrichment features relate to the domain composition and low-complexity regions of the studied proteins.</p><p><strong>Conclusions: </strong>Our study provides insights into the overall contributions of regional binding preferences, protein domains, and low-complexity regions to the specificity of protein-RNA interactions, and shows the value of cross-motif and cross-RBP comparison for data interpretation. Our results are presented for exploratory analysis via an online platform in an RBP-centric and motif-centric manner ( https://imaps.goodwright.com/apps/peka/ ).</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":"23 1","pages":"191"},"PeriodicalIF":12.3,"publicationDate":"2022-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9461102/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33460725","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-09-08DOI: 10.1186/s13059-022-02743-6
Jamshed Khan, Marek Kokot, Sebastian Deorowicz, Rob Patro
The de Bruijn graph is a key data structure in modern computational genomics, and construction of its compacted variant resides upstream of many genomic analyses. As the quantity of genomic data grows rapidly, this often forms a computational bottleneck. We present Cuttlefish 2, significantly advancing the state-of-the-art for this problem. On a commodity server, it reduces the graph construction time for 661K bacterial genomes, of size 2.58Tbp, from 4.5 days to 17-23 h; and it constructs the graph for 1.52Tbp white spruce reads in approximately 10 h, while the closest competitor requires 54-58 h, using considerably more memory.
{"title":"Scalable, ultra-fast, and low-memory construction of compacted de Bruijn graphs with Cuttlefish 2.","authors":"Jamshed Khan, Marek Kokot, Sebastian Deorowicz, Rob Patro","doi":"10.1186/s13059-022-02743-6","DOIUrl":"10.1186/s13059-022-02743-6","url":null,"abstract":"<p><p>The de Bruijn graph is a key data structure in modern computational genomics, and construction of its compacted variant resides upstream of many genomic analyses. As the quantity of genomic data grows rapidly, this often forms a computational bottleneck. We present Cuttlefish 2, significantly advancing the state-of-the-art for this problem. On a commodity server, it reduces the graph construction time for 661K bacterial genomes, of size 2.58Tbp, from 4.5 days to 17-23 h; and it constructs the graph for 1.52Tbp white spruce reads in approximately 10 h, while the closest competitor requires 54-58 h, using considerably more memory.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":"23 1","pages":"190"},"PeriodicalIF":12.3,"publicationDate":"2022-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9454175/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33453631","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-09-07DOI: 10.1186/s13059-022-02756-1
Ningyang Li, Xueyu Zhang, Xiudong Sun, Siyuan Zhu, Yi Cheng, Meng Liu, Song Gao, Jiangjiang Zhang, Yanzhou Wang, Xiai Yang, Jianrong Chen, Fu Li, Qiaoyun He, Zheng Zeng, Xiaoge Yuan, Zhiman Zhou, Longchuan Ma, Taotao Wang, Xiang Li, Hanqiang Liu, Yupeng Pan, Mengyan Zhou, Chunsheng Gao, Gang Zhou, Zhenlin Han, Shiqi Liu, Jianguang Su, Zhihui Cheng, Shilin Tian, Touming Liu
Background: Garlic is an entirely sterile crop with important value as a vegetable, condiment, and medicine. However, the evolutionary history of garlic remains largely unknown.
Results: Here we report a comprehensive map of garlic genomic variation, consisting of amazingly 129.4 million variations. Evolutionary analysis indicates that the garlic population diverged at least 100,000 years ago, and the two groups cultivated in China were domesticated from two independent routes. Consequently, 15.0 and 17.5% of genes underwent an expression change in two cultivated groups, causing a reshaping of their transcriptomic architecture. Furthermore, we find independent domestication leads to few overlaps of deleterious substitutions in these two groups due to separate accumulation and selection-based removal. By analysis of selective sweeps, genome-wide trait associations and associated transcriptomic analysis, we uncover differential selections for the bulb traits in these two garlic groups during their domestication.
Conclusions: This study provides valuable resources for garlic genomics-based breeding, and comprehensive insights into the evolutionary history of this clonal-propagated crop.
{"title":"Genomic insights into the evolutionary history and diversification of bulb traits in garlic.","authors":"Ningyang Li, Xueyu Zhang, Xiudong Sun, Siyuan Zhu, Yi Cheng, Meng Liu, Song Gao, Jiangjiang Zhang, Yanzhou Wang, Xiai Yang, Jianrong Chen, Fu Li, Qiaoyun He, Zheng Zeng, Xiaoge Yuan, Zhiman Zhou, Longchuan Ma, Taotao Wang, Xiang Li, Hanqiang Liu, Yupeng Pan, Mengyan Zhou, Chunsheng Gao, Gang Zhou, Zhenlin Han, Shiqi Liu, Jianguang Su, Zhihui Cheng, Shilin Tian, Touming Liu","doi":"10.1186/s13059-022-02756-1","DOIUrl":"https://doi.org/10.1186/s13059-022-02756-1","url":null,"abstract":"<p><strong>Background: </strong>Garlic is an entirely sterile crop with important value as a vegetable, condiment, and medicine. However, the evolutionary history of garlic remains largely unknown.</p><p><strong>Results: </strong>Here we report a comprehensive map of garlic genomic variation, consisting of amazingly 129.4 million variations. Evolutionary analysis indicates that the garlic population diverged at least 100,000 years ago, and the two groups cultivated in China were domesticated from two independent routes. Consequently, 15.0 and 17.5% of genes underwent an expression change in two cultivated groups, causing a reshaping of their transcriptomic architecture. Furthermore, we find independent domestication leads to few overlaps of deleterious substitutions in these two groups due to separate accumulation and selection-based removal. By analysis of selective sweeps, genome-wide trait associations and associated transcriptomic analysis, we uncover differential selections for the bulb traits in these two garlic groups during their domestication.</p><p><strong>Conclusions: </strong>This study provides valuable resources for garlic genomics-based breeding, and comprehensive insights into the evolutionary history of this clonal-propagated crop.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":"23 1","pages":"188"},"PeriodicalIF":12.3,"publicationDate":"2022-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9450234/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33448119","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-08-29DOI: 10.1186/s13059-022-02750-7
Camille Fonouni-Farde, Aurélie Christ, Thomas Blein, María Florencia Legascue, Lucía Ferrero, Michaël Moison, Leandro Lucero, Juan Sebastián Ramírez-Prado, David Latrasse, Daniel Gonzalez, Moussa Benhamed, Leandro Quadrana, Martin Crespi, Federico Ariel
Background: RNA-DNA hybrid (R-loop)-associated long noncoding RNAs (lncRNAs), including the Arabidopsis lncRNA AUXIN-REGULATED PROMOTER LOOP (APOLO), are emerging as important regulators of three-dimensional chromatin conformation and gene transcriptional activity.
Results: Here, we show that in addition to the PRC1-component LIKE HETEROCHROMATIN PROTEIN 1 (LHP1), APOLO interacts with the methylcytosine-binding protein VARIANT IN METHYLATION 1 (VIM1), a conserved homolog of the mammalian DNA methylation regulator UBIQUITIN-LIKE CONTAINING PHD AND RING FINGER DOMAINS 1 (UHRF1). The APOLO-VIM1-LHP1 complex directly regulates the transcription of the auxin biosynthesis gene YUCCA2 by dynamically determining DNA methylation and H3K27me3 deposition over its promoter during the plant thermomorphogenic response. Strikingly, we demonstrate that the lncRNA UHRF1 Protein Associated Transcript (UPAT), a direct interactor of UHRF1 in humans, can be recognized by VIM1 and LHP1 in plant cells, despite the lack of sequence homology between UPAT and APOLO. In addition, we show that increased levels of APOLO or UPAT hamper VIM1 and LHP1 binding to YUCCA2 promoter and globally alter the Arabidopsis transcriptome in a similar manner.
Conclusions: Collectively, our results uncover a new mechanism in which a plant lncRNA coordinates Polycomb action and DNA methylation through the interaction with VIM1, and indicates that evolutionary unrelated lncRNAs with potentially conserved structures may exert similar functions by interacting with homolog partners.
背景:RNA-DNA杂交(R-loop)相关的长链非编码rna (lncRNAs),包括拟南芥lncRNA生长素调控的启动子LOOP (APOLO),正在成为三维染色质构象和基因转录活性的重要调节因子。结果:在这里,我们发现除了prc1组分LIKE异染色质蛋白1 (LHP1)外,APOLO还与甲基胞嘧啶结合蛋白VARIANT in METHYLATION 1 (VIM1)相互作用,VIM1是哺乳动物DNA甲基化调节因子UBIQUITIN-LIKE CONTAINING PHD AND RING FINGER DOMAINS 1 (UHRF1)的保守同源物。APOLO-VIM1-LHP1复合体在植物热形态响应过程中,通过动态决定DNA甲基化和H3K27me3在启动子上的沉积,直接调节生长素生物合成基因YUCCA2的转录。引人注目的是,我们证明了lncRNA UHRF1蛋白相关转录本(UPAT)是人类UHRF1的直接相互作用物,可以被植物细胞中的VIM1和LHP1识别,尽管UPAT和APOLO之间缺乏序列同源性。此外,我们发现APOLO或UPAT水平的增加阻碍了VIM1和LHP1与YUCCA2启动子的结合,并以类似的方式改变了拟南芥转录组。总之,我们的研究结果揭示了植物lncRNA通过与VIM1的相互作用协调Polycomb作用和DNA甲基化的新机制,并表明具有潜在保守结构的进化无关的lncRNA可能通过与同源伴侣的相互作用发挥类似的功能。
{"title":"The Arabidopsis APOLO and human UPAT sequence-unrelated long noncoding RNAs can modulate DNA and histone methylation machineries in plants.","authors":"Camille Fonouni-Farde, Aurélie Christ, Thomas Blein, María Florencia Legascue, Lucía Ferrero, Michaël Moison, Leandro Lucero, Juan Sebastián Ramírez-Prado, David Latrasse, Daniel Gonzalez, Moussa Benhamed, Leandro Quadrana, Martin Crespi, Federico Ariel","doi":"10.1186/s13059-022-02750-7","DOIUrl":"https://doi.org/10.1186/s13059-022-02750-7","url":null,"abstract":"<p><strong>Background: </strong>RNA-DNA hybrid (R-loop)-associated long noncoding RNAs (lncRNAs), including the Arabidopsis lncRNA AUXIN-REGULATED PROMOTER LOOP (APOLO), are emerging as important regulators of three-dimensional chromatin conformation and gene transcriptional activity.</p><p><strong>Results: </strong>Here, we show that in addition to the PRC1-component LIKE HETEROCHROMATIN PROTEIN 1 (LHP1), APOLO interacts with the methylcytosine-binding protein VARIANT IN METHYLATION 1 (VIM1), a conserved homolog of the mammalian DNA methylation regulator UBIQUITIN-LIKE CONTAINING PHD AND RING FINGER DOMAINS 1 (UHRF1). The APOLO-VIM1-LHP1 complex directly regulates the transcription of the auxin biosynthesis gene YUCCA2 by dynamically determining DNA methylation and H3K27me3 deposition over its promoter during the plant thermomorphogenic response. Strikingly, we demonstrate that the lncRNA UHRF1 Protein Associated Transcript (UPAT), a direct interactor of UHRF1 in humans, can be recognized by VIM1 and LHP1 in plant cells, despite the lack of sequence homology between UPAT and APOLO. In addition, we show that increased levels of APOLO or UPAT hamper VIM1 and LHP1 binding to YUCCA2 promoter and globally alter the Arabidopsis transcriptome in a similar manner.</p><p><strong>Conclusions: </strong>Collectively, our results uncover a new mechanism in which a plant lncRNA coordinates Polycomb action and DNA methylation through the interaction with VIM1, and indicates that evolutionary unrelated lncRNAs with potentially conserved structures may exert similar functions by interacting with homolog partners.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":"23 1","pages":"181"},"PeriodicalIF":12.3,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9422110/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33446687","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-08-29DOI: 10.1186/s13059-022-02735-6
Bryce Kille, Advait Balaji, Fritz J Sedlazeck, Michael Nute, Todd J Treangen
With the arrival of telomere-to-telomere (T2T) assemblies of the human genome comes the computational challenge of efficiently and accurately constructing multiple genome alignments at an unprecedented scale. By identifying nucleotides across genomes which share a common ancestor, multiple genome alignments commonly serve as the bedrock for comparative genomics studies. In this review, we provide an overview of the algorithmic template that most multiple genome alignment methods follow. We also discuss prospective areas of improvement of multiple genome alignment for keeping up with continuously arriving high-quality T2T assembled genomes and for unlocking clinically-relevant insights.
{"title":"Multiple genome alignment in the telomere-to-telomere assembly era.","authors":"Bryce Kille, Advait Balaji, Fritz J Sedlazeck, Michael Nute, Todd J Treangen","doi":"10.1186/s13059-022-02735-6","DOIUrl":"https://doi.org/10.1186/s13059-022-02735-6","url":null,"abstract":"<p><p>With the arrival of telomere-to-telomere (T2T) assemblies of the human genome comes the computational challenge of efficiently and accurately constructing multiple genome alignments at an unprecedented scale. By identifying nucleotides across genomes which share a common ancestor, multiple genome alignments commonly serve as the bedrock for comparative genomics studies. In this review, we provide an overview of the algorithmic template that most multiple genome alignment methods follow. We also discuss prospective areas of improvement of multiple genome alignment for keeping up with continuously arriving high-quality T2T assembled genomes and for unlocking clinically-relevant insights.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":"23 1","pages":"182"},"PeriodicalIF":12.3,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9421119/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33447714","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-05-18DOI: 10.1186/s13059-022-02684-0
Peiyao Zhao, Jiaqiang Zhu, Ying Ma, Xiang Zhou
Background: Spatial transcriptomics are a set of new technologies that profile gene expression on tissues with spatial localization information. With technological advances, recent spatial transcriptomics data are often in the form of sparse counts with an excessive amount of zero values.
Results: We perform a comprehensive analysis on 20 spatial transcriptomics datasets collected from 11 distinct technologies to characterize the distributional properties of the expression count data and understand the statistical nature of the zero values. Across datasets, we show that a substantial fraction of genes displays overdispersion and/or zero inflation that cannot be accounted for by a Poisson model, with genes displaying overdispersion substantially overlapped with genes displaying zero inflation. In addition, we find that either the Poisson or the negative binomial model is sufficient for modeling the majority of genes across most spatial transcriptomics technologies. We further show major sources of overdispersion and zero inflation in spatial transcriptomics including gene expression heterogeneity across tissue locations and spatial distribution of cell types. In particular, when we focus on a relatively homogeneous set of tissue locations or control for cell type compositions, the number of detected overdispersed and/or zero-inflated genes is substantially reduced, and a simple Poisson model is often sufficient to fit the gene expression data there.
Conclusions: Our study provides the first comprehensive evidence that excessive zeros in spatial transcriptomics are not due to zero inflation, supporting the use of count models without a zero inflation component for modeling spatial transcriptomics.
{"title":"Modeling zero inflation is not necessary for spatial transcriptomics.","authors":"Peiyao Zhao, Jiaqiang Zhu, Ying Ma, Xiang Zhou","doi":"10.1186/s13059-022-02684-0","DOIUrl":"https://doi.org/10.1186/s13059-022-02684-0","url":null,"abstract":"<p><strong>Background: </strong>Spatial transcriptomics are a set of new technologies that profile gene expression on tissues with spatial localization information. With technological advances, recent spatial transcriptomics data are often in the form of sparse counts with an excessive amount of zero values.</p><p><strong>Results: </strong>We perform a comprehensive analysis on 20 spatial transcriptomics datasets collected from 11 distinct technologies to characterize the distributional properties of the expression count data and understand the statistical nature of the zero values. Across datasets, we show that a substantial fraction of genes displays overdispersion and/or zero inflation that cannot be accounted for by a Poisson model, with genes displaying overdispersion substantially overlapped with genes displaying zero inflation. In addition, we find that either the Poisson or the negative binomial model is sufficient for modeling the majority of genes across most spatial transcriptomics technologies. We further show major sources of overdispersion and zero inflation in spatial transcriptomics including gene expression heterogeneity across tissue locations and spatial distribution of cell types. In particular, when we focus on a relatively homogeneous set of tissue locations or control for cell type compositions, the number of detected overdispersed and/or zero-inflated genes is substantially reduced, and a simple Poisson model is often sufficient to fit the gene expression data there.</p><p><strong>Conclusions: </strong>Our study provides the first comprehensive evidence that excessive zeros in spatial transcriptomics are not due to zero inflation, supporting the use of count models without a zero inflation component for modeling spatial transcriptomics.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":"23 1","pages":"118"},"PeriodicalIF":12.3,"publicationDate":"2022-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9116027/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142074282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-04-05DOI: 10.1186/s13059-022-02654-6
Connie L Jiang, Yogesh Goyal, Naveen Jain, Qiaohong Wang, Rachel E Truitt, Allison J Coté, Benjamin Emert, Ian A Mellis, Karun Kiani, Wenli Yang, Rajan Jain, Arjun Raj
Background: Cardiac differentiation of human-induced pluripotent stem (hiPS) cells consistently produces a mixed population of cardiomyocytes and non-cardiac cell types, even when using well-characterized protocols. We sought to determine whether different cell types might result from intrinsic differences in hiPS cells prior to the onset of differentiation.
Results: By associating individual differentiated cells that share a common hiPS cell precursor, we tested whether expression variability is predetermined from the hiPS cell state. In a single experiment, cells that shared a progenitor were more transcriptionally similar to each other than to other cells in the differentiated population. However, when the same hiPS cells were differentiated in parallel, we did not observe high transcriptional similarity across differentiations. Additionally, we found that substantial cell death occurs during differentiation in a manner that suggested all cells were equally likely to survive or die, suggesting that there is no intrinsic selection bias for cells descended from particular hiPS cell progenitors. We thus wondered how cells grow spatially during differentiation, so we labeled cells by expression of marker genes and found that cells expressing the same marker tended to occur in patches. Our results suggest that cell type determination across multiple cell types, once initiated, is maintained in a cell-autonomous manner for multiple divisions.
Conclusions: Altogether, our results show that while substantial heterogeneity exists in the initial hiPS cell population, it is not responsible for the variability observed in differentiated outcomes; instead, factors specifying the various cell types likely act during a window that begins shortly after the seeding of hiPS cells for differentiation.
{"title":"Cell type determination for cardiac differentiation occurs soon after seeding of human-induced pluripotent stem cells.","authors":"Connie L Jiang, Yogesh Goyal, Naveen Jain, Qiaohong Wang, Rachel E Truitt, Allison J Coté, Benjamin Emert, Ian A Mellis, Karun Kiani, Wenli Yang, Rajan Jain, Arjun Raj","doi":"10.1186/s13059-022-02654-6","DOIUrl":"10.1186/s13059-022-02654-6","url":null,"abstract":"<p><strong>Background: </strong>Cardiac differentiation of human-induced pluripotent stem (hiPS) cells consistently produces a mixed population of cardiomyocytes and non-cardiac cell types, even when using well-characterized protocols. We sought to determine whether different cell types might result from intrinsic differences in hiPS cells prior to the onset of differentiation.</p><p><strong>Results: </strong>By associating individual differentiated cells that share a common hiPS cell precursor, we tested whether expression variability is predetermined from the hiPS cell state. In a single experiment, cells that shared a progenitor were more transcriptionally similar to each other than to other cells in the differentiated population. However, when the same hiPS cells were differentiated in parallel, we did not observe high transcriptional similarity across differentiations. Additionally, we found that substantial cell death occurs during differentiation in a manner that suggested all cells were equally likely to survive or die, suggesting that there is no intrinsic selection bias for cells descended from particular hiPS cell progenitors. We thus wondered how cells grow spatially during differentiation, so we labeled cells by expression of marker genes and found that cells expressing the same marker tended to occur in patches. Our results suggest that cell type determination across multiple cell types, once initiated, is maintained in a cell-autonomous manner for multiple divisions.</p><p><strong>Conclusions: </strong>Altogether, our results show that while substantial heterogeneity exists in the initial hiPS cell population, it is not responsible for the variability observed in differentiated outcomes; instead, factors specifying the various cell types likely act during a window that begins shortly after the seeding of hiPS cells for differentiation.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":"23 1","pages":"90"},"PeriodicalIF":12.3,"publicationDate":"2022-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8985385/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140912393","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-02-28DOI: 10.1186/s13059-022-02638-6
Kaiwen Bao, Qi Zhang, Shuai Liu, Nan Song, Qiushi Guo, Ling Liu, Shanshan Tian, Jihui Hao, Yi Zhu, Kai Zhang, Ding Ai, Jie Yang, Zhi Yao, Roland Foisner, Lei Shi
Background: Single-stranded DNA (ssDNA) coated with replication protein A (RPA) acts as a key platform for the recruitment and exchange of genome maintenance factors in DNA damage response. Yet, how the formation of the ssDNA-RPA intermediate is regulated remains elusive.
Results: Here, we report that the lamin-associated protein LAP2α is physically associated with RPA, and LAP2α preferentially facilitates RPA deposition on damaged chromatin via physical contacts between LAP2α and RPA1. Importantly, LAP2α-promoted RPA binding to ssDNA plays a critical role in protection of replication forks, activation of ATR, and repair of damaged DNA. We further demonstrate that the preference of LAP2α-promoted RPA loading on damaged chromatin depends on poly ADP-ribose polymerase PARP1, but not poly(ADP-ribosyl)ation.
Conclusions: Our study provides mechanistic insight into RPA deposition in response to DNA damage and reveals a genome protection role of LAP2α.
背景:复制蛋白 A(RPA)包被的单链 DNA(ssDNA)是 DNA 损伤反应中基因组维持因子招募和交换的关键平台。然而,ssDNA-RPA中间体的形成是如何调控的仍不清楚:结果:我们在此报告了片层相关蛋白LAP2α与RPA的物理关联,LAP2α通过LAP2α和RPA1之间的物理接触优先促进RPA在受损染色质上的沉积。重要的是,LAP2α 促进 RPA 与 ssDNA 结合,在保护复制叉、激活 ATR 和修复受损 DNA 方面发挥了关键作用。我们进一步证明,LAP2α促进的RPA负载在受损染色质上的偏好取决于多聚ADP-核糖聚合酶PARP1,而不是多聚(ADP-核糖)结合:我们的研究从机理上揭示了DNA损伤时RPA的沉积,并揭示了LAP2α的基因组保护作用。
{"title":"LAP2α preserves genome integrity through assisting RPA deposition on damaged chromatin.","authors":"Kaiwen Bao, Qi Zhang, Shuai Liu, Nan Song, Qiushi Guo, Ling Liu, Shanshan Tian, Jihui Hao, Yi Zhu, Kai Zhang, Ding Ai, Jie Yang, Zhi Yao, Roland Foisner, Lei Shi","doi":"10.1186/s13059-022-02638-6","DOIUrl":"10.1186/s13059-022-02638-6","url":null,"abstract":"<p><strong>Background: </strong>Single-stranded DNA (ssDNA) coated with replication protein A (RPA) acts as a key platform for the recruitment and exchange of genome maintenance factors in DNA damage response. Yet, how the formation of the ssDNA-RPA intermediate is regulated remains elusive.</p><p><strong>Results: </strong>Here, we report that the lamin-associated protein LAP2α is physically associated with RPA, and LAP2α preferentially facilitates RPA deposition on damaged chromatin via physical contacts between LAP2α and RPA1. Importantly, LAP2α-promoted RPA binding to ssDNA plays a critical role in protection of replication forks, activation of ATR, and repair of damaged DNA. We further demonstrate that the preference of LAP2α-promoted RPA loading on damaged chromatin depends on poly ADP-ribose polymerase PARP1, but not poly(ADP-ribosyl)ation.</p><p><strong>Conclusions: </strong>Our study provides mechanistic insight into RPA deposition in response to DNA damage and reveals a genome protection role of LAP2α.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":"23 1","pages":"64"},"PeriodicalIF":12.3,"publicationDate":"2022-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8883701/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141307233","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-04-13DOI: 10.1186/s13059-021-02323-0
Gareth B Gillard, Lars Grønvold, Line L Røsæg, Matilde Mengkrog Holen, Øystein Monsen, Ben F Koop, Eric B Rondeau, Manu Kumar Gundappa, John Mendoza, Daniel J Macqueen, Rori V Rohlfs, Simen R Sandve, Torgeir R Hvidsten
Background: Whole genome duplication (WGD) events have played a major role in eukaryotic genome evolution, but the consequence of these extreme events in adaptive genome evolution is still not well understood. To address this knowledge gap, we used a comparative phylogenetic model and transcriptomic data from seven species to infer selection on gene expression in duplicated genes (ohnologs) following the salmonid WGD 80-100 million years ago.
Results: We find rare cases of tissue-specific expression evolution but pervasive expression evolution affecting many tissues, reflecting strong selection on maintenance of genome stability following genome doubling. Ohnolog expression levels have evolved mostly asymmetrically, by diverting one ohnolog copy down a path towards lower expression and possible pseudogenization. Loss of expression in one ohnolog is significantly associated with transposable element insertions in promoters and likely driven by selection on gene dosage including selection on stoichiometric balance. We also find symmetric expression shifts, and these are associated with genes under strong evolutionary constraints such as ribosome subunit genes. This possibly reflects selection operating to achieve a gene dose reduction while avoiding accumulation of "toxic mutations". Mechanistically, ohnolog regulatory divergence is dictated by the number of bound transcription factors in promoters, with transposable elements being one likely source of novel binding sites driving tissue-specific gains in expression.
Conclusions: Our results imply pervasive adaptive expression evolution following WGD to overcome the immediate challenges posed by genome doubling and to exploit the long-term genetic opportunities for novel phenotype evolution.
{"title":"Comparative regulomics supports pervasive selection on gene dosage following whole genome duplication.","authors":"Gareth B Gillard, Lars Grønvold, Line L Røsæg, Matilde Mengkrog Holen, Øystein Monsen, Ben F Koop, Eric B Rondeau, Manu Kumar Gundappa, John Mendoza, Daniel J Macqueen, Rori V Rohlfs, Simen R Sandve, Torgeir R Hvidsten","doi":"10.1186/s13059-021-02323-0","DOIUrl":"10.1186/s13059-021-02323-0","url":null,"abstract":"<p><strong>Background: </strong>Whole genome duplication (WGD) events have played a major role in eukaryotic genome evolution, but the consequence of these extreme events in adaptive genome evolution is still not well understood. To address this knowledge gap, we used a comparative phylogenetic model and transcriptomic data from seven species to infer selection on gene expression in duplicated genes (ohnologs) following the salmonid WGD 80-100 million years ago.</p><p><strong>Results: </strong>We find rare cases of tissue-specific expression evolution but pervasive expression evolution affecting many tissues, reflecting strong selection on maintenance of genome stability following genome doubling. Ohnolog expression levels have evolved mostly asymmetrically, by diverting one ohnolog copy down a path towards lower expression and possible pseudogenization. Loss of expression in one ohnolog is significantly associated with transposable element insertions in promoters and likely driven by selection on gene dosage including selection on stoichiometric balance. We also find symmetric expression shifts, and these are associated with genes under strong evolutionary constraints such as ribosome subunit genes. This possibly reflects selection operating to achieve a gene dose reduction while avoiding accumulation of \"toxic mutations\". Mechanistically, ohnolog regulatory divergence is dictated by the number of bound transcription factors in promoters, with transposable elements being one likely source of novel binding sites driving tissue-specific gains in expression.</p><p><strong>Conclusions: </strong>Our results imply pervasive adaptive expression evolution following WGD to overcome the immediate challenges posed by genome doubling and to exploit the long-term genetic opportunities for novel phenotype evolution.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":"22 1","pages":"103"},"PeriodicalIF":12.3,"publicationDate":"2021-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8042706/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25585826","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-04-13DOI: 10.1186/s13059-021-02331-0
Shuang Qu, Zichen Jiao, Geng Lu, Bing Yao, Ting Wang, Weiwei Rong, Jiahan Xu, Ting Fan, Xinlei Sun, Rong Yang, Jun Wang, Yongzhong Yao, Guifang Xu, Xin Yan, Tao Wang, Hongwei Liang, Ke Zen
Background: Although using a blockade of programmed death-ligand 1 (PD-L1) to enhance T cell immune responses shows great promise in tumor immunotherapy, the immune-checkpoint inhibition strategy is limited for patients with solid tumors. The mechanism and efficacy of such immune-checkpoint inhibition strategies in solid tumors remains unclear.
Results: Employing qRT-PCR, Sanger sequencing, and RNA BaseScope analysis, we show that human lung adenocarcinoma (LUAD) all produce a long non-coding RNA isoform of PD-L1 (PD-L1-lnc) by alternative splicing, regardless if the tumor is positive or negative for the protein PD-L1. Similar to PD-L1 mRNA, PD-L1-lnc in various lung adenocarcinoma cells is significantly upregulated by IFNγ. Both in vitro and in vivo studies demonstrate that PD-L1-lnc increases proliferation and invasion but decreases apoptosis of lung adenocarcinoma cells. Mechanistically, PD-L1-lnc promotes lung adenocarcinoma progression through directly binding to c-Myc and enhancing c-Myc transcriptional activity.
Conclusions: In summary, the PD-L1 gene can generate a long non-coding RNA through alternative splicing to promote lung adenocarcinoma progression by enhancing c-Myc activity. Our results argue in favor of investigating PD-L1-lnc depletion in combination with PD-L1 blockade in lung cancer therapy.
{"title":"PD-L1 lncRNA splice isoform promotes lung adenocarcinoma progression via enhancing c-Myc activity.","authors":"Shuang Qu, Zichen Jiao, Geng Lu, Bing Yao, Ting Wang, Weiwei Rong, Jiahan Xu, Ting Fan, Xinlei Sun, Rong Yang, Jun Wang, Yongzhong Yao, Guifang Xu, Xin Yan, Tao Wang, Hongwei Liang, Ke Zen","doi":"10.1186/s13059-021-02331-0","DOIUrl":"10.1186/s13059-021-02331-0","url":null,"abstract":"<p><strong>Background: </strong>Although using a blockade of programmed death-ligand 1 (PD-L1) to enhance T cell immune responses shows great promise in tumor immunotherapy, the immune-checkpoint inhibition strategy is limited for patients with solid tumors. The mechanism and efficacy of such immune-checkpoint inhibition strategies in solid tumors remains unclear.</p><p><strong>Results: </strong>Employing qRT-PCR, Sanger sequencing, and RNA BaseScope analysis, we show that human lung adenocarcinoma (LUAD) all produce a long non-coding RNA isoform of PD-L1 (PD-L1-lnc) by alternative splicing, regardless if the tumor is positive or negative for the protein PD-L1. Similar to PD-L1 mRNA, PD-L1-lnc in various lung adenocarcinoma cells is significantly upregulated by IFNγ. Both in vitro and in vivo studies demonstrate that PD-L1-lnc increases proliferation and invasion but decreases apoptosis of lung adenocarcinoma cells. Mechanistically, PD-L1-lnc promotes lung adenocarcinoma progression through directly binding to c-Myc and enhancing c-Myc transcriptional activity.</p><p><strong>Conclusions: </strong>In summary, the PD-L1 gene can generate a long non-coding RNA through alternative splicing to promote lung adenocarcinoma progression by enhancing c-Myc activity. Our results argue in favor of investigating PD-L1-lnc depletion in combination with PD-L1 blockade in lung cancer therapy.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":"22 1","pages":"104"},"PeriodicalIF":12.3,"publicationDate":"2021-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8042710/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25588666","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}