Background: Histone lactylation, a metabolic stress-related histone modification, plays an important role in the regulation of gene expression during M1 macrophage polarization. However, the role of histone lactylation in tumorigenesis remains unclear.
Results: Here, we show histone lactylation is elevated in tumors and is associated with poor prognosis of ocular melanoma. Target correction of aberrant histone lactylation triggers therapeutic efficacy both in vitro and in vivo. Mechanistically, histone lactylation contributes to tumorigenesis by facilitating YTHDF2 expression. Moreover, YTHDF2 recognizes the m6A modified PER1 and TP53 mRNAs and promotes their degradation, which accelerates tumorigenesis of ocular melanoma.
Conclusion: We reveal the oncogenic role of histone lactylation, thereby providing novel therapeutic targets for ocular melanoma therapy. We also bridge histone modifications with RNA modifications, which provides novel understanding of epigenetic regulation in tumorigenesis.
{"title":"Histone lactylation drives oncogenesis by facilitating m<sup>6</sup>A reader protein YTHDF2 expression in ocular melanoma.","authors":"Jie Yu, Peiwei Chai, Minyue Xie, Shengfang Ge, Jing Ruan, Xianqun Fan, Renbing Jia","doi":"10.1186/s13059-021-02308-z","DOIUrl":"https://doi.org/10.1186/s13059-021-02308-z","url":null,"abstract":"<p><strong>Background: </strong>Histone lactylation, a metabolic stress-related histone modification, plays an important role in the regulation of gene expression during M1 macrophage polarization. However, the role of histone lactylation in tumorigenesis remains unclear.</p><p><strong>Results: </strong>Here, we show histone lactylation is elevated in tumors and is associated with poor prognosis of ocular melanoma. Target correction of aberrant histone lactylation triggers therapeutic efficacy both in vitro and in vivo. Mechanistically, histone lactylation contributes to tumorigenesis by facilitating YTHDF2 expression. Moreover, YTHDF2 recognizes the m6A modified PER1 and TP53 mRNAs and promotes their degradation, which accelerates tumorigenesis of ocular melanoma.</p><p><strong>Conclusion: </strong>We reveal the oncogenic role of histone lactylation, thereby providing novel therapeutic targets for ocular melanoma therapy. We also bridge histone modifications with RNA modifications, which provides novel understanding of epigenetic regulation in tumorigenesis.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":null,"pages":null},"PeriodicalIF":12.3,"publicationDate":"2021-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s13059-021-02308-z","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25484292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-03-16DOI: 10.1186/s13059-021-02304-3
Pan Gao, Qing Lyu, Amr R Ghanam, Cicera R Lazzarotto, Gregory A Newby, Wei Zhang, Mihyun Choi, Orazio J Slivano, Kevin Holden, John A Walker, Anastasia P Kadina, Rob J Munroe, Christian M Abratte, John C Schimenti, David R Liu, Shengdar Q Tsai, Xiaochun Long, Joseph M Miano
Background: Most single nucleotide variants (SNVs) occur in noncoding sequence where millions of transcription factor binding sites (TFBS) reside. Here, a comparative analysis of CRISPR-mediated homology-directed repair (HDR) versus the recently reported prime editing 2 (PE2) system was carried out in mice over a TFBS called a CArG box in the Tspan2 promoter.
Results: Quantitative RT-PCR showed loss of Tspan2 mRNA in aorta and bladder, but not heart or brain, of mice homozygous for an HDR-mediated three base pair substitution in the Tspan2 CArG box. Using the same protospacer, mice homozygous for a PE2-mediated single-base substitution in the Tspan2 CArG box displayed similar cell-specific loss of Tspan2 mRNA; expression of an overlapping long noncoding RNA was also nearly abolished in aorta and bladder. Immuno-RNA fluorescence in situ hybridization validated loss of Tspan2 in vascular smooth muscle cells of HDR and PE2 CArG box mutant mice. Targeted sequencing demonstrated variable frequencies of on-target editing in all PE2 and HDR founders. However, whereas no on-target indels were detected in any of the PE2 founders, all HDR founders showed varying levels of on-target indels. Off-target analysis by targeted sequencing revealed mutations in many HDR founders, but none in PE2 founders.
Conclusions: PE2 directs high-fidelity editing of a single base in a TFBS leading to cell-specific loss in expression of an mRNA/long noncoding RNA gene pair. The PE2 platform expands the genome editing toolbox for modeling and correcting relevant noncoding SNVs in the mouse.
{"title":"Prime editing in mice reveals the essentiality of a single base in driving tissue-specific gene expression.","authors":"Pan Gao, Qing Lyu, Amr R Ghanam, Cicera R Lazzarotto, Gregory A Newby, Wei Zhang, Mihyun Choi, Orazio J Slivano, Kevin Holden, John A Walker, Anastasia P Kadina, Rob J Munroe, Christian M Abratte, John C Schimenti, David R Liu, Shengdar Q Tsai, Xiaochun Long, Joseph M Miano","doi":"10.1186/s13059-021-02304-3","DOIUrl":"10.1186/s13059-021-02304-3","url":null,"abstract":"<p><strong>Background: </strong>Most single nucleotide variants (SNVs) occur in noncoding sequence where millions of transcription factor binding sites (TFBS) reside. Here, a comparative analysis of CRISPR-mediated homology-directed repair (HDR) versus the recently reported prime editing 2 (PE2) system was carried out in mice over a TFBS called a CArG box in the Tspan2 promoter.</p><p><strong>Results: </strong>Quantitative RT-PCR showed loss of Tspan2 mRNA in aorta and bladder, but not heart or brain, of mice homozygous for an HDR-mediated three base pair substitution in the Tspan2 CArG box. Using the same protospacer, mice homozygous for a PE2-mediated single-base substitution in the Tspan2 CArG box displayed similar cell-specific loss of Tspan2 mRNA; expression of an overlapping long noncoding RNA was also nearly abolished in aorta and bladder. Immuno-RNA fluorescence in situ hybridization validated loss of Tspan2 in vascular smooth muscle cells of HDR and PE2 CArG box mutant mice. Targeted sequencing demonstrated variable frequencies of on-target editing in all PE2 and HDR founders. However, whereas no on-target indels were detected in any of the PE2 founders, all HDR founders showed varying levels of on-target indels. Off-target analysis by targeted sequencing revealed mutations in many HDR founders, but none in PE2 founders.</p><p><strong>Conclusions: </strong>PE2 directs high-fidelity editing of a single base in a TFBS leading to cell-specific loss in expression of an mRNA/long noncoding RNA gene pair. The PE2 platform expands the genome editing toolbox for modeling and correcting relevant noncoding SNVs in the mouse.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":null,"pages":null},"PeriodicalIF":12.3,"publicationDate":"2021-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7962346/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25480710","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-03-16DOI: 10.1186/s13059-021-02303-4
Huiguang Yi, Yanling Lin, Chengqi Lin, Wenfei Jin
Here, we develop k -mer substring space decomposition (Kssd), a sketching technique which is significantly faster and more accurate than current sketching methods. We show that it is the only method that can be used for large-scale dataset comparisons at population resolution on simulated and real data. Using Kssd, we prioritize references for all 1,019,179 bacteria whole genome sequencing (WGS) runs from NCBI Sequence Read Archive and find misidentification or contamination in 6164 of these. Additionally, we analyze WGS and exome runs of samples from the 1000 Genomes Project.
{"title":"Kssd: sequence dimensionality reduction by k-mer substring space sampling enables real-time large-scale datasets analysis.","authors":"Huiguang Yi, Yanling Lin, Chengqi Lin, Wenfei Jin","doi":"10.1186/s13059-021-02303-4","DOIUrl":"10.1186/s13059-021-02303-4","url":null,"abstract":"<p><p>Here, we develop k -mer substring space decomposition (Kssd), a sketching technique which is significantly faster and more accurate than current sketching methods. We show that it is the only method that can be used for large-scale dataset comparisons at population resolution on simulated and real data. Using Kssd, we prioritize references for all 1,019,179 bacteria whole genome sequencing (WGS) runs from NCBI Sequence Read Archive and find misidentification or contamination in 6164 of these. Additionally, we analyze WGS and exome runs of samples from the 1000 Genomes Project.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":null,"pages":null},"PeriodicalIF":12.3,"publicationDate":"2021-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7962209/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25484289","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-03-11DOI: 10.1186/s13059-021-02298-y
Oliver Daniel Schwich, Nicole Blümel, Mario Keller, Marius Wegener, Samarth Thonta Setty, Melinda Elaine Brunstein, Ina Poser, Igor Ruiz De Los Mozos, Beatrix Suess, Christian Münch, François McNicoll, Kathi Zarnack, Michaela Müller-McNicoll
Background: Alternative polyadenylation (APA) refers to the regulated selection of polyadenylation sites (PASs) in transcripts, which determines the length of their 3' untranslated regions (3'UTRs). We have recently shown that SRSF3 and SRSF7, two closely related SR proteins, connect APA with mRNA export. The mechanism underlying APA regulation by SRSF3 and SRSF7 remained unknown.
Results: Here we combine iCLIP and 3'-end sequencing and find that SRSF3 and SRSF7 bind upstream of proximal PASs (pPASs), but they exert opposite effects on 3'UTR length. SRSF7 enhances pPAS usage in a concentration-dependent but splicing-independent manner by recruiting the cleavage factor FIP1, generating short 3'UTRs. Protein domains unique to SRSF7, which are absent from SRSF3, contribute to FIP1 recruitment. In contrast, SRSF3 promotes distal PAS (dPAS) usage and hence long 3'UTRs directly by counteracting SRSF7, but also indirectly by maintaining high levels of cleavage factor Im (CFIm) via alternative splicing. Upon SRSF3 depletion, CFIm levels decrease and 3'UTRs are shortened. The indirect SRSF3 targets are particularly sensitive to low CFIm levels, because here CFIm serves a dual function; it enhances dPAS and inhibits pPAS usage by binding immediately downstream and assembling unproductive cleavage complexes, which together promotes long 3'UTRs.
Conclusions: We demonstrate that SRSF3 and SRSF7 are direct modulators of pPAS usage and show how small differences in the domain architecture of SR proteins can confer opposite effects on pPAS regulation.
{"title":"SRSF3 and SRSF7 modulate 3'UTR length through suppression or activation of proximal polyadenylation sites and regulation of CFIm levels.","authors":"Oliver Daniel Schwich, Nicole Blümel, Mario Keller, Marius Wegener, Samarth Thonta Setty, Melinda Elaine Brunstein, Ina Poser, Igor Ruiz De Los Mozos, Beatrix Suess, Christian Münch, François McNicoll, Kathi Zarnack, Michaela Müller-McNicoll","doi":"10.1186/s13059-021-02298-y","DOIUrl":"https://doi.org/10.1186/s13059-021-02298-y","url":null,"abstract":"<p><strong>Background: </strong>Alternative polyadenylation (APA) refers to the regulated selection of polyadenylation sites (PASs) in transcripts, which determines the length of their 3' untranslated regions (3'UTRs). We have recently shown that SRSF3 and SRSF7, two closely related SR proteins, connect APA with mRNA export. The mechanism underlying APA regulation by SRSF3 and SRSF7 remained unknown.</p><p><strong>Results: </strong>Here we combine iCLIP and 3'-end sequencing and find that SRSF3 and SRSF7 bind upstream of proximal PASs (pPASs), but they exert opposite effects on 3'UTR length. SRSF7 enhances pPAS usage in a concentration-dependent but splicing-independent manner by recruiting the cleavage factor FIP1, generating short 3'UTRs. Protein domains unique to SRSF7, which are absent from SRSF3, contribute to FIP1 recruitment. In contrast, SRSF3 promotes distal PAS (dPAS) usage and hence long 3'UTRs directly by counteracting SRSF7, but also indirectly by maintaining high levels of cleavage factor Im (CFIm) via alternative splicing. Upon SRSF3 depletion, CFIm levels decrease and 3'UTRs are shortened. The indirect SRSF3 targets are particularly sensitive to low CFIm levels, because here CFIm serves a dual function; it enhances dPAS and inhibits pPAS usage by binding immediately downstream and assembling unproductive cleavage complexes, which together promotes long 3'UTRs.</p><p><strong>Conclusions: </strong>We demonstrate that SRSF3 and SRSF7 are direct modulators of pPAS usage and show how small differences in the domain architecture of SR proteins can confer opposite effects on pPAS regulation.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":null,"pages":null},"PeriodicalIF":12.3,"publicationDate":"2021-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s13059-021-02298-y","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25466202","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-03-10DOI: 10.1186/s13059-021-02295-1
Johannes Zimmermann, Christoph Kaleta, Silvio Waschina
Genome-scale metabolic models of microorganisms are powerful frameworks to predict phenotypes from an organism's genotype. While manual reconstructions are laborious, automated reconstructions often fail to recapitulate known metabolic processes. Here we present gapseq ( https://github.com/jotech/gapseq ), a new tool to predict metabolic pathways and automatically reconstruct microbial metabolic models using a curated reaction database and a novel gap-filling algorithm. On the basis of scientific literature and experimental data for 14,931 bacterial phenotypes, we demonstrate that gapseq outperforms state-of-the-art tools in predicting enzyme activity, carbon source utilisation, fermentation products, and metabolic interactions within microbial communities.
{"title":"gapseq: informed prediction of bacterial metabolic pathways and reconstruction of accurate metabolic models.","authors":"Johannes Zimmermann, Christoph Kaleta, Silvio Waschina","doi":"10.1186/s13059-021-02295-1","DOIUrl":"10.1186/s13059-021-02295-1","url":null,"abstract":"<p><p>Genome-scale metabolic models of microorganisms are powerful frameworks to predict phenotypes from an organism's genotype. While manual reconstructions are laborious, automated reconstructions often fail to recapitulate known metabolic processes. Here we present gapseq ( https://github.com/jotech/gapseq ), a new tool to predict metabolic pathways and automatically reconstruct microbial metabolic models using a curated reaction database and a novel gap-filling algorithm. On the basis of scientific literature and experimental data for 14,931 bacterial phenotypes, we demonstrate that gapseq outperforms state-of-the-art tools in predicting enzyme activity, carbon source utilisation, fermentation products, and metabolic interactions within microbial communities.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":null,"pages":null},"PeriodicalIF":12.3,"publicationDate":"2021-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7949252/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25454281","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-03-10DOI: 10.1186/s13059-021-02305-2
Changcai Huang, Guangyu Li, Jiayu Wu, Junbo Liang, Xiaoyue Wang
Background: Millions of nucleotide variants are identified through cancer genome sequencing and it is clinically important to identify the pathogenic variants among them. By introducing base substitutions at guide RNA target regions in the genome, CRISPR-Cas9-based base editors provide the possibility for evaluating a large number of variants in their genomic context. However, the variability in editing efficiency and the complexity of outcome mapping are two existing problems for assigning guide RNA effects to variants in base editing screens.
Results: To improve the identification of pathogenic variants, we develop a framework to combine base editing screens with sgRNA efficiency and outcome mapping. We apply the method to evaluate more than 9000 variants across all the exons of BRCA1 and BRCA2 genes. Our efficiency-corrected scoring model identifies 910 loss-of-function variants for BRCA1/2, including 151 variants in the noncoding part of the genes such as the 5' untranslated regions. Many of them are identified in cancer patients and are reported as "benign/likely benign" or "variants of uncertain significance" by clinicians. Our data suggest a need to re-evaluate their clinical significance, which may be helpful for risk assessment and treatment of breast and ovarian cancer.
Conclusions: Our results suggest that base editing screens with efficiency correction is a powerful strategy to identify pathogenic variants in a high-throughput manner. Applying this strategy to assess variants in both coding and noncoding regions of the genome could have a direct impact on the interpretation of cancer variants.
{"title":"Identification of pathogenic variants in cancer genes using base editing screens with editing efficiency correction.","authors":"Changcai Huang, Guangyu Li, Jiayu Wu, Junbo Liang, Xiaoyue Wang","doi":"10.1186/s13059-021-02305-2","DOIUrl":"10.1186/s13059-021-02305-2","url":null,"abstract":"<p><strong>Background: </strong>Millions of nucleotide variants are identified through cancer genome sequencing and it is clinically important to identify the pathogenic variants among them. By introducing base substitutions at guide RNA target regions in the genome, CRISPR-Cas9-based base editors provide the possibility for evaluating a large number of variants in their genomic context. However, the variability in editing efficiency and the complexity of outcome mapping are two existing problems for assigning guide RNA effects to variants in base editing screens.</p><p><strong>Results: </strong>To improve the identification of pathogenic variants, we develop a framework to combine base editing screens with sgRNA efficiency and outcome mapping. We apply the method to evaluate more than 9000 variants across all the exons of BRCA1 and BRCA2 genes. Our efficiency-corrected scoring model identifies 910 loss-of-function variants for BRCA1/2, including 151 variants in the noncoding part of the genes such as the 5' untranslated regions. Many of them are identified in cancer patients and are reported as \"benign/likely benign\" or \"variants of uncertain significance\" by clinicians. Our data suggest a need to re-evaluate their clinical significance, which may be helpful for risk assessment and treatment of breast and ovarian cancer.</p><p><strong>Conclusions: </strong>Our results suggest that base editing screens with efficiency correction is a powerful strategy to identify pathogenic variants in a high-throughput manner. Applying this strategy to assess variants in both coding and noncoding regions of the genome could have a direct impact on the interpretation of cancer variants.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":null,"pages":null},"PeriodicalIF":12.3,"publicationDate":"2021-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7945310/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25454348","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-03-09DOI: 10.1186/s13059-021-02287-1
Eddie Park, Yan Jiang, Lili Hao, Jingyi Hui, Yi Xing
Background: A-to-I RNA editing diversifies the transcriptome and has multiple downstream functional effects. Genetic variation contributes to RNA editing variability between individuals and has the potential to impact phenotypic variability.
Results: We analyze matched genetic and transcriptomic data in 49 tissues across 437 individuals to identify RNA editing events that are associated with genetic variation. Using an RNA editing quantitative trait loci (edQTL) mapping approach, we identify 3117 unique RNA editing events associated with a cis genetic polymorphism. Fourteen percent of these edQTL events are also associated with genetic variation in their gene expression. A subset of these events are associated with genome-wide association study signals of complex traits or diseases. We determine that tissue-specific levels of ADAR and ADARB1 are able to explain a subset of tissue-specific edQTL events. We find that certain microRNAs are able to differentiate between the edited and unedited isoforms of their targets. Furthermore, microRNAs can generate an expression quantitative trait loci (eQTL) signal from an edQTL locus by microRNA-mediated transcript degradation in an editing-specific manner. By integrative analyses of edQTL, eQTL, and microRNA expression profiles, we computationally discover and experimentally validate edQTL-microRNA pairs for which the microRNA may generate an eQTL signal from an edQTL locus in a tissue-specific manner.
Conclusions: Our work suggests a mechanism in which RNA editing variability can influence the phenotypes of complex traits and diseases by altering the stability and steady-state level of critical RNA molecules.
{"title":"Genetic variation and microRNA targeting of A-to-I RNA editing fine tune human tissue transcriptomes.","authors":"Eddie Park, Yan Jiang, Lili Hao, Jingyi Hui, Yi Xing","doi":"10.1186/s13059-021-02287-1","DOIUrl":"10.1186/s13059-021-02287-1","url":null,"abstract":"<p><strong>Background: </strong>A-to-I RNA editing diversifies the transcriptome and has multiple downstream functional effects. Genetic variation contributes to RNA editing variability between individuals and has the potential to impact phenotypic variability.</p><p><strong>Results: </strong>We analyze matched genetic and transcriptomic data in 49 tissues across 437 individuals to identify RNA editing events that are associated with genetic variation. Using an RNA editing quantitative trait loci (edQTL) mapping approach, we identify 3117 unique RNA editing events associated with a cis genetic polymorphism. Fourteen percent of these edQTL events are also associated with genetic variation in their gene expression. A subset of these events are associated with genome-wide association study signals of complex traits or diseases. We determine that tissue-specific levels of ADAR and ADARB1 are able to explain a subset of tissue-specific edQTL events. We find that certain microRNAs are able to differentiate between the edited and unedited isoforms of their targets. Furthermore, microRNAs can generate an expression quantitative trait loci (eQTL) signal from an edQTL locus by microRNA-mediated transcript degradation in an editing-specific manner. By integrative analyses of edQTL, eQTL, and microRNA expression profiles, we computationally discover and experimentally validate edQTL-microRNA pairs for which the microRNA may generate an eQTL signal from an edQTL locus in a tissue-specific manner.</p><p><strong>Conclusions: </strong>Our work suggests a mechanism in which RNA editing variability can influence the phenotypes of complex traits and diseases by altering the stability and steady-state level of critical RNA molecules.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":null,"pages":null},"PeriodicalIF":12.3,"publicationDate":"2021-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7942016/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25464351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-03-08DOI: 10.1186/s13059-021-02300-7
Huihui Li, Mingzhe Xie, Yan Wang, Ludong Yang, Zhi Xie, Hongwei Wang
riboCIRC is a translatome data-oriented circRNA database specifically designed for hosting, exploring, analyzing, and visualizing translatable circRNAs from multi-species. The database provides a comprehensive repository of computationally predicted ribosome-associated circRNAs; a manually curated collection of experimentally verified translated circRNAs; an evaluation of cross-species conservation of translatable circRNAs; a systematic de novo annotation of putative circRNA-encoded peptides, including sequence, structure, and function; and a genome browser to visualize the context-specific occupant footprints of circRNAs. It represents a valuable resource for the circRNA research community and is publicly available at http://www.ribocirc.com .
{"title":"riboCIRC: a comprehensive database of translatable circRNAs.","authors":"Huihui Li, Mingzhe Xie, Yan Wang, Ludong Yang, Zhi Xie, Hongwei Wang","doi":"10.1186/s13059-021-02300-7","DOIUrl":"10.1186/s13059-021-02300-7","url":null,"abstract":"<p><p>riboCIRC is a translatome data-oriented circRNA database specifically designed for hosting, exploring, analyzing, and visualizing translatable circRNAs from multi-species. The database provides a comprehensive repository of computationally predicted ribosome-associated circRNAs; a manually curated collection of experimentally verified translated circRNAs; an evaluation of cross-species conservation of translatable circRNAs; a systematic de novo annotation of putative circRNA-encoded peptides, including sequence, structure, and function; and a genome browser to visualize the context-specific occupant footprints of circRNAs. It represents a valuable resource for the circRNA research community and is publicly available at http://www.ribocirc.com .</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":null,"pages":null},"PeriodicalIF":12.3,"publicationDate":"2021-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7938571/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25464357","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-03-08DOI: 10.1186/s13059-021-02286-2
Ruben Dries, Qian Zhu, Rui Dong, Chee-Huat Linus Eng, Huipeng Li, Kan Liu, Yuntian Fu, Tianxiao Zhao, Arpan Sarkar, Feng Bao, Rani E George, Nico Pierson, Long Cai, Guo-Cheng Yuan
Spatial transcriptomic and proteomic technologies have provided new opportunities to investigate cells in their native microenvironment. Here we present Giotto, a comprehensive and open-source toolbox for spatial data analysis and visualization. The analysis module provides end-to-end analysis by implementing a wide range of algorithms for characterizing tissue composition, spatial expression patterns, and cellular interactions. Furthermore, single-cell RNAseq data can be integrated for spatial cell-type enrichment analysis. The visualization module allows users to interactively visualize analysis outputs and imaging features. To demonstrate its general applicability, we apply Giotto to a wide range of datasets encompassing diverse technologies and platforms.
{"title":"Giotto: a toolbox for integrative analysis and visualization of spatial expression data.","authors":"Ruben Dries, Qian Zhu, Rui Dong, Chee-Huat Linus Eng, Huipeng Li, Kan Liu, Yuntian Fu, Tianxiao Zhao, Arpan Sarkar, Feng Bao, Rani E George, Nico Pierson, Long Cai, Guo-Cheng Yuan","doi":"10.1186/s13059-021-02286-2","DOIUrl":"https://doi.org/10.1186/s13059-021-02286-2","url":null,"abstract":"<p><p>Spatial transcriptomic and proteomic technologies have provided new opportunities to investigate cells in their native microenvironment. Here we present Giotto, a comprehensive and open-source toolbox for spatial data analysis and visualization. The analysis module provides end-to-end analysis by implementing a wide range of algorithms for characterizing tissue composition, spatial expression patterns, and cellular interactions. Furthermore, single-cell RNAseq data can be integrated for spatial cell-type enrichment analysis. The visualization module allows users to interactively visualize analysis outputs and imaging features. To demonstrate its general applicability, we apply Giotto to a wide range of datasets encompassing diverse technologies and platforms.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":null,"pages":null},"PeriodicalIF":12.3,"publicationDate":"2021-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s13059-021-02286-2","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25464355","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-03-05DOI: 10.1186/s13059-021-02294-2
Steffen Albrecht, Maximilian Sprang, Miguel A Andrade-Navarro, Jean-Fred Fontaine
Controlling quality of next-generation sequencing (NGS) data files is a necessary but complex task. To address this problem, we statistically characterize common NGS quality features and develop a novel quality control procedure involving tree-based and deep learning classification algorithms. Predictive models, validated on internal and external functional genomics datasets, are to some extent generalizable to data from unseen species. The derived statistical guidelines and predictive models represent a valuable resource for users of NGS data to better understand quality issues and perform automatic quality control. Our guidelines and software are available at https://github.com/salbrec/seqQscorer .
{"title":"seqQscorer: automated quality control of next-generation sequencing data using machine learning.","authors":"Steffen Albrecht, Maximilian Sprang, Miguel A Andrade-Navarro, Jean-Fred Fontaine","doi":"10.1186/s13059-021-02294-2","DOIUrl":"https://doi.org/10.1186/s13059-021-02294-2","url":null,"abstract":"<p><p>Controlling quality of next-generation sequencing (NGS) data files is a necessary but complex task. To address this problem, we statistically characterize common NGS quality features and develop a novel quality control procedure involving tree-based and deep learning classification algorithms. Predictive models, validated on internal and external functional genomics datasets, are to some extent generalizable to data from unseen species. The derived statistical guidelines and predictive models represent a valuable resource for users of NGS data to better understand quality issues and perform automatic quality control. Our guidelines and software are available at https://github.com/salbrec/seqQscorer .</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":null,"pages":null},"PeriodicalIF":12.3,"publicationDate":"2021-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s13059-021-02294-2","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25447787","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}