首页 > 最新文献

Research in computational molecular biology : ... Annual International Conference, RECOMB ... : proceedings. RECOMB (Conference : 2005- )最新文献

英文 中文
Genomewide Identification and Expression Analyses of PAL Genes in Different Color Radish 不同颜色萝卜PAL基因的全基因组鉴定及表达分析
Biao Lai, Chun-Yu Chen, Zehui Tang, Jing Xiao, Qi Wang, Fabo Chen, Lina Du
{"title":"Genomewide Identification and Expression Analyses of PAL Genes in Different Color Radish","authors":"Biao Lai, Chun-Yu Chen, Zehui Tang, Jing Xiao, Qi Wang, Fabo Chen, Lina Du","doi":"10.5376/cmb.2022.12.0001","DOIUrl":"https://doi.org/10.5376/cmb.2022.12.0001","url":null,"abstract":"","PeriodicalId":74675,"journal":{"name":"Research in computational molecular biology : ... Annual International Conference, RECOMB ... : proceedings. RECOMB (Conference : 2005- )","volume":"52 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79230435","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Cloning and Expression Analysis of Phytoene Desaturase (PDS) Gene from Mango (Mangifera indica L) 芒果(Mangifera indica L)植物烯去饱和酶基因的克隆与表达分析
Kaili Yu, Zhichang Zhao, A. Gao, R. Luo, Jianfeng Huang, Mengyun Zhang, Maofu Li
Phytoene desaturase (PDS) affects the synthesis of carotenoids. It is a key gene in the carotenoids biosynthesis pathway. In order to study the function of PDS gene from mango fruits, the phytoene dehydrogenase ( PDS ) gene of ‘ Guifei ’ mango fruit was obtained with RACE methods. The full-length cDNA sequence of the gene is 1 820 bp, open reading frame is 1 650 bp, encoding 549 amino acids, the molecular weight is 61.34 KD, and the isoelectric point is 6.78. It was cluster analysis found that the mango PDS protein had a close relationship with grapefruit, cantaloupe, and papaya. Its amino acid composition is mainly alanine (ALa), leucine (Leu), valine (Val), and so on. Expression of PDS gene in different varieties by PCR showed: the high expression of the red ‘ Guifei ’ varieties and the expression of green mango varieties with low volume. Prediction of the domain, tertiary structure, and interaction protein of its protein, found to contain phytoene-desat, PLNO2487 superfamily, and other domains. The use of the STRING database found PDS interacted with proteins such as PSY, ZDS, and CRTISO. This research will be valuable for understanding the molecular mechanism of gene regulation in carotenoid biosynthesis and can be served as the basis for the metabolic engineering of mango.
植物烯去饱和酶(PDS)影响类胡萝卜素的合成。它是类胡萝卜素生物合成途径的关键基因。为了研究芒果果实PDS基因的功能,采用RACE方法获得了“贵飞”芒果果实的植物烯脱氢酶(phytoene dehydrogenase, PDS)基因。该基因全长cDNA序列为1 820 bp,开放阅读框为1 650 bp,编码549个氨基酸,分子量为61.34 KD,等电点为6.78。聚类分析发现芒果的PDS蛋白与柚子、哈密瓜、木瓜有密切的亲缘关系。其氨基酸组成主要有丙氨酸(ALa)、亮氨酸(Leu)、缬氨酸(Val)等。不同品种PDS基因的PCR表达结果显示:红色‘贵飞’品种表达量高,青芒果品种表达量低。预测其蛋白的结构域、三级结构和相互作用蛋白,发现含有植物烯-desat、PLNO2487超家族等结构域。使用STRING数据库发现PDS与PSY、ZDS和CRTISO等蛋白相互作用。该研究对了解类胡萝卜素生物合成过程中基因调控的分子机制具有重要意义,可为芒果代谢工程的研究提供基础。
{"title":"Cloning and Expression Analysis of Phytoene Desaturase (PDS) Gene from Mango (Mangifera indica L)","authors":"Kaili Yu, Zhichang Zhao, A. Gao, R. Luo, Jianfeng Huang, Mengyun Zhang, Maofu Li","doi":"10.5376/cmb.2022.12.0004","DOIUrl":"https://doi.org/10.5376/cmb.2022.12.0004","url":null,"abstract":"Phytoene desaturase (PDS) affects the synthesis of carotenoids. It is a key gene in the carotenoids biosynthesis pathway. In order to study the function of PDS gene from mango fruits, the phytoene dehydrogenase ( PDS ) gene of ‘ Guifei ’ mango fruit was obtained with RACE methods. The full-length cDNA sequence of the gene is 1 820 bp, open reading frame is 1 650 bp, encoding 549 amino acids, the molecular weight is 61.34 KD, and the isoelectric point is 6.78. It was cluster analysis found that the mango PDS protein had a close relationship with grapefruit, cantaloupe, and papaya. Its amino acid composition is mainly alanine (ALa), leucine (Leu), valine (Val), and so on. Expression of PDS gene in different varieties by PCR showed: the high expression of the red ‘ Guifei ’ varieties and the expression of green mango varieties with low volume. Prediction of the domain, tertiary structure, and interaction protein of its protein, found to contain phytoene-desat, PLNO2487 superfamily, and other domains. The use of the STRING database found PDS interacted with proteins such as PSY, ZDS, and CRTISO. This research will be valuable for understanding the molecular mechanism of gene regulation in carotenoid biosynthesis and can be served as the basis for the metabolic engineering of mango.","PeriodicalId":74675,"journal":{"name":"Research in computational molecular biology : ... Annual International Conference, RECOMB ... : proceedings. RECOMB (Conference : 2005- )","volume":"110 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76104665","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Privacy-Preserving Genotype Imputation in a Trusted Execution Environment. 可信执行环境中的隐私保护基因型推算。
Natnatee Dokmai, Can Kockan, Kaiyuan Zhu, XiaoFeng Wang, S Cenk Sahinalp, Hyunghoon Cho
{"title":"Privacy-Preserving Genotype Imputation in a Trusted Execution Environment.","authors":"Natnatee Dokmai, Can Kockan, Kaiyuan Zhu, XiaoFeng Wang, S Cenk Sahinalp, Hyunghoon Cho","doi":"","DOIUrl":"","url":null,"abstract":"","PeriodicalId":74675,"journal":{"name":"Research in computational molecular biology : ... Annual International Conference, RECOMB ... : proceedings. RECOMB (Conference : 2005- )","volume":"12 10","pages":"983-993.e7"},"PeriodicalIF":0.0,"publicationDate":"2021-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8635452/pdf/nihms-1713574.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39940700","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Molecular Evolution of Genes Involved in Quinic Acid Utilization in Fungi 真菌利用奎宁酸相关基因的分子进化
D. Asch, Jannet Ziegler, X. Min
Fungi use diverse organic compounds for their growth and development.  Neurospora crassa  can use quinic acid as its sole carbon source for its growth because of presence of a quinic acid utilization (QUT) cluster of genes in its genome. Using bioinformatics methods we examined a total of 285 completely sequenced fungal genomes comprised of 282 unique species and found there were 117 fungal species having all 7 QUT genes in their genomes. Most species in the classes of Dothideomycetes, Eurotiomycetes, Leotiomycetes and Sordariomycetes have QUT genes, however, among 53 species in Saccharomycetes only 3 species have all 7 QUT genes. There were linage specific losses of QUT genes, such as species in Eurotiomycetes class Onygenales order lacked most of QA utilization genes. Our survey revealed that species in Agaricomycetes, Basidiomycota, Chytridiomycetes, Exobasidiomycetes, Malasseziomycetes, Microsporidia, Schizosacharomycetes, and Tremellomycetes did not have QA utilization genes. Using concatenated protein sequences encoded by 7 QUT genes, a robust phylogenetic tree to infer the evolution of the QUT cluster genes was constructed. In addition, we also found QUT genes from recently sequenced genome of cork oak ( Quercus suber ), however, our analysis suggests that these QUT sequences are likely from a contaminated fungal species.
真菌利用各种有机化合物来生长和发育。粗神经孢子虫可以利用奎宁酸作为其生长的唯一碳源,因为其基因组中存在奎宁酸利用(QUT)基因簇。利用生物信息学方法对282个独特物种的285个完全测序的真菌基因组进行了分析,发现有117个真菌物种的基因组中包含了全部7个QUT基因。Dothideomycetes、Eurotiomycetes、Leotiomycetes和Sordariomycetes中的大多数物种都有QUT基因,而在酵母菌的53个物种中,只有3个物种拥有全部7个QUT基因。QUT基因存在谱系特异性缺失,如欧线菌纲中大部分QA利用基因缺失。调查结果显示,木链菌、担子菌、壶菌、外担子菌、马拉色菌、小孢子菌、裂孢菌和银耳菌中没有QA利用基因。利用7个QUT基因编码的串联蛋白序列,构建了一个健壮的系统发育树来推断QUT簇基因的进化。此外,我们还从最近测序的栓皮栎(Quercus suber)基因组中发现了QUT基因,但我们的分析表明,这些QUT序列可能来自受污染的真菌物种。
{"title":"Molecular Evolution of Genes Involved in Quinic Acid Utilization in Fungi","authors":"D. Asch, Jannet Ziegler, X. Min","doi":"10.5376/cmb.2021.11.0005","DOIUrl":"https://doi.org/10.5376/cmb.2021.11.0005","url":null,"abstract":"Fungi use diverse organic compounds for their growth and development.  Neurospora crassa  can use quinic acid as its sole carbon source for its growth because of presence of a quinic acid utilization (QUT) cluster of genes in its genome. Using bioinformatics methods we examined a total of 285 completely sequenced fungal genomes comprised of 282 unique species and found there were 117 fungal species having all 7 QUT genes in their genomes. Most species in the classes of Dothideomycetes, Eurotiomycetes, Leotiomycetes and Sordariomycetes have QUT genes, however, among 53 species in Saccharomycetes only 3 species have all 7 QUT genes. There were linage specific losses of QUT genes, such as species in Eurotiomycetes class Onygenales order lacked most of QA utilization genes. Our survey revealed that species in Agaricomycetes, Basidiomycota, Chytridiomycetes, Exobasidiomycetes, Malasseziomycetes, Microsporidia, Schizosacharomycetes, and Tremellomycetes did not have QA utilization genes. Using concatenated protein sequences encoded by 7 QUT genes, a robust phylogenetic tree to infer the evolution of the QUT cluster genes was constructed. In addition, we also found QUT genes from recently sequenced genome of cork oak ( Quercus suber ), however, our analysis suggests that these QUT sequences are likely from a contaminated fungal species.","PeriodicalId":74675,"journal":{"name":"Research in computational molecular biology : ... Annual International Conference, RECOMB ... : proceedings. RECOMB (Conference : 2005- )","volume":"25 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79709371","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Advances in Transcriptome Analyses Using RNA Sequencing Technology in Soybean Plants [Glycine max] 基于RNA测序技术的大豆转录组分析进展[Glycine max]
J. Min, Mykaela Wagner, Theoni Kasamias
Soybean [Glycine max] is an important oil and food plant for both humans and animals. Recent development in RNA sequencing (RNA-seq) technology provides a cost effective approach to analyzing transcriptomes of plants at different developmental stages and in responses to different biotic and abiotic challenges. Currently there are over 5 000 RNA-seq datasets in soybean plants publicly available at SRA database in the National Center for Biotechnology Information (NCBI). Such a large number of RNA-seq datasets provide soybean researchers an opportunity as well as a challenge for fully exploring the data to understand soybean biology. A number of research articles have been published on applications of RNA-seq in transcriptome analysis of soybean plants, covering a wide range of topics including growth and development, plant mineral nutrients, responses to environmental stresses, pathogens and pests. In this work we compile and review recent advances of RNA-seq transcriptome analyses including profiling of differential gene expression, gene alternative splicing, and gene regulatory networks in soybean plants, with key findings excerpted from each individual published article.
大豆(Glycine max)是一种重要的人畜油脂和食用植物。RNA测序(RNA-seq)技术的最新发展为分析植物在不同发育阶段和应对不同生物和非生物挑战的转录组提供了一种经济有效的方法。目前,在美国国家生物技术信息中心(NCBI)的SRA数据库中有5000多个公开的大豆植物RNA-seq数据集。如此大量的RNA-seq数据集为大豆研究人员提供了充分挖掘数据以了解大豆生物学的机会和挑战。RNA-seq在大豆转录组分析中的应用已经发表了大量的研究文章,涵盖了大豆生长发育、植物矿质营养、对环境胁迫的反应、病原体和害虫等广泛的主题。在这项工作中,我们汇编和回顾了RNA-seq转录组分析的最新进展,包括大豆植物的差异基因表达谱,基因选择性剪接和基因调控网络,并摘录了每篇发表的文章的主要发现。
{"title":"Advances in Transcriptome Analyses Using RNA Sequencing Technology in Soybean Plants [Glycine max]","authors":"J. Min, Mykaela Wagner, Theoni Kasamias","doi":"10.5376/CMB.2021.11.0001","DOIUrl":"https://doi.org/10.5376/CMB.2021.11.0001","url":null,"abstract":"Soybean [Glycine max] is an important oil and food plant for both humans and animals. Recent development in RNA sequencing (RNA-seq) technology provides a cost effective approach to analyzing transcriptomes of plants at different developmental stages and in responses to different biotic and abiotic challenges. Currently there are over 5 000 RNA-seq datasets in soybean plants publicly available at SRA database in the National Center for Biotechnology Information (NCBI). Such a large number of RNA-seq datasets provide soybean researchers an opportunity as well as a challenge for fully exploring the data to understand soybean biology. A number of research articles have been published on applications of RNA-seq in transcriptome analysis of soybean plants, covering a wide range of topics including growth and development, plant mineral nutrients, responses to environmental stresses, pathogens and pests. In this work we compile and review recent advances of RNA-seq transcriptome analyses including profiling of differential gene expression, gene alternative splicing, and gene regulatory networks in soybean plants, with key findings excerpted from each individual published article.","PeriodicalId":74675,"journal":{"name":"Research in computational molecular biology : ... Annual International Conference, RECOMB ... : proceedings. RECOMB (Conference : 2005- )","volume":"7 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89387922","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Randomized Parallel Algorithm for Efficiently Finding Near-Optimal Universal Hitting Sets. 高效寻找近优通用命中集的随机并行算法
Barış Ekim, Bonnie Berger, Yaron Orenstein

As the volume of next generation sequencing data increases, an urgent need for algorithms to efficiently process the data arises. Universal hitting sets (UHS) were recently introduced as an alternative to the central idea of minimizers in sequence analysis with the hopes that they could more efficiently address common tasks such as computing hash functions for read overlap, sparse suffix arrays, and Bloom filters. A UHS is a set of k-mers that hit every sequence of length L, and can thus serve as indices to L-long sequences. Unfortunately, methods for computing small UHSs are not yet practical for real-world sequencing instances due to their serial and deterministic nature, which leads to long runtimes and high memory demands when handling typical values of k (e.g. k>13). To address this bottleneck, we present two algorithmic innovations to significantly decrease runtime while keeping memory usage low: (i) we leverage advanced theoretical and architectural techniques to parallelize and decrease memory usage in calculating k-mer hitting numbers; and (ii) we build upon techniques from randomized Set Cover to select universal k-mers much faster. We implemented these innovations in PASHA, the first randomized parallel algorithm for generating nearoptimal UHSs, which newly handles k>13. We demonstrate empirically that PASHA produces sets only slightly larger than those of serial deterministic algorithms; moreover, the set size is provably guaranteed to be within a small constant factor of the optimal size. PASHA's runtime and memory-usage improvements are orders of magnitude faster than the current best algorithms. We expect our newly-practical construction of UHSs to be adopted in many high-throughput sequence analysis pipelines.

随着下一代测序数据量的增加,迫切需要高效处理数据的算法。最近,通用命中集(UHS)作为序列分析中最小化的核心思想的替代方案被提出,希望它们能更有效地解决常见任务,如计算读取重叠的哈希函数、稀疏后缀数组和布鲁姆过滤器。UHS 是一组 k-mers 的集合,可以命中每个长度为 L 的序列,因此可以作为 L 长序列的索引。遗憾的是,计算小型 UHS 的方法在现实世界的测序实例中并不实用,因为它们具有串行和确定性的特点,在处理典型的 k 值(如 k>13)时,会导致较长的运行时间和较高的内存需求。为了解决这一瓶颈问题,我们提出了两项算法创新,以大幅缩短运行时间,同时保持较低的内存使用率:(i) 我们利用先进的理论和架构技术来并行化和降低计算 k-mer命中数时的内存使用率;(ii) 我们利用随机集合覆盖技术来更快地选择通用 k-mer。我们在 PASHA 中实现了这些创新,PASHA 是首个用于生成近优 UHS 的随机并行算法,新算法可处理 k>13。我们通过实证证明,PASHA 生成的集合仅比串行确定性算法的集合稍大一些;此外,可以证明集合大小保证在最优大小的一个小常数因子之内。PASHA 在运行时间和内存使用方面的改进比目前最好的算法快了几个数量级。我们期待我们新近构建的 UHS 能被许多高通量序列分析流水线所采用。
{"title":"A Randomized Parallel Algorithm for Efficiently Finding Near-Optimal Universal Hitting Sets.","authors":"Barış Ekim, Bonnie Berger, Yaron Orenstein","doi":"10.1007/978-3-030-45257-5_3","DOIUrl":"10.1007/978-3-030-45257-5_3","url":null,"abstract":"<p><p>As the volume of next generation sequencing data increases, an urgent need for algorithms to efficiently process the data arises. <i>Universal hitting sets</i> (UHS) were recently introduced as an alternative to the central idea of minimizers in sequence analysis with the hopes that they could more efficiently address common tasks such as computing hash functions for read overlap, sparse suffix arrays, and Bloom filters. A UHS is a set of <math><mi>k</mi></math>-mers that hit every sequence of length <math><mi>L</mi></math>, and can thus serve as indices to <math><mi>L</mi></math>-long sequences. Unfortunately, methods for computing small UHSs are not yet practical for real-world sequencing instances due to their serial and deterministic nature, which leads to long runtimes and high memory demands when handling typical values of <math><mi>k</mi></math> (e.g. <math><mrow><mi>k</mi><mo>></mo><mn>13</mn></mrow></math>). To address this bottleneck, we present two algorithmic innovations to significantly decrease runtime while keeping memory usage low: (i) we leverage advanced theoretical and architectural techniques to parallelize and decrease memory usage in calculating <math><mi>k</mi></math>-mer hitting numbers; and (ii) we build upon techniques from randomized Set Cover to select universal <math><mi>k</mi></math>-mers much faster. We implemented these innovations in PASHA, the first randomized parallel algorithm for generating nearoptimal UHSs, which newly handles <math><mrow><mi>k</mi><mo>></mo><mn>13</mn></mrow></math>. We demonstrate empirically that PASHA produces sets only slightly larger than those of serial deterministic algorithms; moreover, the set size is provably guaranteed to be within a small constant factor of the optimal size. PASHA's runtime and memory-usage improvements are orders of magnitude faster than the current best algorithms. We expect our newly-practical construction of UHSs to be adopted in many high-throughput sequence analysis pipelines.</p>","PeriodicalId":74675,"journal":{"name":"Research in computational molecular biology : ... Annual International Conference, RECOMB ... : proceedings. RECOMB (Conference : 2005- )","volume":"12074 ","pages":"37-53"},"PeriodicalIF":0.0,"publicationDate":"2020-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11148856/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141249251","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RoboCOP: Multivariate State Space Model Integrating Epigenomic Accessibility Data to Elucidate Genome-Wide Chromatin Occupancy. RoboCOP:整合表观基因组可及性数据的多变量状态空间模型,用于阐明全基因组染色质占据。
Sneha Mitra, Jianling Zhong, David M MacAlpine, Alexander J Hartemink

Chromatin is the tightly packaged structure of DNA and protein within the nucleus of a cell. The arrangement of different protein complexes along the DNA modulates and is modulated by gene expression. Measuring the binding locations and level of occupancy of different transcription factors (TFs) and nucleosomes is therefore crucial to understanding gene regulation. Antibody-based methods for assaying chromatin occupancy are capable of identifying the binding sites of specific DNA binding factors, but only one factor at a time. On the other hand, epigenomic accessibility data like ATAC-seq, DNase-seq, and MNase-seq provide insight into the chromatin landscape of all factors bound along the genome, but with minimal insight into the identities of those factors. Here, we present RoboCOP, a multivariate state space model that integrates chromatin information from epigenomic accessibility data with nucleotide sequence to compute genome-wide probabilistic scores of nucleosome and TF occupancy, for hundreds of different factors at once. RoboCOP can be applied to any epigenomic dataset that provides quantitative insight into chromatin accessibility in any organism, but here we apply it to MNase-seq data to elucidate the protein-binding landscape of nucleosomes and 150 TFs across the yeast genome. Using available protein-binding datasets from the literature, we show that our model more accurately predicts the binding of these factors genome-wide.

染色质是细胞核内 DNA 和蛋白质的紧密包装结构。不同蛋白质复合物沿 DNA 的排列会调节基因表达,并受基因表达的调节。因此,测量不同转录因子(TFs)和核糖体的结合位置和占据水平对于了解基因调控至关重要。基于抗体的染色质占有率检测方法能够确定特定 DNA 结合因子的结合位点,但每次只能确定一个因子。另一方面,ATAC-seq、DNase-seq和MNase-seq等表观基因组可及性数据能让人深入了解沿基因组结合的所有因子的染色质景观,但对这些因子的身份却知之甚少。在这里,我们介绍一种多变量状态空间模型 RoboCOP,它整合了来自表观基因组可及性数据和核苷酸序列的染色质信息,可同时计算数百个不同因子的核小体和 TF 占位的全基因组概率分数。RoboCOP 可以应用于任何表观基因组数据集,定量分析任何生物体的染色质可及性,但在这里我们将其应用于 MNase-seq 数据,以阐明整个酵母基因组中核小体和 150 个 TF 的蛋白质结合情况。利用文献中现有的蛋白质结合数据集,我们发现我们的模型能更准确地预测这些因子在全基因组的结合情况。
{"title":"RoboCOP: Multivariate State Space Model Integrating Epigenomic Accessibility Data to Elucidate Genome-Wide Chromatin Occupancy.","authors":"Sneha Mitra, Jianling Zhong, David M MacAlpine, Alexander J Hartemink","doi":"10.1007/978-3-030-45257-5_9","DOIUrl":"10.1007/978-3-030-45257-5_9","url":null,"abstract":"<p><p>Chromatin is the tightly packaged structure of DNA and protein within the nucleus of a cell. The arrangement of different protein complexes along the DNA modulates and is modulated by gene expression. Measuring the binding locations and level of occupancy of different transcription factors (TFs) and nucleosomes is therefore crucial to understanding gene regulation. Antibody-based methods for assaying chromatin occupancy are capable of identifying the binding sites of specific DNA binding factors, but only one factor at a time. On the other hand, epigenomic accessibility data like ATAC-seq, DNase-seq, and MNase-seq provide insight into the chromatin landscape of all factors bound along the genome, but with minimal insight into the identities of those factors. Here, we present RoboCOP, a multivariate state space model that integrates chromatin information from epigenomic accessibility data with nucleotide sequence to compute genome-wide probabilistic scores of nucleosome and TF occupancy, for hundreds of different factors at once. RoboCOP can be applied to any epigenomic dataset that provides quantitative insight into chromatin accessibility in any organism, but here we apply it to MNase-seq data to elucidate the protein-binding landscape of nucleosomes and 150 TFs across the yeast genome. Using available protein-binding datasets from the literature, we show that our model more accurately predicts the binding of these factors genome-wide.</p>","PeriodicalId":74675,"journal":{"name":"Research in computational molecular biology : ... Annual International Conference, RECOMB ... : proceedings. RECOMB (Conference : 2005- )","volume":"12074 ","pages":"136-151"},"PeriodicalIF":0.0,"publicationDate":"2020-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8356533/pdf/nihms-1727027.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39306371","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research in Computational Molecular Biology: 24th Annual International Conference, RECOMB 2020, Padua, Italy, May 10–13, 2020, Proceedings 计算分子生物学研究:第24届国际会议,RECOMB 2020,意大利帕多瓦,2020年5月10-13日,论文集
{"title":"Research in Computational Molecular Biology: 24th Annual International Conference, RECOMB 2020, Padua, Italy, May 10–13, 2020, Proceedings","authors":"","doi":"10.1007/978-3-030-45257-5","DOIUrl":"https://doi.org/10.1007/978-3-030-45257-5","url":null,"abstract":"","PeriodicalId":74675,"journal":{"name":"Research in computational molecular biology : ... Annual International Conference, RECOMB ... : proceedings. RECOMB (Conference : 2005- )","volume":"20 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78117909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Context-Specific Nested Effects Models. 特定于上下文的嵌套效果模型。
Yuriy Sverchkov, Yi-Hsuan Ho, Audrey Gasch, Mark Craven

Advances in systems biology have made clear the importance of network models for capturing knowledge about complex relationships in gene regulation, metabolism, and cellular signaling. A common approach to uncovering biological networks involves performing perturbations on elements of the network, such as gene knockdown experiments, and measuring how the perturbation affects some reporter of the process under study. In this paper, we develop context-specific nested effects models (CSNEMs), an approach to inferring such networks that generalizes nested effect models (NEMs). The main contribution of this work is that CSNEMs explicitly model the participation of a gene in multiple contexts, meaning that a gene can appear in multiple places in the network. Biologically, the representation of regulators in multiple contexts may indicate that these regulators have distinct roles in different cellular compartments or cell cycle phases. We present an evaluation of the method on simulated data as well as on data from a study of the sodium chloride stress response in Saccharomyces cerevisiae.

系统生物学的进步已经明确了网络模型对于获取基因调控、代谢和细胞信号传导中复杂关系的知识的重要性。揭示生物网络的一种常见方法包括对网络元素进行扰动,例如基因敲低实验,并测量扰动如何影响正在研究的过程的某些报告者。在本文中,我们开发了上下文特定的嵌套效应模型(csnem),这是一种推断这种网络的方法,可以推广嵌套效应模型(nem)。这项工作的主要贡献是csnem明确地模拟了一个基因在多种环境中的参与,这意味着一个基因可以出现在网络中的多个位置。生物学上,调节因子在多种情况下的表现可能表明这些调节因子在不同的细胞室或细胞周期阶段具有不同的作用。我们在模拟数据上以及在酿酒酵母的氯化钠胁迫反应研究数据上对该方法进行了评估。
{"title":"Context-Specific Nested Effects Models.","authors":"Yuriy Sverchkov,&nbsp;Yi-Hsuan Ho,&nbsp;Audrey Gasch,&nbsp;Mark Craven","doi":"10.1007/978-3-319-89929-9_13","DOIUrl":"https://doi.org/10.1007/978-3-319-89929-9_13","url":null,"abstract":"<p><p>Advances in systems biology have made clear the importance of network models for capturing knowledge about complex relationships in gene regulation, metabolism, and cellular signaling. A common approach to uncovering biological networks involves performing perturbations on elements of the network, such as gene knockdown experiments, and measuring how the perturbation affects some reporter of the process under study. In this paper, we develop context-specific nested effects models (CSNEMs), an approach to inferring such networks that generalizes nested effect models (NEMs). The main contribution of this work is that CSNEMs explicitly model the participation of a gene in multiple <i>contexts</i>, meaning that a gene can appear in multiple places in the network. Biologically, the representation of regulators in multiple contexts may indicate that these regulators have distinct roles in different cellular compartments or cell cycle phases. We present an evaluation of the method on simulated data as well as on data from a study of the sodium chloride stress response in <i>Saccharomyces cerevisiae</i>.</p>","PeriodicalId":74675,"journal":{"name":"Research in computational molecular biology : ... Annual International Conference, RECOMB ... : proceedings. RECOMB (Conference : 2005- )","volume":"10812 ","pages":"194-210"},"PeriodicalIF":0.0,"publicationDate":"2018-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6342457/pdf/nihms-1001013.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"36895231","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Generalizable visualization of mega-scale single-cell data. 超大规模单细胞数据的通用可视化。
Hyunghoon Cho, Bonnie Berger, Jian Peng
{"title":"Generalizable visualization of mega-scale single-cell data.","authors":"Hyunghoon Cho,&nbsp;Bonnie Berger,&nbsp;Jian Peng","doi":"","DOIUrl":"","url":null,"abstract":"","PeriodicalId":74675,"journal":{"name":"Research in computational molecular biology : ... Annual International Conference, RECOMB ... : proceedings. RECOMB (Conference : 2005- )","volume":"10812 ","pages":"251-253"},"PeriodicalIF":0.0,"publicationDate":"2018-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5917073/pdf/nihms959802.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"36068026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Research in computational molecular biology : ... Annual International Conference, RECOMB ... : proceedings. RECOMB (Conference : 2005- )
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1