首页 > 最新文献

Statistical Applications in Genetics and Molecular Biology最新文献

英文 中文
GMEPS: a fast and efficient likelihood approach for genome-wide mediation analysis under extreme phenotype sequencing GMEPS:一种在极端表型测序下进行全基因组介导分析的快速有效的可能性方法
IF 0.9 4区 数学 Q3 Mathematics Pub Date : 2022-01-01 DOI: 10.1515/sagmb-2021-0071
J. Liyanage, J. Estepp, K. Srivastava, Yun Li, Motomi Mori, G. Kang
Abstract Due to many advantages such as higher statistical power of detecting the association of genetic variants in human disorders and cost saving, extreme phenotype sequencing (EPS) is a rapidly emerging study design in epidemiological and clinical studies investigating how genetic variations associate with complex phenotypes. However, the investigation of the mediation effect of genetic variants on phenotypes is strictly restrictive under the EPS design because existing methods cannot well accommodate the non-random extreme tails sampling process incurred by the EPS design. In this paper, we propose a likelihood approach for testing the mediation effect of genetic variants through continuous and binary mediators on a continuous phenotype under the EPS design (GMEPS). Besides implementing in EPS design, it can also be utilized as a general mediation analysis procedure. Extensive simulations and two real data applications of a genome-wide association study of benign ethnic neutropenia under EPS design and a candidate-gene study of neurocognitive performance in patients with sickle cell disease under random sampling design demonstrate the superiority of GMEPS under the EPS design over widely used mediation analysis procedures, while demonstrating compatible capabilities under the general random sampling framework.
极端表型测序(extreme phenotype sequencing, EPS)由于具有检测人类疾病中遗传变异关联的较高统计能力和节省成本等诸多优势,在流行病学和临床研究中,研究遗传变异与复杂表型之间的关系是一种迅速兴起的研究设计。然而,由于现有方法不能很好地适应EPS设计带来的非随机极端尾抽样过程,因此在EPS设计下,遗传变异对表型的中介效应的研究受到严格限制。在本文中,我们提出了一种可能性方法来测试遗传变异在EPS设计(GMEPS)下通过连续和二元介质对连续表型的中介效应。除了在EPS设计中实现外,还可以作为通用的中介分析程序使用。一项基于EPS设计的良性少数民族中性粒细胞减少的全基因组关联研究和一项基于随机抽样设计的镰状细胞病患者神经认知表现的候选基因研究的广泛模拟和两个实际数据应用表明,EPS设计下的GMEPS优于广泛使用的中介分析程序,同时显示了在一般随机抽样框架下的兼容能力。
{"title":"GMEPS: a fast and efficient likelihood approach for genome-wide mediation analysis under extreme phenotype sequencing","authors":"J. Liyanage, J. Estepp, K. Srivastava, Yun Li, Motomi Mori, G. Kang","doi":"10.1515/sagmb-2021-0071","DOIUrl":"https://doi.org/10.1515/sagmb-2021-0071","url":null,"abstract":"Abstract Due to many advantages such as higher statistical power of detecting the association of genetic variants in human disorders and cost saving, extreme phenotype sequencing (EPS) is a rapidly emerging study design in epidemiological and clinical studies investigating how genetic variations associate with complex phenotypes. However, the investigation of the mediation effect of genetic variants on phenotypes is strictly restrictive under the EPS design because existing methods cannot well accommodate the non-random extreme tails sampling process incurred by the EPS design. In this paper, we propose a likelihood approach for testing the mediation effect of genetic variants through continuous and binary mediators on a continuous phenotype under the EPS design (GMEPS). Besides implementing in EPS design, it can also be utilized as a general mediation analysis procedure. Extensive simulations and two real data applications of a genome-wide association study of benign ethnic neutropenia under EPS design and a candidate-gene study of neurocognitive performance in patients with sickle cell disease under random sampling design demonstrate the superiority of GMEPS under the EPS design over widely used mediation analysis procedures, while demonstrating compatible capabilities under the general random sampling framework.","PeriodicalId":49477,"journal":{"name":"Statistical Applications in Genetics and Molecular Biology","volume":" ","pages":""},"PeriodicalIF":0.9,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46601961","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Inference of genetic regulatory networks with regulatory hubs using vector autoregressions and automatic relevance determination with model selections. 利用向量自回归和模型选择的自动相关性确定来推断具有调控枢纽的遗传调控网络。
IF 0.9 4区 数学 Q3 Mathematics Pub Date : 2021-12-28 DOI: 10.1515/sagmb-2020-0054
Chi-Kan Chen

The inference of genetic regulatory networks (GRNs) reveals how genes interact with each other. A few genes can regulate many genes as targets to control cell functions. We present new methods based on the order-1 vector autoregression (VAR1) for inferring GRNs from gene expression time series. The methods use the automatic relevance determination (ARD) to incorporate the regulatory hub structure into the estimation of VAR1 in a Bayesian framework. Several sparse approximation schemes are applied to the estimated regression weights or VAR1 model to generate the sparse weighted adjacency matrices representing the inferred GRNs. We apply the proposed and several widespread reference methods to infer GRNs with up to 100 genes using simulated, DREAM4 in silico and experimental E. coli gene expression time series. We show that the proposed methods are efficient on simulated hub GRNs and scale-free GRNs using short time series simulated by VAR1s and outperform reference methods on small-scale DREAM4 in silico GRNs and E. coli GRNs. They can utilize the known major regulatory hubs to improve the performance on larger DREAM4 in silico GRNs and E. coli GRNs. The impact of nonlinear time series data on the performance of proposed methods is discussed.

遗传调控网络(grn)的推断揭示了基因之间如何相互作用。少数基因可以调控许多基因作为靶标来控制细胞功能。我们提出了基于order-1向量自回归(VAR1)的新方法,用于从基因表达时间序列推断grn。该方法使用自动相关性确定(ARD)将监管枢纽结构纳入贝叶斯框架中VAR1的估计。将几种稀疏逼近方案应用于估计的回归权值或VAR1模型,生成表示推断grn的稀疏加权邻接矩阵。我们利用模拟的DREAM4和实验的大肠杆菌基因表达时间序列,应用所提出的方法和几种广泛的参考方法来推断多达100个基因的grn。研究表明,该方法在模拟轮毂grn和使用var1模拟的短时间序列的无标度grn上是有效的,并且在小型DREAM4硅grn和大肠杆菌grn上优于参考方法。他们可以利用已知的主要调控中心来提高更大的DREAM4硅grn和大肠杆菌grn的性能。讨论了非线性时间序列数据对所提方法性能的影响。
{"title":"Inference of genetic regulatory networks with regulatory hubs using vector autoregressions and automatic relevance determination with model selections.","authors":"Chi-Kan Chen","doi":"10.1515/sagmb-2020-0054","DOIUrl":"https://doi.org/10.1515/sagmb-2020-0054","url":null,"abstract":"<p><p>The inference of genetic regulatory networks (GRNs) reveals how genes interact with each other. A few genes can regulate many genes as targets to control cell functions. We present new methods based on the order-1 vector autoregression (VAR1) for inferring GRNs from gene expression time series. The methods use the automatic relevance determination (ARD) to incorporate the regulatory hub structure into the estimation of VAR1 in a Bayesian framework. Several sparse approximation schemes are applied to the estimated regression weights or VAR1 model to generate the sparse weighted adjacency matrices representing the inferred GRNs. We apply the proposed and several widespread reference methods to infer GRNs with up to 100 genes using simulated, DREAM4 in silico and experimental <i>E. coli</i> gene expression time series. We show that the proposed methods are efficient on simulated hub GRNs and scale-free GRNs using short time series simulated by VAR1s and outperform reference methods on small-scale DREAM4 in silico GRNs and <i>E. coli</i> GRNs. They can utilize the known major regulatory hubs to improve the performance on larger DREAM4 in silico GRNs and <i>E. coli</i> GRNs. The impact of nonlinear time series data on the performance of proposed methods is discussed.</p>","PeriodicalId":49477,"journal":{"name":"Statistical Applications in Genetics and Molecular Biology","volume":"20 4-6","pages":"121-143"},"PeriodicalIF":0.9,"publicationDate":"2021-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39646200","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Batch effect reduction of microarray data with dependent samples using an empirical Bayes approach (BRIDGE). 使用经验贝叶斯方法(BRIDGE)减少具有依赖性样本的微阵列数据的批次效应。
IF 0.9 4区 数学 Q3 Mathematics Pub Date : 2021-12-14 DOI: 10.1515/sagmb-2021-0020
Qing Xia, Jeffrey A Thompson, Devin C Koestler

Batch-effects present challenges in the analysis of high-throughput molecular data and are particularly problematic in longitudinal studies when interest lies in identifying genes/features whose expression changes over time, but time is confounded with batch. While many methods to correct for batch-effects exist, most assume independence across samples; an assumption that is unlikely to hold in longitudinal microarray studies. We propose Batch effect Reduction of mIcroarray data with Dependent samples usinGEmpirical Bayes (BRIDGE), a three-step parametric empirical Bayes approach that leverages technical replicate samples profiled at multiple timepoints/batches, so-called "bridge samples", to inform batch-effect reduction/attenuation in longitudinal microarray studies. Extensive simulation studies and an analysis of a real biological data set were conducted to benchmark the performance of BRIDGE against both ComBat and longitudinalComBat. Our results demonstrate that while all methods perform well in facilitating accurate estimates of time effects, BRIDGE outperforms both ComBat and longitudinal ComBat in the removal of batch-effects in data sets with bridging samples, and perhaps as a result, was observed to have improved statistical power for detecting genes with a time effect. BRIDGE demonstrated competitive performance in batch effect reduction of confounded longitudinal microarray studies, both in simulated and a real data sets, and may serve as a useful preprocessing method for researchers conducting longitudinal microarray studies that include bridging samples.

批次效应给高通量分子数据分析带来了挑战,尤其是在纵向研究中,当研究兴趣在于识别表达随时间变化的基因/特征,但时间与批次混淆时,批次效应更是问题重重。虽然有很多方法可以校正批次效应,但大多数方法都假设不同样本之间是独立的,而这一假设在纵向微阵列研究中不太可能成立。我们提出了使用经验贝叶斯降低依赖样本的微阵列数据批次效应(BRIDGE),这是一种三步参数经验贝叶斯方法,它利用在多个时间点/批次剖析的技术复制样本(即所谓的 "桥样本"),为纵向微阵列研究中批次效应的降低/减弱提供信息。我们进行了广泛的模拟研究和对真实生物数据集的分析,以对照 ComBat 和 longitudinalComBat 对 BRIDGE 的性能进行基准测试。我们的结果表明,虽然所有方法都能很好地促进时间效应的准确估计,但 BRIDGE 在消除具有桥接样本的数据集中的批次效应方面优于 ComBat 和纵向 ComBat,因此,在检测具有时间效应的基因方面,BRIDGE 的统计能力也得到了提高。无论是在模拟数据集还是真实数据集中,BRIDGE 在减少纵向微阵列研究中的批次效应方面都表现出了很强的竞争力,可以作为研究人员进行包含桥接样本的纵向微阵列研究的一种有用的预处理方法。
{"title":"Batch effect reduction of microarray data with dependent samples using an empirical Bayes approach (BRIDGE).","authors":"Qing Xia, Jeffrey A Thompson, Devin C Koestler","doi":"10.1515/sagmb-2021-0020","DOIUrl":"10.1515/sagmb-2021-0020","url":null,"abstract":"<p><p>Batch-effects present challenges in the analysis of high-throughput molecular data and are particularly problematic in longitudinal studies when interest lies in identifying genes/features whose expression changes over time, but time is confounded with batch. While many methods to correct for batch-effects exist, most assume independence across samples; an assumption that is unlikely to hold in longitudinal microarray studies. We propose <u>B</u>atch effect <u>R</u>eduction of m<u>I</u>croarray data with <u>D</u>ependent samples usin<u>G</u><u>E</u>mpirical Bayes (<i>BRIDGE</i>), a three-step parametric empirical Bayes approach that leverages technical replicate samples profiled at multiple timepoints/batches, so-called \"bridge samples\", to inform batch-effect reduction/attenuation in longitudinal microarray studies. Extensive simulation studies and an analysis of a real biological data set were conducted to benchmark the performance of <i>BRIDGE</i> against both <i>ComBat</i> and <i>longitudinal</i><i>ComBat</i>. Our results demonstrate that while all methods perform well in facilitating accurate estimates of time effects, <i>BRIDGE</i> outperforms both <i>ComBat</i> and <i>longitudinal ComBat</i> in the removal of batch-effects in data sets with bridging samples, and perhaps as a result, was observed to have improved statistical power for detecting genes with a time effect. <i>BRIDGE</i> demonstrated competitive performance in batch effect reduction of confounded longitudinal microarray studies, both in simulated and a real data sets, and may serve as a useful preprocessing method for researchers conducting longitudinal microarray studies that include bridging samples.</p>","PeriodicalId":49477,"journal":{"name":"Statistical Applications in Genetics and Molecular Biology","volume":"20 4-6","pages":"101-119"},"PeriodicalIF":0.9,"publicationDate":"2021-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9617207/pdf/nihms-1843789.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39586240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Frontmatter
IF 0.9 4区 数学 Q3 Mathematics Pub Date : 2021-12-01 DOI: 10.1515/sagmb-2021-frontmatter4-6
{"title":"Frontmatter","authors":"","doi":"10.1515/sagmb-2021-frontmatter4-6","DOIUrl":"https://doi.org/10.1515/sagmb-2021-frontmatter4-6","url":null,"abstract":"","PeriodicalId":49477,"journal":{"name":"Statistical Applications in Genetics and Molecular Biology","volume":" ","pages":""},"PeriodicalIF":0.9,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43944170","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimizing weighted gene co-expression network analysis with a multi-threaded calculation of the topological overlap matrix. 基于拓扑重叠矩阵多线程计算的优化加权基因共表达网络分析。
IF 0.9 4区 数学 Q3 Mathematics Pub Date : 2021-11-09 DOI: 10.1515/sagmb-2021-0025
Min Shuai, Dongmei He, Xin Chen

Biomolecular networks are often assumed to be scale-free hierarchical networks. The weighted gene co-expression network analysis (WGCNA) treats gene co-expression networks as undirected scale-free hierarchical weighted networks. The WGCNA R software package uses an Adjacency Matrix to store a network, next calculates the topological overlap matrix (TOM), and then identifies the modules (sub-networks), where each module is assumed to be associated with a certain biological function. The most time-consuming step of WGCNA is to calculate TOM from the Adjacency Matrix in a single thread. In this paper, the single-threaded algorithm of the TOM has been changed into a multi-threaded algorithm (the parameters are the default values of WGCNA). In the multi-threaded algorithm, Rcpp was used to make R call a C++ function, and then C++ used OpenMP to start multiple threads to calculate TOM from the Adjacency Matrix. On shared-memory MultiProcessor systems, the calculation time decreases as the number of CPU cores increases. The algorithm of this paper can promote the application of WGCNA on large data sets, and help other research fields to identify sub-networks in undirected scale-free hierarchical weighted networks. The source codes and usage are available at https://github.com/do-somethings-haha/multi-threaded_calculate_unsigned_TOM_from_unsigned_or_signed_Adjacency_Matrix_of_WGCNA.

生物分子网络通常被认为是无标度的分层网络。加权基因共表达网络分析(WGCNA)将基因共表达网络视为无向无标度分层加权网络。WGCNA R软件包使用邻接矩阵来存储网络,然后计算拓扑重叠矩阵(TOM),然后识别模块(子网络),其中每个模块被认为与特定的生物功能相关联。WGCNA中最耗时的一步是从单线程的邻接矩阵中计算TOM。本文将TOM的单线程算法改为多线程算法(参数为WGCNA的默认值)。在多线程算法中,使用Rcpp让R调用一个c++函数,然后c++使用OpenMP启动多个线程从邻接矩阵中计算TOM。在共享内存多处理器系统上,计算时间随着CPU核数的增加而减少。本文的算法可以促进WGCNA在大数据集上的应用,并有助于其他研究领域对无向无标度分层加权网络中的子网络进行识别。源代码和用法可从https://github.com/do-somethings-haha/multi-threaded_calculate_unsigned_TOM_from_unsigned_or_signed_Adjacency_Matrix_of_WGCNA获得。
{"title":"Optimizing weighted gene co-expression network analysis with a multi-threaded calculation of the topological overlap matrix.","authors":"Min Shuai,&nbsp;Dongmei He,&nbsp;Xin Chen","doi":"10.1515/sagmb-2021-0025","DOIUrl":"https://doi.org/10.1515/sagmb-2021-0025","url":null,"abstract":"<p><p>Biomolecular networks are often assumed to be scale-free hierarchical networks. The weighted gene co-expression network analysis (WGCNA) treats gene co-expression networks as undirected scale-free hierarchical weighted networks. The WGCNA R software package uses an Adjacency Matrix to store a network, next calculates the topological overlap matrix (TOM), and then identifies the modules (sub-networks), where each module is assumed to be associated with a certain biological function. The most time-consuming step of WGCNA is to calculate TOM from the Adjacency Matrix in a single thread. In this paper, the single-threaded algorithm of the TOM has been changed into a multi-threaded algorithm (the parameters are the default values of WGCNA). In the multi-threaded algorithm, Rcpp was used to make R call a C++ function, and then C++ used OpenMP to start multiple threads to calculate TOM from the Adjacency Matrix. On shared-memory MultiProcessor systems, the calculation time decreases as the number of CPU cores increases. The algorithm of this paper can promote the application of WGCNA on large data sets, and help other research fields to identify sub-networks in undirected scale-free hierarchical weighted networks. The source codes and usage are available at https://github.com/do-somethings-haha/multi-threaded_calculate_unsigned_TOM_from_unsigned_or_signed_Adjacency_Matrix_of_WGCNA.</p>","PeriodicalId":49477,"journal":{"name":"Statistical Applications in Genetics and Molecular Biology","volume":"20 4-6","pages":"145-153"},"PeriodicalIF":0.9,"publicationDate":"2021-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39696432","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
A hierarchical Bayesian approach for detecting global microbiome associations. 检测全球微生物组关联的分层贝叶斯方法。
IF 0.9 4区 数学 Q3 Mathematics Pub Date : 2021-11-01 DOI: 10.1515/sagmb-2021-0047
Farhad Hatami, Emma Beamish, Albert Davies, Rachael Rigby, Frank Dondelinger

The human gut microbiome has been shown to be associated with a variety of human diseases, including cancer, metabolic conditions and inflammatory bowel disease. Current approaches for detecting microbiome associations are limited by relying on specific measures of ecological distance, or only allowing for the detection of associations with individual bacterial species, rather than the whole microbiome. In this work, we develop a novel hierarchical Bayesian model for detecting global microbiome associations. Our method is not dependent on a choice of distance measure, and is able to incorporate phylogenetic information about microbial species. We perform extensive simulation studies and show that our method allows for consistent estimation of global microbiome effects. Additionally, we investigate the performance of the model on two real-world microbiome studies: a study of microbiome-metabolome associations in inflammatory bowel disease, and a study of associations between diet and the gut microbiome in mice. We show that we can use the method to reliably detect associations in real-world datasets with varying numbers of samples and covariates.

人类肠道微生物组已被证明与多种人类疾病有关,包括癌症、代谢疾病和炎症性肠病。目前检测微生物组关联的方法受到限制,因为它们依赖于特定的生态距离测量,或者只能检测与单个细菌物种而非整个微生物组的关联。在这项工作中,我们开发了一种新型分层贝叶斯模型,用于检测全球微生物组关联。我们的方法不依赖于距离度量的选择,并且能够纳入微生物物种的系统发育信息。我们进行了大量的模拟研究,结果表明我们的方法可以对全球微生物组效应进行一致的估计。此外,我们还调查了该模型在两项实际微生物组研究中的表现:一项是炎症性肠病中微生物组-代谢组关联研究,另一项是小鼠饮食与肠道微生物组关联研究。我们的研究表明,我们可以用这种方法在样本数量和协变量各不相同的真实世界数据集中可靠地检测出相关性。
{"title":"A hierarchical Bayesian approach for detecting global microbiome associations.","authors":"Farhad Hatami, Emma Beamish, Albert Davies, Rachael Rigby, Frank Dondelinger","doi":"10.1515/sagmb-2021-0047","DOIUrl":"10.1515/sagmb-2021-0047","url":null,"abstract":"<p><p>The human gut microbiome has been shown to be associated with a variety of human diseases, including cancer, metabolic conditions and inflammatory bowel disease. Current approaches for detecting microbiome associations are limited by relying on specific measures of ecological distance, or only allowing for the detection of associations with individual bacterial species, rather than the whole microbiome. In this work, we develop a novel hierarchical Bayesian model for detecting global microbiome associations. Our method is not dependent on a choice of distance measure, and is able to incorporate phylogenetic information about microbial species. We perform extensive simulation studies and show that our method allows for consistent estimation of global microbiome effects. Additionally, we investigate the performance of the model on two real-world microbiome studies: a study of microbiome-metabolome associations in inflammatory bowel disease, and a study of associations between diet and the gut microbiome in mice. We show that we can use the method to reliably detect associations in real-world datasets with varying numbers of samples and covariates.</p>","PeriodicalId":49477,"journal":{"name":"Statistical Applications in Genetics and Molecular Biology","volume":"20 3","pages":"85-100"},"PeriodicalIF":0.9,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9125803/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39574241","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Frontmatter
IF 0.9 4区 数学 Q3 Mathematics Pub Date : 2021-10-01 DOI: 10.1515/sagmb-2021-frontmatter3
{"title":"Frontmatter","authors":"","doi":"10.1515/sagmb-2021-frontmatter3","DOIUrl":"https://doi.org/10.1515/sagmb-2021-frontmatter3","url":null,"abstract":"","PeriodicalId":49477,"journal":{"name":"Statistical Applications in Genetics and Molecular Biology","volume":" ","pages":""},"PeriodicalIF":0.9,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43757619","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Frontmatter Frontmatter
IF 0.9 4区 数学 Q3 Mathematics Pub Date : 2021-02-01 DOI: 10.1515/sagmb-2021-frontmatter1
{"title":"Frontmatter","authors":"","doi":"10.1515/sagmb-2021-frontmatter1","DOIUrl":"https://doi.org/10.1515/sagmb-2021-frontmatter1","url":null,"abstract":"","PeriodicalId":49477,"journal":{"name":"Statistical Applications in Genetics and Molecular Biology","volume":"1 1","pages":""},"PeriodicalIF":0.9,"publicationDate":"2021-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/sagmb-2021-frontmatter1","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42140367","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Measuring evolutionary cancer dynamics from genome sequencing, one patient at a time 通过基因组测序测量癌症的进化动态,每次一名患者
IF 0.9 4区 数学 Q3 Mathematics Pub Date : 2020-12-01 DOI: 10.1515/sagmb-2020-0075
G. Caravagna
Abstract Cancers progress through the accumulation of somatic mutations which accrue during tumour evolution, allowing some cells to proliferate in an uncontrolled fashion. This growth process is intimately related to latent evolutionary forces moulding the genetic and epigenetic composition of tumour subpopulations. Understanding cancer requires therefore the understanding of these selective pressures. The adoption of widespread next-generation sequencing technologies opens up for the possibility of measuring molecular profiles of cancers at multiple resolutions, across one or multiple patients. In this review we discuss how cancer genome sequencing data from a single tumour can be used to understand these evolutionary forces, overviewing mathematical models and inferential methods adopted in field of Cancer Evolution.
摘要癌症通过在肿瘤进化过程中积累的体细胞突变来发展,使一些细胞以不受控制的方式增殖。这种生长过程与塑造肿瘤亚群遗传和表观遗传组成的潜在进化力密切相关。因此,理解癌症需要理解这些选择性压力。广泛采用的下一代测序技术为以多种分辨率测量一名或多名患者的癌症分子谱开辟了可能性。在这篇综述中,我们讨论了如何使用单个肿瘤的癌症基因组测序数据来理解这些进化力,概述了癌症进化领域采用的数学模型和推理方法。
{"title":"Measuring evolutionary cancer dynamics from genome sequencing, one patient at a time","authors":"G. Caravagna","doi":"10.1515/sagmb-2020-0075","DOIUrl":"https://doi.org/10.1515/sagmb-2020-0075","url":null,"abstract":"Abstract Cancers progress through the accumulation of somatic mutations which accrue during tumour evolution, allowing some cells to proliferate in an uncontrolled fashion. This growth process is intimately related to latent evolutionary forces moulding the genetic and epigenetic composition of tumour subpopulations. Understanding cancer requires therefore the understanding of these selective pressures. The adoption of widespread next-generation sequencing technologies opens up for the possibility of measuring molecular profiles of cancers at multiple resolutions, across one or multiple patients. In this review we discuss how cancer genome sequencing data from a single tumour can be used to understand these evolutionary forces, overviewing mathematical models and inferential methods adopted in field of Cancer Evolution.","PeriodicalId":49477,"journal":{"name":"Statistical Applications in Genetics and Molecular Biology","volume":" ","pages":""},"PeriodicalIF":0.9,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/sagmb-2020-0075","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45496562","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Inferring dynamic gene regulatory networks with low-order conditional independencies – an evaluation of the method 推断具有低阶条件独立性的动态基因调控网络-对该方法的评价
IF 0.9 4区 数学 Q3 Mathematics Pub Date : 2020-12-01 DOI: 10.1515/sagmb-2020-0051
Hamda Ajmal, M. G. Madden
Abstract Over a decade ago, Lèbre (2009) proposed an inference method, G1DBN, to learn the structure of gene regulatory networks (GRNs) from high dimensional, sparse time-series gene expression data. Their approach is based on concept of low-order conditional independence graphs that they extend to dynamic Bayesian networks (DBNs). They present results to demonstrate that their method yields better structural accuracy compared to the related Lasso and Shrinkage methods, particularly where the data is sparse, that is, the number of time measurements n is much smaller than the number of genes p. This paper challenges these claims using a careful experimental analysis, to show that the GRNs reverse engineered from time-series data using the G1DBN approach are less accurate than claimed by Lèbre (2009). We also show that the Lasso method yields higher structural accuracy for graphs learned from the simulated data, compared to the G1DBN method, particularly when the data is sparse ( n < < p $n{< }{< }p$ ). The Lasso method is also better than G1DBN at identifying the transcription factors (TFs) involved in the cell cycle of Saccharomyces cerevisiae.
摘要十多年前,Lèbre(2009)提出了一种推理方法G1DBN,用于从高维、稀疏的时间序列基因表达数据中学习基因调控网络(GRN)的结构。他们的方法基于低阶条件独立图的概念,并将其扩展到动态贝叶斯网络(DBN)。他们提出的结果表明,与相关的拉索和收缩方法相比,他们的方法产生了更好的结构精度,特别是在数据稀疏的情况下,即时间测量的数量n远小于基因的数量p。本文通过仔细的实验分析对这些说法提出了质疑,以表明使用G1DBN方法从时间序列数据中反向工程的GRN不如Lèbre(2009)所声称的准确。我们还表明,与G1DBN方法相比,Lasso方法对从模拟数据中学习的图产生了更高的结构精度,特别是当数据稀疏时(n<
{"title":"Inferring dynamic gene regulatory networks with low-order conditional independencies – an evaluation of the method","authors":"Hamda Ajmal, M. G. Madden","doi":"10.1515/sagmb-2020-0051","DOIUrl":"https://doi.org/10.1515/sagmb-2020-0051","url":null,"abstract":"Abstract Over a decade ago, Lèbre (2009) proposed an inference method, G1DBN, to learn the structure of gene regulatory networks (GRNs) from high dimensional, sparse time-series gene expression data. Their approach is based on concept of low-order conditional independence graphs that they extend to dynamic Bayesian networks (DBNs). They present results to demonstrate that their method yields better structural accuracy compared to the related Lasso and Shrinkage methods, particularly where the data is sparse, that is, the number of time measurements n is much smaller than the number of genes p. This paper challenges these claims using a careful experimental analysis, to show that the GRNs reverse engineered from time-series data using the G1DBN approach are less accurate than claimed by Lèbre (2009). We also show that the Lasso method yields higher structural accuracy for graphs learned from the simulated data, compared to the G1DBN method, particularly when the data is sparse ( n < < p $n{< }{< }p$ ). The Lasso method is also better than G1DBN at identifying the transcription factors (TFs) involved in the cell cycle of Saccharomyces cerevisiae.","PeriodicalId":49477,"journal":{"name":"Statistical Applications in Genetics and Molecular Biology","volume":" ","pages":""},"PeriodicalIF":0.9,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/sagmb-2020-0051","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46568594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Statistical Applications in Genetics and Molecular Biology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1