模块代表提炼基因共表达模块。

IF 2 4区 生物学 Q4 BIOCHEMISTRY & MOLECULAR BIOLOGY Physical biology Pub Date : 2023-05-04 DOI:10.1088/1478-3975/acce8d
Nathan Mankovich, Helene Andrews-Polymenis, David Threadgill, Michael Kirby
{"title":"模块代表提炼基因共表达模块。","authors":"Nathan Mankovich,&nbsp;Helene Andrews-Polymenis,&nbsp;David Threadgill,&nbsp;Michael Kirby","doi":"10.1088/1478-3975/acce8d","DOIUrl":null,"url":null,"abstract":"<p><p>This paper concerns the identification of gene co-expression modules in transcriptomics data, i.e. collections of genes which are highly co-expressed and potentially linked to a biological mechanism. Weighted gene co-expression network analysis (WGCNA) is a widely used method for module detection based on the computation of eigengenes, the weights of the first principal component for the module gene expression matrix. This eigengene has been used as a centroid in a<i>k</i>-means algorithm to improve module memberships. In this paper, we present four new module representatives: the eigengene subspace, flag mean, flag median and module expression vector. The eigengene subspace, flag mean and flag median are subspace module representatives which capture more variance of the gene expression within a module. The module expression vector is a weighted centroid of the module which leverages the structure of the module gene co-expression network. We use these module representatives in Linde-Buzo-Gray clustering algorithms to refine WGCNA module membership. We evaluate these methodologies on two transcriptomics data sets. We find that most of our module refinement techniques improve upon the WGCNA modules by two statistics: (1) module classification between phenotype and (2) module biological significance according to Gene Ontology terms.</p>","PeriodicalId":20207,"journal":{"name":"Physical biology","volume":null,"pages":null},"PeriodicalIF":2.0000,"publicationDate":"2023-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Module representatives for refining gene co-expression modules.\",\"authors\":\"Nathan Mankovich,&nbsp;Helene Andrews-Polymenis,&nbsp;David Threadgill,&nbsp;Michael Kirby\",\"doi\":\"10.1088/1478-3975/acce8d\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>This paper concerns the identification of gene co-expression modules in transcriptomics data, i.e. collections of genes which are highly co-expressed and potentially linked to a biological mechanism. Weighted gene co-expression network analysis (WGCNA) is a widely used method for module detection based on the computation of eigengenes, the weights of the first principal component for the module gene expression matrix. This eigengene has been used as a centroid in a<i>k</i>-means algorithm to improve module memberships. In this paper, we present four new module representatives: the eigengene subspace, flag mean, flag median and module expression vector. The eigengene subspace, flag mean and flag median are subspace module representatives which capture more variance of the gene expression within a module. The module expression vector is a weighted centroid of the module which leverages the structure of the module gene co-expression network. We use these module representatives in Linde-Buzo-Gray clustering algorithms to refine WGCNA module membership. We evaluate these methodologies on two transcriptomics data sets. We find that most of our module refinement techniques improve upon the WGCNA modules by two statistics: (1) module classification between phenotype and (2) module biological significance according to Gene Ontology terms.</p>\",\"PeriodicalId\":20207,\"journal\":{\"name\":\"Physical biology\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.0000,\"publicationDate\":\"2023-05-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Physical biology\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1088/1478-3975/acce8d\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physical biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1088/1478-3975/acce8d","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

本文关注转录组学数据中基因共表达模块的鉴定,即高度共表达并可能与生物机制相关的基因集合。加权基因共表达网络分析(Weighted gene co-expression network analysis, WGCNA)是一种广泛应用于模块检测的方法,它基于特征基因的计算,即模块基因表达矩阵的第一主成分的权重。该特征基因被用作ak-means算法的质心,以提高模块的隶属度。本文提出了四种新的模表示:特征子空间、标志均值、标志中值和模表达向量。特征基因子空间、标志均值和标志中位数是子空间模块表示,它们捕获了一个模块内基因表达的更多方差。模块表达载体是利用模块基因共表达网络结构的模块的加权质心。我们在Linde-Buzo-Gray聚类算法中使用这些模块代表来优化WGCNA模块的隶属关系。我们在两个转录组学数据集上评估了这些方法。我们发现我们的大多数模块优化技术通过两个统计来改进WGCNA模块:(1)表型之间的模块分类;(2)根据基因本体术语的模块生物学意义。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Module representatives for refining gene co-expression modules.

This paper concerns the identification of gene co-expression modules in transcriptomics data, i.e. collections of genes which are highly co-expressed and potentially linked to a biological mechanism. Weighted gene co-expression network analysis (WGCNA) is a widely used method for module detection based on the computation of eigengenes, the weights of the first principal component for the module gene expression matrix. This eigengene has been used as a centroid in ak-means algorithm to improve module memberships. In this paper, we present four new module representatives: the eigengene subspace, flag mean, flag median and module expression vector. The eigengene subspace, flag mean and flag median are subspace module representatives which capture more variance of the gene expression within a module. The module expression vector is a weighted centroid of the module which leverages the structure of the module gene co-expression network. We use these module representatives in Linde-Buzo-Gray clustering algorithms to refine WGCNA module membership. We evaluate these methodologies on two transcriptomics data sets. We find that most of our module refinement techniques improve upon the WGCNA modules by two statistics: (1) module classification between phenotype and (2) module biological significance according to Gene Ontology terms.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Physical biology
Physical biology 生物-生物物理
CiteScore
4.20
自引率
0.00%
发文量
50
审稿时长
3 months
期刊介绍: Physical Biology publishes articles in the broad interdisciplinary field bridging biology with the physical sciences and engineering. This journal focuses on research in which quantitative approaches – experimental, theoretical and modeling – lead to new insights into biological systems at all scales of space and time, and all levels of organizational complexity. Physical Biology accepts contributions from a wide range of biological sub-fields, including topics such as: molecular biophysics, including single molecule studies, protein-protein and protein-DNA interactions subcellular structures, organelle dynamics, membranes, protein assemblies, chromosome structure intracellular processes, e.g. cytoskeleton dynamics, cellular transport, cell division systems biology, e.g. signaling, gene regulation and metabolic networks cells and their microenvironment, e.g. cell mechanics and motility, chemotaxis, extracellular matrix, biofilms cell-material interactions, e.g. biointerfaces, electrical stimulation and sensing, endocytosis cell-cell interactions, cell aggregates, organoids, tissues and organs developmental dynamics, including pattern formation and morphogenesis physical and evolutionary aspects of disease, e.g. cancer progression, amyloid formation neuronal systems, including information processing by networks, memory and learning population dynamics, ecology, and evolution collective action and emergence of collective phenomena.
期刊最新文献
A role of fear on diseased food web model with multiple functional response. Two fitness inference schemes compared using allele frequencies from 1,068,391 sequences sampled in the UK during the COVID-19 pandemic. Unraveling the role of exercise in cancer suppression: insights from a mathematical model. An exactly solvable model for RNA polymerase during the elongation stage. A theoretical framework for predicting the heterogeneous stiffness map of brain white matter tissue.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1