Application of Granger causality to gene regulatory network discovery

G. Tam, Chunqi Chang, Y. Hung
{"title":"Application of Granger causality to gene regulatory network discovery","authors":"G. Tam, Chunqi Chang, Y. Hung","doi":"10.1109/ISB.2012.6314142","DOIUrl":null,"url":null,"abstract":"Granger causality (GC) has been applied to gene regulatory network discovery using DNA microarray time-series data. Since the number of genes is much larger than the data length, a full model cannot be applied in a straightforward manner, hence GC is often applied to genes pairwisely. In this paper, firstly we investigate with synthetic data and point out how spurious causalities (false discoveries) may emerge in pairwise GC detection. In addition, spurious causalities may also arise if the order of the vector autoregressive model is not high enough. Therefore, besides using a suitable model order, we recommend a full model over pairwise GC. This is possible if pairwise GC is first used to identify a network of interactions among only a few genes, and then all these interactions are validated with a full model again. If a full model is not possible, we recommend using model validation techniques to remove spurious discoveries. Secondly, we apply pairwise GC with model validation to a real dataset (HeLa). To estimate the model order, the Akaike information criterion is found to be more suitable than the Bayesian information criterion. Degree distribution and network hubs are obtained and compared with previous publications. The hubs tend to act as sources of interactions rather than receivers of interactions.","PeriodicalId":224011,"journal":{"name":"2012 IEEE 6th International Conference on Systems Biology (ISB)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE 6th International Conference on Systems Biology (ISB)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISB.2012.6314142","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 13

Abstract

Granger causality (GC) has been applied to gene regulatory network discovery using DNA microarray time-series data. Since the number of genes is much larger than the data length, a full model cannot be applied in a straightforward manner, hence GC is often applied to genes pairwisely. In this paper, firstly we investigate with synthetic data and point out how spurious causalities (false discoveries) may emerge in pairwise GC detection. In addition, spurious causalities may also arise if the order of the vector autoregressive model is not high enough. Therefore, besides using a suitable model order, we recommend a full model over pairwise GC. This is possible if pairwise GC is first used to identify a network of interactions among only a few genes, and then all these interactions are validated with a full model again. If a full model is not possible, we recommend using model validation techniques to remove spurious discoveries. Secondly, we apply pairwise GC with model validation to a real dataset (HeLa). To estimate the model order, the Akaike information criterion is found to be more suitable than the Bayesian information criterion. Degree distribution and network hubs are obtained and compared with previous publications. The hubs tend to act as sources of interactions rather than receivers of interactions.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
格兰杰因果关系在基因调控网络发现中的应用
格兰杰因果关系(GC)已被应用于利用DNA微阵列时间序列数据发现基因调控网络。由于基因的数量远远大于数据长度,因此不能直接应用完整的模型,因此GC通常应用于成对的基因。在本文中,我们首先用合成数据进行研究,并指出在成对GC检测中可能出现虚假因果关系(错误发现)。此外,如果向量自回归模型的阶数不够高,也可能产生虚假的因果关系。因此,除了使用合适的模型顺序外,我们建议使用完整模型而不是成对GC。这是可能的,如果配对GC首先用于识别仅在少数基因之间的相互作用网络,然后再次用完整的模型验证所有这些相互作用。如果一个完整的模型是不可能的,我们建议使用模型验证技术来删除虚假的发现。其次,我们将配对GC与模型验证应用于真实数据集(HeLa)。对于模型阶数的估计,发现赤池信息准则比贝叶斯信息准则更合适。获得了学位分布和网络枢纽,并与以前的出版物进行了比较。集线器倾向于充当交互的来源,而不是交互的接收者。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A fixed-point blind source extraction algorithm and its application to ECG data analysis Comparing two models based on the transcriptional regulation by KaiC of cyanobacteria rhythm Predicting protein complexes via the integration of multiple biological information Effective clustering of microRNA sequences by N-grams and feature weighting RNA-seq coverage effects on biological pathways and GO tag clouds
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1