An Entropy-based Statistical Workflow Provides Noise-Minimizing Biological Annotation for

Muscular Aging, Theodoros Koutsandreas, I. Valavanis, E. Pilalis, A. Chatziioannou
{"title":"An Entropy-based Statistical Workflow Provides Noise-Minimizing Biological Annotation for","authors":"Muscular Aging, Theodoros Koutsandreas, I. Valavanis, E. Pilalis, A. Chatziioannou","doi":"10.1109/ISB.2014.6990749","DOIUrl":null,"url":null,"abstract":"This study aims to expand the efficiency of the interpretation concerning the aging process, by exploring a broad gene set, derived from the analysis of an integrative transcriptomic microarray dataset. The dataset comprises human skeletal muscle samples, obtained from healthy males and females, that were used to derive a gene signature of a high informative content, with respect to its functional association with the aging phenotype. Towards this end, a multilayered computational workflow integrating advanced statistical methodologies for the derivation of reliable confidence measures, distribution-based entropy calculations to examine the informational content of the dataset, enrichment analysis, graph-theoretic methods and intuitive visualization was applied. Specifically, statistical testing revealed differentially expressed genes, while an uncertainty calculation algorithm, exploiting Gene Ontology (GO) terms annotations, extended the list of significant genes from 254 to 2791, namely p-value threshold was increased from 0.0005 to 0.103, while keeping simultaneously noise measurements legitimately low. This rich gene set associated functionally the macroscopic phenotype of muscular aging with highly informative, stably correlated with each other, molecular annotations in the GO database. Finally, a set of 57 reliable genes was identified that comprise a gender-independent aging signature, after incorporating crucial information about genes pivotal regulatory role as inferred by the GO tree. The biological interpretation was highly assisted by the illustration of the functional mappings between genes, cellular location and biological processes through circle packing graphs.","PeriodicalId":249103,"journal":{"name":"2014 8th International Conference on Systems Biology (ISB)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 8th International Conference on Systems Biology (ISB)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISB.2014.6990749","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

This study aims to expand the efficiency of the interpretation concerning the aging process, by exploring a broad gene set, derived from the analysis of an integrative transcriptomic microarray dataset. The dataset comprises human skeletal muscle samples, obtained from healthy males and females, that were used to derive a gene signature of a high informative content, with respect to its functional association with the aging phenotype. Towards this end, a multilayered computational workflow integrating advanced statistical methodologies for the derivation of reliable confidence measures, distribution-based entropy calculations to examine the informational content of the dataset, enrichment analysis, graph-theoretic methods and intuitive visualization was applied. Specifically, statistical testing revealed differentially expressed genes, while an uncertainty calculation algorithm, exploiting Gene Ontology (GO) terms annotations, extended the list of significant genes from 254 to 2791, namely p-value threshold was increased from 0.0005 to 0.103, while keeping simultaneously noise measurements legitimately low. This rich gene set associated functionally the macroscopic phenotype of muscular aging with highly informative, stably correlated with each other, molecular annotations in the GO database. Finally, a set of 57 reliable genes was identified that comprise a gender-independent aging signature, after incorporating crucial information about genes pivotal regulatory role as inferred by the GO tree. The biological interpretation was highly assisted by the illustration of the functional mappings between genes, cellular location and biological processes through circle packing graphs.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于熵的统计工作流提供了噪声最小化的生物注释
本研究旨在通过对整合转录组微阵列数据集的分析,探索广泛的基因集,从而提高对衰老过程的解释效率。该数据集包括从健康男性和女性获得的人类骨骼肌样本,用于获得具有高信息量的基因特征,其与衰老表型的功能关联。为此,应用了一种多层计算工作流,集成了用于推导可靠置信度度量的先进统计方法、用于检查数据集信息内容的基于分布的熵计算、富集分析、图论方法和直观可视化。具体而言,统计检验揭示了差异表达基因,而利用基因本体(Gene Ontology, GO)术语注释的不确定性计算算法将显著基因列表从254个扩展到2791个,即p值阈值从0.0005提高到0.103,同时保持了合理的低噪声测量。这一丰富的基因集与GO数据库中信息丰富、相互稳定相关的分子注释在功能上关联了肌肉衰老的宏观表型。最后,在结合GO树推断的基因关键调控作用的关键信息后,确定了一组57个可靠的基因,包括性别无关的衰老特征。通过圆形包装图说明基因、细胞位置和生物过程之间的功能映射,极大地辅助了生物学解释。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Topological characterization of housekeeping genes in human protein-protein interaction network The correlation and regression analysis on aerosol optical depth, ice cover and cloud cover in Greenland Sea A semi-tensor product approach for Probabilistic Boolean Networks VaccineWatch: a monitoring system of vaccine messages from social media data Evolution analysis for HA gene of human influenza A H3N2 virus (1990 – 2013)
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1