{"title":"Networks and models for the integrated analysis of multi omics data","authors":"Sun Kim","doi":"10.1109/BIBM.2016.7822479","DOIUrl":null,"url":null,"abstract":"These days, genome-wide measurements of genetic and epigenetics events, a.k.a omics data, are routinely produced; epigenetics is control mechanisms of genetics events as epi-means ‘on’ or ‘upon’. As a result, a huge amount of omics data measured from different genetic and epigenetic events are available. For example, the amount of data at The Cancer Genome Atlas(TCGA) alone exceeds 2.5 peta byte as of October 2016. Unfortunately, the dimensions of omics data is huge, typically tens to hundreds or even millions of thousands while the number of samples are limited typically a few to thousands. Thus mining genetic and epigenetic data measured in different phenotype conditions is a very challenging problem, that is, small data sets on extremely high dimensions. Furthermore, all genetic and epigenetic events are inter-related. Thus it is necessary to perform integrated analysis of omics data sets of different types, which is even more challenging. To address these technical challenges, the bioinformatics community has used virtually all known network based analysis techniques, including recently developed deep neural networks. My group has been trying the network based integrated analysis of omics data at three different levels. First, we have been investigating on computational methods for associating different genetic and epigenetic events, which can be viewed as methods for defining edges in the network. Second, we have been developing mining subnetworks on the phenotype and time dimensions. Third, we have recently begun to investigate on the use of deep learning techniques for the integrated analysis of omics data. An important goal of our research is to combine network analysis and deep learning techniques to construct models or draw maps of cancer cells at multiple levels such as genomic mutations, gene activation/suppressions, epigenetic events including DNA methylation, histone modifications, and miRNA interference, biological pathways, and finally at the whole cell level including tumor heterogeneity and clonal evolution.","PeriodicalId":73283,"journal":{"name":"IEEE International Conference on Bioinformatics and Biomedicine workshops. IEEE International Conference on Bioinformatics and Biomedicine","volume":"38 1","pages":"6"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE International Conference on Bioinformatics and Biomedicine workshops. IEEE International Conference on Bioinformatics and Biomedicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBM.2016.7822479","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

These days, genome-wide measurements of genetic and epigenetics events, a.k.a omics data, are routinely produced; epigenetics is control mechanisms of genetics events as epi-means ‘on’ or ‘upon’. As a result, a huge amount of omics data measured from different genetic and epigenetic events are available. For example, the amount of data at The Cancer Genome Atlas(TCGA) alone exceeds 2.5 peta byte as of October 2016. Unfortunately, the dimensions of omics data is huge, typically tens to hundreds or even millions of thousands while the number of samples are limited typically a few to thousands. Thus mining genetic and epigenetic data measured in different phenotype conditions is a very challenging problem, that is, small data sets on extremely high dimensions. Furthermore, all genetic and epigenetic events are inter-related. Thus it is necessary to perform integrated analysis of omics data sets of different types, which is even more challenging. To address these technical challenges, the bioinformatics community has used virtually all known network based analysis techniques, including recently developed deep neural networks. My group has been trying the network based integrated analysis of omics data at three different levels. First, we have been investigating on computational methods for associating different genetic and epigenetic events, which can be viewed as methods for defining edges in the network. Second, we have been developing mining subnetworks on the phenotype and time dimensions. Third, we have recently begun to investigate on the use of deep learning techniques for the integrated analysis of omics data. An important goal of our research is to combine network analysis and deep learning techniques to construct models or draw maps of cancer cells at multiple levels such as genomic mutations, gene activation/suppressions, epigenetic events including DNA methylation, histone modifications, and miRNA interference, biological pathways, and finally at the whole cell level including tumor heterogeneity and clonal evolution.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
多组学数据集成分析的网络和模型
如今,基因和表观遗传学事件的全基因组测量,也就是组学数据,已经成为常规;表观遗传学是遗传学事件的控制机制,epi的意思是“上”或“上”。因此,从不同的遗传和表观遗传事件中测量的大量组学数据是可用的。例如,截至2016年10月,仅癌症基因组图谱(TCGA)的数据量就超过了2.5 peta字节。不幸的是,组学数据的维度是巨大的,通常是几十到几百甚至几百万,而样本的数量通常是有限的,通常是几到几千。因此,挖掘在不同表型条件下测量的遗传和表观遗传数据是一个非常具有挑战性的问题,即在极高维度上的小数据集。此外,所有遗传和表观遗传事件都是相互关联的。因此,需要对不同类型的组学数据集进行综合分析,这更具挑战性。为了应对这些技术挑战,生物信息学社区几乎使用了所有已知的基于网络的分析技术,包括最近开发的深度神经网络。我的团队一直在尝试基于网络的三个不同层次的组学数据综合分析。首先,我们研究了关联不同遗传和表观遗传事件的计算方法,这些方法可以看作是定义网络中边缘的方法。其次,我们一直在开发表现型和时间维度的挖掘子网。第三,我们最近开始研究将深度学习技术用于组学数据的综合分析。我们研究的一个重要目标是将网络分析和深度学习技术结合起来,在基因组突变、基因激活/抑制、表观遗传事件(包括DNA甲基化、组蛋白修饰和miRNA干扰)、生物学途径等多个层面构建模型或绘制癌细胞图谱,最终在全细胞水平(包括肿瘤异质性和克隆进化)构建模型或绘制癌细胞图谱。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Diurnal Pain Classification in Critically Ill Patients using Machine Learning on Accelerometry and Analgesic Data. Transmission cluster characteristics of global, regional, and lineage-specific SARS-CoV-2 phylogenies. Document-level DDI relation extraction with document-entity embedding The Network Pharmacological Mechanism of Yizhiningshen Oral Liquid in the Treatment of Tic Disorders Study on the Medication Law of Traditional Chinese medicine treating Lumbago based on TCM electronic medical record
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1