Large memory high performance computing enables comparison across human gut microbiome of patients with autoimmune diseases and healthy subjects

Sitao Wu, Weizhong Li, L. Smarr, K. Nelson, Shibu Yooseph, M. Torralba
{"title":"Large memory high performance computing enables comparison across human gut microbiome of patients with autoimmune diseases and healthy subjects","authors":"Sitao Wu, Weizhong Li, L. Smarr, K. Nelson, Shibu Yooseph, M. Torralba","doi":"10.1145/2484762.2484828","DOIUrl":null,"url":null,"abstract":"Microbial communities that live on the outside and inside of the human body dramatically influence human health and diseases. In recent years, major progress has been made in understanding the human microbiome communities through projects such as the Human Microbiome Project (http://commonfund.nih.gov/hmp/), using next generation sequencing technologies and metagenomic approaches. In this paper, we describe a comparative computational analysis of 183 human gut microbiome sequence datasets, drawn from healthy individuals as well as those with autoimmune diseases. About 2.4 TB of Illumina deep sequencing metagenomic data were analyzed using computational workflows we developed, which run multiple steps of data- and computing-intensive analyses such as mapping, sequence assembly, gene identification, clustering and functional annotations. The analyses were carried out on the Gordon supercomputer at the San Diego Supercomputer Center (SDSC), using ~180,000 core hours and tens of TB storage space. Our analysis reveals the detailed microbial composition, dynamics, and functional profiles of the samples and provides new insight into how to correlate microbial profiles with human health and disease states.","PeriodicalId":426819,"journal":{"name":"Proceedings of the Conference on Extreme Science and Engineering Discovery Environment: Gateway to Discovery","volume":"271 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Conference on Extreme Science and Engineering Discovery Environment: Gateway to Discovery","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2484762.2484828","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 15

Abstract

Microbial communities that live on the outside and inside of the human body dramatically influence human health and diseases. In recent years, major progress has been made in understanding the human microbiome communities through projects such as the Human Microbiome Project (http://commonfund.nih.gov/hmp/), using next generation sequencing technologies and metagenomic approaches. In this paper, we describe a comparative computational analysis of 183 human gut microbiome sequence datasets, drawn from healthy individuals as well as those with autoimmune diseases. About 2.4 TB of Illumina deep sequencing metagenomic data were analyzed using computational workflows we developed, which run multiple steps of data- and computing-intensive analyses such as mapping, sequence assembly, gene identification, clustering and functional annotations. The analyses were carried out on the Gordon supercomputer at the San Diego Supercomputer Center (SDSC), using ~180,000 core hours and tens of TB storage space. Our analysis reveals the detailed microbial composition, dynamics, and functional profiles of the samples and provides new insight into how to correlate microbial profiles with human health and disease states.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
大内存高性能计算实现了自身免疫性疾病患者和健康人肠道微生物组的比较
生活在人体内外的微生物群落极大地影响着人类的健康和疾病。近年来,通过人类微生物组计划(http://commonfund.nih.gov/hmp/)等项目,利用下一代测序技术和宏基因组方法,在了解人类微生物组群落方面取得了重大进展。在本文中,我们描述了183个人类肠道微生物组序列数据集的比较计算分析,这些数据集来自健康个体和自身免疫性疾病患者。使用我们开发的计算工作流程分析了约2.4 TB的Illumina深度测序宏基因组数据,该工作流程运行了多个数据和计算密集型分析步骤,如作图、序列组装、基因鉴定、聚类和功能注释。分析是在圣地亚哥超级计算机中心(SDSC)的戈登超级计算机上进行的,使用了约180,000核小时和数十TB的存储空间。我们的分析揭示了样品的详细微生物组成,动力学和功能概况,并为如何将微生物概况与人类健康和疾病状态相关联提供了新的见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Optimizing utilization across XSEDE platforms Adaptive latency-aware parallel resource mapping: task graph scheduling onto heterogeneous network topology Optimizing the PCIT algorithm on stampede's Xeon and Xeon Phi processors for faster discovery of biological networks Training, education, and outreach: raising the bar Preliminary experiences with the uintah framework on Intel Xeon Phi and stampede
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1