GAiN: An integrative tool utilizing generative adversarial neural networks for augmented gene expression analysis

IF 6.7 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Patterns Pub Date : 2024-01-08 DOI:10.1016/j.patter.2023.100910
Michael R. Waters, Matthew Inkman, Kay Jayachandran, Roman M. Kowalchuk, Clifford Robinson, Julie K. Schwarz, S. Joshua Swamidass, Obi L. Griffith, Jeffrey J. Szymanski, Jin Zhang
{"title":"GAiN: An integrative tool utilizing generative adversarial neural networks for augmented gene expression analysis","authors":"Michael R. Waters, Matthew Inkman, Kay Jayachandran, Roman M. Kowalchuk, Clifford Robinson, Julie K. Schwarz, S. Joshua Swamidass, Obi L. Griffith, Jeffrey J. Szymanski, Jin Zhang","doi":"10.1016/j.patter.2023.100910","DOIUrl":null,"url":null,"abstract":"<p>Big genomic data and artificial intelligence (AI) are ushering in an era of precision medicine, providing opportunities to study previously under-represented subtypes and rare diseases rather than categorize them as variances. However, clinical researchers face challenges in accessing such novel technologies as well as reliable methods to study small datasets or subcohorts with unique phenotypes. To address this need, we developed an integrative approach, GAiN, to capture patterns of gene expression from small datasets on the basis of an ensemble of generative adversarial networks (GANs) while leveraging big population data. Where conventional biostatistical methods fail, GAiN reliably discovers differentially expressed genes (DEGs) and enriched pathways between two cohorts with limited numbers of samples (n = 10) when benchmarked against a gold standard. GAiN is freely available at GitHub. Thus, GAiN may serve as a crucial tool for gene expression analysis in scenarios with limited samples, as in the context of rare diseases, under-represented populations, or limited investigator resources.</p>","PeriodicalId":36242,"journal":{"name":"Patterns","volume":null,"pages":null},"PeriodicalIF":6.7000,"publicationDate":"2024-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Patterns","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.patter.2023.100910","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Big genomic data and artificial intelligence (AI) are ushering in an era of precision medicine, providing opportunities to study previously under-represented subtypes and rare diseases rather than categorize them as variances. However, clinical researchers face challenges in accessing such novel technologies as well as reliable methods to study small datasets or subcohorts with unique phenotypes. To address this need, we developed an integrative approach, GAiN, to capture patterns of gene expression from small datasets on the basis of an ensemble of generative adversarial networks (GANs) while leveraging big population data. Where conventional biostatistical methods fail, GAiN reliably discovers differentially expressed genes (DEGs) and enriched pathways between two cohorts with limited numbers of samples (n = 10) when benchmarked against a gold standard. GAiN is freely available at GitHub. Thus, GAiN may serve as a crucial tool for gene expression analysis in scenarios with limited samples, as in the context of rare diseases, under-represented populations, or limited investigator resources.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
GAiN:利用生成对抗神经网络进行增强基因表达分析的综合工具
大基因组数据和人工智能(AI)正在开创一个精准医疗时代,为研究以前代表性不足的亚型和罕见疾病提供了机会,而不是将其归类为变异。然而,临床研究人员在获取此类新技术以及研究具有独特表型的小数据集或亚群的可靠方法方面面临挑战。为了满足这一需求,我们开发了一种综合方法--GAiN,在生成对抗网络(GAN)集合的基础上捕捉小数据集中的基因表达模式,同时利用大群体数据。在传统生物统计方法失效的情况下,GAiN 以黄金标准为基准,可靠地发现了样本数量有限(n = 10)的两个队列之间的差异表达基因(DEGs)和富集通路。GAiN 可在 GitHub 上免费获取。因此,在样本有限的情况下,如罕见疾病、代表性不足的人群或研究者资源有限的情况下,GAiN 可作为基因表达分析的重要工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Patterns
Patterns Decision Sciences-Decision Sciences (all)
CiteScore
10.60
自引率
4.60%
发文量
153
审稿时长
19 weeks
期刊介绍:
期刊最新文献
AnnoMate: Exploring and annotating integrated molecular data through custom interactive visualizations Balancing innovation and integrity in peer review The stacking cell puzzle To democratize research with sensitive data, we should make synthetic data more accessible FAIM: Fairness-aware interpretable modeling for trustworthy machine learning in healthcare
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1