Coalitional Game Theory Facilitates Identification of Non-Coding Variants Associated With Autism.

Min Woo Sun, Anika Gupta, Maya Varma, Kelley M Paskov, Jae-Yoon Jung, Nate T Stockham, Dennis P Wall
{"title":"Coalitional Game Theory Facilitates Identification of Non-Coding Variants Associated With Autism.","authors":"Min Woo Sun,&nbsp;Anika Gupta,&nbsp;Maya Varma,&nbsp;Kelley M Paskov,&nbsp;Jae-Yoon Jung,&nbsp;Nate T Stockham,&nbsp;Dennis P Wall","doi":"10.1177/1178222619832859","DOIUrl":null,"url":null,"abstract":"<p><p>Studies on autism spectrum disorder (ASD) have amassed substantial evidence for the role of genetics in the disease's phenotypic manifestation. A large number of coding and non-coding variants with low penetrance likely act in a combinatorial manner to explain the variable forms of ASD. However, many of these combined interactions, both additive and epistatic, remain undefined. Coalitional game theory (CGT) is an approach that seeks to identify players (individual genetic variants or genes) who tend to improve the performance-association to a disease phenotype of interest-of any coalition (subset of co-occurring genetic variants) they join. This method has been previously applied to boost biologically informative signal from gene expression data and exome sequencing data but remains to be explored in the context of cooperativity among non-coding genomic regions. We describe our extension of previous work, highlighting non-coding chromosomal regions relevant to ASD using CGT on alteration data of 4595 fully sequenced genomes from 756 multiplex families. Genomes were encoded into binary matrices for three types of non-coding regions previously implicated in ASD and separated into ASD (case) and unaffected (control) samples. A player metric, the Shapley value, enabled determination of individual variant contributions in both sets of cohorts. A total of 30 non-coding positions were found to have significantly elevated player scores and likely represent significant contributors to the genetic coordination underlying ASD. Cross-study analyses revealed that a subset of mutated non-coding regions (all of which are in human accelerated regions (HARs)) and related genes are involved in biological pathways or behavioral outcomes known to be affected in autism, suggesting the importance of single nucleotide polymorphisms (SNPs) within HARs in ASD. These findings support the use of CGT in identifying hidden yet influential non-coding players from large-scale genomic data, to better understand the precise underpinnings of complex neurodevelopmental disorders such as autism.</p>","PeriodicalId":88397,"journal":{"name":"Biomedical informatics insights","volume":"11 ","pages":"1178222619832859"},"PeriodicalIF":0.0000,"publicationDate":"2019-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1177/1178222619832859","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical informatics insights","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/1178222619832859","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

Studies on autism spectrum disorder (ASD) have amassed substantial evidence for the role of genetics in the disease's phenotypic manifestation. A large number of coding and non-coding variants with low penetrance likely act in a combinatorial manner to explain the variable forms of ASD. However, many of these combined interactions, both additive and epistatic, remain undefined. Coalitional game theory (CGT) is an approach that seeks to identify players (individual genetic variants or genes) who tend to improve the performance-association to a disease phenotype of interest-of any coalition (subset of co-occurring genetic variants) they join. This method has been previously applied to boost biologically informative signal from gene expression data and exome sequencing data but remains to be explored in the context of cooperativity among non-coding genomic regions. We describe our extension of previous work, highlighting non-coding chromosomal regions relevant to ASD using CGT on alteration data of 4595 fully sequenced genomes from 756 multiplex families. Genomes were encoded into binary matrices for three types of non-coding regions previously implicated in ASD and separated into ASD (case) and unaffected (control) samples. A player metric, the Shapley value, enabled determination of individual variant contributions in both sets of cohorts. A total of 30 non-coding positions were found to have significantly elevated player scores and likely represent significant contributors to the genetic coordination underlying ASD. Cross-study analyses revealed that a subset of mutated non-coding regions (all of which are in human accelerated regions (HARs)) and related genes are involved in biological pathways or behavioral outcomes known to be affected in autism, suggesting the importance of single nucleotide polymorphisms (SNPs) within HARs in ASD. These findings support the use of CGT in identifying hidden yet influential non-coding players from large-scale genomic data, to better understand the precise underpinnings of complex neurodevelopmental disorders such as autism.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
联合博弈论有助于识别与自闭症相关的非编码变体。
对自闭症谱系障碍(ASD)的研究为遗传学在该疾病表型表现中的作用积累了大量证据。大量外显率低的编码和非编码变体可能以组合的方式解释ASD的可变形式。然而,这些组合相互作用中的许多,包括加性和上位性,仍然没有定义。联盟博弈论(CGT)是一种试图识别参与者(个体遗传变异或基因)的方法,他们倾向于改善与他们加入的任何联盟(共同发生的遗传变异的子集)感兴趣的疾病表型的表现关联。该方法先前已被应用于增强来自基因表达数据和外显子组测序数据的生物信息信号,但在非编码基因组区域之间的协同性方面仍有待探索。我们描述了我们对先前工作的扩展,使用CGT对来自756个多重家族的4595个全测序基因组的改变数据强调了与ASD相关的非编码染色体区域。将基因组编码到先前与ASD有关的三种类型的非编码区的二进制矩阵中,并将其分为ASD(病例)和未受影响(对照)样本。一个参与者指标,沙普利值,能够确定两组队列中的个人变量贡献。共发现30个非编码位置的球员得分显著提高,可能对ASD的遗传协调有重要贡献。交叉研究分析显示,一组突变的非编码区(均位于人类加速区(HARs))和相关基因参与了已知受自闭症影响的生物学途径或行为结果,这表明HARs中单核苷酸多态性(SNPs)在ASD中的重要性。这些发现支持使用CGT从大规模基因组数据中识别隐藏但有影响力的非编码参与者,以更好地了解复杂神经发育障碍(如自闭症)的确切基础。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A Data-Driven Approach to Predicting Septic Shock in the Intensive Care Unit A Genome Model to Explain Major Features of Neurodevelopmental Disorders in Newborns. Mathematical Model for Computer-Assisted Modification of Medication Dosing Rules. Applying Supervised Machine Learning to Identify Which Patient Characteristics Identify the Highest Rates of Mortality Post-Interhospital Transfer. Coalitional Game Theory Facilitates Identification of Non-Coding Variants Associated With Autism.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1