Soyeon Kim, Yidi Qin, Hyun Jung Park, Rebecca I Caldino Bohn, Molin Yue, Zhongli Xu, Erick Forno, Wei Chen, Juan C Celedón
{"title":"MOSES:一种基于甲基化的基因关联方法,用于揭示与性状或疾病相关的环境调控基因。","authors":"Soyeon Kim, Yidi Qin, Hyun Jung Park, Rebecca I Caldino Bohn, Molin Yue, Zhongli Xu, Erick Forno, Wei Chen, Juan C Celedón","doi":"10.1186/s13148-024-01776-x","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>DNA methylation is a critical regulatory mechanism of gene expression, influencing various human diseases and traits. While traditional expression quantitative trait loci (eQTL) studies have helped elucidate the genetic regulation of gene expression, there is a growing need to explore environmental influences on gene expression. Existing methods such as PrediXcan and FUSION focus on genotype-based associations but overlook the impact of environmental factors. To address this gap, we present MOSES (methylation-based gene association), a novel approach that utilizes DNA methylation to identify environmentally regulated genes associated with traits or diseases without relying on measured gene expression.</p><p><strong>Results: </strong>MOSES involves training, imputation, and association testing. It employs elastic-net penalized regression models to estimate the influence of CpGs and SNPs (if available) on gene expression. We developed and compared four MOSES versions incorporating different methylation and genetic data: (1) cis-DNA methylation within 1 Mb of promoter regions, (2) both cis-SNPs and cis-CpGs, 3) both cis- and a part of trans- CpGs (±5Mb away) from promoter regions), and 4) long-range DNA methylation (±10 Mb away) from promoter regions. Our analysis using nasal epithelium and white blood cell data from the Epigenetic Variation and Childhood Asthma in Puerto Ricans (EVA-PR) study demonstrated that MOSES, particularly the version incorporating long-range CpGs (MOSES-DNAm 10 M), significantly outperformed existing methods like PrediXcan, MethylXcan, and Biomethyl in predicting gene expression. MOSES-DNAm 10 M identified more differentially expressed genes (DEGs) associated with atopic asthma, particularly those involved in immune pathways, highlighting its superior performance in uncovering environmentally regulated genes. Further application of MOSES to lung tissue data from idiopathic pulmonary fibrosis (IPF) patients confirmed its robustness and versatility across different diseases and tissues.</p><p><strong>Conclusion: </strong>MOSES represents an innovative advancement in gene association studies, leveraging DNA methylation to capture the influence of environmental factors on gene expression. By incorporating long-range CpGs, MOSES-DNAm 10 M provides superior predictive accuracy and gene association capabilities compared to traditional genotype-based methods. This novel approach offers valuable insights into the complex interplay between genetics and the environment, enhancing our understanding of disease mechanisms and potentially guiding therapeutic strategies. The user-friendly MOSES R package is publicly available to advance studies in various diseases, including immune-related conditions like asthma.</p>","PeriodicalId":10366,"journal":{"name":"Clinical Epigenetics","volume":"16 1","pages":"161"},"PeriodicalIF":4.8000,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11574994/pdf/","citationCount":"0","resultStr":"{\"title\":\"MOSES: a methylation-based gene association approach for unveiling environmentally regulated genes linked to a trait or disease.\",\"authors\":\"Soyeon Kim, Yidi Qin, Hyun Jung Park, Rebecca I Caldino Bohn, Molin Yue, Zhongli Xu, Erick Forno, Wei Chen, Juan C Celedón\",\"doi\":\"10.1186/s13148-024-01776-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>DNA methylation is a critical regulatory mechanism of gene expression, influencing various human diseases and traits. While traditional expression quantitative trait loci (eQTL) studies have helped elucidate the genetic regulation of gene expression, there is a growing need to explore environmental influences on gene expression. Existing methods such as PrediXcan and FUSION focus on genotype-based associations but overlook the impact of environmental factors. To address this gap, we present MOSES (methylation-based gene association), a novel approach that utilizes DNA methylation to identify environmentally regulated genes associated with traits or diseases without relying on measured gene expression.</p><p><strong>Results: </strong>MOSES involves training, imputation, and association testing. It employs elastic-net penalized regression models to estimate the influence of CpGs and SNPs (if available) on gene expression. We developed and compared four MOSES versions incorporating different methylation and genetic data: (1) cis-DNA methylation within 1 Mb of promoter regions, (2) both cis-SNPs and cis-CpGs, 3) both cis- and a part of trans- CpGs (±5Mb away) from promoter regions), and 4) long-range DNA methylation (±10 Mb away) from promoter regions. Our analysis using nasal epithelium and white blood cell data from the Epigenetic Variation and Childhood Asthma in Puerto Ricans (EVA-PR) study demonstrated that MOSES, particularly the version incorporating long-range CpGs (MOSES-DNAm 10 M), significantly outperformed existing methods like PrediXcan, MethylXcan, and Biomethyl in predicting gene expression. MOSES-DNAm 10 M identified more differentially expressed genes (DEGs) associated with atopic asthma, particularly those involved in immune pathways, highlighting its superior performance in uncovering environmentally regulated genes. Further application of MOSES to lung tissue data from idiopathic pulmonary fibrosis (IPF) patients confirmed its robustness and versatility across different diseases and tissues.</p><p><strong>Conclusion: </strong>MOSES represents an innovative advancement in gene association studies, leveraging DNA methylation to capture the influence of environmental factors on gene expression. By incorporating long-range CpGs, MOSES-DNAm 10 M provides superior predictive accuracy and gene association capabilities compared to traditional genotype-based methods. This novel approach offers valuable insights into the complex interplay between genetics and the environment, enhancing our understanding of disease mechanisms and potentially guiding therapeutic strategies. The user-friendly MOSES R package is publicly available to advance studies in various diseases, including immune-related conditions like asthma.</p>\",\"PeriodicalId\":10366,\"journal\":{\"name\":\"Clinical Epigenetics\",\"volume\":\"16 1\",\"pages\":\"161\"},\"PeriodicalIF\":4.8000,\"publicationDate\":\"2024-11-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11574994/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Clinical Epigenetics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s13148-024-01776-x\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical Epigenetics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s13148-024-01776-x","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
摘要
背景:DNA 甲基化是基因表达的重要调控机制,影响着人类的各种疾病和性状。虽然传统的表达量性状位点(eQTL)研究有助于阐明基因表达的遗传调控,但人们越来越需要探索环境对基因表达的影响。PrediXcan 和 FUSION 等现有方法侧重于基于基因型的关联,但忽略了环境因素的影响。为了弥补这一缺陷,我们提出了 MOSES(基于甲基化的基因关联),这是一种利用 DNA 甲基化来识别与性状或疾病相关的环境调控基因的新方法,而无需依赖测量的基因表达:结果:MOSES 包括训练、估算和关联测试。它采用弹性网惩罚回归模型来估计 CpGs 和 SNPs(如果有的话)对基因表达的影响。我们开发并比较了四种包含不同甲基化和遗传数据的 MOSES 版本:(1)启动子区域 1 Mb 范围内的顺式 DNA 甲基化;(2)顺式 SNP 和顺式 CpGs;(3)顺式 CpGs 和部分反式 CpGs(距离启动子区域 ±5 Mb);(4)距离启动子区域的长程 DNA 甲基化(距离启动子区域 ±10 Mb)。我们利用波多黎各人表观遗传变异和儿童哮喘(EVA-PR)研究中的鼻上皮细胞和白细胞数据进行的分析表明,MOSES,尤其是包含长程 CpGs 的版本(MOSES-DNAm 10 M),在预测基因表达方面明显优于 PrediXcan、MethylXcan 和 Biomethyl 等现有方法。MOSES-DNAm 10 M 发现了更多与特应性哮喘相关的差异表达基因(DEGs),尤其是那些参与免疫通路的基因,这凸显了它在发现环境调控基因方面的卓越性能。MOSES 在特发性肺纤维化(IPF)患者肺组织数据中的进一步应用证实了它在不同疾病和组织中的稳健性和通用性:MOSES代表了基因关联研究的创新进步,它利用DNA甲基化捕捉环境因素对基因表达的影响。通过结合长程 CpGs,MOSES-DNAm 10 M 与传统的基于基因型的方法相比,具有更高的预测准确性和基因关联能力。这种新方法为我们深入了解遗传与环境之间复杂的相互作用提供了宝贵的视角,增强了我们对疾病机制的理解,并有可能为治疗策略提供指导。用户友好的 MOSES R 软件包已公开发布,可用于推进各种疾病的研究,包括哮喘等免疫相关疾病。
MOSES: a methylation-based gene association approach for unveiling environmentally regulated genes linked to a trait or disease.
Background: DNA methylation is a critical regulatory mechanism of gene expression, influencing various human diseases and traits. While traditional expression quantitative trait loci (eQTL) studies have helped elucidate the genetic regulation of gene expression, there is a growing need to explore environmental influences on gene expression. Existing methods such as PrediXcan and FUSION focus on genotype-based associations but overlook the impact of environmental factors. To address this gap, we present MOSES (methylation-based gene association), a novel approach that utilizes DNA methylation to identify environmentally regulated genes associated with traits or diseases without relying on measured gene expression.
Results: MOSES involves training, imputation, and association testing. It employs elastic-net penalized regression models to estimate the influence of CpGs and SNPs (if available) on gene expression. We developed and compared four MOSES versions incorporating different methylation and genetic data: (1) cis-DNA methylation within 1 Mb of promoter regions, (2) both cis-SNPs and cis-CpGs, 3) both cis- and a part of trans- CpGs (±5Mb away) from promoter regions), and 4) long-range DNA methylation (±10 Mb away) from promoter regions. Our analysis using nasal epithelium and white blood cell data from the Epigenetic Variation and Childhood Asthma in Puerto Ricans (EVA-PR) study demonstrated that MOSES, particularly the version incorporating long-range CpGs (MOSES-DNAm 10 M), significantly outperformed existing methods like PrediXcan, MethylXcan, and Biomethyl in predicting gene expression. MOSES-DNAm 10 M identified more differentially expressed genes (DEGs) associated with atopic asthma, particularly those involved in immune pathways, highlighting its superior performance in uncovering environmentally regulated genes. Further application of MOSES to lung tissue data from idiopathic pulmonary fibrosis (IPF) patients confirmed its robustness and versatility across different diseases and tissues.
Conclusion: MOSES represents an innovative advancement in gene association studies, leveraging DNA methylation to capture the influence of environmental factors on gene expression. By incorporating long-range CpGs, MOSES-DNAm 10 M provides superior predictive accuracy and gene association capabilities compared to traditional genotype-based methods. This novel approach offers valuable insights into the complex interplay between genetics and the environment, enhancing our understanding of disease mechanisms and potentially guiding therapeutic strategies. The user-friendly MOSES R package is publicly available to advance studies in various diseases, including immune-related conditions like asthma.
期刊介绍:
Clinical Epigenetics, the official journal of the Clinical Epigenetics Society, is an open access, peer-reviewed journal that encompasses all aspects of epigenetic principles and mechanisms in relation to human disease, diagnosis and therapy. Clinical trials and research in disease model organisms are particularly welcome.