LungGENIE: the lung gene-expression and network imputation engine.

IF 3.7 2区 生物学 Q2 BIOTECHNOLOGY & APPLIED MICROBIOLOGY BMC Genomics Pub Date : 2025-03-10 DOI:10.1186/s12864-025-11412-4
Auyon J Ghosh, Liam P Coyne, Sanchit Panda, Aravind A Menon, Matthew Moll, Michael A Archer, Jason Wallen, Frank A Middleton, Craig P Hersh, Stephen J Glatt, Jonathan L Hess
{"title":"LungGENIE: the lung gene-expression and network imputation engine.","authors":"Auyon J Ghosh, Liam P Coyne, Sanchit Panda, Aravind A Menon, Matthew Moll, Michael A Archer, Jason Wallen, Frank A Middleton, Craig P Hersh, Stephen J Glatt, Jonathan L Hess","doi":"10.1186/s12864-025-11412-4","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Few cohorts have study populations large enough to conduct molecular analysis of ex vivo lung tissue for genomic analyses. Transcriptome imputation is a non-invasive alternative with many potential applications. We present a novel transcriptome-imputation method called the Lung Gene Expression and Network Imputation Engine (LungGENIE) that uses principal components from blood gene-expression levels in a linear regression model to predict lung tissue-specific gene-expression.</p><p><strong>Methods: </strong>We use paired blood and lung RNA sequencing data from the Genotype-Tissue Expression (GTEx) project to train LungGENIE models. We replicate model performance in a unique dataset, where we generated RNA sequencing data from paired lung and blood samples available through the SUNY Upstate Biorepository (SUBR). We further demonstrate proof-of-concept application of LungGENIE models in an independent blood RNA sequencing data from the Genetic Epidemiology of COPD (COPDGene) study.</p><p><strong>Results: </strong>We show that LungGENIE prediction accuracies have higher correlation to measured lung tissue expression compared to existing cis-expression quantitative trait loci-based methods (median Pearson's r = 0.25, IQR 0.19-0.32), with close to half of the reliably predicted transcripts being replicated in the testing dataset. Finally, we demonstrate significant correlation of differential expression results in chronic obstructive pulmonary disease (COPD) from imputed lung tissue gene-expression and differential expression results experimentally determined from lung tissue.</p><p><strong>Conclusion: </strong>Our results demonstrate that LungGENIE provides complementary results to existing expression quantitative trait loci-based methods and outperforms direct blood to lung results across internal cross-validation, external replication, and proof-of-concept in an independent dataset. Taken together, we establish LungGENIE as a tool with many potential applications in the study of lung diseases.</p>","PeriodicalId":9030,"journal":{"name":"BMC Genomics","volume":"26 1","pages":"227"},"PeriodicalIF":3.7000,"publicationDate":"2025-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11892309/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Genomics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s12864-025-11412-4","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Few cohorts have study populations large enough to conduct molecular analysis of ex vivo lung tissue for genomic analyses. Transcriptome imputation is a non-invasive alternative with many potential applications. We present a novel transcriptome-imputation method called the Lung Gene Expression and Network Imputation Engine (LungGENIE) that uses principal components from blood gene-expression levels in a linear regression model to predict lung tissue-specific gene-expression.

Methods: We use paired blood and lung RNA sequencing data from the Genotype-Tissue Expression (GTEx) project to train LungGENIE models. We replicate model performance in a unique dataset, where we generated RNA sequencing data from paired lung and blood samples available through the SUNY Upstate Biorepository (SUBR). We further demonstrate proof-of-concept application of LungGENIE models in an independent blood RNA sequencing data from the Genetic Epidemiology of COPD (COPDGene) study.

Results: We show that LungGENIE prediction accuracies have higher correlation to measured lung tissue expression compared to existing cis-expression quantitative trait loci-based methods (median Pearson's r = 0.25, IQR 0.19-0.32), with close to half of the reliably predicted transcripts being replicated in the testing dataset. Finally, we demonstrate significant correlation of differential expression results in chronic obstructive pulmonary disease (COPD) from imputed lung tissue gene-expression and differential expression results experimentally determined from lung tissue.

Conclusion: Our results demonstrate that LungGENIE provides complementary results to existing expression quantitative trait loci-based methods and outperforms direct blood to lung results across internal cross-validation, external replication, and proof-of-concept in an independent dataset. Taken together, we establish LungGENIE as a tool with many potential applications in the study of lung diseases.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
LungGENIE:肺基因表达和网络植入引擎。
背景:很少有研究群体大到足以对离体肺组织进行基因组分析的分子分析。转录组植入是一种具有许多潜在应用的非侵入性替代方法。我们提出了一种新的转录组归算方法,称为肺基因表达和网络归算引擎(LungGENIE),它使用线性回归模型中血液基因表达水平的主成分来预测肺组织特异性基因表达。方法:我们使用来自基因型组织表达(GTEx)项目的配对血液和肺RNA测序数据来训练LungGENIE模型。我们在一个独特的数据集中复制了模型的性能,在这个数据集中,我们从配对的肺和血液样本中生成了RNA测序数据,这些样本可以通过纽约州立大学北部生物储存库(SUBR)获得。我们进一步展示了LungGENIE模型在COPD遗传流行病学(COPDGene)研究中独立血液RNA测序数据的概念验证应用。结果:我们发现,与现有的基于顺式表达的数量性状位点的方法相比,LungGENIE预测准确性与测量的肺组织表达具有更高的相关性(Pearson的中位数r = 0.25, IQR为0.19-0.32),接近一半的可靠预测转录本在测试数据集中被复制。最后,我们证明了慢性阻塞性肺疾病(COPD)中来自肺组织基因表达的差异表达结果与肺组织中实验确定的差异表达结果之间存在显著相关性。结论:我们的研究结果表明,LungGENIE为现有的基于基因座的表达定量性状方法提供了补充结果,并且在内部交叉验证、外部复制和独立数据集中的概念验证中优于直接血液到肺的结果。综上所述,我们建立了LungGENIE作为一种工具,在肺部疾病的研究中具有许多潜在的应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
BMC Genomics
BMC Genomics 生物-生物工程与应用微生物
CiteScore
7.40
自引率
4.50%
发文量
769
审稿时长
6.4 months
期刊介绍: BMC Genomics is an open access, peer-reviewed journal that considers articles on all aspects of genome-scale analysis, functional genomics, and proteomics. BMC Genomics is part of the BMC series which publishes subject-specific journals focused on the needs of individual research communities across all areas of biology and medicine. We offer an efficient, fair and friendly peer review service, and are committed to publishing all sound science, provided that there is some advance in knowledge presented by the work.
期刊最新文献
Comparative analysis of venom genes in the chromosome-level genomes of two closely related cone snails. Genomic structures of rDNA clusters in Eimeria acervulina. Genetic recovery in an alpine climate-sensitive pika species during upward treeline shift. Further insights into the sialome switch of Amblyomma americanum adult females. Dissecting the population structure, diversity and genetic architecture of rust resistance in wild emmer wheat (Triticum turgidum subsp. dicoccoides).
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1