Multimodal learning for mapping genotype–phenotype dynamics

IF 18.3 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Nature computational science Pub Date : 2025-01-28 DOI:10.1038/s43588-024-00765-7
Farhan Khodaee, Rohola Zandie, Elazer R. Edelman
{"title":"Multimodal learning for mapping genotype–phenotype dynamics","authors":"Farhan Khodaee, Rohola Zandie, Elazer R. Edelman","doi":"10.1038/s43588-024-00765-7","DOIUrl":null,"url":null,"abstract":"How complex phenotypes emerge from intricate gene expression patterns is a fundamental question in biology. Integrating high-content genotyping approaches such as single-cell RNA sequencing and advanced learning methods such as language models offers an opportunity for dissecting this complex relationship. Here we present a computational integrated genetics framework designed to analyze and interpret the high-dimensional landscape of genotypes and their associated phenotypes simultaneously. We applied this approach to develop a multimodal foundation model to explore the genotype–phenotype relationship manifold for human transcriptomics at the cellular level. Analyzing this joint manifold showed a refined resolution of cellular heterogeneity, uncovered potential cross-tissue biomarkers and provided contextualized embeddings to investigate the polyfunctionality of genes shown for the von Willebrand factor (VWF) gene in endothelial cells. Overall, this study advances our understanding of the dynamic interplay between gene expression and phenotypic manifestation and demonstrates the potential of integrated genetics in uncovering new dimensions of cellular function and complexity. A multimodal computational framework is proposed to integrate single-cell RNA sequencing data with phenotypic information to map complex genotype–phenotype relationships. This approach helps to refine cellular heterogeneity analysis, identify cross-tissue biomarkers and reveal polyfunctional characteristics of genes with cellular resolution.","PeriodicalId":74246,"journal":{"name":"Nature computational science","volume":"5 4","pages":"333-344"},"PeriodicalIF":18.3000,"publicationDate":"2025-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature computational science","FirstCategoryId":"1085","ListUrlMain":"https://www.nature.com/articles/s43588-024-00765-7","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

How complex phenotypes emerge from intricate gene expression patterns is a fundamental question in biology. Integrating high-content genotyping approaches such as single-cell RNA sequencing and advanced learning methods such as language models offers an opportunity for dissecting this complex relationship. Here we present a computational integrated genetics framework designed to analyze and interpret the high-dimensional landscape of genotypes and their associated phenotypes simultaneously. We applied this approach to develop a multimodal foundation model to explore the genotype–phenotype relationship manifold for human transcriptomics at the cellular level. Analyzing this joint manifold showed a refined resolution of cellular heterogeneity, uncovered potential cross-tissue biomarkers and provided contextualized embeddings to investigate the polyfunctionality of genes shown for the von Willebrand factor (VWF) gene in endothelial cells. Overall, this study advances our understanding of the dynamic interplay between gene expression and phenotypic manifestation and demonstrates the potential of integrated genetics in uncovering new dimensions of cellular function and complexity. A multimodal computational framework is proposed to integrate single-cell RNA sequencing data with phenotypic information to map complex genotype–phenotype relationships. This approach helps to refine cellular heterogeneity analysis, identify cross-tissue biomarkers and reveal polyfunctional characteristics of genes with cellular resolution.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
绘制基因型-表型动力学的多模态学习。
复杂的表型是如何从复杂的基因表达模式中产生的,这是生物学中的一个基本问题。整合高含量的基因分型方法,如单细胞RNA测序和先进的学习方法,如语言模型,为剖析这种复杂的关系提供了机会。在这里,我们提出了一个计算集成遗传学框架,旨在同时分析和解释基因型及其相关表型的高维景观。我们应用这种方法开发了一个多模态基础模型,在细胞水平上探索人类转录组学的基因型-表型关系。分析这种联合歧管显示了细胞异质性的精细分辨率,揭示了潜在的跨组织生物标志物,并提供了上下文嵌入来研究内皮细胞中血管性血液病因子(VWF)基因的多功能性。总的来说,这项研究促进了我们对基因表达和表型表现之间动态相互作用的理解,并展示了综合遗传学在揭示细胞功能和复杂性的新维度方面的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
11.70
自引率
0.00%
发文量
0
期刊最新文献
BrainParc: unified lifespan brain parcellation from structural magnetic resonance images. De novo design of functional nucleic acids of aptamers. Choreographing molecular design with TANGO. High-throughput transition-state searches in zeolite nanopores. A meaningful map of the underexplored electrolyte universe.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1