Graham Heimberg, Tony Kuo, Daryle J. DePianto, Omar Salem, Tobias Heigl, Nathaniel Diamant, Gabriele Scalia, Tommaso Biancalani, Shannon J. Turley, Jason R. Rock, Héctor Corrada Bravo, Josh Kaminker, Jason A. Vander Heiden, Aviv Regev
{"title":"A cell atlas foundation model for scalable search of similar human cells","authors":"Graham Heimberg, Tony Kuo, Daryle J. DePianto, Omar Salem, Tobias Heigl, Nathaniel Diamant, Gabriele Scalia, Tommaso Biancalani, Shannon J. Turley, Jason R. Rock, Héctor Corrada Bravo, Josh Kaminker, Jason A. Vander Heiden, Aviv Regev","doi":"10.1038/s41586-024-08411-y","DOIUrl":null,"url":null,"abstract":"<p>Single-cell RNA-seq (scRNA-seq) has profiled hundreds of millions of human cells across organs, diseases, development, and perturbations to date. Mining these growing atlases could reveal cell-disease associations, discover cell states in unexpected tissue contexts, and relate <i>in vivo</i> biology to <i>in vitro</i> models. These require a common measure of cell similarity across the body and an efficient way to search. Here, we develop SCimilarity, a metric learning framework to learn a unified and interpretable representation that enables rapid queries of tens of millions of cell profiles from diverse studies for cells that are transcriptionally similar to an input cell profile or state. We use SCimilarity to query a 23.4 million cell atlas of 412 scRNA-seq studies for macrophage and fibroblast profiles from interstitial lung disease<sup>1</sup> and reveal similar cell profiles across other fibrotic diseases and tissues. The top scoring <i>in vitro</i> hit for the macrophage query was a 3D hydrogel system<sup>2</sup>, which we experimentally demonstrated reproduces this cell state. SCimilarity serves as a foundation model for single-cell profiles that enables researchers to query for similar cellular states across the human body, providing a powerful tool for generating biological insights from the Human Cell Atlas.</p>","PeriodicalId":18787,"journal":{"name":"Nature","volume":"19 1","pages":""},"PeriodicalIF":50.5000,"publicationDate":"2024-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41586-024-08411-y","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Single-cell RNA-seq (scRNA-seq) has profiled hundreds of millions of human cells across organs, diseases, development, and perturbations to date. Mining these growing atlases could reveal cell-disease associations, discover cell states in unexpected tissue contexts, and relate in vivo biology to in vitro models. These require a common measure of cell similarity across the body and an efficient way to search. Here, we develop SCimilarity, a metric learning framework to learn a unified and interpretable representation that enables rapid queries of tens of millions of cell profiles from diverse studies for cells that are transcriptionally similar to an input cell profile or state. We use SCimilarity to query a 23.4 million cell atlas of 412 scRNA-seq studies for macrophage and fibroblast profiles from interstitial lung disease1 and reveal similar cell profiles across other fibrotic diseases and tissues. The top scoring in vitro hit for the macrophage query was a 3D hydrogel system2, which we experimentally demonstrated reproduces this cell state. SCimilarity serves as a foundation model for single-cell profiles that enables researchers to query for similar cellular states across the human body, providing a powerful tool for generating biological insights from the Human Cell Atlas.
期刊介绍:
Nature is a prestigious international journal that publishes peer-reviewed research in various scientific and technological fields. The selection of articles is based on criteria such as originality, importance, interdisciplinary relevance, timeliness, accessibility, elegance, and surprising conclusions. In addition to showcasing significant scientific advances, Nature delivers rapid, authoritative, insightful news, and interpretation of current and upcoming trends impacting science, scientists, and the broader public. The journal serves a dual purpose: firstly, to promptly share noteworthy scientific advances and foster discussions among scientists, and secondly, to ensure the swift dissemination of scientific results globally, emphasizing their significance for knowledge, culture, and daily life.