Conor R. Walker, Xiaoting Li, Manav Chakravarthy, William Lounsbery-Scaife, Yoolim A. Choi, Ritambhara Singh, Gamze Gürsoy
{"title":"单细胞计数矩阵的私人信息泄露","authors":"Conor R. Walker, Xiaoting Li, Manav Chakravarthy, William Lounsbery-Scaife, Yoolim A. Choi, Ritambhara Singh, Gamze Gürsoy","doi":"10.1016/j.cell.2024.09.012","DOIUrl":null,"url":null,"abstract":"The increase in publicly available human single-cell datasets, encompassing millions of cells from many donors, has significantly enhanced our understanding of complex biological processes. However, the accessibility of these datasets raises significant privacy concerns. Due to the inherent noise in single-cell measurements and the scarcity of population-scale single-cell datasets, recent private information quantification studies have focused on bulk gene expression data sharing. To address this gap, we demonstrate that individuals in single-cell gene expression datasets are vulnerable to linking attacks, where attackers can infer their sensitive phenotypic information using publicly available tissue or cell-type-specific expression quantitative trait loci (eQTLs) information. We further develop a method for genotype prediction and genotype-phenotype linking that remains effective without relying on eQTL information. We show that variants from one study can be exploited to uncover private information about individuals in another study.","PeriodicalId":9656,"journal":{"name":"Cell","volume":null,"pages":null},"PeriodicalIF":45.5000,"publicationDate":"2024-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Private information leakage from single-cell count matrices\",\"authors\":\"Conor R. Walker, Xiaoting Li, Manav Chakravarthy, William Lounsbery-Scaife, Yoolim A. Choi, Ritambhara Singh, Gamze Gürsoy\",\"doi\":\"10.1016/j.cell.2024.09.012\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The increase in publicly available human single-cell datasets, encompassing millions of cells from many donors, has significantly enhanced our understanding of complex biological processes. However, the accessibility of these datasets raises significant privacy concerns. Due to the inherent noise in single-cell measurements and the scarcity of population-scale single-cell datasets, recent private information quantification studies have focused on bulk gene expression data sharing. To address this gap, we demonstrate that individuals in single-cell gene expression datasets are vulnerable to linking attacks, where attackers can infer their sensitive phenotypic information using publicly available tissue or cell-type-specific expression quantitative trait loci (eQTLs) information. We further develop a method for genotype prediction and genotype-phenotype linking that remains effective without relying on eQTL information. We show that variants from one study can be exploited to uncover private information about individuals in another study.\",\"PeriodicalId\":9656,\"journal\":{\"name\":\"Cell\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":45.5000,\"publicationDate\":\"2024-10-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cell\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1016/j.cell.2024.09.012\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cell","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1016/j.cell.2024.09.012","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
Private information leakage from single-cell count matrices
The increase in publicly available human single-cell datasets, encompassing millions of cells from many donors, has significantly enhanced our understanding of complex biological processes. However, the accessibility of these datasets raises significant privacy concerns. Due to the inherent noise in single-cell measurements and the scarcity of population-scale single-cell datasets, recent private information quantification studies have focused on bulk gene expression data sharing. To address this gap, we demonstrate that individuals in single-cell gene expression datasets are vulnerable to linking attacks, where attackers can infer their sensitive phenotypic information using publicly available tissue or cell-type-specific expression quantitative trait loci (eQTLs) information. We further develop a method for genotype prediction and genotype-phenotype linking that remains effective without relying on eQTL information. We show that variants from one study can be exploited to uncover private information about individuals in another study.
期刊介绍:
Cells is an international, peer-reviewed, open access journal that focuses on cell biology, molecular biology, and biophysics. It is affiliated with several societies, including the Spanish Society for Biochemistry and Molecular Biology (SEBBM), Nordic Autophagy Society (NAS), Spanish Society of Hematology and Hemotherapy (SEHH), and Society for Regenerative Medicine (Russian Federation) (RPO).
The journal publishes research findings of significant importance in various areas of experimental biology, such as cell biology, molecular biology, neuroscience, immunology, virology, microbiology, cancer, human genetics, systems biology, signaling, and disease mechanisms and therapeutics. The primary criterion for considering papers is whether the results contribute to significant conceptual advances or raise thought-provoking questions and hypotheses related to interesting and important biological inquiries.
In addition to primary research articles presented in four formats, Cells also features review and opinion articles in its "leading edge" section, discussing recent research advancements and topics of interest to its wide readership.