PICOTEES: a privacy-preserving online service of phenotype exploration for genetic-diagnostic variants from Chinese children cohorts.

Xinran Dong, Yulan Lu, Lanting Guo, Chuan Li, Qi Ni, Bingbing Wu, Huijun Wang, Lin Yang, Songyang Wu, Qi Sun, Hao Zheng, Wenhao Zhou, Shuang Wang
{"title":"PICOTEES: a privacy-preserving online service of phenotype exploration for genetic-diagnostic variants from Chinese children cohorts.","authors":"Xinran Dong, Yulan Lu, Lanting Guo, Chuan Li, Qi Ni, Bingbing Wu, Huijun Wang, Lin Yang, Songyang Wu, Qi Sun, Hao Zheng, Wenhao Zhou, Shuang Wang","doi":"10.1016/j.jgg.2023.09.003","DOIUrl":null,"url":null,"abstract":"<p><p>The growth in biomedical data resources has raised potential privacy concerns and risks of genetic information leakage. For instance, exome sequencing aids clinical decisions by comparing data through web services, but it requires significant trust between users and providers. To alleviate privacy concerns, the most commonly used strategy is to anonymize sensitive data. Unfortunately, studies have shown that anonymization is insufficient to protect against reidentification attacks. Recently, privacy-preserving technologies have been applied to preserve application utility while protecting the privacy of biomedical data. We present the PICOTEES framework, a privacy-preserving online service of phenotype exploration for genetic-diagnostic variants (https://birthdefectlab.cn:3000/). PICOTEES enables privacy-preserving queries of the phenotype spectrum for a single variant by utilizing trusted execution environment technology, which can protect the privacy of the user's query information, backend models, and data, as well as the final results. We demonstrate the utility and performance of PICOTEES by exploring a bioinformatics dataset. The dataset is from a cohort containing 20,909 genetic testing patients with 3,152,508 variants from the Children's Hospital of Fudan University in China, dominated by the Chinese Han population (>99.9%). Our query results yield a large number of unreported diagnostic variants and previously reported pathogenicity.</p>","PeriodicalId":15985,"journal":{"name":"Journal of genetics and genomics = Yi chuan xue bao","volume":" ","pages":"243-251"},"PeriodicalIF":0.0000,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of genetics and genomics = Yi chuan xue bao","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.jgg.2023.09.003","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/9/13 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The growth in biomedical data resources has raised potential privacy concerns and risks of genetic information leakage. For instance, exome sequencing aids clinical decisions by comparing data through web services, but it requires significant trust between users and providers. To alleviate privacy concerns, the most commonly used strategy is to anonymize sensitive data. Unfortunately, studies have shown that anonymization is insufficient to protect against reidentification attacks. Recently, privacy-preserving technologies have been applied to preserve application utility while protecting the privacy of biomedical data. We present the PICOTEES framework, a privacy-preserving online service of phenotype exploration for genetic-diagnostic variants (https://birthdefectlab.cn:3000/). PICOTEES enables privacy-preserving queries of the phenotype spectrum for a single variant by utilizing trusted execution environment technology, which can protect the privacy of the user's query information, backend models, and data, as well as the final results. We demonstrate the utility and performance of PICOTEES by exploring a bioinformatics dataset. The dataset is from a cohort containing 20,909 genetic testing patients with 3,152,508 variants from the Children's Hospital of Fudan University in China, dominated by the Chinese Han population (>99.9%). Our query results yield a large number of unreported diagnostic variants and previously reported pathogenicity.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
PICOTEES:一项保护隐私的在线服务,为中国儿童群体的遗传诊断变异进行表型探索。
生物医学数据资源的增长引发了潜在的隐私问题和基因信息泄露的风险。例如,外显子组测序通过网络服务比较数据来帮助临床决策,但它需要用户和提供者之间的高度信任。为了缓解隐私问题,最常用的策略是匿名化敏感数据。不幸的是,研究表明,匿名化不足以抵御再识别攻击。最近,隐私保护技术被应用于在保护生物医学数据隐私的同时保持应用效用。我们提出了PICOTEES框架,这是一种保护隐私的遗传诊断变异表型探索在线服务。PICOTEES通过利用可信执行环境技术,实现了对单个变体表型谱的隐私保护查询,该技术可以保护用户查询信息、后端模型和数据以及最终结果的隐私。我们通过探索生物信息学数据集来展示PICOTEES的实用性和性能。该数据集来自中国复旦大学儿童医院的20909名基因检测患者,其中3152508个变异株,主要是中国汉族(>99.9%)。我们的查询结果产生了大量未报告的诊断变异株和先前报告的致病性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
H3K36me3 and H2A.Z coordinately modulate flowering time in Arabidopsis. Translation machinery: the basis of translational control. Coiled-coil domain-containing 38 is required for acrosome biogenesis and fibrous sheath assembly in mice. Mechanisms underlying key agronomic traits and implications for molecular breeding in soybean. S-acylation of YKT61 modulates its unconventional participation in the formation of SNARE complexes in Arabidopsis.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1