Privacy enhancing and generalizable deep learning with synthetic data for mediastinal neoplasm diagnosis

IF 12.4 1区医学 Q1 HEALTH CARE SCIENCES & SERVICES NPJ Digital Medicine Pub Date : 2024-10-20 DOI:10.1038/s41746-024-01290-7

Zhanping Zhou, Yuchen Guo, Ruijie Tang, Hengrui Liang, Jianxing He, Feng Xu

{"title":"Privacy enhancing and generalizable deep learning with synthetic data for mediastinal neoplasm diagnosis","authors":"Zhanping Zhou, Yuchen Guo, Ruijie Tang, Hengrui Liang, Jianxing He, Feng Xu","doi":"10.1038/s41746-024-01290-7","DOIUrl":null,"url":null,"abstract":"The success of deep learning (DL) relies heavily on training data from which DL models encapsulate information. Consequently, the development and deployment of DL models expose data to potential privacy breaches, which are particularly critical in data-sensitive contexts like medicine. We propose a new technique named DiffGuard that generates realistic and diverse synthetic medical images with annotations, even indistinguishable for experts, to replace real data for DL model training, which cuts off their direct connection and enhances privacy safety. We demonstrate that DiffGuard enhances privacy safety with much less data leakage and better resistance against privacy attacks on data and model. It also improves the accuracy and generalizability of DL models for segmentation and classification of mediastinal neoplasms in multi-center evaluation. We expect that our solution would enlighten the road to privacy-preserving DL for precision medicine, promote data and model sharing, and inspire more innovation on artificial-intelligence-generated-content technologies for medicine.","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":" ","pages":"1-15"},"PeriodicalIF":12.4000,"publicationDate":"2024-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s41746-024-01290-7.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"NPJ Digital Medicine","FirstCategoryId":"3","ListUrlMain":"https://www.nature.com/articles/s41746-024-01290-7","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 0

Abstract

The success of deep learning (DL) relies heavily on training data from which DL models encapsulate information. Consequently, the development and deployment of DL models expose data to potential privacy breaches, which are particularly critical in data-sensitive contexts like medicine. We propose a new technique named DiffGuard that generates realistic and diverse synthetic medical images with annotations, even indistinguishable for experts, to replace real data for DL model training, which cuts off their direct connection and enhances privacy safety. We demonstrate that DiffGuard enhances privacy safety with much less data leakage and better resistance against privacy attacks on data and model. It also improves the accuracy and generalizability of DL models for segmentation and classification of mediastinal neoplasms in multi-center evaluation. We expect that our solution would enlighten the road to privacy-preserving DL for precision medicine, promote data and model sharing, and inspire more innovation on artificial-intelligence-generated-content technologies for medicine.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用合成数据进行纵隔肿瘤诊断的隐私增强型可泛化深度学习

深度学习（DL）的成功在很大程度上依赖于训练数据，DL 模型从中封装信息。因此，深度学习模型的开发和部署会使数据面临潜在的隐私泄露风险，这在医学等对数据敏感的环境中尤为重要。我们提出了一种名为 "DiffGuard "的新技术，它能生成真实、多样且带有注释的合成医学图像，专家甚至无法分辨这些图像，从而取代真实数据用于 DL 模型的训练，切断它们之间的直接联系，提高隐私安全性。我们证明了 DiffGuard 能增强隐私安全性，减少数据泄露，更好地抵御对数据和模型的隐私攻击。在多中心评估中，它还提高了用于纵隔肿瘤分割和分类的 DL 模型的准确性和通用性。我们期待我们的解决方案能为精准医疗的隐私保护DL之路带来启迪，促进数据和模型共享，并激发更多人工智能生成内容技术在医疗领域的创新。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

NPJ Digital Medicine Multiple-

CiteScore

25.10

自引率

3.30%

发文量

170

审稿时长

15 weeks

期刊介绍： npj Digital Medicine is an online open-access journal that focuses on publishing peer-reviewed research in the field of digital medicine. The journal covers various aspects of digital medicine, including the application and implementation of digital and mobile technologies in clinical settings, virtual healthcare, and the use of artificial intelligence and informatics. The primary goal of the journal is to support innovation and the advancement of healthcare through the integration of new digital and mobile technologies. When determining if a manuscript is suitable for publication, the journal considers four important criteria: novelty, clinical relevance, scientific rigor, and digital innovation.