Truong Giang Vu, Nursultan Makhanov, Nguyen Anh Tu, Kok-Seng Wong
{"title":"On the Trade-off Between Privacy Protection and Data Utility for Chest X-ray Images","authors":"Truong Giang Vu, Nursultan Makhanov, Nguyen Anh Tu, Kok-Seng Wong","doi":"10.1109/ATC55345.2022.9942970","DOIUrl":null,"url":null,"abstract":"The rising advancement in deep learning (DL) techniques has enabled machine learning (ML) models to assist practitioners in performing medical tasks with high accuracy. However, it also poses privacy concerns regarding how such models will proceed with medical data containing protected patient health information. Therefore, some efforts have been made to anonymize medical data to preserve data privacy while keeping the model performance high enough to avoid wrong decisions in the medical field. Nevertheless, the adversary can develop an ML model to re-identify a patient's identity by matching an arbitrary chest X-ray image with a public or leaked image dataset with high accuracy. This paper aims to find a trade-off between our privacy protection method and data utility for medical images. Specifically, we propose a solution to anonymize chest X-ray images by directly adding noise to the images to prevent verification attacks and evaluate how well those images can maintain good performance in the lung disease classification task. Simulation results on real-world datasets show that the proposed solution achieved a good trade-off between privacy protection and data utility.","PeriodicalId":135827,"journal":{"name":"2022 International Conference on Advanced Technologies for Communications (ATC)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Advanced Technologies for Communications (ATC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ATC55345.2022.9942970","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The rising advancement in deep learning (DL) techniques has enabled machine learning (ML) models to assist practitioners in performing medical tasks with high accuracy. However, it also poses privacy concerns regarding how such models will proceed with medical data containing protected patient health information. Therefore, some efforts have been made to anonymize medical data to preserve data privacy while keeping the model performance high enough to avoid wrong decisions in the medical field. Nevertheless, the adversary can develop an ML model to re-identify a patient's identity by matching an arbitrary chest X-ray image with a public or leaked image dataset with high accuracy. This paper aims to find a trade-off between our privacy protection method and data utility for medical images. Specifically, we propose a solution to anonymize chest X-ray images by directly adding noise to the images to prevent verification attacks and evaluate how well those images can maintain good performance in the lung disease classification task. Simulation results on real-world datasets show that the proposed solution achieved a good trade-off between privacy protection and data utility.