R. Zhang, W. Ni, N. Fu, L. Hou, D. Zhang, Y. Zhang
{"title":"DP-LTGAN: Differentially private trajectory publishing via Locally-aware Transformer-based GAN","authors":"R. Zhang, W. Ni, N. Fu, L. Hou, D. Zhang, Y. Zhang","doi":"10.1016/j.future.2024.107686","DOIUrl":null,"url":null,"abstract":"<div><div>Trajectory data has a wide range of applications in various domains but also raises serious privacy concerns. To address these concerns, the integration of deep learning with differential privacy for trajectory publication has gained widespread attention. However, existing solutions are mostly based on temporal neural networks and the Generative Adversarial Networks (GAN) framework, which intrinsically faces the ”forgetting” problem, leading to their failure to capture and simulate the multi-time-scale behavioral patterns of traffic participants, thereby reducing the utility of published trajectories. Moreover, the computational cost and the noise introduced by the widely used differentially private gradient perturbation method are proportional to the model size, which compromises model quality. To address these problems, we propose a Differentially Private Trajectory Publishing method via Locally-aware Transformer-based GAN (DP-LTGAN), achieving high-utility trajectory publishing while providing differential privacy protection. Specifically, our method features a Locally-aware Transformer, whose attention mechanism is refined by incorporating local state encoding and a multi-scale temporal encoding mechanism. This enhancement significantly improves the modeling of both long- and short-term trajectory patterns. Furthermore, a differentially private gradient perturbation method named Common Term Perturbation (CTP) has been developed, which effectively reduces the amount of noise and the computational cost by utilizing a designed local noise addition pattern and an adaptive noise addition mechanism. Extensive experiments on several real trajectory datasets show that our method enhances the utility and efficiency of synthetic trajectories by 57.7% and 46.88%, respectively, significantly outperforming current state-of-the-art approaches.</div></div>","PeriodicalId":55132,"journal":{"name":"Future Generation Computer Systems-The International Journal of Escience","volume":"166 ","pages":"Article 107686"},"PeriodicalIF":6.2000,"publicationDate":"2024-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Future Generation Computer Systems-The International Journal of Escience","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167739X24006502","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Trajectory data has a wide range of applications in various domains but also raises serious privacy concerns. To address these concerns, the integration of deep learning with differential privacy for trajectory publication has gained widespread attention. However, existing solutions are mostly based on temporal neural networks and the Generative Adversarial Networks (GAN) framework, which intrinsically faces the ”forgetting” problem, leading to their failure to capture and simulate the multi-time-scale behavioral patterns of traffic participants, thereby reducing the utility of published trajectories. Moreover, the computational cost and the noise introduced by the widely used differentially private gradient perturbation method are proportional to the model size, which compromises model quality. To address these problems, we propose a Differentially Private Trajectory Publishing method via Locally-aware Transformer-based GAN (DP-LTGAN), achieving high-utility trajectory publishing while providing differential privacy protection. Specifically, our method features a Locally-aware Transformer, whose attention mechanism is refined by incorporating local state encoding and a multi-scale temporal encoding mechanism. This enhancement significantly improves the modeling of both long- and short-term trajectory patterns. Furthermore, a differentially private gradient perturbation method named Common Term Perturbation (CTP) has been developed, which effectively reduces the amount of noise and the computational cost by utilizing a designed local noise addition pattern and an adaptive noise addition mechanism. Extensive experiments on several real trajectory datasets show that our method enhances the utility and efficiency of synthetic trajectories by 57.7% and 46.88%, respectively, significantly outperforming current state-of-the-art approaches.
轨迹数据在各个领域有着广泛的应用,但也引起了严重的隐私问题。为了解决这些问题,深度学习与差分隐私的融合得到了广泛的关注。然而,现有的解决方案大多基于时间神经网络和生成对抗网络(GAN)框架,这些解决方案本质上面临“遗忘”问题,导致它们无法捕获和模拟交通参与者的多时间尺度行为模式,从而降低了已发布轨迹的效用。此外,广泛使用的差分私有梯度摄动法的计算成本和引入的噪声与模型大小成正比,影响了模型质量。为了解决这些问题,我们提出了一种基于本地感知变压器的差分私有轨迹发布方法(DP-LTGAN),在提供差分隐私保护的同时实现高效用轨迹发布。具体来说,我们的方法采用了一个局部感知的变压器,其注意机制通过结合局部状态编码和多尺度时间编码机制来改进。这种增强显著地改善了长期和短期轨迹模式的建模。在此基础上,提出了一种差分私有梯度摄动方法——共项摄动(Common Term perturbation, CTP),该方法利用设计的局部噪声加加模式和自适应噪声加加机制,有效地降低了噪声量和计算量。在几个真实轨迹数据集上的大量实验表明,我们的方法将合成轨迹的效用和效率分别提高了57.7%和46.88%,显著优于目前最先进的方法。
期刊介绍:
Computing infrastructures and systems are constantly evolving, resulting in increasingly complex and collaborative scientific applications. To cope with these advancements, there is a growing need for collaborative tools that can effectively map, control, and execute these applications.
Furthermore, with the explosion of Big Data, there is a requirement for innovative methods and infrastructures to collect, analyze, and derive meaningful insights from the vast amount of data generated. This necessitates the integration of computational and storage capabilities, databases, sensors, and human collaboration.
Future Generation Computer Systems aims to pioneer advancements in distributed systems, collaborative environments, high-performance computing, and Big Data analytics. It strives to stay at the forefront of developments in grids, clouds, and the Internet of Things (IoT) to effectively address the challenges posed by these wide-area, fully distributed sensing and computing systems.