CATS: Conditional Adversarial Trajectory Synthesis for privacy-preserving trajectory data publication using deep learning approaches

IF 4.3 1区 地球科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS International Journal of Geographical Information Science Pub Date : 2023-10-09 DOI:10.1080/13658816.2023.2262550
Jinmeng Rao, Song Gao, Sijia Zhu
{"title":"CATS: Conditional Adversarial Trajectory Synthesis for privacy-preserving trajectory data publication using deep learning approaches","authors":"Jinmeng Rao, Song Gao, Sijia Zhu","doi":"10.1080/13658816.2023.2262550","DOIUrl":null,"url":null,"abstract":"AbstractThe prevalence of ubiquitous location-aware devices and mobile Internet enables us to collect massive individual-level trajectory dataset from users. Such trajectory big data bring new opportunities to human mobility research but also raise public concerns with regard to location privacy. In this work, we present the Conditional Adversarial Trajectory Synthesis (CATS), a deep-learning-based GeoAI methodological framework for privacy-preserving trajectory data generation and publication. CATS applies K-anonymity to the underlying spatiotemporal distributions of human movements, which provides a distributional-level strong privacy guarantee. By leveraging conditional adversarial training on K-anonymized human mobility matrices, trajectory global context learning using the attention-based mechanism, and recurrent bipartite graph matching of adjacent trajectory points, CATS is able to reconstruct trajectory topology from conditionally sampled locations and generate high-quality individual-level synthetic trajectory data, which can serve as supplements or alternatives to raw data for privacy-preserving trajectory data publication. The experiment results on over 90k GPS trajectories show that our method has a better performance in privacy preservation, spatiotemporal characteristic preservation, and downstream utility compared with baseline methods, which brings new insights into privacy-preserving human mobility research using generative AI techniques and explores data ethics issues in GIScience.Keywords: Geoprivacygenerative adversarial networkhuman mobilityGeoAIsynthetic data generation AcknowledgmentThe authors acknowledge the funding support provided by the American Family Insurance Data Science Institute Funding Initiative at the University of Wisconsin-Madison. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the funder(s).Disclosure statementNo potential conflict of interest was reported by the author(s).Data and codes availability statementThe data and codes that support the findings of this study are available at the following link on figshare: https://doi.org/10.6084/m9.figshare.20760970. It is worth noting that due to the non-disclosure agreement with the data provider, we are not releasing the original individual-level GPS trajectory data but sharing the k-anonymized aggregated human mobility data used in our experiments.Additional informationNotes on contributorsJinmeng RaoJinmeng Rao is a research scientist at Mineral Earth Sciences. He received his PhD degree from the Department of Geography, University of Wisconsin-Madison. His research interests include GeoAI, Privacy-Preserving AI, and Location Privacy.Song GaoSong Gao is an associate professor in GIScience at the Department of Geography, University of Wisconsin-Madison. He holds a PhD in Geography at the University of California, Santa Barbara. His main research interests include place-based GIS, geospatial data science and GeoAI approaches to human mobility and social sensing.Sijia ZhuSijia Zhu is a Master student in Data Science at Columbia University. She received her bachelor degrees in Statistics and Economics from the University of Wisconsin-Madison.","PeriodicalId":14162,"journal":{"name":"International Journal of Geographical Information Science","volume":"46 1","pages":"0"},"PeriodicalIF":4.3000,"publicationDate":"2023-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Geographical Information Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/13658816.2023.2262550","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 2

Abstract

AbstractThe prevalence of ubiquitous location-aware devices and mobile Internet enables us to collect massive individual-level trajectory dataset from users. Such trajectory big data bring new opportunities to human mobility research but also raise public concerns with regard to location privacy. In this work, we present the Conditional Adversarial Trajectory Synthesis (CATS), a deep-learning-based GeoAI methodological framework for privacy-preserving trajectory data generation and publication. CATS applies K-anonymity to the underlying spatiotemporal distributions of human movements, which provides a distributional-level strong privacy guarantee. By leveraging conditional adversarial training on K-anonymized human mobility matrices, trajectory global context learning using the attention-based mechanism, and recurrent bipartite graph matching of adjacent trajectory points, CATS is able to reconstruct trajectory topology from conditionally sampled locations and generate high-quality individual-level synthetic trajectory data, which can serve as supplements or alternatives to raw data for privacy-preserving trajectory data publication. The experiment results on over 90k GPS trajectories show that our method has a better performance in privacy preservation, spatiotemporal characteristic preservation, and downstream utility compared with baseline methods, which brings new insights into privacy-preserving human mobility research using generative AI techniques and explores data ethics issues in GIScience.Keywords: Geoprivacygenerative adversarial networkhuman mobilityGeoAIsynthetic data generation AcknowledgmentThe authors acknowledge the funding support provided by the American Family Insurance Data Science Institute Funding Initiative at the University of Wisconsin-Madison. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the funder(s).Disclosure statementNo potential conflict of interest was reported by the author(s).Data and codes availability statementThe data and codes that support the findings of this study are available at the following link on figshare: https://doi.org/10.6084/m9.figshare.20760970. It is worth noting that due to the non-disclosure agreement with the data provider, we are not releasing the original individual-level GPS trajectory data but sharing the k-anonymized aggregated human mobility data used in our experiments.Additional informationNotes on contributorsJinmeng RaoJinmeng Rao is a research scientist at Mineral Earth Sciences. He received his PhD degree from the Department of Geography, University of Wisconsin-Madison. His research interests include GeoAI, Privacy-Preserving AI, and Location Privacy.Song GaoSong Gao is an associate professor in GIScience at the Department of Geography, University of Wisconsin-Madison. He holds a PhD in Geography at the University of California, Santa Barbara. His main research interests include place-based GIS, geospatial data science and GeoAI approaches to human mobility and social sensing.Sijia ZhuSijia Zhu is a Master student in Data Science at Columbia University. She received her bachelor degrees in Statistics and Economics from the University of Wisconsin-Madison.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
CATS:使用深度学习方法进行隐私保护轨迹数据发布的条件对抗轨迹综合
摘要无处不在的位置感知设备和移动互联网的普及,使我们能够从用户那里收集海量的个人层面的轨迹数据。这种轨迹大数据为人类移动研究带来了新的机遇,但也引起了公众对位置隐私的关注。在这项工作中,我们提出了条件对抗轨迹合成(CATS),这是一种基于深度学习的GeoAI方法框架,用于隐私保护轨迹数据的生成和发布。CATS将k -匿名应用于人类运动的底层时空分布,提供了分布级的强隐私保证。通过利用k匿名人类移动矩阵的条件对抗训练、基于注意机制的轨迹全局上下文学习以及相邻轨迹点的循环二部图匹配,CATS能够从有条件采样的位置重建轨迹拓扑,并生成高质量的个人级合成轨迹数据,这些数据可以作为原始数据的补充或替代,用于保护隐私的轨迹数据发布。在超过90k的GPS轨迹上的实验结果表明,与基线方法相比,我们的方法在隐私保护、时空特征保护和下游效用方面具有更好的性能,为利用生成式人工智能技术保护隐私的人类移动研究提供了新的见解,并探讨了GIScience中的数据伦理问题。关键词:地理隐私生成对抗网络人类移动性地理合成数据生成致谢作者感谢威斯康星大学麦迪逊分校美国家庭保险数据科学研究所资助计划提供的资金支持。本材料中表达的任何意见、发现、结论或建议均为作者的意见,并不一定反映资助者的观点。披露声明作者未报告潜在的利益冲突。数据和代码可用性声明支持本研究结果的数据和代码可从figshare的以下链接获得:https://doi.org/10.6084/m9.figshare.20760970。值得注意的是,由于与数据提供商的保密协议,我们不会发布原始的个人层面的GPS轨迹数据,而是共享我们实验中使用的k匿名聚合的人类移动数据。作者简介饶金梦,矿物地球科学研究所研究员。他获得美国威斯康星大学麦迪逊分校地理系博士学位。他的研究兴趣包括GeoAI、隐私保护AI和位置隐私。高松,美国威斯康星大学麦迪逊分校地理系地理科学副教授。他拥有加州大学圣巴巴拉分校地理学博士学位。他的主要研究兴趣包括基于地点的地理信息系统、地理空间数据科学和GeoAI方法在人类移动和社会感知方面的应用。朱思佳,哥伦比亚大学数据科学专业硕士研究生。她在威斯康星大学麦迪逊分校获得统计学和经济学学士学位。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
11.00
自引率
7.00%
发文量
81
审稿时长
9 months
期刊介绍: International Journal of Geographical Information Science provides a forum for the exchange of original ideas, approaches, methods and experiences in the rapidly growing field of geographical information science (GIScience). It is intended to interest those who research fundamental and computational issues of geographic information, as well as issues related to the design, implementation and use of geographical information for monitoring, prediction and decision making. Published research covers innovations in GIScience and novel applications of GIScience in natural resources, social systems and the built environment, as well as relevant developments in computer science, cartography, surveying, geography and engineering in both developed and developing countries.
期刊最新文献
GPU-accelerated parallel all-pair shortest path routing within stochastic road networks Collective flow-evolutionary patterns reveal the mesoscopic structure between snapshots of spatial network Geospatial foundation models for image analysis: evaluating and enhancing NASA-IBM Prithvi’s domain adaptability Translating street view imagery to correct perspectives to enhance bikeability and walkability studies A multi-view ensemble machine learning approach for 3D modeling using geological and geophysical data
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1