STCDM: Spatio-Temporal Contrastive Diffusion Model for Check-In Sequence Generation

IF 10.4 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE IEEE Transactions on Knowledge and Data Engineering Pub Date : 2025-01-10 DOI:10.1109/TKDE.2025.3525718
Letian Gong;Shengnan Guo;Yan Lin;Yichen Liu;Erwen Zheng;Yiwei Shuang;Youfang Lin;Jilin Hu;Huaiyu Wan
{"title":"STCDM: Spatio-Temporal Contrastive Diffusion Model for Check-In Sequence Generation","authors":"Letian Gong;Shengnan Guo;Yan Lin;Yichen Liu;Erwen Zheng;Yiwei Shuang;Youfang Lin;Jilin Hu;Huaiyu Wan","doi":"10.1109/TKDE.2025.3525718","DOIUrl":null,"url":null,"abstract":"Analyzing and comprehending check-in sequences is crucial for various applications in smart cities. However, publicly available check-in datasets are often limited in scale due to privacy concerns. This poses a significant obstacle to academic research and downstream applications. Thus, it is urgent to generate realistic check-in datasets. The denoising diffusion probabilistic model (DDPM) as one of the most capable generation methods is a good choice to achieve this goal. However, generating check-in sequences using DDPM is not an easy feat. The difficulties lie in handling check-in sequences of variable lengths and capturing the correlation from check-in sequences’ distinct characteristics. This paper addresses the challenges by proposing a Spatio-Temporal Contrastive Diffusion Model (STCDM). This model introduces a novel spatio-temporal lossless encoding method that effectively encodes check-in sequences into a suitable format with equal length. Furthermore, we capture the spatio-temporal correlations with two disentangled diffusion modules to reduce the impact of the difference between spatial and temporal characteristics. Finally, we incorporate contrastive learning to enhance the relationship between diffusion modules. We generate four realistic datasets in different scenarios using STCDM and design four metrics for comparison. Experiments demonstrate that our generated datasets are more realistic and free of privacy leakage.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"37 4","pages":"2141-2154"},"PeriodicalIF":10.4000,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Knowledge and Data Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10836764/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Analyzing and comprehending check-in sequences is crucial for various applications in smart cities. However, publicly available check-in datasets are often limited in scale due to privacy concerns. This poses a significant obstacle to academic research and downstream applications. Thus, it is urgent to generate realistic check-in datasets. The denoising diffusion probabilistic model (DDPM) as one of the most capable generation methods is a good choice to achieve this goal. However, generating check-in sequences using DDPM is not an easy feat. The difficulties lie in handling check-in sequences of variable lengths and capturing the correlation from check-in sequences’ distinct characteristics. This paper addresses the challenges by proposing a Spatio-Temporal Contrastive Diffusion Model (STCDM). This model introduces a novel spatio-temporal lossless encoding method that effectively encodes check-in sequences into a suitable format with equal length. Furthermore, we capture the spatio-temporal correlations with two disentangled diffusion modules to reduce the impact of the difference between spatial and temporal characteristics. Finally, we incorporate contrastive learning to enhance the relationship between diffusion modules. We generate four realistic datasets in different scenarios using STCDM and design four metrics for comparison. Experiments demonstrate that our generated datasets are more realistic and free of privacy leakage.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
STCDM:用于生成签到序列的时空对比扩散模型
分析和理解入住顺序对于智慧城市的各种应用至关重要。然而,由于隐私问题,公开可用的签入数据集通常在规模上受到限制。这对学术研究和下游应用构成了重大障碍。因此,迫切需要生成真实的签入数据集。消噪扩散概率模型(DDPM)作为最有效的生成方法之一,是实现这一目标的良好选择。然而,使用DDPM生成签入序列并不是一件容易的事。难点在于处理可变长度的检入序列和从检入序列的不同特征中获取相关性。本文通过提出一个时空对比扩散模型(STCDM)来解决这些挑战。该模型引入了一种新颖的时空无损编码方法,将检入序列有效地编码为合适的等长度格式。此外,我们通过两个分离的扩散模块捕获时空相关性,以减少时空特征差异的影响。最后,我们结合对比学习来加强扩散模块之间的关系。我们使用STCDM在不同场景下生成了四个真实的数据集,并设计了四个指标进行比较。实验表明,我们生成的数据集更真实,没有隐私泄露。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Knowledge and Data Engineering
IEEE Transactions on Knowledge and Data Engineering 工程技术-工程:电子与电气
CiteScore
11.70
自引率
3.40%
发文量
515
审稿时长
6 months
期刊介绍: The IEEE Transactions on Knowledge and Data Engineering encompasses knowledge and data engineering aspects within computer science, artificial intelligence, electrical engineering, computer engineering, and related fields. It provides an interdisciplinary platform for disseminating new developments in knowledge and data engineering and explores the practicality of these concepts in both hardware and software. Specific areas covered include knowledge-based and expert systems, AI techniques for knowledge and data management, tools, and methodologies, distributed processing, real-time systems, architectures, data management practices, database design, query languages, security, fault tolerance, statistical databases, algorithms, performance evaluation, and applications.
期刊最新文献
Moon: A Modality Conversion-Based Efficient Multivariate Time Series Anomaly Detection Win-Win Approaches for Cross Dynamic Task Assignment in Spatial Crowdsourcing Property-Induced Partitioning for Graph Pattern Queries on Distributed RDF Systems Locally Differentially Private Truth Discovery for Sparse Crowdsensing Learnable Game-Theoretic Policy Optimization for Data-Centric Self-Explanation Rationalization
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1