Generative Adversarial Network for Synthetic Time Series Data Generation in Smart Grids

Chi Zhang, S. Kuppannagari, R. Kannan, V. Prasanna
{"title":"Generative Adversarial Network for Synthetic Time Series Data Generation in Smart Grids","authors":"Chi Zhang, S. Kuppannagari, R. Kannan, V. Prasanna","doi":"10.1109/SmartGridComm.2018.8587464","DOIUrl":null,"url":null,"abstract":"The availability of fine grained time series data is a pre-requisite for research in smart-grids. While data for transmission systems is relatively easily obtainable, issues related to data collection, security and privacy hinder the widespread public availability/accessibility of such datasets at the distribution system level. This has prevented the larger research community from effectively applying sophisticated machine learning algorithms to significantly improve the distribution-level accuracy of predictions and increase the efficiency of grid operations. Synthetic dataset generation has proven to be a promising solution for addressing data availability issues in various domains such as computer vision, natural language processing and medicine. However, its exploration in the smart grid context remains unsatisfactory. Previous works have tried to generate synthetic datasets by modeling the underlying system dynamics: an approach which is difficult, time consuming, error prone and often times infeasible in many problems. In this work, we propose a novel data-driven approach to synthetic dataset generation by utilizing deep generative adversarial networks (GAN) to learn the conditional probability distribution of essential features in the real dataset and generate samples based on the learned distribution. To evaluate our synthetically generated dataset, we measure the maximum mean discrepancy (MMD) between real and synthetic datasets as probability distributions, and show that their sampling distance converges. To further validate our synthetic dataset, we perform common smart grid tasks such as k-means clustering and short-term prediction on both datasets. Experimental results show the efficacy of our synthetic dataset approach: the real and synthetic datasets are indistinguishable by solely examining the output of these tasks.","PeriodicalId":213523,"journal":{"name":"2018 IEEE International Conference on Communications, Control, and Computing Technologies for Smart Grids (SmartGridComm)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"81","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Conference on Communications, Control, and Computing Technologies for Smart Grids (SmartGridComm)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SmartGridComm.2018.8587464","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 81

Abstract

The availability of fine grained time series data is a pre-requisite for research in smart-grids. While data for transmission systems is relatively easily obtainable, issues related to data collection, security and privacy hinder the widespread public availability/accessibility of such datasets at the distribution system level. This has prevented the larger research community from effectively applying sophisticated machine learning algorithms to significantly improve the distribution-level accuracy of predictions and increase the efficiency of grid operations. Synthetic dataset generation has proven to be a promising solution for addressing data availability issues in various domains such as computer vision, natural language processing and medicine. However, its exploration in the smart grid context remains unsatisfactory. Previous works have tried to generate synthetic datasets by modeling the underlying system dynamics: an approach which is difficult, time consuming, error prone and often times infeasible in many problems. In this work, we propose a novel data-driven approach to synthetic dataset generation by utilizing deep generative adversarial networks (GAN) to learn the conditional probability distribution of essential features in the real dataset and generate samples based on the learned distribution. To evaluate our synthetically generated dataset, we measure the maximum mean discrepancy (MMD) between real and synthetic datasets as probability distributions, and show that their sampling distance converges. To further validate our synthetic dataset, we perform common smart grid tasks such as k-means clustering and short-term prediction on both datasets. Experimental results show the efficacy of our synthetic dataset approach: the real and synthetic datasets are indistinguishable by solely examining the output of these tasks.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
智能电网合成时间序列数据生成的生成对抗网络
细粒度时间序列数据的可用性是智能电网研究的先决条件。虽然传输系统的数据相对容易获得,但与数据收集、安全和隐私有关的问题阻碍了在配电系统一级广泛地向公众提供/获取这些数据集。这阻碍了更大的研究界有效地应用复杂的机器学习算法来显著提高预测的分布级准确性和提高电网运行的效率。合成数据集生成已被证明是解决计算机视觉、自然语言处理和医学等各个领域数据可用性问题的有前途的解决方案。然而,其在智能电网背景下的探索仍不尽人意。以前的工作试图通过对底层系统动力学建模来生成合成数据集:这种方法困难、耗时、容易出错,而且在许多问题中往往是不可行的。在这项工作中,我们提出了一种新的数据驱动的合成数据集生成方法,利用深度生成对抗网络(GAN)来学习真实数据集中基本特征的条件概率分布,并根据学习到的分布生成样本。为了评估我们合成的数据集,我们测量了真实数据集和合成数据集之间的最大平均差异(MMD)作为概率分布,并表明它们的采样距离收敛。为了进一步验证我们的合成数据集,我们在两个数据集上执行常见的智能电网任务,如k-means聚类和短期预测。实验结果表明了我们的合成数据集方法的有效性:通过单独检查这些任务的输出,真实数据集和合成数据集无法区分。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Behind-the-Meter Solar Generation Disaggregation using Consumer Mixture Models Coordinated Planning of Multi-Energy System with District Heating Network A Cost-efficient Software Testbed for Cyber-Physical Security in IEC 61850-based Substations Joint Optimal Power Flow Routing and Decentralized Scheduling with Vehicle-to-Grid Regulation Service Energy Flexibility for Systems with large Thermal Masses with Applications to Shopping Centers
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1