{"title":"Data-driven modeling of noise time series with convolutional generative adversarial networks.","authors":"Adam Wunderlich, Jack Sklar","doi":"10.1088/2632-2153/acee44","DOIUrl":null,"url":null,"abstract":"<p><p>Random noise arising from physical processes is an inherent characteristic of measurements and a limiting factor for most signal processing and data analysis tasks. Given the recent interest in generative adversarial networks (GANs) for data-driven modeling, it is important to determine to what extent GANs can faithfully reproduce noise in target data sets. In this paper, we present an empirical investigation that aims to shed light on this issue for time series. Namely, we assess two general-purpose GANs for time series that are based on the popular deep convolutional GAN architecture, a direct time-series model and an image-based model that uses a short-time Fourier transform data representation. The GAN models are trained and quantitatively evaluated using distributions of simulated noise time series with known ground-truth parameters. Target time series distributions include a broad range of noise types commonly encountered in physical measurements, electronics, and communication systems: band-limited thermal noise, power law noise, shot noise, and impulsive noise. We find that GANs are capable of learning many noise types, although they predictably struggle when the GAN architecture is not well suited to some aspects of the noise, e.g. impulsive time-series with extreme outliers. Our findings provide insights into the capabilities and potential limitations of current approaches to time-series GANs and highlight areas for further research. In addition, our battery of tests provides a useful benchmark to aid the development of deep generative models for time series.</p>","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"4 3","pages":""},"PeriodicalIF":6.3000,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10484071/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machine Learning Science and Technology","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1088/2632-2153/acee44","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Random noise arising from physical processes is an inherent characteristic of measurements and a limiting factor for most signal processing and data analysis tasks. Given the recent interest in generative adversarial networks (GANs) for data-driven modeling, it is important to determine to what extent GANs can faithfully reproduce noise in target data sets. In this paper, we present an empirical investigation that aims to shed light on this issue for time series. Namely, we assess two general-purpose GANs for time series that are based on the popular deep convolutional GAN architecture, a direct time-series model and an image-based model that uses a short-time Fourier transform data representation. The GAN models are trained and quantitatively evaluated using distributions of simulated noise time series with known ground-truth parameters. Target time series distributions include a broad range of noise types commonly encountered in physical measurements, electronics, and communication systems: band-limited thermal noise, power law noise, shot noise, and impulsive noise. We find that GANs are capable of learning many noise types, although they predictably struggle when the GAN architecture is not well suited to some aspects of the noise, e.g. impulsive time-series with extreme outliers. Our findings provide insights into the capabilities and potential limitations of current approaches to time-series GANs and highlight areas for further research. In addition, our battery of tests provides a useful benchmark to aid the development of deep generative models for time series.
物理过程产生的随机噪声是测量的固有特征,也是大多数信号处理和数据分析任务的限制因素。鉴于最近人们对用于数据驱动建模的生成式对抗网络(GANs)的兴趣,确定 GANs 在多大程度上能忠实地再现目标数据集中的噪声非常重要。本文介绍了一项实证调查,旨在揭示时间序列的这一问题。也就是说,我们评估了两种基于流行的深度卷积 GAN 架构的通用时间序列 GAN,一种是直接的时间序列模型,另一种是使用短时傅立叶变换数据表示的基于图像的模型。这些 GAN 模型是利用具有已知真实参数的模拟噪声时间序列分布进行训练和定量评估的。目标时间序列分布包括物理测量、电子和通信系统中常见的各种噪声类型:带限热噪声、幂律噪声、射频噪声和脉冲噪声。我们发现,GAN 能够学习多种类型的噪声,不过当 GAN 架构不能很好地适应噪声的某些方面时,例如带有极端离群值的脉冲时间序列,GAN 就会陷入困境。我们的研究结果让我们深入了解了当前时间序列 GAN 方法的能力和潜在局限性,并突出了有待进一步研究的领域。此外,我们的一系列测试还为时间序列深度生成模型的开发提供了有用的基准。
期刊介绍:
Machine Learning Science and Technology is a multidisciplinary open access journal that bridges the application of machine learning across the sciences with advances in machine learning methods and theory as motivated by physical insights. Specifically, articles must fall into one of the following categories: advance the state of machine learning-driven applications in the sciences or make conceptual, methodological or theoretical advances in machine learning with applications to, inspiration from, or motivated by scientific problems.