GAN Training With Kernel Discriminators: What Parameters Control Convergence Rates?

IF 5.8 2区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Signal Processing Pub Date : 2024-12-12 DOI:10.1109/TSP.2024.3516083
Evan Becker;Parthe Pandit;Sundeep Rangan;Alyson K. Fletcher
{"title":"GAN Training With Kernel Discriminators: What Parameters Control Convergence Rates?","authors":"Evan Becker;Parthe Pandit;Sundeep Rangan;Alyson K. Fletcher","doi":"10.1109/TSP.2024.3516083","DOIUrl":null,"url":null,"abstract":"Generative Adversarial Networks (GANs) are widely used for modeling complex data. However, the dynamics of the gradient descent-ascent (GDA) algorithms, often used for training GANs, have been notoriously difficult to analyze. We study these dynamics in the case where the discriminator is kernel-based and the true distribution consists of discrete points in Euclidean space. Prior works have analyzed the GAN dynamics in such scenarios via simple linearization close to the equilibrium. In this work, we show that linearized analysis can be grossly inaccurate, even at moderate distances from the equilibrium. We then propose an alternative non-linear yet tractable <italic>second moment model</i>. The proposed model predicts the convergence behavior well and reveals new insights about the role of the kernel width on convergence rate, not apparent in the linearized analysis. These insights suggest certain shapes of the kernel offer both fast local convergence and improved global convergence. We corroborate our theoretical results through simulations.","PeriodicalId":13330,"journal":{"name":"IEEE Transactions on Signal Processing","volume":"73 ","pages":"433-445"},"PeriodicalIF":5.8000,"publicationDate":"2024-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10795659/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Generative Adversarial Networks (GANs) are widely used for modeling complex data. However, the dynamics of the gradient descent-ascent (GDA) algorithms, often used for training GANs, have been notoriously difficult to analyze. We study these dynamics in the case where the discriminator is kernel-based and the true distribution consists of discrete points in Euclidean space. Prior works have analyzed the GAN dynamics in such scenarios via simple linearization close to the equilibrium. In this work, we show that linearized analysis can be grossly inaccurate, even at moderate distances from the equilibrium. We then propose an alternative non-linear yet tractable second moment model. The proposed model predicts the convergence behavior well and reveals new insights about the role of the kernel width on convergence rate, not apparent in the linearized analysis. These insights suggest certain shapes of the kernel offer both fast local convergence and improved global convergence. We corroborate our theoretical results through simulations.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
用核鉴别器训练GAN:什么参数控制收敛速率?
生成对抗网络(GANs)广泛用于复杂数据的建模。然而,梯度下降-上升(GDA)算法的动力学,通常用于训练gan,已经出了名的难以分析。我们研究了鉴别器是基于核的,且真分布由欧几里德空间中的离散点组成的情况下的这些动态。先前的工作已经通过接近平衡的简单线性化来分析这种情况下的GAN动力学。在这项工作中,我们表明线性化分析可能是非常不准确的,即使在中等距离的平衡。然后,我们提出了另一种非线性但易于处理的二阶矩模型。提出的模型很好地预测了收敛行为,并揭示了关于核宽度对收敛速率的作用的新见解,这在线性化分析中并不明显。这些见解表明,内核的某些形状既可以提供快速的局部收敛,又可以改进全局收敛。我们通过模拟证实了我们的理论结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Signal Processing
IEEE Transactions on Signal Processing 工程技术-工程:电子与电气
CiteScore
11.20
自引率
9.30%
发文量
310
审稿时长
3.0 months
期刊介绍: The IEEE Transactions on Signal Processing covers novel theory, algorithms, performance analyses and applications of techniques for the processing, understanding, learning, retrieval, mining, and extraction of information from signals. The term “signal” includes, among others, audio, video, speech, image, communication, geophysical, sonar, radar, medical and musical signals. Examples of topics of interest include, but are not limited to, information processing and the theory and application of filtering, coding, transmitting, estimating, detecting, analyzing, recognizing, synthesizing, recording, and reproducing signals.
期刊最新文献
Robust Filtering and Learning in State-Space Models: Skewness and Heavy Tails Via Asymmetric Laplace Distribution Movable Antenna Enhanced Integrated Sensing and Communication Via Antenna Position Optimization Estimating Sequences with Memory for Minimizing Convex Non-smooth Composite Functions Cellular, Cell-less, and Everything in Between: A Unified Framework for Utility Region Analysis in Wireless Networks Dynamic Regret for Byzantine-Robust Online Federated Learning
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1