首页 > 最新文献

IEEE journal on selected areas in information theory最新文献

英文 中文
Shannon Bounds for Quadratic Rate-Distortion Problems 二次速率失真问题的香农界值
Pub Date : 2024-09-20 DOI: 10.1109/JSAIT.2024.3465022
Michael Gastpar;Erixhen Sula
The Shannon lower bound has been the subject of several important contributions by Berger. This paper surveys Shannon bounds on rate-distortion problems under mean-squared error distortion with a particular emphasis on Berger’s techniques. Moreover, as a new result, the Gray-Wyner network is added to the canon of settings for which such bounds are known. In the Shannon bounding technique, elegant lower bounds are expressed in terms of the source entropy power. Moreover, there is often a complementary upper bound that involves the source variance in such a way that the bounds coincide in the special case of Gaussian statistics. Such pairs of bounds are sometimes referred to as Shannon bounds. The present paper puts Berger’s work on many aspects of this problem in the context of more recent developments, encompassing indirect and remote source coding such as the CEO problem, originally proposed by Berger, as well as the Gray-Wyner network as a new contribution.
香农下界是伯杰数次重要贡献的主题。本文研究了均方误差失真条件下速率失真问题的香农下界,并特别强调了伯杰的技术。此外,作为一项新成果,格雷-维纳网络也被添加到已知此类约束的环境中。在香农约束技术中,优雅的下限用源熵功率表示。此外,在高斯统计的特殊情况下,通常会有一个涉及源方差的互补上界,使两者的边界重合。这样的边界对有时被称为香农边界。本文将伯杰在这一问题的许多方面所做的工作与最近的发展结合起来,包括间接和远程源编码,如伯杰最初提出的 CEO 问题,以及作为新贡献的格雷-惠纳网络。
{"title":"Shannon Bounds for Quadratic Rate-Distortion Problems","authors":"Michael Gastpar;Erixhen Sula","doi":"10.1109/JSAIT.2024.3465022","DOIUrl":"https://doi.org/10.1109/JSAIT.2024.3465022","url":null,"abstract":"The Shannon lower bound has been the subject of several important contributions by Berger. This paper surveys Shannon bounds on rate-distortion problems under mean-squared error distortion with a particular emphasis on Berger’s techniques. Moreover, as a new result, the Gray-Wyner network is added to the canon of settings for which such bounds are known. In the Shannon bounding technique, elegant lower bounds are expressed in terms of the source entropy power. Moreover, there is often a complementary upper bound that involves the source variance in such a way that the bounds coincide in the special case of Gaussian statistics. Such pairs of bounds are sometimes referred to as Shannon bounds. The present paper puts Berger’s work on many aspects of this problem in the context of more recent developments, encompassing indirect and remote source coding such as the CEO problem, originally proposed by Berger, as well as the Gray-Wyner network as a new contribution.","PeriodicalId":73295,"journal":{"name":"IEEE journal on selected areas in information theory","volume":"5 ","pages":"597-608"},"PeriodicalIF":0.0,"publicationDate":"2024-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142452676","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Computation of Binary Arithmetic Sum Over an Asymmetric Diamond Network 通过非对称钻石网络计算二进制算术和
Pub Date : 2024-09-02 DOI: 10.1109/JSAIT.2024.3453273
Ruze Zhang;Xuan Guang;Shenghao Yang;Xueyan Niu;Bo Bai
In this paper, the problem of zero-error network function computation is considered, where in a directed acyclic network, a single sink node is required to compute with zero error a function of the source messages that are separately generated by multiple source nodes. From the information-theoretic point of view, we are interested in the fundamental computing capacity, which is defined as the average number of times that the function can be computed with zero error for one use of the network. The explicit characterization of the computing capacity in general is overwhelming difficult. The best known upper bound applicable to arbitrary network topologies and arbitrary target functions is the one proved by Guang et al. in using an approach of the cut-set strong partition. This bound is tight for all previously considered network function computation problems whose computing capacities are known. In this paper, we consider the model of computing the binary arithmetic sum over an asymmetric diamond network, which is of great importance to illustrate the combinatorial nature of network function computation problem. First, we prove a corrected upper bound 1 by using a linear programming approach, which corrects an invalid bound previously claimed in the literature. Nevertheless, this upper bound cannot bring any improvement over the best known upper bound for this model, which is also equal to 1. Further, by developing a different graph coloring approach, we obtain an improved upper bound ${}frac {1}{log _{3} 2+log 3-1}~(approx 0.822)$ . We thus show that the best known upper bound proved by Guang et al. is not tight for this model which answers the open problem that whether this bound in general is tight. On the other hand, we present an explicit code construction, which implies a lower bound ${}frac {1}{2}log _{3}6~(approx 0.815)$ on the computing capacity. Comparing the improved upper and lower bounds thus obtained, there exists a rough 0.007 gap between them.
本文考虑的是零误差网络函数计算问题,即在有向无环网络中,要求单个汇节点以零误差计算多个源节点分别生成的源信息的函数。从信息论的角度来看,我们感兴趣的是基本计算能力,它被定义为在一次网络使用中以零误差计算函数的平均次数。在一般情况下,计算能力的明确表征非常困难。目前已知的适用于任意网络拓扑和任意目标函数的最佳上界是 Guang 等人利用切集强分割方法证明的。对于之前考虑过的所有已知计算能力的网络函数计算问题,这个约束都很严格。在本文中,我们考虑了在非对称菱形网络上计算二进制算术和的模型,这对说明网络函数计算问题的组合性质具有重要意义。首先,我们利用线性规划方法证明了一个修正的上界 1,修正了之前文献中声称的一个无效上界。尽管如此,这个上界与该模型已知的最佳上界(也等于 1)相比并没有任何改进。此外,通过开发一种不同的图着色方法,我们得到了一个改进的上界 ${}frac {1}{log _{3}2+log 3-1}~(大约 0.822)$ 。因此,我们证明了由 Guang 等人证明的已知上界对于这个模型并不严密,这就回答了一个悬而未决的问题:这个上界在一般情况下是否严密。另一方面,我们提出了一种显式代码构造,它意味着计算能力的下界为 ${}frac {1}{2}log _{3}6~(approx 0.815)$ 。比较由此获得的改进上界和下界,两者之间大致存在 0.007 的差距。
{"title":"Computation of Binary Arithmetic Sum Over an Asymmetric Diamond Network","authors":"Ruze Zhang;Xuan Guang;Shenghao Yang;Xueyan Niu;Bo Bai","doi":"10.1109/JSAIT.2024.3453273","DOIUrl":"https://doi.org/10.1109/JSAIT.2024.3453273","url":null,"abstract":"In this paper, the problem of zero-error network function computation is considered, where in a directed acyclic network, a single sink node is required to compute with zero error a function of the source messages that are separately generated by multiple source nodes. From the information-theoretic point of view, we are interested in the fundamental computing capacity, which is defined as the average number of times that the function can be computed with zero error for one use of the network. The explicit characterization of the computing capacity in general is overwhelming difficult. The best known upper bound applicable to arbitrary network topologies and arbitrary target functions is the one proved by Guang et al. in using an approach of the cut-set strong partition. This bound is tight for all previously considered network function computation problems whose computing capacities are known. In this paper, we consider the model of computing the binary arithmetic sum over an asymmetric diamond network, which is of great importance to illustrate the combinatorial nature of network function computation problem. First, we prove a corrected upper bound 1 by using a linear programming approach, which corrects an invalid bound previously claimed in the literature. Nevertheless, this upper bound cannot bring any improvement over the best known upper bound for this model, which is also equal to 1. Further, by developing a different graph coloring approach, we obtain an improved upper bound \u0000<inline-formula> <tex-math>${}frac {1}{log _{3} 2+log 3-1}~(approx 0.822)$ </tex-math></inline-formula>\u0000. We thus show that the best known upper bound proved by Guang et al. is not tight for this model which answers the open problem that whether this bound in general is tight. On the other hand, we present an explicit code construction, which implies a lower bound \u0000<inline-formula> <tex-math>${}frac {1}{2}log _{3}6~(approx 0.815)$ </tex-math></inline-formula>\u0000 on the computing capacity. Comparing the improved upper and lower bounds thus obtained, there exists a rough 0.007 gap between them.","PeriodicalId":73295,"journal":{"name":"IEEE journal on selected areas in information theory","volume":"5 ","pages":"585-596"},"PeriodicalIF":0.0,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142320442","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Low-Complexity Coding Techniques for Cloud Radio Access Networks 云无线接入网络的低复杂度编码技术
Pub Date : 2024-08-28 DOI: 10.1109/JSAIT.2024.3451240
Nadim Ghaddar;Lele Wang
The problem of coding for the uplink and downlink of cloud radio access networks (C-RAN’s) with K users and L relays is considered. It is shown that low-complexity coding schemes that achieve any point in the rate-fronthaul region of joint coding and compression can be constructed starting from at most $4(K+L)-2$ point-to-point codes designed for symmetric channels. This reduces the seemingly hard task of constructing good codes for C-RAN’s to the much better understood task of finding good codes for single-user channels. To show this result, an equivalence between the achievable rate-fronthaul regions of joint coding and successive coding is established. Then, rate-splitting and quantization-splitting techniques are used to show that the task of achieving a rate-fronthaul point in the joint coding region can be simplified to that of achieving a corner point in a higher-dimensional C-RAN problem. As a by-product, some interesting properties of the rate-fronthaul region of joint decoding for uplink C-RAN’s are also derived.
本文研究了具有 K 个用户和 L 个中继站的云无线接入网(C-RAN)的上下行链路编码问题。研究表明,可以从最多 $4(K+L)-2$ 为对称信道设计的点对点编码开始,构建低复杂度编码方案,以实现联合编码和压缩的速率-链路区域内的任意点。这就把为 C-RAN 构建良好编码这一看似艰巨的任务简化为为单用户信道寻找良好编码这一更好理解的任务。为了证明这一结果,我们建立了联合编码和连续编码的可实现速率-频带区域之间的等价关系。然后,利用速率拆分和量化拆分技术来说明,在联合编码区域实现速率-频带点的任务可以简化为在高维 C-RAN 问题中实现角点的任务。作为副产品,还得出了上行链路 C-RAN 联合解码区域的一些有趣特性。
{"title":"Low-Complexity Coding Techniques for Cloud Radio Access Networks","authors":"Nadim Ghaddar;Lele Wang","doi":"10.1109/JSAIT.2024.3451240","DOIUrl":"https://doi.org/10.1109/JSAIT.2024.3451240","url":null,"abstract":"The problem of coding for the uplink and downlink of cloud radio access networks (C-RAN’s) with K users and L relays is considered. It is shown that low-complexity coding schemes that achieve any point in the rate-fronthaul region of joint coding and compression can be constructed starting from at most \u0000<inline-formula> <tex-math>$4(K+L)-2$ </tex-math></inline-formula>\u0000 point-to-point codes designed for symmetric channels. This reduces the seemingly hard task of constructing good codes for C-RAN’s to the much better understood task of finding good codes for single-user channels. To show this result, an equivalence between the achievable rate-fronthaul regions of joint coding and successive coding is established. Then, rate-splitting and quantization-splitting techniques are used to show that the task of achieving a rate-fronthaul point in the joint coding region can be simplified to that of achieving a corner point in a higher-dimensional C-RAN problem. As a by-product, some interesting properties of the rate-fronthaul region of joint decoding for uplink C-RAN’s are also derived.","PeriodicalId":73295,"journal":{"name":"IEEE journal on selected areas in information theory","volume":"5 ","pages":"572-584"},"PeriodicalIF":0.0,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142275002","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Erratum to “LightVeriFL: A Lightweight and Verifiable Secure Aggregation for Federated Learning” LightVeriFL:用于联合学习的轻量级可验证安全聚合"
Pub Date : 2024-08-26 DOI: 10.1109/JSAIT.2024.3413928
Baturalp Buyukates;Jinhyun So;Hessam Mahdavifar;Salman Avestimehr
This article addresses errors in [1]. Equation (2) contained an error wherein x was not bold. It is corrected below.
本文解决了 [1] 中的错误。公式(2)中有一处错误,其中 x 没有加粗。现更正如下。
{"title":"Erratum to “LightVeriFL: A Lightweight and Verifiable Secure Aggregation for Federated Learning”","authors":"Baturalp Buyukates;Jinhyun So;Hessam Mahdavifar;Salman Avestimehr","doi":"10.1109/JSAIT.2024.3413928","DOIUrl":"https://doi.org/10.1109/JSAIT.2024.3413928","url":null,"abstract":"This article addresses errors in \u0000<xref>[1]</xref>\u0000. \u0000<xref>Equation (2)</xref>\u0000 contained an error wherein x was not bold. It is corrected below.","PeriodicalId":73295,"journal":{"name":"IEEE journal on selected areas in information theory","volume":"5 ","pages":"570-571"},"PeriodicalIF":0.0,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10648286","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142077637","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
JPEG Compliant Compression for DNN Vision 符合 JPEG 标准的 DNN Vision 压缩技术
Pub Date : 2024-07-04 DOI: 10.1109/JSAIT.2024.3422011
Ahmed H. Salamah;Kaixiang Zheng;Linfeng Ye;En-Hui Yang
Conventional image compression techniques are primarily developed for the human visual system. However, with the extensive use of deep neural networks (DNNs) for computer vision, more and more images will be consumed by DNN-based intelligent machines, which makes it crucial to develop image compression techniques customized for DNN vision while being JPEG compliant. In this paper, we revisit the JPEG rate distortion theory for DNN vision. First, we propose a novel distortion measure, dubbed the sensitivity weighted error (SWE), for DNN vision. Second, we incorporate SWE into the soft decision quantization (SDQ) process of JPEG to trade SWE for rate. Finally, we develop an algorithm, called OptS, for designing optimal quantization tables for the luminance channel and chrominance channels, respectively. To test the performance of the resulting DNN-oriented compression framework and algorithm, experiments of image classification are conducted on the ImageNet dataset for four prevalent DNN models. Results demonstrate that our proposed framework and algorithm achieve better rate-accuracy (R-A) performance than the default JPEG. For some DNN models, our proposed framework and algorithm provide a significant reduction in the compression rate up to 67.84% with no accuracy loss compared to the default JPEG.
传统的图像压缩技术主要是针对人类视觉系统开发的。然而,随着深度神经网络(DNN)在计算机视觉领域的广泛应用,越来越多的图像将被基于 DNN 的智能机器所使用,因此,在符合 JPEG 标准的同时,开发专为 DNN 视觉定制的图像压缩技术至关重要。在本文中,我们重新审视了适用于 DNN 视觉的 JPEG 率失真理论。首先,我们为 DNN 视觉提出了一种新的失真测量方法,称为灵敏度加权误差(SWE)。其次,我们将 SWE 纳入 JPEG 的软决策量化(SDQ)过程,以 SWE 换取速率。最后,我们开发了一种名为 OptS 的算法,用于分别为亮度通道和色度通道设计最佳量化表。为了测试面向 DNN 的压缩框架和算法的性能,我们在 ImageNet 数据集上对四种流行的 DNN 模型进行了图像分类实验。结果表明,与默认的 JPEG 相比,我们提出的框架和算法实现了更好的速率-准确率(R-A)性能。对于某些 DNN 模型,与默认 JPEG 相比,我们提出的框架和算法大大降低了压缩率,最高可达 67.84%,且没有任何精度损失。
{"title":"JPEG Compliant Compression for DNN Vision","authors":"Ahmed H. Salamah;Kaixiang Zheng;Linfeng Ye;En-Hui Yang","doi":"10.1109/JSAIT.2024.3422011","DOIUrl":"https://doi.org/10.1109/JSAIT.2024.3422011","url":null,"abstract":"Conventional image compression techniques are primarily developed for the human visual system. However, with the extensive use of deep neural networks (DNNs) for computer vision, more and more images will be consumed by DNN-based intelligent machines, which makes it crucial to develop image compression techniques customized for DNN vision while being JPEG compliant. In this paper, we revisit the JPEG rate distortion theory for DNN vision. First, we propose a novel distortion measure, dubbed the sensitivity weighted error (SWE), for DNN vision. Second, we incorporate SWE into the soft decision quantization (SDQ) process of JPEG to trade SWE for rate. Finally, we develop an algorithm, called OptS, for designing optimal quantization tables for the luminance channel and chrominance channels, respectively. To test the performance of the resulting DNN-oriented compression framework and algorithm, experiments of image classification are conducted on the ImageNet dataset for four prevalent DNN models. Results demonstrate that our proposed framework and algorithm achieve better rate-accuracy (R-A) performance than the default JPEG. For some DNN models, our proposed framework and algorithm provide a significant reduction in the compression rate up to 67.84% with no accuracy loss compared to the default JPEG.","PeriodicalId":73295,"journal":{"name":"IEEE journal on selected areas in information theory","volume":"5 ","pages":"520-533"},"PeriodicalIF":0.0,"publicationDate":"2024-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141966001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Throughput and Latency Analysis for Line Networks With Outage Links 带中断链路的线路网络吞吐量和延迟分析
Pub Date : 2024-06-25 DOI: 10.1109/JSAIT.2024.3419054
Yanyan Dong;Shenghao Yang;Jie Wang;Fan Cheng
Wireless communication links suffer from outage events caused by fading and interference. To facilitate a tractable analysis of network communication throughput and latency, we propose an outage link model to represent a communication link in the slow fading phenomenon. For a line-topology network with outage links, we study three types of intermediate network node schemes: random linear network coding, store-and-forward, and hop-by-hop retransmission. We provide the analytical formulas for the maximum throughputs and the end-to-end latency for each scheme. To gain a more explicit understanding, we perform a scalability analysis of the throughput and latency as the network length increases. We observe that the same order of throughput/latency holds across a wide range of outage functions for each scheme. We illustrate how our exact formulae and scalability results can be applied to compare different schemes.
无线通信链路存在由衰落和干扰引起的中断事件。为了便于分析网络通信的吞吐量和延迟,我们提出了一个中断链路模型来表示慢衰落现象中的通信链路。对于具有中断链路的线路拓扑网络,我们研究了三种中间网络节点方案:随机线性网络编码、存储转发和逐跳重传。我们提供了每种方案的最大吞吐量和端到端延迟的解析公式。为了获得更清晰的理解,我们对网络长度增加时的吞吐量和延迟进行了可扩展性分析。我们发现,每种方案的吞吐量/延迟在各种中断函数中都保持相同的顺序。我们将说明如何应用我们的精确公式和可扩展性结果来比较不同的方案。
{"title":"Throughput and Latency Analysis for Line Networks With Outage Links","authors":"Yanyan Dong;Shenghao Yang;Jie Wang;Fan Cheng","doi":"10.1109/JSAIT.2024.3419054","DOIUrl":"https://doi.org/10.1109/JSAIT.2024.3419054","url":null,"abstract":"Wireless communication links suffer from outage events caused by fading and interference. To facilitate a tractable analysis of network communication throughput and latency, we propose an outage link model to represent a communication link in the slow fading phenomenon. For a line-topology network with outage links, we study three types of intermediate network node schemes: random linear network coding, store-and-forward, and hop-by-hop retransmission. We provide the analytical formulas for the maximum throughputs and the end-to-end latency for each scheme. To gain a more explicit understanding, we perform a scalability analysis of the throughput and latency as the network length increases. We observe that the same order of throughput/latency holds across a wide range of outage functions for each scheme. We illustrate how our exact formulae and scalability results can be applied to compare different schemes.","PeriodicalId":73295,"journal":{"name":"IEEE journal on selected areas in information theory","volume":"5 ","pages":"464-477"},"PeriodicalIF":0.0,"publicationDate":"2024-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10571545","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141966000","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Addressing GAN Training Instabilities via Tunable Classification Losses 通过可调分类损失解决 GAN 训练不稳定性问题
Pub Date : 2024-06-19 DOI: 10.1109/JSAIT.2024.3415670
Monica Welfert;Gowtham R. Kurri;Kyle Otstot;Lalitha Sankar
Generative adversarial networks (GANs), modeled as a zero-sum game between a generator (G) and a discriminator (D), allow generating synthetic data with formal guarantees. Noting that D is a classifier, we begin by reformulating the GAN value function using class probability estimation (CPE) losses. We prove a two-way correspondence between CPE loss GANs and f-GANs which minimize f-divergences. We also show that all symmetric f-divergences are equivalent in convergence. In the finite sample and model capacity setting, we define and obtain bounds on estimation and generalization errors. We specialize these results to $alpha $ -GANs, defined using $alpha $ -loss, a tunable CPE loss family parametrized by $alpha in (0,infty $ ]. We next introduce a class of dual-objective GANs to address training instabilities of GANs by modeling each player’s objective using $alpha $ -loss to obtain $(alpha _{D},alpha _{G})$ -GANs. We show that the resulting non-zero sum game simplifies to minimizing an f-divergence under appropriate conditions on $(alpha _{D},alpha _{G})$ . Generalizing this dual-objective formulation using CPE losses, we define and obtain upper bounds on an appropriately defined estimation error. Finally, we highlight the value of tuning $(alpha _{D},alpha _{G})$ in alleviating training instabilities for the synthetic 2D Gaussian mixture ring as well as the large publicly available Celeb-A and LSUN Classroom image datasets.
生成式对抗网络(GAN)被模拟为生成器(G)和判别器(D)之间的零和博弈,可以生成具有形式保证的合成数据。注意到 D 是一个分类器,我们首先使用类概率估计(CPE)损失重新表述了 GAN 的价值函数。我们证明了 CPE 损失 GAN 与 f-GAN 之间的双向对应关系,后者最大限度地减小了 f 分歧。我们还证明了所有对称的 f-divergences 在收敛性上是等价的。在有限样本和模型容量设置中,我们定义并获得了估计误差和泛化误差的界限。我们将这些结果专门应用于$alpha $ -GANs,使用$alpha $ -loss定义,$alpha in (0,infty $ ]是一个参数为$alpha in (0,infty $ ]的可调CPE损失族。接下来,我们引入了一类双目标 GAN,通过使用 $alpha $ -loss 对每个参与者的目标进行建模,得到 $(alpha _{D},alpha _{G})$ -GAN,从而解决 GAN 的训练不稳定性问题。我们证明,由此产生的非零和博弈在 $(alpha _{D},alpha _{G})$ 的适当条件下简化为最小化 f-发散。通过使用 CPE 损失对这一双目标表述进行推广,我们定义并获得了适当定义的估计误差上限。最后,我们强调了调整 $(alpha _{D},alpha _{G})$ 在缓解合成二维高斯混合环以及大型公开 Celeb-A 和 LSUN 课堂图像数据集的训练不稳定性方面的价值。
{"title":"Addressing GAN Training Instabilities via Tunable Classification Losses","authors":"Monica Welfert;Gowtham R. Kurri;Kyle Otstot;Lalitha Sankar","doi":"10.1109/JSAIT.2024.3415670","DOIUrl":"https://doi.org/10.1109/JSAIT.2024.3415670","url":null,"abstract":"Generative adversarial networks (GANs), modeled as a zero-sum game between a generator (G) and a discriminator (D), allow generating synthetic data with formal guarantees. Noting that D is a classifier, we begin by reformulating the GAN value function using class probability estimation (CPE) losses. We prove a two-way correspondence between CPE loss GANs and f-GANs which minimize f-divergences. We also show that all symmetric f-divergences are equivalent in convergence. In the finite sample and model capacity setting, we define and obtain bounds on estimation and generalization errors. We specialize these results to \u0000<inline-formula> <tex-math>$alpha $ </tex-math></inline-formula>\u0000-GANs, defined using \u0000<inline-formula> <tex-math>$alpha $ </tex-math></inline-formula>\u0000-loss, a tunable CPE loss family parametrized by \u0000<inline-formula> <tex-math>$alpha in (0,infty $ </tex-math></inline-formula>\u0000]. We next introduce a class of dual-objective GANs to address training instabilities of GANs by modeling each player’s objective using \u0000<inline-formula> <tex-math>$alpha $ </tex-math></inline-formula>\u0000-loss to obtain \u0000<inline-formula> <tex-math>$(alpha _{D},alpha _{G})$ </tex-math></inline-formula>\u0000-GANs. We show that the resulting non-zero sum game simplifies to minimizing an f-divergence under appropriate conditions on \u0000<inline-formula> <tex-math>$(alpha _{D},alpha _{G})$ </tex-math></inline-formula>\u0000. Generalizing this dual-objective formulation using CPE losses, we define and obtain upper bounds on an appropriately defined estimation error. Finally, we highlight the value of tuning \u0000<inline-formula> <tex-math>$(alpha _{D},alpha _{G})$ </tex-math></inline-formula>\u0000 in alleviating training instabilities for the synthetic 2D Gaussian mixture ring as well as the large publicly available Celeb-A and LSUN Classroom image datasets.","PeriodicalId":73295,"journal":{"name":"IEEE journal on selected areas in information theory","volume":"5 ","pages":"534-553"},"PeriodicalIF":0.0,"publicationDate":"2024-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141965916","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Long-Term Fairness in Sequential Multi-Agent Selection With Positive Reinforcement 带正向强化的连续多代理选择中的长期公平性
Pub Date : 2024-06-18 DOI: 10.1109/JSAIT.2024.3416078
Bhagyashree Puranik;Ozgur Guldogan;Upamanyu Madhow;Ramtin Pedarsani
While much of the rapidly growing literature on fair decision-making focuses on metrics for one-shot decisions, recent work has raised the intriguing possibility of designing sequential decision-making to positively impact long-term social fairness. In selection processes such as college admissions or hiring, biasing slightly towards applicants from under-represented groups is hypothesized to provide positive feedback that increases the pool of under-represented applicants in future selection rounds, thus enhancing fairness in the long term. In this paper, we examine this hypothesis and its consequences in a setting in which multiple agents are selecting from a common pool of applicants. We propose the Multi-agent Fair-Greedy policy, that balances greedy score maximization and fairness. Under this policy, we prove that the resource pool and the admissions converge to a long-term fairness target set by the agents when the score distributions across the groups in the population are identical. We provide empirical evidence of existence of equilibria under non-identical score distributions through synthetic and adapted real-world datasets. We then sound a cautionary note for more complex applicant pool evolution models, under which uncoordinated behavior by the agents can cause negative reinforcement, leading to a reduction in the fraction of under-represented applicants. Our results indicate that, while positive reinforcement is a promising mechanism for long-term fairness, policies must be designed carefully to be robust to variations in the evolution model, with a number of open issues that remain to be explored by algorithm designers, social scientists, and policymakers.
尽管快速增长的有关公平决策的文献大多侧重于一次性决策的衡量标准,但最近的研究提出了一种令人感兴趣的可能性,即通过设计连续决策来对长期社会公平性产生积极影响。在大学录取或招聘等选拔过程中,如果对来自代表性不足群体的申请人略有偏向,就会产生积极的反馈,从而在未来的选拔中增加代表性不足的申请人的数量,从而提高长期的公平性。在本文中,我们将在多个代理从一个共同的申请人库中进行遴选的情况下,对这一假设及其结果进行研究。我们提出了多代理公平-贪婪政策,在贪婪分数最大化和公平性之间取得了平衡。在这一政策下,我们证明了当群体中各组的分数分布相同时,资源池和录取率会趋同于代理设定的长期公平目标。我们通过合成和改编的现实世界数据集,提供了非相同分数分布下存在均衡的经验证据。然后,我们对更复杂的申请者群体演化模型提出了警告,在这种情况下,代理人的不协调行为可能会导致负强化,从而导致代表性不足的申请者比例下降。我们的研究结果表明,虽然正强化是一种有希望实现长期公平的机制,但政策的设计必须谨慎,以适应演化模型的变化,同时还有许多开放性问题有待算法设计者、社会科学家和政策制定者去探索。
{"title":"Long-Term Fairness in Sequential Multi-Agent Selection With Positive Reinforcement","authors":"Bhagyashree Puranik;Ozgur Guldogan;Upamanyu Madhow;Ramtin Pedarsani","doi":"10.1109/JSAIT.2024.3416078","DOIUrl":"https://doi.org/10.1109/JSAIT.2024.3416078","url":null,"abstract":"While much of the rapidly growing literature on fair decision-making focuses on metrics for one-shot decisions, recent work has raised the intriguing possibility of designing sequential decision-making to positively impact long-term social fairness. In selection processes such as college admissions or hiring, biasing slightly towards applicants from under-represented groups is hypothesized to provide positive feedback that increases the pool of under-represented applicants in future selection rounds, thus enhancing fairness in the long term. In this paper, we examine this hypothesis and its consequences in a setting in which multiple agents are selecting from a common pool of applicants. We propose the Multi-agent Fair-Greedy policy, that balances greedy score maximization and fairness. Under this policy, we prove that the resource pool and the admissions converge to a long-term fairness target set by the agents when the score distributions across the groups in the population are identical. We provide empirical evidence of existence of equilibria under non-identical score distributions through synthetic and adapted real-world datasets. We then sound a cautionary note for more complex applicant pool evolution models, under which uncoordinated behavior by the agents can cause negative reinforcement, leading to a reduction in the fraction of under-represented applicants. Our results indicate that, while positive reinforcement is a promising mechanism for long-term fairness, policies must be designed carefully to be robust to variations in the evolution model, with a number of open issues that remain to be explored by algorithm designers, social scientists, and policymakers.","PeriodicalId":73295,"journal":{"name":"IEEE journal on selected areas in information theory","volume":"5 ","pages":"424-441"},"PeriodicalIF":0.0,"publicationDate":"2024-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141624148","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Controlled Privacy Leakage Propagation Throughout Overlapping Grouped Learning 通过重叠分组学习控制隐私泄露传播
Pub Date : 2024-06-18 DOI: 10.1109/JSAIT.2024.3416089
Shahrzad Kiani;Franziska Boenisch;Stark C. Draper
Federated Learning (FL) is the standard protocol for collaborative learning. In FL, multiple workers jointly train a shared model. They exchange model updates calculated on their data, while keeping the raw data itself local. Since workers naturally form groups based on common interests and privacy policies, we are motivated to extend standard FL to reflect a setting with multiple, potentially overlapping groups. In this setup where workers can belong and contribute to more than one group at a time, complexities arise in understanding privacy leakage and in adhering to privacy policies. To address the challenges, we propose differential private overlapping grouped learning (DP-OGL), a novel method to implement privacy guarantees within overlapping groups. Under the honest-but-curious threat model, we derive novel privacy guarantees between arbitrary pairs of workers. These privacy guarantees describe and quantify two key effects of privacy leakage in DP-OGL: propagation delay, i.e., the fact that information from one group will leak to other groups only with temporal offset through the common workers and information degradation, i.e., the fact that noise addition over model updates limits information leakage between workers. Our experiments show that applying DP-OGL enhances utility while maintaining strong privacy compared to standard FL setups.
联合学习(FL)是协作学习的标准协议。在 FL 中,多个工作人员共同训练一个共享模型。他们交换根据各自数据计算的模型更新,同时保持原始数据本身的本地化。由于工作人员会根据共同的兴趣和隐私政策自然地组成小组,因此我们有动力对标准 FL 进行扩展,以反映具有多个潜在重叠小组的环境。在这种情况下,工人可以同时属于一个以上的小组并为其做出贡献,因此在理解隐私泄露和遵守隐私政策方面出现了复杂的问题。为了应对这些挑战,我们提出了差分隐私重叠分组学习(DP-OGL),这是一种在重叠组内实现隐私保证的新方法。在 "诚实但好奇 "的威胁模型下,我们得出了任意工人对之间的新型隐私保证。这些隐私保证描述并量化了 DP-OGL 中隐私泄漏的两个关键影响:传播延迟,即一个组的信息只有通过共同工作者的时间偏移才会泄漏到其他组;信息退化,即模型更新时的噪声增加限制了工作者之间的信息泄漏。我们的实验表明,与标准的 FL 设置相比,应用 DP-OGL 可以提高效用,同时保持较高的隐私性。
{"title":"Controlled Privacy Leakage Propagation Throughout Overlapping Grouped Learning","authors":"Shahrzad Kiani;Franziska Boenisch;Stark C. Draper","doi":"10.1109/JSAIT.2024.3416089","DOIUrl":"https://doi.org/10.1109/JSAIT.2024.3416089","url":null,"abstract":"Federated Learning (FL) is the standard protocol for collaborative learning. In FL, multiple workers jointly train a shared model. They exchange model updates calculated on their data, while keeping the raw data itself local. Since workers naturally form groups based on common interests and privacy policies, we are motivated to extend standard FL to reflect a setting with multiple, potentially overlapping groups. In this setup where workers can belong and contribute to more than one group at a time, complexities arise in understanding privacy leakage and in adhering to privacy policies. To address the challenges, we propose differential private overlapping grouped learning (DP-OGL), a novel method to implement privacy guarantees within overlapping groups. Under the honest-but-curious threat model, we derive novel privacy guarantees between arbitrary pairs of workers. These privacy guarantees describe and quantify two key effects of privacy leakage in DP-OGL: propagation delay, i.e., the fact that information from one group will leak to other groups only with temporal offset through the common workers and information degradation, i.e., the fact that noise addition over model updates limits information leakage between workers. Our experiments show that applying DP-OGL enhances utility while maintaining strong privacy compared to standard FL setups.","PeriodicalId":73295,"journal":{"name":"IEEE journal on selected areas in information theory","volume":"5 ","pages":"442-463"},"PeriodicalIF":0.0,"publicationDate":"2024-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141624126","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Information Velocity of Cascaded Gaussian Channels With Feedback 带反馈的级联高斯信道的信息速度
Pub Date : 2024-06-18 DOI: 10.1109/JSAIT.2024.3416310
Elad Domanovitz;Anatoly Khina;Tal Philosof;Yuval Kochman
We consider a line network of nodes, connected by additive white noise channels, equipped with local feedback. We study the velocity at which information spreads over this network. For transmission of a data packet, we give an explicit positive lower bound on the velocity, for any packet size. Furthermore, we consider streaming, that is, transmission of data packets generated at a given average arrival rate. We show that a positive velocity exists as long as the arrival rate is below the individual Gaussian channel capacity, and provide an explicit lower bound. Our analysis involves applying pulse-amplitude modulation to the data (successively in the streaming case), and using linear mean-squared error estimation at the network nodes. For general white noise, we derive exponential error-probability bounds. For single-packet transmission over channels with (sub-)Gaussian noise, we show a doubly-exponential behavior, which reduces to the celebrated Schalkwijk–Kailath scheme when considering a single node. Viewing the constellation as an “analog source”, we also provide bounds on the exponential decay of the mean-squared error of source transmission over the network.
我们考虑了一个由节点组成的线性网络,该网络通过加性白噪声信道连接,并配有局部反馈。我们研究信息在该网络中的传播速度。对于数据包的传输,无论数据包大小如何,我们都给出了速度的明确正下限。此外,我们还考虑了流式传输,即传输以给定平均到达率生成的数据包。我们证明,只要到达率低于单个高斯信道容量,就存在正速度,并给出了明确的下限。我们的分析包括对数据进行脉冲幅度调制(在流式情况下连续进行),并在网络节点使用线性均方误差估计。对于一般白噪声,我们推导出指数误差概率边界。对于在具有(亚)高斯噪声的信道上进行的单包传输,我们展示了一种双指数行为,当考虑单个节点时,这种行为简化为著名的 Schalkwijk-Kailath 方案。将星座视为 "模拟源",我们还提供了网络上源传输均方误差的指数衰减约束。
{"title":"Information Velocity of Cascaded Gaussian Channels With Feedback","authors":"Elad Domanovitz;Anatoly Khina;Tal Philosof;Yuval Kochman","doi":"10.1109/JSAIT.2024.3416310","DOIUrl":"https://doi.org/10.1109/JSAIT.2024.3416310","url":null,"abstract":"We consider a line network of nodes, connected by additive white noise channels, equipped with local feedback. We study the velocity at which information spreads over this network. For transmission of a data packet, we give an explicit positive lower bound on the velocity, for any packet size. Furthermore, we consider streaming, that is, transmission of data packets generated at a given average arrival rate. We show that a positive velocity exists as long as the arrival rate is below the individual Gaussian channel capacity, and provide an explicit lower bound. Our analysis involves applying pulse-amplitude modulation to the data (successively in the streaming case), and using linear mean-squared error estimation at the network nodes. For general white noise, we derive exponential error-probability bounds. For single-packet transmission over channels with (sub-)Gaussian noise, we show a doubly-exponential behavior, which reduces to the celebrated Schalkwijk–Kailath scheme when considering a single node. Viewing the constellation as an “analog source”, we also provide bounds on the exponential decay of the mean-squared error of source transmission over the network.","PeriodicalId":73295,"journal":{"name":"IEEE journal on selected areas in information theory","volume":"5 ","pages":"554-569"},"PeriodicalIF":0.0,"publicationDate":"2024-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141980053","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE journal on selected areas in information theory
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1