首页 > 最新文献

IEEE journal on selected areas in communications : a publication of the IEEE Communications Society最新文献

英文 中文
ZeTFRi—A Zero Trust-Based Free Rider Detection Framework for Next Generation Federated Learning Networks 基于零信任的下一代联邦学习网络搭便车检测框架
Shehan Edirimannage;Ibrahim Khalil;Charitha Elvitigala;Wathsara Daluwatta;Primal Wijesekera;Albert Y. Zomaya
With the rapid expansion of next-generation networking, Internet of Things (IoT) devices have become central components of federated learning (FL) networks. FL offers a paradigm for distributed training machine learning models while preserving user data privacy. However, existing network security measures often struggle to identify legitimate contributors from opportunistic free riders within these networks. The Free Rider (FR) problem arises when participants seek to benefit from the FL processes without contributing. In particular, free riders are known to exist within or outside of the network, whereas outside free riders can hardly be identified. The Zero Trust model proposes an environment where no entity, including the network itself, is inherently trusted, providing a foundation to counter external threats seeking to exploit the network. This study proposes a novel framework strengthened by the Zero Trust model to identify external free riders in FL networks. Leveraging a Deep Autoencoding Gaussian Mixture Model (DAGMM)-based technique for internal free rider detection, our framework demonstrates superior performance in identifying free riders across various FR scenarios compared to current state-of-the-art solutions. Through our proposed framework and the principles of Zero Trust, we establish a robust security guarantee for FL networks, ensuring the integrity of the learning process.
随着下一代网络的快速发展,物联网(IoT)设备已成为联邦学习(FL)网络的核心组件。FL为分布式训练机器学习模型提供了一个范例,同时保护了用户数据隐私。然而,现有的网络安全措施往往难以从这些网络中的投机搭便车者中识别合法贡献者。当参与者试图在没有贡献的情况下从FL过程中获益时,就会出现搭便车(FR)问题。特别是,网络内外都有搭便车者,而网络外的搭便车者很难被识别。零信任模型提出了一个环境,在这个环境中,包括网络本身在内的任何实体都不受信任,这为反击试图利用网络的外部威胁提供了基础。本研究提出了一种新的框架,通过零信任模型来识别FL网络中的外部搭便车者。利用基于深度自动编码高斯混合模型(DAGMM)的内部搭便车检测技术,与当前最先进的解决方案相比,我们的框架在识别各种FR场景中的搭便车者方面表现出卓越的性能。通过我们提出的框架和零信任原则,我们为FL网络建立了强大的安全保障,确保了学习过程的完整性。
{"title":"ZeTFRi—A Zero Trust-Based Free Rider Detection Framework for Next Generation Federated Learning Networks","authors":"Shehan Edirimannage;Ibrahim Khalil;Charitha Elvitigala;Wathsara Daluwatta;Primal Wijesekera;Albert Y. Zomaya","doi":"10.1109/JSAC.2025.3560013","DOIUrl":"10.1109/JSAC.2025.3560013","url":null,"abstract":"With the rapid expansion of next-generation networking, Internet of Things (IoT) devices have become central components of federated learning (FL) networks. FL offers a paradigm for distributed training machine learning models while preserving user data privacy. However, existing network security measures often struggle to identify legitimate contributors from opportunistic free riders within these networks. The Free Rider (FR) problem arises when participants seek to benefit from the FL processes without contributing. In particular, free riders are known to exist within or outside of the network, whereas outside free riders can hardly be identified. The Zero Trust model proposes an environment where no entity, including the network itself, is inherently trusted, providing a foundation to counter external threats seeking to exploit the network. This study proposes a novel framework strengthened by the Zero Trust model to identify external free riders in FL networks. Leveraging a Deep Autoencoding Gaussian Mixture Model (DAGMM)-based technique for internal free rider detection, our framework demonstrates superior performance in identifying free riders across various FR scenarios compared to current state-of-the-art solutions. Through our proposed framework and the principles of Zero Trust, we establish a robust security guarantee for FL networks, ensuring the integrity of the learning process.","PeriodicalId":73294,"journal":{"name":"IEEE journal on selected areas in communications : a publication of the IEEE Communications Society","volume":"43 6","pages":"1938-1953"},"PeriodicalIF":0.0,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143822605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hybrid Digital-Analog Semantic Communications 混合数字模拟语义通信
Huiqiang Xie;Zhijin Qin;Zhu Han;Khaled B. Letaief
Digital and analog semantic communications (SemCom) face inherent limitations such as data security concerns in analog SemCom, as well as leveling-off and cliff-edge effects in digital SemCom. In order to overcome these challenges, we propose a novel SemCom framework and a corresponding system called HDA-DeepSC, which leverages a hybrid digital-analog approach for multimedia transmission. This is achieved through the introduction of analog-digital allocation and fusion modules. To strike a balance between data rate and distortion, we design new loss functions that take into account long-distance dependencies in the semantic distortion constraint, essential information recovery in the channel distortion constraint, and optimal bit stream generation in the rate constraint. Additionally, we propose denoising diffusion-based signal detection techniques, which involve carefully designed variance schedules and sampling algorithms to refine transmitted signals. Through extensive numerical experiments, we will demonstrate that HDA-DeepSC exhibits robustness to channel variations and is capable of supporting various communication scenarios. Our proposed framework outperforms existing benchmarks in terms of peak signal-to-noise ratio and multi-scale structural similarity, showcasing its superiority in semantic communication quality.
数字和模拟语义通信(SemCom)面临着固有的限制,例如模拟SemCom中的数据安全问题,以及数字SemCom中的平稳和悬崖边缘效应。为了克服这些挑战,我们提出了一种新的SemCom框架和相应的系统,称为HDA-DeepSC,它利用混合数字模拟方法进行多媒体传输。这是通过引入模拟数字分配和融合模块来实现的。为了在数据速率和失真之间取得平衡,我们设计了新的损失函数,该函数在语义失真约束中考虑了长距离依赖,在信道失真约束中考虑了必要信息恢复,在速率约束中考虑了最优比特流生成。此外,我们提出了基于扩散的去噪信号检测技术,该技术涉及精心设计的方差表和采样算法,以细化传输信号。通过广泛的数值实验,我们将证明HDA-DeepSC对信道变化具有鲁棒性,并且能够支持各种通信场景。我们提出的框架在峰值信噪比和多尺度结构相似度方面优于现有基准,显示了其在语义通信质量方面的优势。
{"title":"Hybrid Digital-Analog Semantic Communications","authors":"Huiqiang Xie;Zhijin Qin;Zhu Han;Khaled B. Letaief","doi":"10.1109/JSAC.2025.3559149","DOIUrl":"10.1109/JSAC.2025.3559149","url":null,"abstract":"Digital and analog semantic communications (SemCom) face inherent limitations such as data security concerns in analog SemCom, as well as leveling-off and cliff-edge effects in digital SemCom. In order to overcome these challenges, we propose a novel SemCom framework and a corresponding system called HDA-DeepSC, which leverages a hybrid digital-analog approach for multimedia transmission. This is achieved through the introduction of analog-digital allocation and fusion modules. To strike a balance between data rate and distortion, we design new loss functions that take into account long-distance dependencies in the semantic distortion constraint, essential information recovery in the channel distortion constraint, and optimal bit stream generation in the rate constraint. Additionally, we propose denoising diffusion-based signal detection techniques, which involve carefully designed variance schedules and sampling algorithms to refine transmitted signals. Through extensive numerical experiments, we will demonstrate that HDA-DeepSC exhibits robustness to channel variations and is capable of supporting various communication scenarios. Our proposed framework outperforms existing benchmarks in terms of peak signal-to-noise ratio and multi-scale structural similarity, showcasing its superiority in semantic communication quality.","PeriodicalId":73294,"journal":{"name":"IEEE journal on selected areas in communications : a publication of the IEEE Communications Society","volume":"43 7","pages":"2478-2492"},"PeriodicalIF":0.0,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10960639","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143819733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Layered Randomized Quantization for Communication-Efficient and Privacy-Preserving Distributed Learning 基于分层随机量化的高效通信和隐私保护分布式学习
Guangfeng Yan;Tan Li;Kui Wu;Linqi Song
In distributed learning systems, ensuring efficient communication and privacy protection are two significant challenges. Although several existing works have attempted to address these challenges simultaneously, they often overlook essential learning-oriented features such as dynamic gradient and communication characteristics. In this paper, we propose a communication-efficient and privacy-preserving distributed SGD algorithm. Our proposed algorithm employs a layered randomized quantizer (LRQ) to reduce communication overhead, which also ensures that quantization errors follow an exact Gaussian distribution, thus achieving client-level differential privacy. We analyze the trade-off between convergence error, communication, and privacy under non-IID data distributions. Besides, we modify the algorithm to be training-adaptive by adjusting the per-round privacy budget allocation in response to i) dynamic gradient features and ii) real-time changing communication rounds. Both closed-form solutions are derived by solving the minimization problem of convergence error subject to the privacy budget constraint. Finally, we evaluate the effectiveness of our approach through extensive experiments on various datasets, including MNIST, CIFAR-10, and CIFAR-100, demonstrating its superiority in terms of communication cost, privacy protection, and model performance compared to state-of-the-art methods.
在分布式学习系统中,确保有效的通信和隐私保护是两个重要的挑战。虽然一些现有的作品试图同时解决这些挑战,但它们往往忽略了基本的学习导向特征,如动态梯度和交流特征。本文提出了一种通信高效、隐私保护的分布式SGD算法。我们提出的算法采用分层随机量化器(LRQ)来减少通信开销,这也确保量化误差遵循精确的高斯分布,从而实现客户端级差分隐私。我们分析了在非iid数据分布下收敛误差、通信和隐私之间的权衡。此外,我们通过调整每轮隐私预算分配来响应i)动态梯度特征和ii)实时变化的通信回合,将算法修改为训练自适应。通过求解隐私预算约束下的收敛误差最小化问题,推导出两种封闭解。最后,我们通过在各种数据集(包括MNIST、CIFAR-10和CIFAR-100)上进行的大量实验来评估我们方法的有效性,与最先进的方法相比,证明了其在通信成本、隐私保护和模型性能方面的优势。
{"title":"Layered Randomized Quantization for Communication-Efficient and Privacy-Preserving Distributed Learning","authors":"Guangfeng Yan;Tan Li;Kui Wu;Linqi Song","doi":"10.1109/JSAC.2025.3559136","DOIUrl":"10.1109/JSAC.2025.3559136","url":null,"abstract":"In distributed learning systems, ensuring efficient communication and privacy protection are two significant challenges. Although several existing works have attempted to address these challenges simultaneously, they often overlook essential learning-oriented features such as dynamic gradient and communication characteristics. In this paper, we propose a communication-efficient and privacy-preserving distributed SGD algorithm. Our proposed algorithm employs a layered randomized quantizer (LRQ) to reduce communication overhead, which also ensures that quantization errors follow an exact Gaussian distribution, thus achieving client-level differential privacy. We analyze the trade-off between convergence error, communication, and privacy under non-IID data distributions. Besides, we modify the algorithm to be training-adaptive by adjusting the per-round privacy budget allocation in response to i) dynamic gradient features and ii) real-time changing communication rounds. Both closed-form solutions are derived by solving the minimization problem of convergence error subject to the privacy budget constraint. Finally, we evaluate the effectiveness of our approach through extensive experiments on various datasets, including MNIST, CIFAR-10, and CIFAR-100, demonstrating its superiority in terms of communication cost, privacy protection, and model performance compared to state-of-the-art methods.","PeriodicalId":73294,"journal":{"name":"IEEE journal on selected areas in communications : a publication of the IEEE Communications Society","volume":"43 7","pages":"2684-2699"},"PeriodicalIF":0.0,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143819732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Universal Joint Source-Channel Coding for Modulation-Agnostic Semantic Communication 调制不可知语义通信的通用联合源信道编码
Yoon Huh;Hyowoon Seo;Wan Choi
From the perspective of joint source-channel coding (JSCC), there has been significant research on utilizing semantic communication, which inherently possesses analog characteristics, within digital device environments. However, a single-model approach that operates modulation-agnostically across various digital modulation orders has not yet been established. This article presents the first attempt at such an approach by proposing a universal joint source-channel coding (uJSCC) system that utilizes a single-model encoder-decoder pair and trained vector quantization (VQ) codebooks. To support various modulation orders within a single model, the operation of every neural network (NN)-based module in the uJSCC system requires the selection of modulation orders according to signal-to-noise ratio (SNR) boundaries. To address the challenge of unequal output statistics from shared parameters across NN layers, we integrate multiple batch normalization (BN) layers, selected based on modulation order, after each NN layer. This integration occurs with minimal impact on the overall model size. Through a comprehensive series of experiments, we validate that the modulation-agnostic semantic communication framework demonstrates superiority over existing digital semantic communication approaches in terms of model complexity, communication efficiency, and task effectiveness.
从联合源信道编码(JSCC)的角度来看,在数字设备环境中利用固有的具有模拟特性的语义通信已经有了大量的研究。然而,尚未建立一种跨各种数字调制阶的调制不可知操作的单模型方法。本文提出了一种通用联合源信道编码(uJSCC)系统,该系统利用单模型编码器-解码器对和训练矢量量化(VQ)码本,提出了这种方法的第一次尝试。为了在单一模型中支持多种调制顺序,uJSCC系统中每个基于神经网络(NN)的模块的运行都需要根据信噪比(SNR)边界选择调制顺序。为了解决跨神经网络层共享参数的不相等输出统计的挑战,我们在每个神经网络层之后集成了多个基于调制顺序选择的批处理归一化(BN)层。这种集成对整体模型大小的影响最小。通过一系列全面的实验,我们验证了调制无关语义通信框架在模型复杂性,通信效率和任务有效性方面优于现有的数字语义通信方法。
{"title":"Universal Joint Source-Channel Coding for Modulation-Agnostic Semantic Communication","authors":"Yoon Huh;Hyowoon Seo;Wan Choi","doi":"10.1109/JSAC.2025.3559138","DOIUrl":"10.1109/JSAC.2025.3559138","url":null,"abstract":"From the perspective of joint source-channel coding (JSCC), there has been significant research on utilizing semantic communication, which inherently possesses analog characteristics, within digital device environments. However, a single-model approach that operates modulation-agnostically across various digital modulation orders has not yet been established. This article presents the first attempt at such an approach by proposing a universal joint source-channel coding (uJSCC) system that utilizes a single-model encoder-decoder pair and trained vector quantization (VQ) codebooks. To support various modulation orders within a single model, the operation of every neural network (NN)-based module in the uJSCC system requires the selection of modulation orders according to signal-to-noise ratio (SNR) boundaries. To address the challenge of unequal output statistics from shared parameters across NN layers, we integrate multiple batch normalization (BN) layers, selected based on modulation order, after each NN layer. This integration occurs with minimal impact on the overall model size. Through a comprehensive series of experiments, we validate that the modulation-agnostic semantic communication framework demonstrates superiority over existing digital semantic communication approaches in terms of model complexity, communication efficiency, and task effectiveness.","PeriodicalId":73294,"journal":{"name":"IEEE journal on selected areas in communications : a publication of the IEEE Communications Society","volume":"43 7","pages":"2560-2574"},"PeriodicalIF":0.0,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143819729","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
VARFVV: View-Adaptive Real-Time Interactive Free-View Video Streaming With Edge Computing VARFVV:基于边缘计算的自适应实时交互式自由视频流
Qiang Hu;Qihan He;Houqiang Zhong;Guo Lu;Xiaoyun Zhang;Guangtao Zhai;Yanfeng Wang
Free-view video (FVV) allows users to explore immersive video content from multiple views. However, delivering FVV poses significant challenges due to the uncertainty in view switching, combined with the substantial bandwidth and computational resources required to transmit and decode multiple video streams, which may result in frequent playback interruptions. Existing approaches, either client-based or cloud-based, struggle to meet high Quality of Experience (QoE) requirements under limited bandwidth and computational resources. To address these issues, we propose VARFVV, a bandwidth- and computationally-efficient system that enables real-time interactive FVV streaming with high QoE and low switching delay. Specifically, VARFVV introduces a low-complexity FVV generation scheme that reassembles multiview video frames at the edge server based on user-selected view tracks, eliminating the need for transcoding and significantly reducing computational overhead. This design makes it well-suited for large-scale, mobile-based UHD FVV experiences. Furthermore, we present a popularity-adaptive bit allocation method, leveraging a graph neural network, that predicts view popularity and dynamically adjusts bit allocation to maximize QoE within bandwidth constraints. We also construct an FVV dataset comprising 330 videos from 10 scenes, including basketball, opera, etc. Extensive experiments show that VARFVV surpasses existing methods in video quality, switching latency, computational efficiency, and bandwidth usage, supporting over 500 users on a single edge server with a switching delay of 71.5ms. Our code and dataset are available at https://github.com/qianghu-huber/VARFVV
自由观看视频(FVV)允许用户从多个角度探索沉浸式视频内容。然而,由于视图切换的不确定性,再加上传输和解码多个视频流所需的大量带宽和计算资源,传输FVV带来了巨大的挑战,这可能导致频繁的播放中断。现有的方法,无论是基于客户机的还是基于云的,都难以在有限的带宽和计算资源下满足高质量的体验(QoE)需求。为了解决这些问题,我们提出了VARFVV,这是一种带宽和计算效率高的系统,可以实现具有高QoE和低切换延迟的实时交互式FVV流。具体来说,VARFVV引入了一种低复杂性的FVV生成方案,该方案基于用户选择的视图轨道在边缘服务器上重新组装多视图视频帧,从而消除了转码的需要并显着降低了计算开销。这种设计使其非常适合大规模,基于移动的UHD FVV体验。此外,我们提出了一种流行度自适应比特分配方法,利用图神经网络,预测视图流行度并动态调整比特分配,以在带宽约束下最大化QoE。我们还构建了一个包含330个视频的FVV数据集,这些视频来自10个场景,包括篮球、歌剧等。大量实验表明,VARFVV在视频质量、切换延迟、计算效率和带宽使用方面都优于现有方法,在单个边缘服务器上支持500多个用户,切换延迟为71.5ms。我们的代码和数据集可在https://github.com/qianghu-huber/VARFVV上获得
{"title":"VARFVV: View-Adaptive Real-Time Interactive Free-View Video Streaming With Edge Computing","authors":"Qiang Hu;Qihan He;Houqiang Zhong;Guo Lu;Xiaoyun Zhang;Guangtao Zhai;Yanfeng Wang","doi":"10.1109/JSAC.2025.3559140","DOIUrl":"10.1109/JSAC.2025.3559140","url":null,"abstract":"Free-view video (FVV) allows users to explore immersive video content from multiple views. However, delivering FVV poses significant challenges due to the uncertainty in view switching, combined with the substantial bandwidth and computational resources required to transmit and decode multiple video streams, which may result in frequent playback interruptions. Existing approaches, either client-based or cloud-based, struggle to meet high Quality of Experience (QoE) requirements under limited bandwidth and computational resources. To address these issues, we propose VARFVV, a bandwidth- and computationally-efficient system that enables real-time interactive FVV streaming with high QoE and low switching delay. Specifically, VARFVV introduces a low-complexity FVV generation scheme that reassembles multiview video frames at the edge server based on user-selected view tracks, eliminating the need for transcoding and significantly reducing computational overhead. This design makes it well-suited for large-scale, mobile-based UHD FVV experiences. Furthermore, we present a popularity-adaptive bit allocation method, leveraging a graph neural network, that predicts view popularity and dynamically adjusts bit allocation to maximize QoE within bandwidth constraints. We also construct an FVV dataset comprising 330 videos from 10 scenes, including basketball, opera, etc. Extensive experiments show that VARFVV surpasses existing methods in video quality, switching latency, computational efficiency, and bandwidth usage, supporting over 500 users on a single edge server with a switching delay of 71.5ms. Our code and dataset are available at <uri>https://github.com/qianghu-huber/VARFVV</uri>","PeriodicalId":73294,"journal":{"name":"IEEE journal on selected areas in communications : a publication of the IEEE Communications Society","volume":"43 7","pages":"2620-2634"},"PeriodicalIF":0.0,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143819686","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiple-Masks Error Correction Code Transformer for Short Block Codes 短分组码的多掩码纠错码变压器
Seong-Joon Park;Hee-Youl Kwak;Sang-Hyo Kim;Sunghwan Kim;Yongjune Kim;Jong-Seon No
With the broadening applications of deep learning, neural decoders have emerged as a key research focus, specifically aimed at improving the decoding performance of conventional decoding algorithms. In particular, error correction code transformer (ECCT), which utilizes the transformer architecture, has achieved state-of-the-art performance among neural network-based decoders. We present three technical contributions to significantly enhance the performance of ECCT. First, we propose a novel transformer architecture of ECCT, termed the multiple-masks ECCT (MM ECCT). We employ multiple masked self-attention blocks with different mask matrices in a parallel manner to learn diverse relationships among the codeword bits. Second, we discover that constructing mask matrices based on systematic parity check matrices (PCMs) can make the attention maps sparse, which not only enhances the decoding performance but also reduces computational complexity. Finally, we propose using complementary mask matrices derived from cyclic permutations of the systematic PCM. These complementary mask matrices are specifically designed to enhance the decoding of cyclic codes. Our extensive simulation results show that the proposed MM ECCT architecture with carefully designed mask matrices outperforms the original ECCT by a large margin, achieving state-of-the-art decoding performance among neural decoders. The source code is available at https://github.com/iil-postech/mm-ecct.
随着深度学习应用的不断扩大,神经解码器已经成为一个重要的研究热点,专门针对提高传统解码算法的解码性能。特别是,利用变压器结构的纠错码变压器(ECCT)在基于神经网络的解码器中取得了最先进的性能。我们提出了三个技术贡献,以显著提高ECCT的性能。首先,我们提出了一种新的变压器ECCT结构,称为多掩模ECCT (MM ECCT)。我们以并行的方式使用多个具有不同掩码矩阵的掩码自注意块来学习码字位之间的各种关系。其次,我们发现基于系统奇偶校验矩阵构造掩码矩阵可以使注意映射稀疏化,这不仅提高了解码性能,而且降低了计算复杂度。最后,我们建议使用由系统PCM的循环置换导出的互补掩模矩阵。这些互补掩码矩阵是专门设计来增强循环码的解码。我们广泛的仿真结果表明,采用精心设计的掩模矩阵的MM ECCT架构在很大程度上优于原始ECCT,在神经解码器中实现了最先进的解码性能。源代码可从https://github.com/iil-postech/mm-ecct获得。
{"title":"Multiple-Masks Error Correction Code Transformer for Short Block Codes","authors":"Seong-Joon Park;Hee-Youl Kwak;Sang-Hyo Kim;Sunghwan Kim;Yongjune Kim;Jong-Seon No","doi":"10.1109/JSAC.2025.3559154","DOIUrl":"10.1109/JSAC.2025.3559154","url":null,"abstract":"With the broadening applications of deep learning, neural decoders have emerged as a key research focus, specifically aimed at improving the decoding performance of conventional decoding algorithms. In particular, error correction code transformer (ECCT), which utilizes the transformer architecture, has achieved state-of-the-art performance among neural network-based decoders. We present three technical contributions to significantly enhance the performance of ECCT. First, we propose a novel transformer architecture of ECCT, termed the <italic>multiple-masks ECCT (MM ECCT)</i>. We employ multiple masked self-attention blocks with different mask matrices in a parallel manner to learn diverse relationships among the codeword bits. Second, we discover that constructing mask matrices based on systematic parity check matrices (PCMs) can make the attention maps <italic>sparse</i>, which not only enhances the decoding performance but also reduces computational complexity. Finally, we propose using complementary mask matrices derived from cyclic permutations of the systematic PCM. These complementary mask matrices are specifically designed to enhance the decoding of cyclic codes. Our extensive simulation results show that the proposed MM ECCT architecture with carefully designed mask matrices outperforms the original ECCT by a large margin, achieving state-of-the-art decoding performance among neural decoders. The source code is available at <uri>https://github.com/iil-postech/mm-ecct</uri>.","PeriodicalId":73294,"journal":{"name":"IEEE journal on selected areas in communications : a publication of the IEEE Communications Society","volume":"43 7","pages":"2518-2529"},"PeriodicalIF":0.0,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143819731","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust Regression With Ensembles Communicating Over Noisy Channels 集成在噪声信道上通信的鲁棒回归
Yuval Ben-Hur;Yuval Cassuto
As machine-learning models grow in size, their implementation requirements cannot be met by a single computer system. This observation motivates distributed settings, in which intermediate computations are performed across a network of processing units, while the central node only aggregates their outputs. However, distributing inference tasks across low-precision or faulty edge devices, operating over a network of noisy communication channels, gives rise to serious reliability challenges. We study the problem of an ensemble of devices, implementing regression algorithms, that communicate through additive noisy channels in order to collaboratively perform a joint regression task. We define the problem formally, and develop methods for optimizing the aggregation coefficients for the parameters of the noise in the channels, which can potentially be correlated. Our results apply to the leading state-of-the-art ensemble regression methods: bagging and gradient boosting. We demonstrate the effectiveness of our algorithms on both synthetic and real-world datasets.
随着机器学习模型规模的增长,单个计算机系统无法满足其实现需求。这种观察激发了分布式设置,其中中间计算跨处理单元网络执行,而中心节点仅汇总它们的输出。然而,将推理任务分布在低精度或故障边缘设备上,在噪声通信信道网络上运行,会产生严重的可靠性挑战。我们研究了一个集成设备的问题,实现回归算法,通过加性噪声信道进行通信,以便协同执行联合回归任务。我们正式定义了这个问题,并开发了优化通道中噪声参数聚集系数的方法,这些参数可能是相关的。我们的结果适用于领先的最先进的集合回归方法:bagging和梯度提升。我们证明了我们的算法在合成和现实世界数据集上的有效性。
{"title":"Robust Regression With Ensembles Communicating Over Noisy Channels","authors":"Yuval Ben-Hur;Yuval Cassuto","doi":"10.1109/JSAC.2025.3559150","DOIUrl":"10.1109/JSAC.2025.3559150","url":null,"abstract":"As machine-learning models grow in size, their implementation requirements cannot be met by a single computer system. This observation motivates distributed settings, in which intermediate computations are performed across a network of processing units, while the central node only aggregates their outputs. However, distributing inference tasks across low-precision or faulty edge devices, operating over a network of noisy communication channels, gives rise to serious reliability challenges. We study the problem of an ensemble of devices, implementing regression algorithms, that communicate through additive noisy channels in order to collaboratively perform a joint regression task. We define the problem formally, and develop methods for optimizing the aggregation coefficients for the parameters of the noise in the channels, which can potentially be correlated. Our results apply to the leading state-of-the-art ensemble regression methods: bagging and gradient boosting. We demonstrate the effectiveness of our algorithms on both synthetic and real-world datasets.","PeriodicalId":73294,"journal":{"name":"IEEE journal on selected areas in communications : a publication of the IEEE Communications Society","volume":"43 7","pages":"2714-2727"},"PeriodicalIF":0.0,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143819730","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Mathematical Theory for Learning Semantic Languages by Abstract Learners 抽象学习者学习语义语言的数学理论
Kuo-Yu Liao;Cheng-Shang Chang;Y.-W. Peter Hong
Recent advances in Large Language Models (LLMs) have demonstrated the emergence of capabilities (learned skills) when the number of system parameters and the size of training data surpass certain thresholds. The exact mechanisms behind such phenomena are not fully understood and remain a topic of active research. Inspired by the skill-text bipartite graph model proposed by Arora and Goyal for modeling semantic languages, we develop a mathematical theory to explain the emergence of learned skills, taking the learning (or training) process into account. Our approach models the learning process for skills in the skill-text bipartite graph as an iterative decoding process in Low-Density Parity Check (LDPC) codes and Irregular Repetition Slotted ALOHA (IRSA). Using density evolution analysis, we demonstrate the emergence of learned skills when the ratio of the number of training texts to the number of skills exceeds a certain threshold. Our analysis also yields a scaling law for testing errors relative to this ratio. Upon completion of the training, the association of learned skills can also be acquired to form a skill association graph. We use site percolation analysis to derive the conditions for the existence of a giant component in the skill association graph. Our analysis can also be extended to the setting with a hierarchy of skills, where a fine-tuned model is built upon a foundation model. It is also applicable to the setting with multiple classes of skills and texts. As an important application, we propose a method for semantic compression and discuss its connections to semantic communication.
大型语言模型(llm)的最新进展表明,当系统参数的数量和训练数据的大小超过一定的阈值时,能力(学习技能)就会出现。这种现象背后的确切机制尚不完全清楚,仍然是一个积极研究的主题。受Arora和Goyal提出的用于语义语言建模的技能-文本二部图模型的启发,我们开发了一种数学理论来解释学习技能的出现,并考虑到学习(或训练)过程。我们的方法将技能-文本二部图中的技能学习过程建模为低密度奇偶校验(LDPC)码和不规则重复开槽式ALOHA (IRSA)码的迭代解码过程。利用密度演化分析,我们证明了当训练文本的数量与技能数量的比例超过一定阈值时,习得技能的出现。我们的分析还得出了与该比率相关的测试误差的标度定律。训练完成后,还可以获得所学技能的关联,形成技能关联图。我们使用站点渗透分析来推导技能关联图中存在巨大组件的条件。我们的分析还可以扩展到具有技能层次结构的设置,其中一个微调模型是建立在基础模型之上的。它也适用于多类技能和文本的设置。作为一个重要的应用,我们提出了一种语义压缩方法,并讨论了它与语义通信的联系。
{"title":"A Mathematical Theory for Learning Semantic Languages by Abstract Learners","authors":"Kuo-Yu Liao;Cheng-Shang Chang;Y.-W. Peter Hong","doi":"10.1109/JSAC.2025.3559118","DOIUrl":"10.1109/JSAC.2025.3559118","url":null,"abstract":"Recent advances in Large Language Models (LLMs) have demonstrated the emergence of capabilities (learned skills) when the number of system parameters and the size of training data surpass certain thresholds. The exact mechanisms behind such phenomena are not fully understood and remain a topic of active research. Inspired by the skill-text bipartite graph model proposed by Arora and Goyal for modeling semantic languages, we develop a mathematical theory to explain the emergence of learned skills, taking the learning (or training) process into account. Our approach models the learning process for skills in the skill-text bipartite graph as an iterative decoding process in Low-Density Parity Check (LDPC) codes and Irregular Repetition Slotted ALOHA (IRSA). Using density evolution analysis, we demonstrate the emergence of learned skills when the ratio of the number of training texts to the number of skills exceeds a certain threshold. Our analysis also yields a scaling law for testing errors relative to this ratio. Upon completion of the training, the association of learned skills can also be acquired to form a skill association graph. We use site percolation analysis to derive the conditions for the existence of a giant component in the skill association graph. Our analysis can also be extended to the setting with a hierarchy of skills, where a fine-tuned model is built upon a foundation model. It is also applicable to the setting with multiple classes of skills and texts. As an important application, we propose a method for semantic compression and discuss its connections to semantic communication.","PeriodicalId":73294,"journal":{"name":"IEEE journal on selected areas in communications : a publication of the IEEE Communications Society","volume":"43 7","pages":"2700-2713"},"PeriodicalIF":0.0,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143813822","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Channel-Optimized Strategic Quantization 渠道优化战略量化
Anju Anand;Emrah Akyol
This paper studies a quantization problem between an encoder and a decoder with misaligned objectives, where the quantization indices are transmitted over a noisy channel. Building on the prior results on the non-strategic counterpart of this problem, we characterize the encoding and decoding strategies and expected encoder and decoder distortions at the Stackelberg equilibrium, where the encoder is the leader, and the decoder is the follower. On the design side, we extend the gradient-descent-based solution framework developed for the noiseless setting to this noisy communication scenario, combined with uniformly randomized index mapping. We finally present numerical simulation results to demonstrate the efficacy of the proposed approach. The MATLAB codes associated with the design and evaluation of the proposed algorithm are provided at: https://github.com/strategic-quantization/channel-optimized-strategic-quantizer.
本文研究了目标不对准的编码器和解码器之间的量化问题,其中量化指标在噪声信道上传输。在此问题的非策略对应的先前结果的基础上,我们描述了编码和解码策略以及在Stackelberg均衡下的预期编码器和解码器扭曲,其中编码器是领导者,解码器是追随者。在设计方面,我们将为无噪声环境开发的基于梯度下降的解决方案框架扩展到这种有噪声的通信场景,并结合统一的随机索引映射。最后给出了数值模拟结果来验证所提方法的有效性。与所提出的算法的设计和评估相关的MATLAB代码提供在:https://github.com/strategic-quantization/channel-optimized-strategic-quantizer。
{"title":"Channel-Optimized Strategic Quantization","authors":"Anju Anand;Emrah Akyol","doi":"10.1109/JSAC.2025.3559117","DOIUrl":"10.1109/JSAC.2025.3559117","url":null,"abstract":"This paper studies a quantization problem between an encoder and a decoder with misaligned objectives, where the quantization indices are transmitted over a noisy channel. Building on the prior results on the non-strategic counterpart of this problem, we characterize the encoding and decoding strategies and expected encoder and decoder distortions at the Stackelberg equilibrium, where the encoder is the leader, and the decoder is the follower. On the design side, we extend the gradient-descent-based solution framework developed for the noiseless setting to this noisy communication scenario, combined with uniformly randomized index mapping. We finally present numerical simulation results to demonstrate the efficacy of the proposed approach. The MATLAB codes associated with the design and evaluation of the proposed algorithm are provided at: <uri>https://github.com/strategic-quantization/channel-optimized-strategic-quantizer</uri>.","PeriodicalId":73294,"journal":{"name":"IEEE journal on selected areas in communications : a publication of the IEEE Communications Society","volume":"43 7","pages":"2530-2542"},"PeriodicalIF":0.0,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143813546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Task-Oriented Lossy Compression With Data, Perception, and Classification Constraints 具有数据、感知和分类约束的面向任务的有损压缩
Yuhan Wang;Youlong Wu;Shuai Ma;Ying-Jun Angela Zhang
By extracting task-relevant information while maximally compressing the input, the information bottleneck (IB) principle has provided a guideline for learning effective and robust representations of the target inference. However, extending the idea to the multi-task learning scenario with joint consideration of generative tasks and traditional reconstruction tasks remains unexplored. This paper addresses this gap by reconsidering the lossy compression problem with diverse constraints on data reconstruction, perceptual quality, and classification accuracy. Firstly, we study two ternary relationships, namely, the rate-distortion-classification (RDC) and rate-perception-classification (RPC). For both RDC and RPC functions, we derive the closed-form expressions of the optimal rate for binary and Gaussian sources. These new results complement the IB principle and provide insights into effectively extracting task-oriented information to fulfill diverse objectives. Secondly, unlike prior research demonstrating a tradeoff between classification and perception in signal restoration problems, we prove that such a tradeoff does not exist in the RPC function and reveal that the source noise plays a decisive role in the classification-perception tradeoff. Finally, we implement a deep-learning-based image compression framework, incorporating multiple tasks related to distortion, perception, and classification. The experimental results coincide with the theoretical analysis and verify the effectiveness of our generalized IB in balancing various task objectives.
通过在最大限度地压缩输入的同时提取任务相关信息,信息瓶颈(IB)原则为学习目标推理的有效和鲁棒表示提供了指导。然而,将这一想法扩展到多任务学习场景中,同时考虑生成任务和传统重建任务,仍未得到探索。本文通过重新考虑对数据重建、感知质量和分类精度具有不同约束的有损压缩问题来解决这一差距。首先,我们研究了两种三元关系,即速率扭曲分类(RDC)和速率感知分类(RPC)。对于RDC和RPC函数,我们导出了二进制和高斯源的最优速率的封闭表达式。这些新结果补充了IB原则,并为有效提取面向任务的信息以实现不同目标提供了见解。其次,与先前研究表明在信号恢复问题中分类和感知之间存在权衡不同,我们证明了RPC函数中不存在这种权衡,并揭示了源噪声在分类-感知权衡中起决定性作用。最后,我们实现了一个基于深度学习的图像压缩框架,结合了与失真、感知和分类相关的多个任务。实验结果与理论分析一致,验证了我们的广义IB在平衡各种任务目标方面的有效性。
{"title":"Task-Oriented Lossy Compression With Data, Perception, and Classification Constraints","authors":"Yuhan Wang;Youlong Wu;Shuai Ma;Ying-Jun Angela Zhang","doi":"10.1109/JSAC.2025.3559164","DOIUrl":"10.1109/JSAC.2025.3559164","url":null,"abstract":"By extracting task-relevant information while maximally compressing the input, the information bottleneck (IB) principle has provided a guideline for learning effective and robust representations of the target inference. However, extending the idea to the multi-task learning scenario with joint consideration of generative tasks and traditional reconstruction tasks remains unexplored. This paper addresses this gap by reconsidering the lossy compression problem with diverse constraints on data reconstruction, perceptual quality, and classification accuracy. Firstly, we study two ternary relationships, namely, the <italic>rate-distortion-classification (RDC)</i> and <italic>rate-perception-classification (RPC)</i>. For both RDC and RPC functions, we derive the <italic>closed-form expressions</i> of the optimal rate for binary and Gaussian sources. These new results complement the IB principle and provide insights into effectively extracting task-oriented information to fulfill diverse objectives. Secondly, unlike prior research demonstrating a tradeoff between classification and perception in signal restoration problems, we prove that such a tradeoff does not exist in the RPC function and reveal that the source noise plays a decisive role in the classification-perception tradeoff. Finally, we implement a deep-learning-based image compression framework, incorporating multiple tasks related to distortion, perception, and classification. The experimental results coincide with the theoretical analysis and verify the effectiveness of our generalized IB in balancing various task objectives.","PeriodicalId":73294,"journal":{"name":"IEEE journal on selected areas in communications : a publication of the IEEE Communications Society","volume":"43 7","pages":"2635-2650"},"PeriodicalIF":0.0,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143813481","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE journal on selected areas in communications : a publication of the IEEE Communications Society
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1