首页 > 最新文献

IEEE Transactions on Broadcasting最新文献

英文 中文
Using Deep Reinforcement Learning (DRL) to Optimize Quality in 360-Degree Video Tile Management 使用深度强化学习(DRL)优化360度视频贴图管理的质量
IF 3.2 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-03-11 DOI: 10.1109/TBC.2025.3541860
Chunguang Li;Dayoung Lee;Minseok Song
360-degree videos inherently require significant storage space because each segment consists of many tiles, each of which is further transcoded and stored in multiple versions. It is thus impractical to store all transcoded versions, which makes it essential to make effective use of limited storage space. However, the inefficiency of existing heuristic-based management schemes arises from the challenge of incorporating various factors, such as variable bandwidth requirements influenced by network conditions, tile access distribution, and video quality dependent on content. To address this, we propose a new storage space management scheme, which combines the dueling deep Q-network (DQN) algorithm based on the field-of-view (FoV) distribution and the greedy algorithm that considers the overall video popularity. We first model an environment in which the agent can determine the versions for each tile to achieve the best video quality under various storage limit conditions. The dueling DQN environment comprises 1) an action space determining version combinations for each tile within specified storage limits, 2) an observation space enabling the agent to learn variable bandwidths and tile access distributions, and 3) a reward model deriving the expected video quality for different actions. Building upon the dueling DQN model correlating storage limits with expected video quality, we present a greedy algorithm that selects versions among multiple videos within storage limits for the purpose of maximizing popularity-weighted video quality. Extensive simulations evaluated the proposed scheme under various storage limits, bandwidth changes, and FoV distributions, demonstrating an improvement in overall popularity-weighted video quality ranging from 0.49% to 37.77% (with an average improvement of 13.96%) compared to existing benchmark schemes.
360度视频本质上需要大量的存储空间,因为每个片段由许多块组成,每个块进一步转编码并存储为多个版本。因此,存储所有转码版本是不切实际的,因此必须有效利用有限的存储空间。然而,现有的基于启发式的管理方案的低效率源于整合各种因素的挑战,例如受网络条件影响的可变带宽需求、tile访问分布和依赖于内容的视频质量。为了解决这个问题,我们提出了一种新的存储空间管理方案,该方案结合了基于视场分布的决斗深度q -网络(DQN)算法和考虑整体视频流行度的贪婪算法。我们首先建立了一个环境模型,在这个环境中,智能体可以确定每个贴图的版本,以在各种存储限制条件下获得最佳的视频质量。决斗DQN环境包括:1)决定在指定存储限制内每个贴图版本组合的动作空间,2)使智能体能够学习可变带宽和贴图访问分布的观察空间,以及3)派生不同动作的预期视频质量的奖励模型。在将存储限制与期望视频质量相关联的决斗DQN模型的基础上,我们提出了一种贪婪算法,该算法在存储限制内的多个视频中选择版本,以最大化流行加权视频质量。大量的模拟评估了在各种存储限制、带宽变化和视场分布下提出的方案,表明与现有基准方案相比,总体流行加权视频质量的改善范围从0.49%到37.77%(平均改善13.96%)。
{"title":"Using Deep Reinforcement Learning (DRL) to Optimize Quality in 360-Degree Video Tile Management","authors":"Chunguang Li;Dayoung Lee;Minseok Song","doi":"10.1109/TBC.2025.3541860","DOIUrl":"https://doi.org/10.1109/TBC.2025.3541860","url":null,"abstract":"360-degree videos inherently require significant storage space because each segment consists of many tiles, each of which is further transcoded and stored in multiple versions. It is thus impractical to store all transcoded versions, which makes it essential to make effective use of limited storage space. However, the inefficiency of existing heuristic-based management schemes arises from the challenge of incorporating various factors, such as variable bandwidth requirements influenced by network conditions, tile access distribution, and video quality dependent on content. To address this, we propose a new storage space management scheme, which combines the dueling deep Q-network (DQN) algorithm based on the field-of-view (FoV) distribution and the greedy algorithm that considers the overall video popularity. We first model an environment in which the agent can determine the versions for each tile to achieve the best video quality under various storage limit conditions. The dueling DQN environment comprises 1) an action space determining version combinations for each tile within specified storage limits, 2) an observation space enabling the agent to learn variable bandwidths and tile access distributions, and 3) a reward model deriving the expected video quality for different actions. Building upon the dueling DQN model correlating storage limits with expected video quality, we present a greedy algorithm that selects versions among multiple videos within storage limits for the purpose of maximizing popularity-weighted video quality. Extensive simulations evaluated the proposed scheme under various storage limits, bandwidth changes, and FoV distributions, demonstrating an improvement in overall popularity-weighted video quality ranging from 0.49% to 37.77% (with an average improvement of 13.96%) compared to existing benchmark schemes.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"71 2","pages":"555-569"},"PeriodicalIF":3.2,"publicationDate":"2025-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144243592","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FTBM: A Fault-Tolerant BIER Multicast for MBMS in 5G/B5G Dynamic Edge Networks FTBM: 5G/B5G动态边缘网络中MBMS的容错BIER组播
IF 3.2 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-03-07 DOI: 10.1109/TBC.2025.3541889
Honglin Fang;Peng Yu;Xinxiu Liu;Ying Wang;Wenjing Li;Xuesong Qiu;Zhaowei Qu
The evolution of 5G and Beyond 5G (B5G) networks has intensified the demand for efficient Multimedia Broadcast Multicast Services (MBMS), particularly in dynamic edge environments. The frequent alterations in network topology and multicast group configurations in these environments present substantial scalability challenges for traditional IP MultiCast (IPMC) mechanisms. Bit Index Explicit Replication (BIER) offers a stateless IPMC alternative that mitigates the limitations of traditional IPMC mechanisms. However, it still encounters fault tolerance issues in dynamic edge networks, where link faults occur frequently. This paper propose a Fault-Tolerant BIER Multicast (FTBM) mechanism specifically designed for MBMS in dynamic edge networks. FTBM optimizes BIER multicast paths by employing Multi-Agent Deep Reinforcement Learning (MADRL) to minimize transmission delays while addressing constraints such as random link faults, limited queue capacity, and forwarding restrictions. Extensive simulations demonstrate that FTBM significantly enhances multicast performance under varying traffic loads and dense fault conditions, leading to improved transmission efficiency and network load balancing. This work provides a resilient and scalable solution for next-generation MBMS in dynamic network environments.
5G和超5G (B5G)网络的发展加剧了对高效多媒体广播多播服务(MBMS)的需求,特别是在动态边缘环境中。在这些环境中,网络拓扑结构和组播组配置的频繁变化给传统的IP组播(IPMC)机制带来了巨大的可扩展性挑战。Bit Index Explicit Replication (BIER)提供了一种无状态的IPMC替代方案,减轻了传统IPMC机制的局限性。但是,在链路故障频繁发生的动态边缘网络中,仍然存在容错问题。针对动态边缘网络中的MBMS,提出了一种容错BIER组播(FTBM)机制。FTBM通过使用多智能体深度强化学习(MADRL)来优化BIER组播路径,以最大限度地减少传输延迟,同时解决诸如随机链路故障,有限队列容量和转发限制等约束。大量的仿真结果表明,在不同的流量负载和密集的故障条件下,FTBM可以显著提高组播性能,从而提高传输效率和网络负载均衡。这项工作为动态网络环境下的下一代MBMS提供了弹性和可扩展的解决方案。
{"title":"FTBM: A Fault-Tolerant BIER Multicast for MBMS in 5G/B5G Dynamic Edge Networks","authors":"Honglin Fang;Peng Yu;Xinxiu Liu;Ying Wang;Wenjing Li;Xuesong Qiu;Zhaowei Qu","doi":"10.1109/TBC.2025.3541889","DOIUrl":"https://doi.org/10.1109/TBC.2025.3541889","url":null,"abstract":"The evolution of 5G and Beyond 5G (B5G) networks has intensified the demand for efficient Multimedia Broadcast Multicast Services (MBMS), particularly in dynamic edge environments. The frequent alterations in network topology and multicast group configurations in these environments present substantial scalability challenges for traditional IP MultiCast (IPMC) mechanisms. Bit Index Explicit Replication (BIER) offers a stateless IPMC alternative that mitigates the limitations of traditional IPMC mechanisms. However, it still encounters fault tolerance issues in dynamic edge networks, where link faults occur frequently. This paper propose a Fault-Tolerant BIER Multicast (FTBM) mechanism specifically designed for MBMS in dynamic edge networks. FTBM optimizes BIER multicast paths by employing Multi-Agent Deep Reinforcement Learning (MADRL) to minimize transmission delays while addressing constraints such as random link faults, limited queue capacity, and forwarding restrictions. Extensive simulations demonstrate that FTBM significantly enhances multicast performance under varying traffic loads and dense fault conditions, leading to improved transmission efficiency and network load balancing. This work provides a resilient and scalable solution for next-generation MBMS in dynamic network environments.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"71 2","pages":"411-425"},"PeriodicalIF":3.2,"publicationDate":"2025-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144243676","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Transactions on Broadcasting Publication Information IEEE广播出版信息汇刊
IF 3.2 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-03-05 DOI: 10.1109/TBC.2025.3542624
{"title":"IEEE Transactions on Broadcasting Publication Information","authors":"","doi":"10.1109/TBC.2025.3542624","DOIUrl":"https://doi.org/10.1109/TBC.2025.3542624","url":null,"abstract":"","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"71 1","pages":"C2-C2"},"PeriodicalIF":3.2,"publicationDate":"2025-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10913473","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143553156","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Transactions on Broadcasting Information for Authors IEEE作者广播信息汇刊
IF 3.2 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-03-05 DOI: 10.1109/TBC.2025.3542626
{"title":"IEEE Transactions on Broadcasting Information for Authors","authors":"","doi":"10.1109/TBC.2025.3542626","DOIUrl":"https://doi.org/10.1109/TBC.2025.3542626","url":null,"abstract":"","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"71 1","pages":"C3-C4"},"PeriodicalIF":3.2,"publicationDate":"2025-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10913472","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143553154","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Once-Training-All-Fine: No-Reference Point Cloud Quality Assessment via Domain-Relevance Degradation Description 一次训练全精:基于域相关退化描述的无参考点云质量评估
IF 3.2 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-03-04 DOI: 10.1109/TBC.2025.3541862
Yipeng Liu;Qi Yang;Yujie Zhang;Yiling Xu;Le Yang;Xiaozhong Xu;Shan Liu
The visual quality of point clouds plays a crucial role in the development and broadcasting of immersive media. Therefore, investigating point cloud quality assessment (PCQA) is instrumental in facilitating immersive media applications, including virtual reality and augmented reality applications. Considering reference point clouds are not available in many cases, no-reference (NR) metrics have become a research hotspot. Existing NR methods suffer from difficult training. To address this shortcoming, we propose a novel NR-PCQA method, Point Cloud Quality Assessment via Domain-relevance Degradation Description (D3-PCQA). First, we demonstrate our model’s interpretability by deriving the function of each module using a kernelized ridge regression model. Specifically, quality assessment can be characterized as a leap from the scattered perceptual domain (reflecting subjective perception) to the ordered quality domain (reflecting mean opinion score). Second, to reduce the significant domain discrepancy, we establish an intermediate domain, the description domain, based on insights from the human visual system (HVS), by considering the domain relevance among samples located in the perception domain and learning a structured latent space. The anchor features derived from the learned latent space are generated as cross-domain auxiliary information to promote domain transformation. Furthermore, the newly established description domain decomposes the NR-PCQA problem into two relevant stages. These stages include a classification stage that gives the degradation descriptions to point clouds and a regression stage to determine the confidence degrees of descriptions, providing a semantic explanation for the predicted quality scores. Experimental results demonstrate that D3-PCQA exhibits robust performance and outstanding generalization on several publicly available datasets.
点云的视觉质量在沉浸式媒体的开发和传播中起着至关重要的作用。因此,研究点云质量评估(PCQA)有助于促进沉浸式媒体应用,包括虚拟现实和增强现实应用。由于参考点云在很多情况下是不可用的,因此无参考度量(NR)成为研究热点。现有的NR方法训练困难。为了解决这一问题,我们提出了一种新的NR-PCQA方法,即基于域相关退化描述的点云质量评估(D3-PCQA)。首先,我们通过使用核脊回归模型推导每个模块的函数来证明我们模型的可解释性。具体来说,质量评估可以被描述为从分散的感知域(反映主观感知)到有序的质量域(反映平均意见得分)的飞跃。其次,为了减少显著的域差异,我们基于人类视觉系统(HVS)的洞察力,通过考虑位于感知域的样本之间的域相关性并学习结构化的潜在空间,建立了一个中间域,即描述域。从学习到的潜空间中提取锚点特征作为跨域辅助信息,促进域转换。此外,新建立的描述域将NR-PCQA问题分解为两个相关的阶段。这些阶段包括一个分类阶段,给出对点云的退化描述,以及一个回归阶段,确定描述的置信度,为预测的质量分数提供语义解释。实验结果表明,D3-PCQA在多个公开可用的数据集上表现出鲁棒性和出色的泛化性。
{"title":"Once-Training-All-Fine: No-Reference Point Cloud Quality Assessment via Domain-Relevance Degradation Description","authors":"Yipeng Liu;Qi Yang;Yujie Zhang;Yiling Xu;Le Yang;Xiaozhong Xu;Shan Liu","doi":"10.1109/TBC.2025.3541862","DOIUrl":"https://doi.org/10.1109/TBC.2025.3541862","url":null,"abstract":"The visual quality of point clouds plays a crucial role in the development and broadcasting of immersive media. Therefore, investigating point cloud quality assessment (PCQA) is instrumental in facilitating immersive media applications, including virtual reality and augmented reality applications. Considering reference point clouds are not available in many cases, no-reference (NR) metrics have become a research hotspot. Existing NR methods suffer from difficult training. To address this shortcoming, we propose a novel NR-PCQA method, Point Cloud Quality Assessment via Domain-relevance Degradation Description (D3-PCQA). First, we demonstrate our model’s interpretability by deriving the function of each module using a kernelized ridge regression model. Specifically, quality assessment can be characterized as a leap from the scattered perceptual domain (reflecting subjective perception) to the ordered quality domain (reflecting mean opinion score). Second, to reduce the significant domain discrepancy, we establish an intermediate domain, the description domain, based on insights from the human visual system (HVS), by considering the domain relevance among samples located in the perception domain and learning a structured latent space. The anchor features derived from the learned latent space are generated as cross-domain auxiliary information to promote domain transformation. Furthermore, the newly established description domain decomposes the NR-PCQA problem into two relevant stages. These stages include a classification stage that gives the degradation descriptions to point clouds and a regression stage to determine the confidence degrees of descriptions, providing a semantic explanation for the predicted quality scores. Experimental results demonstrate that D3-PCQA exhibits robust performance and outstanding generalization on several publicly available datasets.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"71 2","pages":"616-630"},"PeriodicalIF":3.2,"publicationDate":"2025-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144243586","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive Deep Joint Source-Channel Coding for One-to-Many Wireless Image Transmission 一对多无线图像传输的自适应深度联合源信道编码
IF 4.8 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-03-01 DOI: 10.1109/TBC.2025.3559003
Lei Luo;Ziyang He;Junjie Wu;Hongwei Guo;Ce Zhu
Deep learning based joint source-channel coding (DJSCC) has recently made significant progress and emerged as a potential solution for future wireless communication. However, there are still several crucial issues that necessitate further in-depth exploration to enhance the efficiency of DJSCC, such as channel quality adaptability, bandwidth adaptability, and the delicate balance between efficiency and complexity. This work proposes an adaptive deep joint source-channel coding scheme tailored for one-to-many wireless image transmission scenarios (ADMIT). First, to effectively improve transmission performance, neighboring attention is introduced as the backbone for the proposed ADMIT method. Second, a channel quality adaptive module (CQAM) is designed based on multi-scale feature fusion, which seamlessly adapts to fluctuating channel conditions across a wide range of channel signal-to-noise ratios (CSNRs). Third, to be precisely tailored to different bandwidth resources, the channel gained adaptive module (CGAM) dynamically adjusts the significance of individual channels within the latent space, which ensures seamless varying bandwidth accommodation with a single model through bandwidth adaptation and symbol completion. Additionally, to mitigate the imbalance of loss across multiple bandwidth ratios during the training process, the gradient normalization (GradNorm) based training strategy is leveraged to ensure adaptive balancing of loss decreasing. The extensive experimental results demonstrate that the proposed method significantly enhances transmission performance while maintaining relatively low computational complexity. The source codes are available at: https://github.com/llsurreal919/ADMIT.
基于深度学习的联合源信道编码(DJSCC)最近取得了重大进展,并成为未来无线通信的潜在解决方案。然而,要提高DJSCC的效率,仍有几个关键问题需要进一步深入探讨,如信道质量适应性、带宽适应性以及效率与复杂性之间的微妙平衡。本文提出了一种适合一对多无线图像传输场景(ADMIT)的自适应深度联合源信道编码方案。首先,为了有效提高传输性能,在提出的ADMIT方法中引入了相邻注意作为主干。其次,设计了基于多尺度特征融合的信道质量自适应模块(CQAM),该模块能够无缝适应大范围信道信噪比(CSNRs)波动的信道条件。第三,为了精确定制不同的带宽资源,信道增益自适应模块(CGAM)在潜在空间内动态调整单个信道的显著性,通过带宽自适应和符号补全,保证了单一模型对不同带宽的无缝适应。此外,为了减轻训练过程中多个带宽比损失的不平衡,利用基于梯度归一化(GradNorm)的训练策略来确保损失减少的自适应平衡。大量的实验结果表明,该方法在保持较低的计算复杂度的同时显著提高了传输性能。源代码可从https://github.com/llsurreal919/ADMIT获得。
{"title":"Adaptive Deep Joint Source-Channel Coding for One-to-Many Wireless Image Transmission","authors":"Lei Luo;Ziyang He;Junjie Wu;Hongwei Guo;Ce Zhu","doi":"10.1109/TBC.2025.3559003","DOIUrl":"https://doi.org/10.1109/TBC.2025.3559003","url":null,"abstract":"Deep learning based joint source-channel coding (DJSCC) has recently made significant progress and emerged as a potential solution for future wireless communication. However, there are still several crucial issues that necessitate further in-depth exploration to enhance the efficiency of DJSCC, such as channel quality adaptability, bandwidth adaptability, and the delicate balance between efficiency and complexity. This work proposes an <underline>a</u>daptive <underline>d</u>eep joint source-channel coding scheme tailored for one-to-<underline>m</u>any wireless <underline>i</u>mage <underline>t</u>ransmission scenarios (ADMIT). First, to effectively improve transmission performance, neighboring attention is introduced as the backbone for the proposed ADMIT method. Second, a channel quality adaptive module (CQAM) is designed based on multi-scale feature fusion, which seamlessly adapts to fluctuating channel conditions across a wide range of channel signal-to-noise ratios (CSNRs). Third, to be precisely tailored to different bandwidth resources, the channel gained adaptive module (CGAM) dynamically adjusts the significance of individual channels within the latent space, which ensures seamless varying bandwidth accommodation with a single model through bandwidth adaptation and symbol completion. Additionally, to mitigate the imbalance of loss across multiple bandwidth ratios during the training process, the gradient normalization (GradNorm) based training strategy is leveraged to ensure adaptive balancing of loss decreasing. The extensive experimental results demonstrate that the proposed method significantly enhances transmission performance while maintaining relatively low computational complexity. The source codes are available at: <uri>https://github.com/llsurreal919/ADMIT</uri>.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"71 3","pages":"914-929"},"PeriodicalIF":4.8,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144998093","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring Invertible Encoding for Deep Video Compression 探索深度视频压缩的可逆编码
IF 3.2 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-02-27 DOI: 10.1109/TBC.2025.3541869
Haifeng Guo;Sam Kwong;Mingliang Zhou
Deep video compression methods typically use autoencoder-style networks for encoding and decoding, which can result in the loss of information during encoding that cannot be retrieved during decoding. To address this issue, recent work has explored the use of invertible neural networks for enhanced invertible encoding, which has successfully mitigated spatial information loss for better image compression. In this paper, we propose a new approach that extends invertible encoding to temporal information and introduces an encoding-decoding network for deep video compression. Our network incorporates a novel attentive channel squeeze module to improve compression performance while also leveraging a conditional coding framework for motion compression. The entire framework is optimized via a single loss function that balances bit cost and frame quality. The experimental results demonstrate the effectiveness of our approach, which achieves 15.45%/57.92% bit savings in terms of PSNR/MS-SSIM compared with the high-efficiency video coding low-delay P configuration.
深度视频压缩方法通常使用自动编码器风格的网络进行编码和解码,这可能导致编码过程中的信息丢失,而解码过程中无法检索。为了解决这个问题,最近的工作已经探索了使用可逆神经网络来增强可逆编码,这已经成功地减轻了空间信息丢失,以获得更好的图像压缩。本文提出了一种将可逆编码扩展到时间信息的新方法,并介绍了一种用于深度视频压缩的编解码网络。我们的网络集成了一个新颖的专注信道压缩模块,以提高压缩性能,同时还利用条件编码框架进行运动压缩。整个框架通过平衡比特成本和帧质量的单个损失函数进行优化。实验结果证明了该方法的有效性,与高效视频编码低延迟P配置相比,在PSNR/MS-SSIM方面节省了15.45%/57.92%的比特。
{"title":"Exploring Invertible Encoding for Deep Video Compression","authors":"Haifeng Guo;Sam Kwong;Mingliang Zhou","doi":"10.1109/TBC.2025.3541869","DOIUrl":"https://doi.org/10.1109/TBC.2025.3541869","url":null,"abstract":"Deep video compression methods typically use autoencoder-style networks for encoding and decoding, which can result in the loss of information during encoding that cannot be retrieved during decoding. To address this issue, recent work has explored the use of invertible neural networks for enhanced invertible encoding, which has successfully mitigated spatial information loss for better image compression. In this paper, we propose a new approach that extends invertible encoding to temporal information and introduces an encoding-decoding network for deep video compression. Our network incorporates a novel attentive channel squeeze module to improve compression performance while also leveraging a conditional coding framework for motion compression. The entire framework is optimized via a single loss function that balances bit cost and frame quality. The experimental results demonstrate the effectiveness of our approach, which achieves 15.45%/57.92% bit savings in terms of PSNR/MS-SSIM compared with the high-efficiency video coding low-delay P configuration.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"71 2","pages":"517-528"},"PeriodicalIF":3.2,"publicationDate":"2025-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10906771","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144243816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing 5G V2X URLLC Broadcast/Multicast Services With FL-Based Wireless Resource Allocation 基于fl的无线资源分配增强5G V2X URLLC广播/组播业务
IF 3.2 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-02-25 DOI: 10.1109/TBC.2025.3541887
Qian Huang;Xiaoyin Yi;Fei Qi;Lei Liu;Qingming Xie;Qin Jiang;Chunxia Hu
This paper addresses the challenges of wireless resource allocation for 5G Ultra-reliable Low-latency Communication (URLLC) broadcast/multicast services in Vehicle-to-Everything (V2X) scenarios. It proposes three key algorithms: an iterative resource allocation approach that decomposes optimization into power and spectrum subproblems, a federated learning-based multicast resource allocation scheme that protects data privacy while enabling distributed training, and a cooperative multi-agent reinforcement learning solution that treats vehicles as intelligent nodes to jointly optimize system throughput, URLLC delivery rate, and multicast performance. Path loss models, mobility patterns, and interference scenarios are analyzed for both unicast and multicast transmissions. Simulation results demonstrate that the proposed algorithms achieve superior performance in terms of throughput, reliability, and latency compared to traditional and baseline approaches.
本文讨论了车对万物(V2X)场景下5G超可靠低延迟通信(URLLC)广播/多播服务无线资源分配的挑战。提出了三种关键算法:将优化分解为功率子问题和频谱子问题的迭代资源分配方法,在实现分布式训练的同时保护数据隐私的基于联邦学习的组播资源分配方案,以及将车辆视为智能节点共同优化系统吞吐量、URLLC投递率和组播性能的协作多智能体强化学习解决方案。分析了单播和组播传输的路径损耗模型、移动模式和干扰情况。仿真结果表明,与传统方法和基线方法相比,所提出的算法在吞吐量、可靠性和延迟方面具有优越的性能。
{"title":"Enhancing 5G V2X URLLC Broadcast/Multicast Services With FL-Based Wireless Resource Allocation","authors":"Qian Huang;Xiaoyin Yi;Fei Qi;Lei Liu;Qingming Xie;Qin Jiang;Chunxia Hu","doi":"10.1109/TBC.2025.3541887","DOIUrl":"https://doi.org/10.1109/TBC.2025.3541887","url":null,"abstract":"This paper addresses the challenges of wireless resource allocation for 5G Ultra-reliable Low-latency Communication (URLLC) broadcast/multicast services in Vehicle-to-Everything (V2X) scenarios. It proposes three key algorithms: an iterative resource allocation approach that decomposes optimization into power and spectrum subproblems, a federated learning-based multicast resource allocation scheme that protects data privacy while enabling distributed training, and a cooperative multi-agent reinforcement learning solution that treats vehicles as intelligent nodes to jointly optimize system throughput, URLLC delivery rate, and multicast performance. Path loss models, mobility patterns, and interference scenarios are analyzed for both unicast and multicast transmissions. Simulation results demonstrate that the proposed algorithms achieve superior performance in terms of throughput, reliability, and latency compared to traditional and baseline approaches.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"71 2","pages":"384-396"},"PeriodicalIF":3.2,"publicationDate":"2025-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144243868","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Simplified Fast List PAC Decoder for Broadcasting Services in 6G: Algorithm and Implementation 用于6G广播业务的简化快速列表PAC解码器:算法与实现
IF 3.2 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-02-17 DOI: 10.1109/TBC.2025.3534624
Jingxin Dai;Hang Yin;Yansong Lv;Yuhuan Wang;Yin Xu;Rui Lv
In the 6G network, integrating broadcasting and mobile networks will significantly improve the transmission capability. Considering the excellent error-correction performance, polarized-adjusted convolutional (PAC) codes are promising for ensuring reliable data transmission in 6G broadcasting services. However, the inherent high decoding latency of PAC codes poses challenges for seamless switching between broadcasting and mobile services. In this paper, we propose a simplified fast list (SFL) PAC decoder, which jointly exploits the node thresholds and adaptive path-pruning technology to reduce the decoding latency while maintaining high reliability. Firstly, we present a novel path expansion rule based on the node thresholds to avoid unnecessary computations. Secondly, the introduction of the adaptive path-pruning technology efficiently reduces the number of sorting operations. Moreover, we implement the proposed decoder on general purpose processors (GPPs) by software. Simulation results show that the proposed SFL decoding algorithm significantly reduces the decoding latency by up to 75.18% compared to the state-of-the-art (SOTA) work with no noticeable degradation in error-correction performance. Software implementation of the proposed decoder achieves an 18.80% improvement in throughput performance over the SOTA PAC software decoder.
在6G网络中,广播和移动网络的融合将显著提高传输能力。考虑到良好的纠错性能,极化调整卷积码(PAC)有望在6G广播业务中确保可靠的数据传输。然而,PAC码固有的高解码延迟给广播和移动业务之间的无缝切换带来了挑战。本文提出了一种简化的快速列表(SFL) PAC解码器,该解码器结合了节点阈值和自适应路径剪剪技术,在保持高可靠性的同时减少了解码延迟。首先,提出了一种基于节点阈值的路径扩展规则,避免了不必要的计算。其次,引入自适应路径修剪技术,有效地减少了排序操作的数量。此外,我们还通过软件在通用处理器(gpp)上实现了所提出的解码器。仿真结果表明,与现有的SOTA译码算法相比,SFL译码算法的译码延迟降低了75.18%,且纠错性能没有明显下降。该解码器的软件实现比SOTA PAC软件解码器的吞吐量性能提高了18.80%。
{"title":"Simplified Fast List PAC Decoder for Broadcasting Services in 6G: Algorithm and Implementation","authors":"Jingxin Dai;Hang Yin;Yansong Lv;Yuhuan Wang;Yin Xu;Rui Lv","doi":"10.1109/TBC.2025.3534624","DOIUrl":"https://doi.org/10.1109/TBC.2025.3534624","url":null,"abstract":"In the 6G network, integrating broadcasting and mobile networks will significantly improve the transmission capability. Considering the excellent error-correction performance, polarized-adjusted convolutional (PAC) codes are promising for ensuring reliable data transmission in 6G broadcasting services. However, the inherent high decoding latency of PAC codes poses challenges for seamless switching between broadcasting and mobile services. In this paper, we propose a simplified fast list (SFL) PAC decoder, which jointly exploits the node thresholds and adaptive path-pruning technology to reduce the decoding latency while maintaining high reliability. Firstly, we present a novel path expansion rule based on the node thresholds to avoid unnecessary computations. Secondly, the introduction of the adaptive path-pruning technology efficiently reduces the number of sorting operations. Moreover, we implement the proposed decoder on general purpose processors (GPPs) by software. Simulation results show that the proposed SFL decoding algorithm significantly reduces the decoding latency by up to 75.18% compared to the state-of-the-art (SOTA) work with no noticeable degradation in error-correction performance. Software implementation of the proposed decoder achieves an 18.80% improvement in throughput performance over the SOTA PAC software decoder.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"71 2","pages":"397-410"},"PeriodicalIF":3.2,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144243867","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Heterogeneous Network Transmission Architecture Based on NOMA for Next-Generation Converged Communications and Broadcasting Systems 面向下一代融合通信广播系统的基于NOMA的异构网络传输体系结构
IF 3.2 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-02-14 DOI: 10.1109/TBC.2025.3534620
Xiaowu Ou;Haoyang Li;Yin Xu;Dazhi He;Wenjun Zhang;Yiyan Wu
Inter-Tower Communication Network (ITCN), which supports communication between different base stations via wireless links, has excellent potential for simultaneous transmission of broadcast data and personalized data using Layered Division Multiplexing (LDM). In this paper, a heterogeneous ITCN architecture using LDM and wireless backhauling is proposed. In uplink transmission, the power-limited user devices transmit data to the secondary transmitters (STs), and the STs relay the data to the master transmitter (MT). In downlink transmission, the MT and multiple STs cooperatively transmit broadcast data using single frequency network (SFN) mode, and the STs relay the personalized data from the MT to users simultaneously. Considering the co-channel interference, this paper proposes a joint subchannel assignment and power allocation scheme for both uplink and downlink transmission. A mixed integer optimization problem is formulated, and an alternating optimization algorithm (AO) based on game theory and convex optimization is proposed. Simulation results are conducted with different system configurations to demonstrate the convergence and effectiveness of the proposed algorithms.
塔间通信网络(ITCN)通过无线链路支持不同基站之间的通信,具有利用分层分复用(LDM)同时传输广播数据和个性化数据的良好潜力。本文提出了一种基于LDM和无线回程的异构ITCN体系结构。在上行链路传输中,功率有限的用户设备将数据传输到二级发射机(STs),而二级发射机将数据中继到主发射机(MT)。在下行传输中,MT和多个st采用单频网络(SFN)模式协同传输广播数据,st同时将MT的个性化数据转发给用户。考虑到同信道干扰,本文提出了一种上下行传输的联合子信道分配和功率分配方案。提出了一个混合整数优化问题,并提出了一种基于博弈论和凸优化的交替优化算法。在不同的系统配置下进行了仿真,验证了算法的收敛性和有效性。
{"title":"A Heterogeneous Network Transmission Architecture Based on NOMA for Next-Generation Converged Communications and Broadcasting Systems","authors":"Xiaowu Ou;Haoyang Li;Yin Xu;Dazhi He;Wenjun Zhang;Yiyan Wu","doi":"10.1109/TBC.2025.3534620","DOIUrl":"https://doi.org/10.1109/TBC.2025.3534620","url":null,"abstract":"Inter-Tower Communication Network (ITCN), which supports communication between different base stations via wireless links, has excellent potential for simultaneous transmission of broadcast data and personalized data using Layered Division Multiplexing (LDM). In this paper, a heterogeneous ITCN architecture using LDM and wireless backhauling is proposed. In uplink transmission, the power-limited user devices transmit data to the secondary transmitters (STs), and the STs relay the data to the master transmitter (MT). In downlink transmission, the MT and multiple STs cooperatively transmit broadcast data using single frequency network (SFN) mode, and the STs relay the personalized data from the MT to users simultaneously. Considering the co-channel interference, this paper proposes a joint subchannel assignment and power allocation scheme for both uplink and downlink transmission. A mixed integer optimization problem is formulated, and an alternating optimization algorithm (AO) based on game theory and convex optimization is proposed. Simulation results are conducted with different system configurations to demonstrate the convergence and effectiveness of the proposed algorithms.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"71 2","pages":"372-383"},"PeriodicalIF":3.2,"publicationDate":"2025-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144243677","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Broadcasting
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1