首页 > 最新文献

IEEE Transactions on Broadcasting最新文献

英文 中文
Learning-Based Efficient Quantizer Selection for Fast HEVC Encoder 为快速 HEVC 编码器选择基于学习的高效量化器
IF 4.5 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2023-12-19 DOI: 10.1109/TBC.2023.3333750
Motong Xu;Byeungwoo Jeon
The rate-distortion optimized quantization (RDOQ) in HEVC has improved the coding efficiency of the conventional uniform scalar quantization (SQ) very much. Since the RDOQ is computationally complex, in this paper, we investigate a way of performing RDOQ more efficiently in HEVC. Based on our statistical observation of non-trivial percentage of transform blocks (TB) for which RDOQ does not change their quantization results of SQ, we design a learning-based quantizer selection scheme which can tell in advance whether RDOQ is expected to modify the quantization levels calculated by SQ. Only those TBs likely to be changed by RDOQ are subject to the actual RDOQ process. For the remaining TBs, we design an improved SQ which adapts the dead-zone interval size and round offset based on coefficient group and entropy coding features. The proposed improved SQ has much lower computational complexity than RDOQ while achieving better coding efficiency than the conventional SQ. The experimental results show that our efficient quantization scheme respectively provides 9% and 34% of encoding and quantization time reduction by selectively performing RDOQ only for 21% of TBs. The average BDBR performances of Y, Cb, and Cr channels are respectively–0.03%, 0.48%, and 0.45%.
HEVC 中的速率失真优化量化(RDOQ)大大提高了传统均匀标量量化(SQ)的编码效率。由于 RDOQ 计算复杂,本文研究了一种在 HEVC 中更高效地执行 RDOQ 的方法。根据我们的统计观察,RDOQ 不会改变 SQ 量化结果的变换块(TB)所占的比例并不高,因此我们设计了一种基于学习的量化器选择方案,该方案可以提前判断 RDOQ 是否会修改 SQ 计算出的量化级别。只有那些可能会被 RDOQ 更改的 TB 才会进入实际的 RDOQ 流程。对于其余的 TB,我们设计了一种改进的 SQ,它能根据系数组和熵编码特征调整死区间隔大小和轮偏移。所提出的改进 SQ 比 RDOQ 的计算复杂度低得多,同时比传统 SQ 获得更好的编码效率。实验结果表明,我们的高效量化方案仅在 21% 的 TB 上选择性地执行 RDOQ,就分别减少了 9% 和 34% 的编码和量化时间。Y、Cb 和 Cr 信道的平均 BDBR 性能分别为-0.03%、0.48% 和 0.45%。
{"title":"Learning-Based Efficient Quantizer Selection for Fast HEVC Encoder","authors":"Motong Xu;Byeungwoo Jeon","doi":"10.1109/TBC.2023.3333750","DOIUrl":"https://doi.org/10.1109/TBC.2023.3333750","url":null,"abstract":"The rate-distortion optimized quantization (RDOQ) in HEVC has improved the coding efficiency of the conventional uniform scalar quantization (SQ) very much. Since the RDOQ is computationally complex, in this paper, we investigate a way of performing RDOQ more efficiently in HEVC. Based on our statistical observation of non-trivial percentage of transform blocks (TB) for which RDOQ does not change their quantization results of SQ, we design a learning-based quantizer selection scheme which can tell in advance whether RDOQ is expected to modify the quantization levels calculated by SQ. Only those TBs likely to be changed by RDOQ are subject to the actual RDOQ process. For the remaining TBs, we design an improved SQ which adapts the dead-zone interval size and round offset based on coefficient group and entropy coding features. The proposed improved SQ has much lower computational complexity than RDOQ while achieving better coding efficiency than the conventional SQ. The experimental results show that our efficient quantization scheme respectively provides 9% and 34% of encoding and quantization time reduction by selectively performing RDOQ only for 21% of TBs. The average BDBR performances of Y, Cb, and Cr channels are respectively–0.03%, 0.48%, and 0.45%.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"70 1","pages":"161-173"},"PeriodicalIF":4.5,"publicationDate":"2023-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140052881","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep Curriculum Reinforcement Learning for Adaptive 360° Video Streaming With Two-Stage Training 利用深度课程强化学习进行自适应 360° 视频流两阶段训练
IF 4.5 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2023-12-15 DOI: 10.1109/TBC.2023.3334137
Yuhong Xie;Yuan Zhang;Tao Lin
Deep reinforcement learning (DRL) has demonstrated remarkable potential within the domain of video adaptive bitrate (ABR) optimization. However, training a well-performing DRL agent in the two-tier 360° video streaming system is non-trivial. The conventional DRL training approach fails to enable the model to start learning from simpler environments and then progressively explore more challenging ones, leading to suboptimal asymptotic performance and poor long-tail performance. In this paper, we propose a novel approach called DCRL360, which seamlessly integrates automatic curriculum learning (ACL) with DRL techniques to enable adaptive decision-making for 360° video bitrate selection and chunk scheduling. To tackle the training issue, we introduce a structured two-stage training framework. The first stage focuses on the selection of tasks conducive to learning, guided by a newly introduced training metric called Pscore, to enhance asymptotic performance. The newly introduced metric takes into consideration multiple facets, including performance improvement potential, the risk of being forgotten, and the uncertainty of a decision, to encourage the agent to train in rewarding environments. The second stage utilizes existing rule-based techniques to identify challenging tasks for fine-tuning the model, thereby alleviating the long-tail effect. Our experimental results demonstrate that DCRL360 outperforms state-of-the-art algorithms under various network conditions - including 5G/LTE/Broadband - with a remarkable improvement of 6.51-20.86% in quality of experience (QoE), as well as a reduction in bandwidth wastage by 10.60-31.50%.
深度强化学习(DRL)在视频自适应比特率(ABR)优化领域展现出了巨大的潜力。然而,在双层 360° 视频流系统中训练一个性能良好的 DRL 代理并非易事。传统的 DRL 训练方法无法使模型从较简单的环境开始学习,然后逐步探索更具挑战性的环境,从而导致渐近性能不理想和长尾性能不佳。在本文中,我们提出了一种名为 DCRL360 的新方法,该方法将自动课程学习 (ACL) 与 DRL 技术无缝集成,实现了 360° 视频比特率选择和块调度的自适应决策。为了解决训练问题,我们引入了一个结构化的两阶段训练框架。第一阶段的重点是选择有利于学习的任务,以新引入的名为 Pscore 的训练指标为指导,提高渐近性能。新引入的指标考虑了多个方面,包括提高性能的潜力、被遗忘的风险和决策的不确定性,以鼓励代理在有回报的环境中进行训练。第二阶段利用现有的基于规则的技术来确定具有挑战性的任务,以便对模型进行微调,从而缓解长尾效应。我们的实验结果表明,在各种网络条件下(包括 5G/LTE/宽带),DCRL360 的性能优于最先进的算法,体验质量(QoE)显著提高了 6.51-20.86%,带宽浪费减少了 10.60-31.50%。
{"title":"Deep Curriculum Reinforcement Learning for Adaptive 360° Video Streaming With Two-Stage Training","authors":"Yuhong Xie;Yuan Zhang;Tao Lin","doi":"10.1109/TBC.2023.3334137","DOIUrl":"https://doi.org/10.1109/TBC.2023.3334137","url":null,"abstract":"Deep reinforcement learning (DRL) has demonstrated remarkable potential within the domain of video adaptive bitrate (ABR) optimization. However, training a well-performing DRL agent in the two-tier 360° video streaming system is non-trivial. The conventional DRL training approach fails to enable the model to start learning from simpler environments and then progressively explore more challenging ones, leading to suboptimal asymptotic performance and poor long-tail performance. In this paper, we propose a novel approach called DCRL360, which seamlessly integrates automatic curriculum learning (ACL) with DRL techniques to enable adaptive decision-making for 360° video bitrate selection and chunk scheduling. To tackle the training issue, we introduce a structured two-stage training framework. The first stage focuses on the selection of tasks conducive to learning, guided by a newly introduced training metric called Pscore, to enhance asymptotic performance. The newly introduced metric takes into consideration multiple facets, including performance improvement potential, the risk of being forgotten, and the uncertainty of a decision, to encourage the agent to train in rewarding environments. The second stage utilizes existing rule-based techniques to identify challenging tasks for fine-tuning the model, thereby alleviating the long-tail effect. Our experimental results demonstrate that DCRL360 outperforms state-of-the-art algorithms under various network conditions - including 5G/LTE/Broadband - with a remarkable improvement of 6.51-20.86% in quality of experience (QoE), as well as a reduction in bandwidth wastage by 10.60-31.50%.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"70 2","pages":"441-452"},"PeriodicalIF":4.5,"publicationDate":"2023-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141292482","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Federated Multitask Learning for Pedestrian Location-Aware 5G Multicast/Broadcast Services 面向行人位置感知 5G 多播/广播服务的联合多任务学习
IF 4.5 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2023-12-15 DOI: 10.1109/TBC.2023.3332012
Zexuan Jing;Junsheng Mu;Jian Jin;Zhenzhen Jiao;Peng Yu
5G multicast/broadcast services can provide transformative new opportunities as mobile devices proliferate. However, realizing the full potential of these services requires real-time pedestrian localization. We propose a federated multitask learning (FML) approach on smartphones to enable pedestrian location-aware 5G multicast/broadcast services. Our lightweight FML architecture provides accurate real-time localization while preserving privacy. The pedestrian location data enables adaptive 5G network planning, contextual location-based services, quality of service improvements, and load balancing. Simulations demonstrate the effectiveness of our FML scheme for accurate pedestrian localization. They also highlight significant enhancements to 5G multicast/broadcast services enabled by real-time pedestrian positioning. In summary, our work facilitates enhanced 5G multicast/broadcast services through federated on-device learning for real-time pedestrian localization.
随着移动设备的普及,5G 多播/广播服务可提供变革性的新机遇。然而,要充分发挥这些服务的潜力,需要对行人进行实时定位。我们提出了在智能手机上实现行人位置感知 5G 多播/广播服务的联合多任务学习(FML)方法。我们的轻量级 FML 架构可在保护隐私的同时提供准确的实时定位。行人位置数据可实现自适应 5G 网络规划、基于上下文位置的服务、服务质量改进和负载平衡。仿真证明了我们的 FML 方案在准确定位行人方面的有效性。模拟还突出了实时行人定位对 5G 多播/广播服务的重大提升。总之,我们的工作通过联合设备上学习实时行人定位,促进了增强型 5G 多播/广播服务。
{"title":"Federated Multitask Learning for Pedestrian Location-Aware 5G Multicast/Broadcast Services","authors":"Zexuan Jing;Junsheng Mu;Jian Jin;Zhenzhen Jiao;Peng Yu","doi":"10.1109/TBC.2023.3332012","DOIUrl":"https://doi.org/10.1109/TBC.2023.3332012","url":null,"abstract":"5G multicast/broadcast services can provide transformative new opportunities as mobile devices proliferate. However, realizing the full potential of these services requires real-time pedestrian localization. We propose a federated multitask learning (FML) approach on smartphones to enable pedestrian location-aware 5G multicast/broadcast services. Our lightweight FML architecture provides accurate real-time localization while preserving privacy. The pedestrian location data enables adaptive 5G network planning, contextual location-based services, quality of service improvements, and load balancing. Simulations demonstrate the effectiveness of our FML scheme for accurate pedestrian localization. They also highlight significant enhancements to 5G multicast/broadcast services enabled by real-time pedestrian positioning. In summary, our work facilitates enhanced 5G multicast/broadcast services through federated on-device learning for real-time pedestrian localization.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"70 1","pages":"66-77"},"PeriodicalIF":4.5,"publicationDate":"2023-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140052873","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cooperative Non-Orthogonal Broadcast and Unicast Transmission for Integrated Satellite–Terrestrial Network 综合卫星鈥揟陆地网络的合作非正交广播和单播传输
IF 3.2 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2023-12-11 DOI: 10.1109/TBC.2023.3335815
Zhiqiang Li;Shuai Han;Liang Xiao;Mugen Peng
The integrated satellite-terrestrial network (ISTN) is gaining traction for providing seamless communication and various services, i.e., broadcast and unicast information services. However, meeting massive terminal access and diverse information services poses challenges due to limited spectrum resources and complex multiple access interference in ISTN. Recently, rate-splitting multiple access (RSMA) has emerged as a promising solution offering non-orthogonal transmission and robust interference management. Inspired by this, we design the non-orthogonal broadcast and unicast (NOBU) transmission model by utilizing the common and private data streams of RSMA. Taking different levels of cooperation between satellite and base station (BS) into consideration, we propose two cooperative NOBU transmission schemes, where one is that only broadcast messages are shared, and the other is that the broadcast message and the sub-common message split by terminals are shared and jointly encoded into a super-common stream. Building upon this, we formulate joint max-min rate optimization problems while satisfying the broadcast information rate requirement in ISTN. To address these non-convex problems, we introduce an improved alternating optimization algorithm based on weighted minimum mean square error. Simulation results validate the significant gains of cooperative NOBU schemes compared to various baseline schemes.
卫星-地面综合网络(ISTN)在提供无缝通信和各种服务(即广播和单播信息服务)方面日益受到重视。然而,由于 ISTN 的频谱资源有限和复杂的多重接入干扰,满足大规模终端接入和多样化信息服务的需求面临挑战。最近,速率分割多重接入(RSMA)作为一种有前途的解决方案出现了,它能提供非正交传输和稳健的干扰管理。受此启发,我们利用 RSMA 的公共和私有数据流设计了非正交广播和单播(NOBU)传输模型。考虑到卫星和基站(BS)之间不同程度的合作,我们提出了两种合作式 NOBU 传输方案,一种是只共享广播信息,另一种是共享广播信息和终端分出的子公共信息,并共同编码成超级公共流。在此基础上,我们提出了联合最大最小速率优化问题,同时满足 ISTN 中的广播信息速率要求。为了解决这些非凸问题,我们引入了一种基于加权最小均方误差的改进交替优化算法。仿真结果验证了与各种基线方案相比,合作 NOBU 方案的显著收益。
{"title":"Cooperative Non-Orthogonal Broadcast and Unicast Transmission for Integrated Satellite–Terrestrial Network","authors":"Zhiqiang Li;Shuai Han;Liang Xiao;Mugen Peng","doi":"10.1109/TBC.2023.3335815","DOIUrl":"10.1109/TBC.2023.3335815","url":null,"abstract":"The integrated satellite-terrestrial network (ISTN) is gaining traction for providing seamless communication and various services, i.e., broadcast and unicast information services. However, meeting massive terminal access and diverse information services poses challenges due to limited spectrum resources and complex multiple access interference in ISTN. Recently, rate-splitting multiple access (RSMA) has emerged as a promising solution offering non-orthogonal transmission and robust interference management. Inspired by this, we design the non-orthogonal broadcast and unicast (NOBU) transmission model by utilizing the common and private data streams of RSMA. Taking different levels of cooperation between satellite and base station (BS) into consideration, we propose two cooperative NOBU transmission schemes, where one is that only broadcast messages are shared, and the other is that the broadcast message and the sub-common message split by terminals are shared and jointly encoded into a super-common stream. Building upon this, we formulate joint max-min rate optimization problems while satisfying the broadcast information rate requirement in ISTN. To address these non-convex problems, we introduce an improved alternating optimization algorithm based on weighted minimum mean square error. Simulation results validate the significant gains of cooperative NOBU schemes compared to various baseline schemes.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"70 3","pages":"1052-1064"},"PeriodicalIF":3.2,"publicationDate":"2023-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142207656","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
2023 Scott Helt Memorial Award for the Best Paper Published in the IEEE Transactions on Broadcasting 2023 年度《电气和电子工程师学会广播学报》(IEEE Transactions on Broadcasting)最佳论文 Scott Helt 纪念奖
IF 4.5 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2023-12-08 DOI: 10.1109/TBC.2023.3336210
The 2023 Scott Helt Memorial Award was awarded to Hequn Zhang, Yue Zhang, John Cosmas, Nawar Jawad, Wei Li, Robert Muller, Tao Jiang for their paper, “mmWave Indoor Channel Measurement Campaign for 5G New Radio Indoor Broadcasting”. The papers appeared in the IEEE Transactions on Broadcasting, vol. 68, no. 2, pp. 331–344, June 2022. The purpose of the IEEE Scott Helt Memorial Award is to recognize exceptional publications in the field and to stimulate interest in and encourage contributions to the fields of interest of the Society.
2023 年斯科特-赫尔特纪念奖授予张贺群、张越、约翰-科斯马斯、纳瓦尔-贾瓦德、李伟、罗伯特-穆勒、蒋涛,以表彰他们的论文 "mmWave Indoor Channel Measurement Campaign for 5G New Radio Indoor Broadcasting"。论文发表于 2022 年 6 月出版的《电气和电子工程师学会广播学报》(IEEE Transactions on Broadcasting)第 68 卷第 2 期第 331-344 页。IEEE Scott Helt 纪念奖的目的是表彰在该领域发表的杰出论文,激发对学会感兴趣领域的兴趣并鼓励为这些领域做出贡献。
{"title":"2023 Scott Helt Memorial Award for the Best Paper Published in the IEEE Transactions on Broadcasting","authors":"","doi":"10.1109/TBC.2023.3336210","DOIUrl":"https://doi.org/10.1109/TBC.2023.3336210","url":null,"abstract":"The 2023 Scott Helt Memorial Award was awarded to Hequn Zhang, Yue Zhang, John Cosmas, Nawar Jawad, Wei Li, Robert Muller, Tao Jiang for their paper, “mmWave Indoor Channel Measurement Campaign for 5G New Radio Indoor Broadcasting”. The papers appeared in the IEEE Transactions on Broadcasting, vol. 68, no. 2, pp. 331–344, June 2022. The purpose of the IEEE Scott Helt Memorial Award is to recognize exceptional publications in the field and to stimulate interest in and encourage contributions to the fields of interest of the Society.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"69 4","pages":"979-980"},"PeriodicalIF":4.5,"publicationDate":"2023-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10352330","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138558170","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Transactions on Broadcasting Information for Authors 电气和电子工程师学会(IEEE)《关于广播作者信息的论文集》(IEEE Transactions on Broadcasting Information for Authors
IF 4.5 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2023-12-08 DOI: 10.1109/TBC.2023.3336429
{"title":"IEEE Transactions on Broadcasting Information for Authors","authors":"","doi":"10.1109/TBC.2023.3336429","DOIUrl":"https://doi.org/10.1109/TBC.2023.3336429","url":null,"abstract":"","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"69 4","pages":"C3-C4"},"PeriodicalIF":4.5,"publicationDate":"2023-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10352326","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138558193","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Transactions on Broadcasting Publication Information 电气和电子工程师学会《广播学报》出版信息
IF 4.5 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2023-12-08 DOI: 10.1109/TBC.2023.3336433
{"title":"IEEE Transactions on Broadcasting Publication Information","authors":"","doi":"10.1109/TBC.2023.3336433","DOIUrl":"https://doi.org/10.1109/TBC.2023.3336433","url":null,"abstract":"","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"69 4","pages":"C2-C2"},"PeriodicalIF":4.5,"publicationDate":"2023-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10352327","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138558164","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Computationally Stable Low Sampling Rate Digital Predistortion for Non-Terrestrial Networks 用于非地面网络的计算稳定的低采样率数字预失真
IF 4.5 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2023-12-05 DOI: 10.1109/TBC.2023.3334141
Boyan Li;Xin Hu;Naixin Kan;Weidong Wang;Fadhel M. Ghannouchi
With the advent of the fifth generation (5G) New Radio (NR), the Non-Terrestrial Network (NTN) stands out as a solution to enable wider coverage of broadcast satellites. NTN systems require higher data rates and bandwidth. Digital predistortion (DPD) is commonly adopted as an effective method to enhance the power efficiency of broadcast satellites’ NTN systems. With the continuous increase of signal bandwidth, the bandwidth of the feedback loop and the sampling rate of analog-to-digital converters (ADCs) need to be reduced so as to reduce the system cost. The computational complexity and overfitting effect of the existing band-limited DPD (BLDPD) method will raise as the decrease of feedback bandwidth. To address this issue, one deep neural network (DNN) assisted band-limited polynomial digital predistortion (DNN-BLP DPD) is proposed in this paper. This method reduces the computational complexity and the overfitting effect of the band-limited basis functions by grouping a small number of band-limited basis functions for online parameter identification while embedding the DNN in the parameter identification module. Compared with the conventional BLDPD, the experimental results show that the proposed method can achieve a low sampling rate and low computational complexity while ensuring modeling accuracy.
随着第五代(5G)新无线电(NR)技术的出现,非地面网络(NTN)脱颖而出,成为实现广播卫星更广泛覆盖的解决方案。NTN 系统需要更高的数据传输速率和带宽。数字预失真(DPD)被普遍采用,是提高广播卫星 NTN 系统功率效率的有效方法。随着信号带宽的不断增加,需要降低反馈回路的带宽和模数转换器(ADC)的采样率,以降低系统成本。随着反馈带宽的减小,现有带限 DPD(BLDPD)方法的计算复杂度和过拟合效果都会提高。针对这一问题,本文提出了一种深度神经网络(DNN)辅助带限多项式数字预失真(DNN-BLP DPD)。该方法通过将少量带限基函数分组进行在线参数识别,同时在参数识别模块中嵌入 DNN,从而降低了计算复杂度和带限基函数的过拟合效应。实验结果表明,与传统的 BLDPD 相比,所提出的方法在保证建模精度的同时,还能实现较低的采样率和较低的计算复杂度。
{"title":"Computationally Stable Low Sampling Rate Digital Predistortion for Non-Terrestrial Networks","authors":"Boyan Li;Xin Hu;Naixin Kan;Weidong Wang;Fadhel M. Ghannouchi","doi":"10.1109/TBC.2023.3334141","DOIUrl":"https://doi.org/10.1109/TBC.2023.3334141","url":null,"abstract":"With the advent of the fifth generation (5G) New Radio (NR), the Non-Terrestrial Network (NTN) stands out as a solution to enable wider coverage of broadcast satellites. NTN systems require higher data rates and bandwidth. Digital predistortion (DPD) is commonly adopted as an effective method to enhance the power efficiency of broadcast satellites’ NTN systems. With the continuous increase of signal bandwidth, the bandwidth of the feedback loop and the sampling rate of analog-to-digital converters (ADCs) need to be reduced so as to reduce the system cost. The computational complexity and overfitting effect of the existing band-limited DPD (BLDPD) method will raise as the decrease of feedback bandwidth. To address this issue, one deep neural network (DNN) assisted band-limited polynomial digital predistortion (DNN-BLP DPD) is proposed in this paper. This method reduces the computational complexity and the overfitting effect of the band-limited basis functions by grouping a small number of band-limited basis functions for online parameter identification while embedding the DNN in the parameter identification module. Compared with the conventional BLDPD, the experimental results show that the proposed method can achieve a low sampling rate and low computational complexity while ensuring modeling accuracy.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"70 1","pages":"325-333"},"PeriodicalIF":4.5,"publicationDate":"2023-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140042849","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Virtual-Competitors-Based Rate Control for 360-Degree Video Coding 基于虚拟竞争者的 360 度视频编码速率控制
IF 4.5 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2023-12-04 DOI: 10.1109/TBC.2023.3332019
Jielian Lin;Hongbin Lin;Yiwen Xu;Yuanxun Kang;Tiesong Zhao
360 video applications are attracting more attention in broadcasting systems. In the case of limited bandwidth, the bit fluctuation will affect the perception quality when transmitting 360-degree videos. To further optimize the bit allocation and reduce fluctuation, this paper proposes a virtual-competitors-based Rate Control (RC) algorithm for 360-degree video coding. The virtual competitors’ concept is first proposed to balance the volatility of the bit between the adjacent GOPs. In addition, by introducing game theory, the virtual competitors-based frame-level bit allocation model is instructed. Furthermore, we propose a GOP-level bit allocation scheme with the average encoded bits of the historical GOP, the remaining bits of the encoded video sequence, and the number of unencoded GOPs. Based on the designed frame-level and GOP-level bit allocation scheme, the overall bit allocation method is proposed to implement the lower GOP-level bit fluctuation. Experimental results indicate the proposed method with the optimal RC Error, Bjøntegaard Delta Peak-Signal-to-Noise-Ratio, Bjøntegaard Delta Bit Rate, and bit fluctuation than the benchmarks, which validates the efficiency of the proposed method.
360 视频应用在广播系统中越来越受到关注。在带宽有限的情况下,比特波动会影响 360 度视频传输的感知质量。为了进一步优化比特分配并减少波动,本文提出了一种基于虚拟竞争者的 360 度视频编码速率控制(RC)算法。首先提出了虚拟竞争者的概念,以平衡相邻 GOP 之间的比特波动。此外,通过引入博弈论,指导了基于虚拟竞争者的帧级比特分配模型。此外,我们还利用历史 GOP 的平均编码比特、已编码视频序列的剩余比特以及未编码 GOP 的数量,提出了一种 GOP 级比特分配方案。基于所设计的帧级和 GOP 级比特分配方案,我们提出了整体比特分配方法,以实现较低的 GOP 级比特波动。实验结果表明,所提方法的 RC 误差、Bjøntegaard Delta 峰值信号与噪声比、Bjøntegaard Delta 比特率和比特波动均优于基准值,验证了所提方法的效率。
{"title":"Virtual-Competitors-Based Rate Control for 360-Degree Video Coding","authors":"Jielian Lin;Hongbin Lin;Yiwen Xu;Yuanxun Kang;Tiesong Zhao","doi":"10.1109/TBC.2023.3332019","DOIUrl":"https://doi.org/10.1109/TBC.2023.3332019","url":null,"abstract":"360 video applications are attracting more attention in broadcasting systems. In the case of limited bandwidth, the bit fluctuation will affect the perception quality when transmitting 360-degree videos. To further optimize the bit allocation and reduce fluctuation, this paper proposes a virtual-competitors-based Rate Control (RC) algorithm for 360-degree video coding. The virtual competitors’ concept is first proposed to balance the volatility of the bit between the adjacent GOPs. In addition, by introducing game theory, the virtual competitors-based frame-level bit allocation model is instructed. Furthermore, we propose a GOP-level bit allocation scheme with the average encoded bits of the historical GOP, the remaining bits of the encoded video sequence, and the number of unencoded GOPs. Based on the designed frame-level and GOP-level bit allocation scheme, the overall bit allocation method is proposed to implement the lower GOP-level bit fluctuation. Experimental results indicate the proposed method with the optimal RC Error, Bjøntegaard Delta Peak-Signal-to-Noise-Ratio, Bjøntegaard Delta Bit Rate, and bit fluctuation than the benchmarks, which validates the efficiency of the proposed method.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"70 1","pages":"357-365"},"PeriodicalIF":4.5,"publicationDate":"2023-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140042835","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Misaligned RGB-Depth Boundary Identification and Correction for Depth Image Recovery 用于深度图像复原的 RGB 深度边界错位识别与校正
IF 4.5 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2023-11-29 DOI: 10.1109/TBC.2023.3332014
Meng Yang;Lulu Zhang;Delong Suzhang;Ce Zhu;Nanning Zheng
Raw depth images generally contain a large number of erroneous pixels near object boundaries due to the limitation of depth sensors. It induces misalignment of object boundaries between RGB and depth pairs. Most existing methods do not explicitly study such RGB-Depth misalignment problem. Thereby, depth boundaries cannot be accurately recovered. In this paper, a simple yet effective model is developed based on the guided filter (GF) to identify misaligned object boundaries of a raw depth image. Using GF to filter a raw depth image with the guidance of a reference RGB image, structure of the RGB image can be progressively transferred to filtered depth images as the window size of GF increases. Therefore, misaligned object boundaries in raw depth image can be identified from residuals of filtered depth images from large-size and small-size GFs. The model is embedded into Markov random field to correct misaligned object boundaries. It is restricted in fixed-width regions around depth boundaries to avoid texture-copy artifacts. The optimization problem is solved efficiently in an iterative way. Quantitative and visual results on three RGB-Depth datasets verify that the proposed method achieves the best results compared with recent optimization-based or learning-based baselines. In addition, the proposed method is effectively applied in no-reference depth quality assessment, depth super-resolution, and depth estimation enhancement.
由于深度传感器的限制,原始深度图像通常在物体边界附近包含大量错误像素。这会导致 RGB 和深度对之间的物体边界错位。大多数现有方法都没有明确研究这种 RGB-Depth 错位问题。因此,深度边界无法准确恢复。本文基于引导滤波器(GF)开发了一个简单而有效的模型,用于识别原始深度图像中错位的物体边界。在参考 RGB 图像的引导下使用 GF 滤波原始深度图像,随着 GF 窗口大小的增加,RGB 图像的结构可逐步转移到滤波深度图像中。因此,原始深度图像中错位的物体边界可以从大尺寸和小尺寸 GF 过滤深度图像的残差中识别出来。该模型被嵌入马尔可夫随机场,以修正错位的物体边界。它被限制在深度边界周围的固定宽度区域,以避免纹理复制伪影。优化问题以迭代方式高效解决。在三个 RGB 深度数据集上的定量和视觉结果证实,与最近基于优化或基于学习的基线方法相比,所提出的方法取得了最佳效果。此外,提出的方法还有效地应用于无参照深度质量评估、深度超分辨率和深度估计增强。
{"title":"Misaligned RGB-Depth Boundary Identification and Correction for Depth Image Recovery","authors":"Meng Yang;Lulu Zhang;Delong Suzhang;Ce Zhu;Nanning Zheng","doi":"10.1109/TBC.2023.3332014","DOIUrl":"https://doi.org/10.1109/TBC.2023.3332014","url":null,"abstract":"Raw depth images generally contain a large number of erroneous pixels near object boundaries due to the limitation of depth sensors. It induces misalignment of object boundaries between RGB and depth pairs. Most existing methods do not explicitly study such RGB-Depth misalignment problem. Thereby, depth boundaries cannot be accurately recovered. In this paper, a simple yet effective model is developed based on the guided filter (GF) to identify misaligned object boundaries of a raw depth image. Using GF to filter a raw depth image with the guidance of a reference RGB image, structure of the RGB image can be progressively transferred to filtered depth images as the window size of GF increases. Therefore, misaligned object boundaries in raw depth image can be identified from residuals of filtered depth images from large-size and small-size GFs. The model is embedded into Markov random field to correct misaligned object boundaries. It is restricted in fixed-width regions around depth boundaries to avoid texture-copy artifacts. The optimization problem is solved efficiently in an iterative way. Quantitative and visual results on three RGB-Depth datasets verify that the proposed method achieves the best results compared with recent optimization-based or learning-based baselines. In addition, the proposed method is effectively applied in no-reference depth quality assessment, depth super-resolution, and depth estimation enhancement.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"70 1","pages":"183-196"},"PeriodicalIF":4.5,"publicationDate":"2023-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140052874","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Broadcasting
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1