首页 > 最新文献

IEEE Transactions on Broadcasting最新文献

英文 中文
LDPC-Coded LDM Systems Employing Non-Uniform Injection Level for Combining Broadcast and Multicast/Unicast Services 采用非均匀注入水平的 LDPC-Coded LDM 系统,用于组合广播和多播/联播服务
IF 3.2 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-03-16 DOI: 10.1109/TBC.2024.3394296
Hao Ju;Yin Xu;Ruiqi Liu;Dazhi He;Sungjun Ahn;Namho Hur;Sung-Ik Park;Wenjun Zhang;Yiyan Wu
Layered Division Multiplexing (LDM) is a Power-based Non-Orthogonal Multiplexing (P-NOM) technique that has been implemented in the Advanced Television System Committee (ATSC) 3.0 terrestrial TV physical layer to effectively multiplex services with different robustness and data rate requirements. As communication systems quickly evolve, the services to be delivered are becoming more diverse and versatile. Up to now, the LDM system adopted in the terrestrial TV system uses a uniform injection level for the lower-level (or Layer 2) signal injection. This paper investigates the non-uniform injection level LDM (NULDM). The proposed technique can explore the Unequal Error Protection (UEP) property of Low-Density Parity-Check (LDPC) codes and the flexible power allocation nature of the NULDM to improve the system performance and spectrum efficiency. NULDM enables the seamless integration of broadcast/multicast and unicast services in one RF channel, where the unicast signal can assign different resources (power, frequency, and time) based on the UE distance and service requirements. Meanwhile, more power could be allocated to improve the upper layer (or Layer 1) broadcast and datacast services. To make better use of the UEP property of LDPC codes in NULDM, the extended Gaussian mixture approximation (EGMA) method is used to design bit interleaving patterns. Additionally, inspired by the channel order of polar codes, this paper proposes an LDPC sub-block interleaving order (SBIO) scheme that performs similarly to the EGMA interleaving model, while better adapting to the diverse needs of proposed mixed service delivery scenarios for convergence of broadband wireless communications and broadcasting systems.
分层多路复用(LDM)是一种基于功率的非正交多路复用(P-NOM)技术,已在先进电视系统委员会(ATSC)3.0 地面电视物理层中实施,以有效地复用具有不同稳健性和数据速率要求的服务。随着通信系统的快速发展,所要提供的服务也越来越多样化和多功能化。迄今为止,地面电视系统采用的 LDM 系统对底层(或第 2 层)信号注入采用统一的注入电平。本文研究了非均匀注入电平 LDM(NULDM)。所提出的技术可利用低密度奇偶校验码(LDPC)的不等差保护(UEP)特性和 NULDM 灵活的功率分配特性来提高系统性能和频谱效率。NULDM 可在一个射频信道中无缝集成广播/多播和单播服务,其中单播信号可根据 UE 的距离和服务要求分配不同的资源(功率、频率和时间)。同时,还可分配更多功率,以改善上层(或第一层)广播和数据广播服务。为了在 NULDM 中更好地利用 LDPC 码的 UEP 特性,使用了扩展高斯混合近似(EGMA)方法来设计比特交错模式。此外,受极性编码信道顺序的启发,本文提出了一种 LDPC 子块交错顺序(SBIO)方案,其性能与 EGMA 交错模式类似,同时能更好地适应宽带无线通信和广播系统融合中混合服务交付场景的不同需求。
{"title":"LDPC-Coded LDM Systems Employing Non-Uniform Injection Level for Combining Broadcast and Multicast/Unicast Services","authors":"Hao Ju;Yin Xu;Ruiqi Liu;Dazhi He;Sungjun Ahn;Namho Hur;Sung-Ik Park;Wenjun Zhang;Yiyan Wu","doi":"10.1109/TBC.2024.3394296","DOIUrl":"10.1109/TBC.2024.3394296","url":null,"abstract":"Layered Division Multiplexing (LDM) is a Power-based Non-Orthogonal Multiplexing (P-NOM) technique that has been implemented in the Advanced Television System Committee (ATSC) 3.0 terrestrial TV physical layer to effectively multiplex services with different robustness and data rate requirements. As communication systems quickly evolve, the services to be delivered are becoming more diverse and versatile. Up to now, the LDM system adopted in the terrestrial TV system uses a uniform injection level for the lower-level (or Layer 2) signal injection. This paper investigates the non-uniform injection level LDM (NULDM). The proposed technique can explore the Unequal Error Protection (UEP) property of Low-Density Parity-Check (LDPC) codes and the flexible power allocation nature of the NULDM to improve the system performance and spectrum efficiency. NULDM enables the seamless integration of broadcast/multicast and unicast services in one RF channel, where the unicast signal can assign different resources (power, frequency, and time) based on the UE distance and service requirements. Meanwhile, more power could be allocated to improve the upper layer (or Layer 1) broadcast and datacast services. To make better use of the UEP property of LDPC codes in NULDM, the extended Gaussian mixture approximation (EGMA) method is used to design bit interleaving patterns. Additionally, inspired by the channel order of polar codes, this paper proposes an LDPC sub-block interleaving order (SBIO) scheme that performs similarly to the EGMA interleaving model, while better adapting to the diverse needs of proposed mixed service delivery scenarios for convergence of broadband wireless communications and broadcasting systems.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"70 3","pages":"1032-1043"},"PeriodicalIF":3.2,"publicationDate":"2024-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141060278","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Innovative Adaptive Web-Based Solution for Improved Remote Co-Creation and Delivery of Artistic Performances 改进艺术表演远程共同创作和交付的创新型自适应网络解决方案
IF 4.5 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-03-13 DOI: 10.1109/TBC.2024.3363455
Mohammed Amine Togou;Anderson Augusto Simiscuka;Rohit Verma;Noel E. O’Connor;Iñigo Tamayo;Stefano Masneri;Mikel Zorrilla;Gabriel-Miro Muntean
Due to the COVID-19 pandemic, most arts and cultural activities have moved online. This has contributed to the surge in development of artistic tools that enable professional artists to produce engaging and immersive shows remotely. This article introduces TRACTION Co-Creation Stage (TCS), a novel Web-based solution, designed and developed in the context of the EU Horizon 2020 TRACTION project, which allows for remote creation and delivery of artistic shows. TCS supports multiple artists performing simultaneously, either live or pre-recorded, on multiple stages at different geographical locations. It employs a client-server approach. The client has two major components: Control and Display. The former is used by the production teams to create shows by specifying layouts, scenes, and media sources to be included. The latter is used by viewers to watch the various shows. To ensure viewers’ good quality of experience (QoE) levels, TCS employs adaptive streaming based on a novel Prioritised Adaptation solution based on the DASH standard for pre-recorded content delivery (PADA), which is introduced in this paper. User tests and experiments are carried out to evaluate the performance of TCS’ Control and Display applications and that of PADA algorithm when creating and distributing opera shows.
由于 COVID-19 的流行,大多数艺术和文化活动都转移到了网上。这推动了艺术工具的迅猛发展,使专业艺术家能够远程制作引人入胜、身临其境的节目。本文介绍了 "TRACTION 协同创作舞台"(TCS),这是一种基于网络的新型解决方案,由欧盟地平线 2020 TRACTION 项目设计和开发,可实现艺术表演的远程创作和交付。TCS 支持多位艺术家在不同地理位置的多个舞台上同时进行现场或预录制表演。它采用客户端-服务器方式。客户端有两个主要组件:控制和显示。前者供制作团队使用,通过指定布局、场景和媒体资源来制作节目。后者供观众观看各种节目。为确保观众获得良好的体验质量(QoE),TCS 采用了自适应流媒体技术,该技术基于本文介绍的预录内容传输 DASH 标准(PADA),是一种新颖的优先自适应解决方案。通过用户测试和实验,评估了 TCS 的控制和显示应用程序以及 PADA 算法在创建和分发歌剧节目时的性能。
{"title":"An Innovative Adaptive Web-Based Solution for Improved Remote Co-Creation and Delivery of Artistic Performances","authors":"Mohammed Amine Togou;Anderson Augusto Simiscuka;Rohit Verma;Noel E. O’Connor;Iñigo Tamayo;Stefano Masneri;Mikel Zorrilla;Gabriel-Miro Muntean","doi":"10.1109/TBC.2024.3363455","DOIUrl":"10.1109/TBC.2024.3363455","url":null,"abstract":"Due to the COVID-19 pandemic, most arts and cultural activities have moved online. This has contributed to the surge in development of artistic tools that enable professional artists to produce engaging and immersive shows remotely. This article introduces TRACTION Co-Creation Stage (TCS), a novel Web-based solution, designed and developed in the context of the EU Horizon 2020 TRACTION project, which allows for remote creation and delivery of artistic shows. TCS supports multiple artists performing simultaneously, either live or pre-recorded, on multiple stages at different geographical locations. It employs a client-server approach. The client has two major components: Control and Display. The former is used by the production teams to create shows by specifying layouts, scenes, and media sources to be included. The latter is used by viewers to watch the various shows. To ensure viewers’ good quality of experience (QoE) levels, TCS employs adaptive streaming based on a novel Prioritised Adaptation solution based on the DASH standard for pre-recorded content delivery (PADA), which is introduced in this paper. User tests and experiments are carried out to evaluate the performance of TCS’ Control and Display applications and that of PADA algorithm when creating and distributing opera shows.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"70 2","pages":"719-730"},"PeriodicalIF":4.5,"publicationDate":"2024-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10472407","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140125226","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep-Learning-Based Classifier With Custom Feature-Extraction Layers for Digitally Modulated Signals 基于深度学习的分类器,具有针对数字调制信号的自定义特征提取层
IF 3.2 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-03-10 DOI: 10.1109/TBC.2024.3391056
John A. Snoap;Dimitrie C. Popescu;Chad M. Spooner
The paper presents a novel deep-learning (DL) based classifier for digitally modulated signals that uses a capsule network (CAP) with custom-designed feature extraction layers. The classifier takes the in-phase/quadrature (I/Q) components of the digitally modulated signal as input, and the feature extraction layers are inspired by cyclostationary signal processing (CSP) techniques, which extract the cyclic cumulant (CC) features that are employed by conventional CSP-based approaches to blind modulation classification and signal identification. Specifically, the feature extraction layers implement a proxy of the mathematical functions used in the calculation of the CC features and include a squaring layer, a raise-to-the-power-of-three layer, and a fast-Fourier-transform (FFT) layer, along with additional normalization and warping layers to ensure that the relative signal powers are retained and to prevent the trainable neural network (NN) layers from diverging in the training process. The classification performance and the generalization abilities of the proposed CAP are tested using two distinct datasets that contain similar classes of digitally modulated signals but that have been generated independently, and numerical results obtained reveal that the proposed CAP with novel feature extraction layers achieves high classification accuracy while also outperforming alternative DL-based approaches for signal classification in terms of both classification accuracy and generalization abilities.
本文介绍了一种基于深度学习(DL)的新型数字调制信号分类器,该分类器使用带有定制设计特征提取层的胶囊网络(CAP)。该分类器将数字调制信号的同相/正交(I/Q)分量作为输入,而特征提取层则受到环静止信号处理(CSP)技术的启发,该技术可提取循环累积(CC)特征,这些特征被传统的基于 CSP 的方法用于盲调制分类和信号识别。具体来说,特征提取层实现了用于计算 CC 特征的数学函数的代理,包括一个平方层、一个三倍功率层和一个快速傅里叶变换(FFT)层,以及额外的归一化和翘曲层,以确保保留相对信号功率,并防止可训练神经网络(NN)层在训练过程中发散。使用两个不同的数据集测试了所提出的 CAP 的分类性能和泛化能力,这两个数据集包含类似类别的数字调制信号,但都是独立生成的。数值结果表明,所提出的 CAP 连同新颖的特征提取层实现了较高的分类精度,同时在分类精度和泛化能力方面也优于其他基于 DL 的信号分类方法。
{"title":"Deep-Learning-Based Classifier With Custom Feature-Extraction Layers for Digitally Modulated Signals","authors":"John A. Snoap;Dimitrie C. Popescu;Chad M. Spooner","doi":"10.1109/TBC.2024.3391056","DOIUrl":"10.1109/TBC.2024.3391056","url":null,"abstract":"The paper presents a novel deep-learning (DL) based classifier for digitally modulated signals that uses a capsule network (CAP) with custom-designed feature extraction layers. The classifier takes the in-phase/quadrature (I/Q) components of the digitally modulated signal as input, and the feature extraction layers are inspired by cyclostationary signal processing (CSP) techniques, which extract the cyclic cumulant (CC) features that are employed by conventional CSP-based approaches to blind modulation classification and signal identification. Specifically, the feature extraction layers implement a proxy of the mathematical functions used in the calculation of the CC features and include a squaring layer, a raise-to-the-power-of-three layer, and a fast-Fourier-transform (FFT) layer, along with additional normalization and warping layers to ensure that the relative signal powers are retained and to prevent the trainable neural network (NN) layers from diverging in the training process. The classification performance and the generalization abilities of the proposed CAP are tested using two distinct datasets that contain similar classes of digitally modulated signals but that have been generated independently, and numerical results obtained reveal that the proposed CAP with novel feature extraction layers achieves high classification accuracy while also outperforming alternative DL-based approaches for signal classification in terms of both classification accuracy and generalization abilities.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"70 3","pages":"763-773"},"PeriodicalIF":3.2,"publicationDate":"2024-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140934724","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Removing Banding Artifacts in HDR Videos Generated From Inverse Tone Mapping 消除由反色调映射生成的 HDR 视频中的带状伪影
IF 4.5 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-03-10 DOI: 10.1109/TBC.2024.3394297
Fei Zhou;Zikang Zheng;Guoping Qiu
Displaying standard dynamic range (SDR) videos on high dynamic range (HDR) devices requires inverse tone mapping (ITM). However, such mapping can introduce banding artifacts. This paper presents a banding removal method for inversely tone mapped HDR videos based on deep convolutional neural networks (DCNNs) and adaptive filtering. Three banding relevant feature maps are first extracted and then fed to two DCNNs, a ShapeNet and a PositionNet. The PositionNet learns a soft mask indicating the locations where banding is likely to have occurred and filtering is required while the ShapeNet predicts the filter shapes appropriate for different locations. An advantage of the method is that the adaptive filters can be jointly optimized with a learning-based ITM algorithm for creating high-quality HDR videos. Experimental results show that our method outperforms state-of-the-art algorithms qualitatively and quantitatively.
在高动态范围(HDR)设备上显示标准动态范围(SDR)视频需要反色调映射(ITM)。然而,这种映射会引入带状伪影。本文提出了一种基于深度卷积神经网络(DCNN)和自适应滤波的反色调映射 HDR 视频带状消除方法。首先提取三个与色带相关的特征图,然后将其馈送给两个 DCNN,一个是形状网络(ShapeNet),另一个是位置网络(PositionNet)。PositionNet 学习软掩码,指出可能发生带状化并需要滤波的位置,而 ShapeNet 则预测适合不同位置的滤波器形状。该方法的优势在于,自适应滤波器可与基于学习的 ITM 算法联合优化,以创建高质量的 HDR 视频。实验结果表明,我们的方法在质量和数量上都优于最先进的算法。
{"title":"Removing Banding Artifacts in HDR Videos Generated From Inverse Tone Mapping","authors":"Fei Zhou;Zikang Zheng;Guoping Qiu","doi":"10.1109/TBC.2024.3394297","DOIUrl":"10.1109/TBC.2024.3394297","url":null,"abstract":"Displaying standard dynamic range (SDR) videos on high dynamic range (HDR) devices requires inverse tone mapping (ITM). However, such mapping can introduce banding artifacts. This paper presents a banding removal method for inversely tone mapped HDR videos based on deep convolutional neural networks (DCNNs) and adaptive filtering. Three banding relevant feature maps are first extracted and then fed to two DCNNs, a ShapeNet and a PositionNet. The PositionNet learns a soft mask indicating the locations where banding is likely to have occurred and filtering is required while the ShapeNet predicts the filter shapes appropriate for different locations. An advantage of the method is that the adaptive filters can be jointly optimized with a learning-based ITM algorithm for creating high-quality HDR videos. Experimental results show that our method outperforms state-of-the-art algorithms qualitatively and quantitatively.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"70 2","pages":"753-762"},"PeriodicalIF":4.5,"publicationDate":"2024-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140934722","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimal OFDM-IM Signals With Constant PAPR 具有恒定 PAPR 的最佳 OFDM-IM 信号
IF 3.2 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-03-10 DOI: 10.1109/TBC.2024.3394292
Jiabo Hu;Yajun Wang;Zhuxian Lian;Yinjie Su;Zhibin Xie
Orthogonal frequency division multiplexing indexed modulation (OFDM-IM), an emerging multi-carrier modulation technique, offers significant advantages over traditional OFDM. The OFDM-IM scheme exhibits superior performance in terms of bit error rate (BER) at low and medium data rates, while also enhancing resilience to inter-carrier interference in dynamically changing channels. However, the challenge of a high peak-to-average ratio (PAPR) also persists in OFDM-IM. In this study, we propose a novel approach to mitigate PAPR by introducing a small dither signal to the idle subcarrier, leveraging the inherent characteristics of OFDM-IM. Subsequently, we address the nonconvex and non-smooth optimization problem of minimizing the maximum amplitude of dither signals while maintaining a constant PAPR constraint. To effectively tackle this challenging optimization task, we adopt the linearized alternating direction multiplier method (LADMM), referred to as the LADMM-direct algorithm, which provides a simple closed-form solution for each subproblem encountered during the optimization process. To improve the convergence rate of the LADMM-direct algorithm, a LADMM-relax algorithm is also proposed to address the PAPR problem. Simulation results demonstrate that our proposed LADMM-direct and LADMM-relax algorithms significantly reduce computational complexity and achieve superior performance in terms of both PAPR and bit error rate (BER) compared to state-of-the-art algorithms.
正交频分复用索引调制(OFDM-IM)是一种新兴的多载波调制技术,与传统的 OFDM 相比具有显著优势。OFDM-IM 方案在中低数据速率下的误码率(BER)方面表现出卓越的性能,同时还增强了对动态变化信道中载波间干扰的抗干扰能力。然而,OFDM-IM 仍然面临峰均比(PAPR)过高的挑战。在本研究中,我们提出了一种新方法,利用 OFDM-IM 的固有特性,通过在空闲子载波上引入小抖动信号来缓解 PAPR。随后,我们解决了一个非凸和非平滑的优化问题,即在保持恒定 PAPR 约束的同时,最小化抖动信号的最大振幅。为有效解决这一具有挑战性的优化任务,我们采用了线性化交替方向乘法器方法(LADMM),即 LADMM-direct算法,该算法可为优化过程中遇到的每个子问题提供简单的闭式解。为了提高 LADMM-direct 算法的收敛速度,还提出了一种 LADMM-relax 算法来解决 PAPR 问题。仿真结果表明,与最先进的算法相比,我们提出的 LADMM-直接算法和 LADMM-relax 算法大大降低了计算复杂度,并在 PAPR 和误码率 (BER) 方面取得了优异的性能。
{"title":"Optimal OFDM-IM Signals With Constant PAPR","authors":"Jiabo Hu;Yajun Wang;Zhuxian Lian;Yinjie Su;Zhibin Xie","doi":"10.1109/TBC.2024.3394292","DOIUrl":"10.1109/TBC.2024.3394292","url":null,"abstract":"Orthogonal frequency division multiplexing indexed modulation (OFDM-IM), an emerging multi-carrier modulation technique, offers significant advantages over traditional OFDM. The OFDM-IM scheme exhibits superior performance in terms of bit error rate (BER) at low and medium data rates, while also enhancing resilience to inter-carrier interference in dynamically changing channels. However, the challenge of a high peak-to-average ratio (PAPR) also persists in OFDM-IM. In this study, we propose a novel approach to mitigate PAPR by introducing a small dither signal to the idle subcarrier, leveraging the inherent characteristics of OFDM-IM. Subsequently, we address the nonconvex and non-smooth optimization problem of minimizing the maximum amplitude of dither signals while maintaining a constant PAPR constraint. To effectively tackle this challenging optimization task, we adopt the linearized alternating direction multiplier method (LADMM), referred to as the LADMM-direct algorithm, which provides a simple closed-form solution for each subproblem encountered during the optimization process. To improve the convergence rate of the LADMM-direct algorithm, a LADMM-relax algorithm is also proposed to address the PAPR problem. Simulation results demonstrate that our proposed LADMM-direct and LADMM-relax algorithms significantly reduce computational complexity and achieve superior performance in terms of both PAPR and bit error rate (BER) compared to state-of-the-art algorithms.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"70 3","pages":"945-954"},"PeriodicalIF":3.2,"publicationDate":"2024-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140934688","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fast Mode and CU Splitting Decision for Intra Prediction in VVC SCC 针对 VVC SCC 内部预测的快速模式和 CU 分割决策
IF 3.2 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-03-10 DOI: 10.1109/TBC.2024.3394288
Dayong Wang;Junyi Yu;Xin Lu;Frederic Dufaux;Bo Hang;Hui Guo;Ce Zhu
Currently, screen content video applications are increasingly widespread in our daily lives. The latest Screen Content Coding (SCC) standard, known as Versatile Video Coding (VVC) SCC, employs a quad-tree plus multi-type tree (QTMT) coding structure for Coding Unit (CU) partitioning and screen content Coding Modes (CMs) selection. While VVC SCC achieves high coding efficiency, its coding complexity poses a significant obstacle to the further widespread adoption of screen content video. Hence, it is crucial to enhance the coding speed of VVC SCC. In this paper, we propose a fast mode and splitting decision for Intra prediction in VVC SCC. Specifically, we initially exploit deep learning techniques to predict content types for all CUs. Subsequently, we examine CM distributions of different content types to predict candidate CMs for CUs. We then introduce early skip and early terminate CM decisions for different content types of CUs to further eliminate unlikely CMs. Finally, we develop Block-based Differential Pulse-Code Modulation (BDPCM) early termination and CU splitting early termination to improve coding speed. Experimental results demonstrate that the proposed algorithm improves coding speed on average by 41.14%, with the BDBR increasing by 1.17%.
目前,屏幕内容视频应用在我们的日常生活中越来越广泛。最新的屏幕内容编码(SCC)标准,即多功能视频编码(VVC)SCC,采用四叉树加多类型树(QTMT)编码结构进行编码单元(CU)划分和屏幕内容编码模式(CM)选择。虽然 VVC SCC 实现了较高的编码效率,但其编码复杂性对屏幕内容视频的进一步广泛应用构成了重大障碍。因此,提高 VVC SCC 的编码速度至关重要。本文提出了 VVC SCC 中内预测的快速模式和分割决策。具体来说,我们首先利用深度学习技术来预测所有 CU 的内容类型。随后,我们检查不同内容类型的 CM 分布,预测 CU 的候选 CM。然后,我们针对不同内容类型的 CU 引入早期跳过和早期终止 CM 的决策,以进一步消除不可能的 CM。最后,我们开发了基于块的差分脉冲编码调制(BDPCM)早期终止和 CU 分割早期终止,以提高编码速度。实验结果表明,所提算法的编码速度平均提高了 41.14%,BDBR 提高了 1.17%。
{"title":"Fast Mode and CU Splitting Decision for Intra Prediction in VVC SCC","authors":"Dayong Wang;Junyi Yu;Xin Lu;Frederic Dufaux;Bo Hang;Hui Guo;Ce Zhu","doi":"10.1109/TBC.2024.3394288","DOIUrl":"10.1109/TBC.2024.3394288","url":null,"abstract":"Currently, screen content video applications are increasingly widespread in our daily lives. The latest Screen Content Coding (SCC) standard, known as Versatile Video Coding (VVC) SCC, employs a quad-tree plus multi-type tree (QTMT) coding structure for Coding Unit (CU) partitioning and screen content Coding Modes (CMs) selection. While VVC SCC achieves high coding efficiency, its coding complexity poses a significant obstacle to the further widespread adoption of screen content video. Hence, it is crucial to enhance the coding speed of VVC SCC. In this paper, we propose a fast mode and splitting decision for Intra prediction in VVC SCC. Specifically, we initially exploit deep learning techniques to predict content types for all CUs. Subsequently, we examine CM distributions of different content types to predict candidate CMs for CUs. We then introduce early skip and early terminate CM decisions for different content types of CUs to further eliminate unlikely CMs. Finally, we develop Block-based Differential Pulse-Code Modulation (BDPCM) early termination and CU splitting early termination to improve coding speed. Experimental results demonstrate that the proposed algorithm improves coding speed on average by 41.14%, with the BDBR increasing by 1.17%.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"70 3","pages":"872-883"},"PeriodicalIF":3.2,"publicationDate":"2024-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140934479","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive Mixed-Scale Feature Fusion Network for Blind AI-Generated Image Quality Assessment 用于人工智能生成的盲图像质量评估的自适应混合尺度特征融合网络
IF 3.2 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-03-06 DOI: 10.1109/TBC.2024.3391060
Tianwei Zhou;Songbai Tan;Wei Zhou;Yu Luo;Yuan-Gen Wang;Guanghui Yue
With the increasing maturity of the text-to-image and image-to-image generative models, AI-generated images (AGIs) have shown great application potential in advertisement, entertainment, education, social media, etc. Although remarkable advancements have been achieved in generative models, very few efforts have been paid to design relevant quality assessment models. In this paper, we propose a novel blind image quality assessment (IQA) network, named AMFF-Net, for AGIs. AMFF-Net evaluates AGI quality from three dimensions, i.e., “visual quality”, “authenticity”, and “consistency”. Specifically, inspired by the characteristics of the human visual system and motivated by the observation that “visual quality” and “authenticity” are characterized by both local and global aspects, AMFF-Net scales the image up and down and takes the scaled images and original-sized image as the inputs to obtain multi-scale features. After that, an Adaptive Feature Fusion (AFF) block is used to adaptively fuse the multi-scale features with learnable weights. In addition, considering the correlation between the image and prompt, AMFF-Net compares the semantic features from text encoder and image encoder to evaluate the text-to-image alignment. We carry out extensive experiments on three AGI quality assessment databases, and the experimental results show that our AMFF-Net obtains better performance than nine state-of-the-art blind IQA methods. The results of ablation experiments further demonstrate the effectiveness of the proposed multi-scale input strategy and AFF block.
随着文本到图像和图像到图像生成模型的日益成熟,人工智能生成的图像(AGIs)在广告、娱乐、教育、社交媒体等领域显示出巨大的应用潜力。尽管在生成模型方面已经取得了令人瞩目的进展,但很少有人致力于设计相关的质量评估模型。在本文中,我们为 AGIs 提出了一种新型盲图像质量评估(IQA)网络,名为 AMFF-Net。AMFF-Net 从三个维度评估 AGI 质量,即 "视觉质量"、"真实性 "和 "一致性"。具体来说,AMFF-Net 受人类视觉系统特征的启发,并观察到 "视觉质量 "和 "真实性 "具有局部和全局两个方面的特征,因此将图像进行上下缩放,并将缩放后的图像和原始大小的图像作为输入,从而获得多尺度特征。然后,使用自适应特征融合(AFF)模块,利用可学习权重对多尺度特征进行自适应融合。此外,考虑到图像和提示之间的相关性,AMFF-Net 还会比较来自文本编码器和图像编码器的语义特征,以评估文本到图像的对齐情况。我们在三个 AGI 质量评估数据库上进行了大量实验,实验结果表明我们的 AMFF-Net 比九种最先进的盲 IQA 方法获得了更好的性能。消融实验结果进一步证明了所提出的多尺度输入策略和 AFF 块的有效性。
{"title":"Adaptive Mixed-Scale Feature Fusion Network for Blind AI-Generated Image Quality Assessment","authors":"Tianwei Zhou;Songbai Tan;Wei Zhou;Yu Luo;Yuan-Gen Wang;Guanghui Yue","doi":"10.1109/TBC.2024.3391060","DOIUrl":"10.1109/TBC.2024.3391060","url":null,"abstract":"With the increasing maturity of the text-to-image and image-to-image generative models, AI-generated images (AGIs) have shown great application potential in advertisement, entertainment, education, social media, etc. Although remarkable advancements have been achieved in generative models, very few efforts have been paid to design relevant quality assessment models. In this paper, we propose a novel blind image quality assessment (IQA) network, named AMFF-Net, for AGIs. AMFF-Net evaluates AGI quality from three dimensions, i.e., “visual quality”, “authenticity”, and “consistency”. Specifically, inspired by the characteristics of the human visual system and motivated by the observation that “visual quality” and “authenticity” are characterized by both local and global aspects, AMFF-Net scales the image up and down and takes the scaled images and original-sized image as the inputs to obtain multi-scale features. After that, an Adaptive Feature Fusion (AFF) block is used to adaptively fuse the multi-scale features with learnable weights. In addition, considering the correlation between the image and prompt, AMFF-Net compares the semantic features from text encoder and image encoder to evaluate the text-to-image alignment. We carry out extensive experiments on three AGI quality assessment databases, and the experimental results show that our AMFF-Net obtains better performance than nine state-of-the-art blind IQA methods. The results of ablation experiments further demonstrate the effectiveness of the proposed multi-scale input strategy and AFF block.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"70 3","pages":"833-843"},"PeriodicalIF":3.2,"publicationDate":"2024-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140886564","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Transactions on Broadcasting Publication Information 电气和电子工程师学会《广播学报》出版信息
IF 4.5 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-03-05 DOI: 10.1109/TBC.2024.3364857
{"title":"IEEE Transactions on Broadcasting Publication Information","authors":"","doi":"10.1109/TBC.2024.3364857","DOIUrl":"10.1109/TBC.2024.3364857","url":null,"abstract":"","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"70 1","pages":"C2-C2"},"PeriodicalIF":4.5,"publicationDate":"2024-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10460256","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140046594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Transactions on Broadcasting Information for Authors 电气和电子工程师学会(IEEE)《关于广播作者信息的论文集》(IEEE Transactions on Broadcasting Information for Authors
IF 4.5 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-03-05 DOI: 10.1109/TBC.2024.3364859
{"title":"IEEE Transactions on Broadcasting Information for Authors","authors":"","doi":"10.1109/TBC.2024.3364859","DOIUrl":"https://doi.org/10.1109/TBC.2024.3364859","url":null,"abstract":"","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"70 1","pages":"C3-C4"},"PeriodicalIF":4.5,"publicationDate":"2024-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10460268","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140042909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Performance Assessment for LDM Transmission Based on DVB Satellite Standard 基于 DVB 卫星标准的 LDM 传输性能评估
IF 4.5 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-02-29 DOI: 10.1109/TBC.2024.3349789
Pansoo Kim;Hyuncheol Park
Variable Coding and Modulation (VCM), one of the Orthogonal Multiple Access (OMA) schemes and also known as channel adaptive Time-sharing Division Multiplexing (TDM) in satellite broadcasting and communication systems, has been widely utilized to mitigate heavy rain fading in Ku/Ka-band to enhance link availability in DVB-S2x (Digital Video Broadcasting - Satellite 2nd generation eXtension) standard. For next-generation satellite broadcasting and communication, we exploit Layer Division Multiplexing (LDM) technology, which is also referred to as Non-Orthogonal Multiple Access (NOMA). We conduct a performance assessment of VCM in terms of the total achievable data rate under Additive White Gaussian Noise (AWGN) channel and nonlinear satellite High Power Amplifier (HPA) impairments. In addition, we consider realistic Radio Frequency (RF) inaccuracies characterized by timing and carrier offsets. Through the identification of the performance impacts, we propose a robust carrier phase synchronization scheme to mitigate phase noise impairment. Numerical results demonstrate that our proposed scheme can enhance Packet Error Rate (PER) performance compared to the conventional one in a phase noise environment.
可变编码和调制(VCM)是卫星广播和通信系统中的正交多路存取(OMA)方案之一,也被称为信道自适应分时多路复用(TDM),在 DVB-S2x(数字视频广播-卫星第二代扩展)标准中被广泛用于缓解 Ku/Ka 波段的暴雨衰落,以提高链路可用性。对于下一代卫星广播和通信,我们采用了层分复用(LDM)技术,该技术也被称为非正交多址接入(NOMA)。我们根据加性白高斯噪声(AWGN)信道和非线性卫星高功率放大器(HPA)损伤条件下的总可实现数据速率对 VCM 进行了性能评估。此外,我们还考虑了以定时和载波偏移为特征的现实射频(RF)误差。通过识别性能影响,我们提出了一种稳健的载波相位同步方案,以减轻相位噪声的损害。数值结果表明,与相位噪声环境下的传统方案相比,我们提出的方案可以提高数据包错误率(PER)性能。
{"title":"Performance Assessment for LDM Transmission Based on DVB Satellite Standard","authors":"Pansoo Kim;Hyuncheol Park","doi":"10.1109/TBC.2024.3349789","DOIUrl":"10.1109/TBC.2024.3349789","url":null,"abstract":"Variable Coding and Modulation (VCM), one of the Orthogonal Multiple Access (OMA) schemes and also known as channel adaptive Time-sharing Division Multiplexing (TDM) in satellite broadcasting and communication systems, has been widely utilized to mitigate heavy rain fading in Ku/Ka-band to enhance link availability in DVB-S2x (Digital Video Broadcasting - Satellite 2nd generation eXtension) standard. For next-generation satellite broadcasting and communication, we exploit Layer Division Multiplexing (LDM) technology, which is also referred to as Non-Orthogonal Multiple Access (NOMA). We conduct a performance assessment of VCM in terms of the total achievable data rate under Additive White Gaussian Noise (AWGN) channel and nonlinear satellite High Power Amplifier (HPA) impairments. In addition, we consider realistic Radio Frequency (RF) inaccuracies characterized by timing and carrier offsets. Through the identification of the performance impacts, we propose a robust carrier phase synchronization scheme to mitigate phase noise impairment. Numerical results demonstrate that our proposed scheme can enhance Packet Error Rate (PER) performance compared to the conventional one in a phase noise environment.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"70 2","pages":"382-390"},"PeriodicalIF":4.5,"publicationDate":"2024-02-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140009014","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Broadcasting
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1