首页 > 最新文献

IEEE Transactions on Broadcasting最新文献

英文 中文
LFIC-DRASC: Deep Light Field Image Compression Using Disentangled Representation and Asymmetrical Strip Convolution LFIC-DRASC:基于非纠缠表示和非对称条卷积的深光场图像压缩
IF 4.8 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-07-03 DOI: 10.1109/TBC.2025.3579225
Shiyu Feng;Yun Zhang;Linwei Zhu;Sam Kwong
Light-Field (LF) image is emerging 4D data of light rays that is capable of realistically presenting spatial and angular information of 3D scene. However, the large data volume of LF images becomes the most challenging issue in real-time processing, transmission, and storage. In this paper, we propose an end-to-end deep LF Image Compression method Using Disentangled Representation and Asymmetrical Strip Convolution (LFIC-DRASC) to improve coding efficiency. Firstly, we formulate the LF image compression problem as learning a disentangled LF representation network and an image encoding-decoding network. Secondly, we propose two novel feature extractors that leverage the structural prior of LF data by integrating features across different dimensions. Meanwhile, disentangled LF representation network is proposed to enhance the LF feature disentangling and decoupling. Thirdly, we propose the LFIC-DRASC for LF image compression, where two Asymmetrical Strip Convolution (ASC) operators, i.e., horizontal and vertical, are proposed to capture long-range correlation in LF feature space. These two ASC operators can be combined with the square convolution to further decouple LF features, which enhances the model’s ability in representing intricate spatial relationships. Experimental results demonstrate that the proposed LFIC-DRASC achieves an average of 20.5% bit rate reductions compared with the state-of-the-art methods. Source code and pre-trained models of LFIC-DRASC are available at https://github.com/SYSU-Video/LFIC-DRASC.
光场图像是一种能够真实呈现三维场景空间和角度信息的新兴的光线四维数据。然而,LF图像的大数据量在实时处理、传输和存储方面成为最具挑战性的问题。为了提高编码效率,本文提出了一种基于非纠缠表示和非对称条卷积(LFIC-DRASC)的端到端深度LF图像压缩方法。首先,我们将LF图像压缩问题表述为学习一个解纠缠的LF表示网络和一个图像编解码网络。其次,我们提出了两种新的特征提取器,通过整合不同维度的特征来利用LF数据的结构先验。同时,提出解纠缠LF表示网络,增强LF特征的解纠缠和解耦性。第三,我们提出了LF图像压缩的LFIC-DRASC算法,其中提出了水平和垂直两个不对称条卷积算子(ASC)来捕获LF特征空间中的远程相关性。这两种ASC算子可以与平方卷积相结合,进一步解耦LF特征,增强了模型表示复杂空间关系的能力。实验结果表明,与目前最先进的方法相比,LFIC-DRASC平均降低了20.5%的比特率。LFIC-DRASC的源代码和预训练模型可在https://github.com/SYSU-Video/LFIC-DRASC上获得。
{"title":"LFIC-DRASC: Deep Light Field Image Compression Using Disentangled Representation and Asymmetrical Strip Convolution","authors":"Shiyu Feng;Yun Zhang;Linwei Zhu;Sam Kwong","doi":"10.1109/TBC.2025.3579225","DOIUrl":"https://doi.org/10.1109/TBC.2025.3579225","url":null,"abstract":"Light-Field (LF) image is emerging 4D data of light rays that is capable of realistically presenting spatial and angular information of 3D scene. However, the large data volume of LF images becomes the most challenging issue in real-time processing, transmission, and storage. In this paper, we propose an end-to-end deep LF Image Compression method Using Disentangled Representation and Asymmetrical Strip Convolution (LFIC-DRASC) to improve coding efficiency. Firstly, we formulate the LF image compression problem as learning a disentangled LF representation network and an image encoding-decoding network. Secondly, we propose two novel feature extractors that leverage the structural prior of LF data by integrating features across different dimensions. Meanwhile, disentangled LF representation network is proposed to enhance the LF feature disentangling and decoupling. Thirdly, we propose the LFIC-DRASC for LF image compression, where two Asymmetrical Strip Convolution (ASC) operators, i.e., horizontal and vertical, are proposed to capture long-range correlation in LF feature space. These two ASC operators can be combined with the square convolution to further decouple LF features, which enhances the model’s ability in representing intricate spatial relationships. Experimental results demonstrate that the proposed LFIC-DRASC achieves an average of 20.5% bit rate reductions compared with the state-of-the-art methods. Source code and pre-trained models of LFIC-DRASC are available at <uri>https://github.com/SYSU-Video/LFIC-DRASC</uri>.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"71 3","pages":"889-902"},"PeriodicalIF":4.8,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144997131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MMSE Precoding for Reliability Enhancement in Downlink MISO-RSMA Systems 基于MMSE预编码的下行MISO-RSMA系统可靠性提高
IF 4.8 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-06-27 DOI: 10.1109/TBC.2025.3579251
Xuehan Wang;Jintao Wang;Jinhong Yuan;Jian Song
Rate splitting multiple access (RSMA) has been regarded as one of the most promising technologies for the next-generation broadcasting and mobile communication systems. Many prior designs for RSMA systems focused on the capacity optimization from the information-theoretic analysis, while the reliability in realistic deployment is less considered. To this end, the linear precoder of downlink multiple-input single-output (MISO)-RSMA systems is elaborately designed in this paper by minimizing the mean square error (MSE) associated with the worst user equipment (UE). The optimization problem is first formulated by investigating the MSE for each UE from the view of signal processing, where the zero-forcing (ZF) precoding is utilized for the private messages. The minimum MSE (MMSE) precoding is then obtained by utilizing the semi-definite relaxation (SDR) for the common precoding vector and deriving the closed-form optimal power allocation coefficient between private and common messages. A heuristic closed-form solution is then developed to reduce the complexity caused by the semi-definite programming (SDP). Simulation results demonstrate the reliability superiority of the proposed schemes beyond space-division multiple access (SDMA) and conventional RSMA approaches based on the max-min fairness (MMF) rate optimization even though the near-optimal weighted MMSE algorithm can be deployed.
速率分割多址(RSMA)被认为是下一代广播和移动通信系统中最有前途的技术之一。以往的RSMA系统设计多侧重于从信息论分析出发的容量优化,而对实际部署中的可靠性考虑较少。为此,本文通过最小化与最差用户设备(UE)相关的均方误差(MSE),精心设计了下行链路多输入单输出(MISO)-RSMA系统的线性预编码器。优化问题首先从信号处理的角度研究每个UE的MSE,其中对私有消息使用零强制(ZF)预编码。然后利用公共预编码向量的半定松弛(SDR),推导出私有和公共消息之间的封闭式最优功率分配系数,得到最小MSE (MMSE)预编码。然后提出了一种启发式封闭解,以降低半确定规划(SDP)所带来的复杂性。仿真结果表明,即使可以部署接近最优的加权MMSE算法,所提出的方案也比基于最大最小公平性(MMF)速率优化的空分多址(SDMA)和传统的RSMA方法具有可靠性优势。
{"title":"MMSE Precoding for Reliability Enhancement in Downlink MISO-RSMA Systems","authors":"Xuehan Wang;Jintao Wang;Jinhong Yuan;Jian Song","doi":"10.1109/TBC.2025.3579251","DOIUrl":"https://doi.org/10.1109/TBC.2025.3579251","url":null,"abstract":"Rate splitting multiple access (RSMA) has been regarded as one of the most promising technologies for the next-generation broadcasting and mobile communication systems. Many prior designs for RSMA systems focused on the capacity optimization from the information-theoretic analysis, while the reliability in realistic deployment is less considered. To this end, the linear precoder of downlink multiple-input single-output (MISO)-RSMA systems is elaborately designed in this paper by minimizing the mean square error (MSE) associated with the worst user equipment (UE). The optimization problem is first formulated by investigating the MSE for each UE from the view of signal processing, where the zero-forcing (ZF) precoding is utilized for the private messages. The minimum MSE (MMSE) precoding is then obtained by utilizing the semi-definite relaxation (SDR) for the common precoding vector and deriving the closed-form optimal power allocation coefficient between private and common messages. A heuristic closed-form solution is then developed to reduce the complexity caused by the semi-definite programming (SDP). Simulation results demonstrate the reliability superiority of the proposed schemes beyond space-division multiple access (SDMA) and conventional RSMA approaches based on the max-min fairness (MMF) rate optimization even though the near-optimal weighted MMSE algorithm can be deployed.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"71 3","pages":"732-740"},"PeriodicalIF":4.8,"publicationDate":"2025-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144997920","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Secure Video Quality Assessment Resisting Adversarial Attacks 安全视频质量评估抵御对抗性攻击
IF 4.8 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-06-24 DOI: 10.1109/TBC.2025.3575339
Ao-Xiang Zhang;Yuan-Gen Wang;Yu Ran;Weixuan Tang;Qingxiao Guan;Chunsheng Yang
The exponential surge in video traffic has intensified the imperative for Video Quality Assessment (VQA). Leveraging cutting-edge architectures, current VQA models have achieved human-comparable accuracy. However, recent studies have revealed the vulnerability of existing VQA models against adversarial attacks. To establish a reliable and practical assessment system, a secure VQA model capable of resisting such malicious attacks is urgently demanded. Unfortunately, no attempt has been made to explore this issue. This paper first attempts to investigate general adversarial defense principles, aiming to endow existing VQA models with security. Specifically, we first introduce random spatial grid sampling on the video frame for intra-frame defense. Then, we design pixel-wise randomization through a guardian map, globally neutralizing adversarial perturbations. Meanwhile, we extract temporal information from the video sequence as compensation for inter-frame defense. Building upon these principles, we present a novel VQA framework from a security-oriented perspective, termed SecureVQA. Extensive experiments indicate that SecureVQA sets a new benchmark in security while achieving competitive VQA performance compared with state-of-the-art models. Ablation studies delve deeper into analyzing the principles of SecureVQA, demonstrating their generalization and contributions to the security of leading VQA models. The code is available at https://github.com/GZHU-DVL/SecureVQA.
视频流量呈指数级增长,使得视频质量评估(VQA)变得更加迫切。利用尖端的架构,当前的VQA模型已经达到了与人类相当的精度。然而,最近的研究揭示了现有VQA模型在对抗对抗性攻击时的脆弱性。为了建立一个可靠实用的评估体系,迫切需要一种能够抵御此类恶意攻击的安全VQA模型。不幸的是,没有人尝试去探讨这个问题。本文首先尝试研究一般的对抗性防御原理,旨在赋予现有VQA模型以安全性。具体来说,我们首先在视频帧上引入随机空间网格采样来进行帧内防御。然后,我们通过守护图设计逐像素随机化,全局中和对抗性扰动。同时,我们从视频序列中提取时间信息作为帧间防御的补偿。在这些原则的基础上,我们从面向安全的角度提出了一个新的VQA框架,称为SecureVQA。广泛的实验表明,SecureVQA在安全性方面树立了新的基准,同时与最先进的模型相比,实现了具有竞争力的VQA性能。消融研究更深入地分析了SecureVQA的原理,展示了它们的泛化和对领先VQA模型安全性的贡献。代码可在https://github.com/GZHU-DVL/SecureVQA上获得。
{"title":"Secure Video Quality Assessment Resisting Adversarial Attacks","authors":"Ao-Xiang Zhang;Yuan-Gen Wang;Yu Ran;Weixuan Tang;Qingxiao Guan;Chunsheng Yang","doi":"10.1109/TBC.2025.3575339","DOIUrl":"https://doi.org/10.1109/TBC.2025.3575339","url":null,"abstract":"The exponential surge in video traffic has intensified the imperative for Video Quality Assessment (VQA). Leveraging cutting-edge architectures, current VQA models have achieved human-comparable accuracy. However, recent studies have revealed the vulnerability of existing VQA models against adversarial attacks. To establish a reliable and practical assessment system, a secure VQA model capable of resisting such malicious attacks is urgently demanded. Unfortunately, no attempt has been made to explore this issue. This paper first attempts to investigate general adversarial defense principles, aiming to endow existing VQA models with security. Specifically, we first introduce random spatial grid sampling on the video frame for intra-frame defense. Then, we design pixel-wise randomization through a guardian map, globally neutralizing adversarial perturbations. Meanwhile, we extract temporal information from the video sequence as compensation for inter-frame defense. Building upon these principles, we present a novel VQA framework from a security-oriented perspective, termed SecureVQA. Extensive experiments indicate that SecureVQA sets a new benchmark in security while achieving competitive VQA performance compared with state-of-the-art models. Ablation studies delve deeper into analyzing the principles of SecureVQA, demonstrating their generalization and contributions to the security of leading VQA models. The code is available at <uri>https://github.com/GZHU-DVL/SecureVQA</uri>.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"71 3","pages":"821-832"},"PeriodicalIF":4.8,"publicationDate":"2025-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144998181","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
When Multipath QUIC Meets Model Predictive Control and Band Sparse Network Coding: A Novel Multipathing Solution for Video Streaming Over Heterogeneous Wireless Networks 当多路径QUIC满足模型预测控制和频带稀疏网络编码:一种新的异构无线网络视频流多路径解决方案
IF 4.8 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-06-16 DOI: 10.1109/TBC.2025.3570838
Yuanlong Cao;Haopeng Zhang;Ming Jiang;Yirui Jiang;Jinquan Nie
Multipath Quick UDP Internet Connections (MPQUIC) integrated with network coding offers a promising approach to improving the Quality of Experience (QoE) for video services over heterogeneous wireless networks. However, a significant challenge arises when encoding nodes transmit potentially redundant packets while awaiting decoding acknowledgments (ACKs) from endpoints. This behavior can limit effective transmission rates, thereby degrading real-time streaming performance and user QoE. In this paper, we propose MP2-QUIC, which addresses these challenges through a novel adaptive Model Predictive Control (MPC) framework for MPQUIC that optimizes both congestion window and encoding redundancy parameters via a discrete state transition model. By incorporating operating point linearization and leveraging the Central Limit Theorem, MP2-QUIC effectively enhances the control performance and effective throughput of the model in heterogeneous wireless network environments. MP2-QUIC further employs Band-Sparse Network Coding (Band-SNC) to minimize computational complexity at endpoints, while utilizing queuing theory principles to determine optimal encoded packet quantities. This integrated approach significantly enhances end-user QoE, and the experimental results demonstrate MP2-QUIC’s superior performance compared to existing MPQUIC encoding solutions, yielding a 68.85% reduction in peak decoding overhead and marked improvements in Peak Signal-to-Noise Ratio (PSNR).
与网络编码相结合的多路径快速UDP互联网连接(MPQUIC)为异构无线网络上视频服务的体验质量(QoE)提高提供了一种很有前途的方法。然而,当编码节点在等待端点的解码确认(ack)时传输可能冗余的数据包时,会出现一个重大挑战。这种行为会限制有效的传输速率,从而降低实时流性能和用户QoE。在本文中,我们提出MP2-QUIC,它通过MPQUIC的一种新的自适应模型预测控制(MPC)框架来解决这些挑战,该框架通过离散状态转换模型优化拥塞窗口和编码冗余参数。MP2-QUIC通过引入工作点线性化并利用中心极限定理,有效地提高了模型在异构无线网络环境下的控制性能和有效吞吐量。MP2-QUIC进一步采用带稀疏网络编码(Band-SNC)来最小化端点的计算复杂度,同时利用排队论原理来确定最佳编码包数量。这种集成方法显著提高了终端用户的QoE,实验结果表明MP2-QUIC与现有的MPQUIC编码方案相比性能优越,峰值解码开销降低了68.85%,峰值信噪比(PSNR)显著提高。
{"title":"When Multipath QUIC Meets Model Predictive Control and Band Sparse Network Coding: A Novel Multipathing Solution for Video Streaming Over Heterogeneous Wireless Networks","authors":"Yuanlong Cao;Haopeng Zhang;Ming Jiang;Yirui Jiang;Jinquan Nie","doi":"10.1109/TBC.2025.3570838","DOIUrl":"https://doi.org/10.1109/TBC.2025.3570838","url":null,"abstract":"Multipath Quick UDP Internet Connections (MPQUIC) integrated with network coding offers a promising approach to improving the Quality of Experience (QoE) for video services over heterogeneous wireless networks. However, a significant challenge arises when encoding nodes transmit potentially redundant packets while awaiting decoding acknowledgments (ACKs) from endpoints. This behavior can limit effective transmission rates, thereby degrading real-time streaming performance and user QoE. In this paper, we propose MP2-QUIC, which addresses these challenges through a novel adaptive Model Predictive Control (MPC) framework for MPQUIC that optimizes both congestion window and encoding redundancy parameters via a discrete state transition model. By incorporating operating point linearization and leveraging the Central Limit Theorem, MP2-QUIC effectively enhances the control performance and effective throughput of the model in heterogeneous wireless network environments. MP2-QUIC further employs Band-Sparse Network Coding (Band-SNC) to minimize computational complexity at endpoints, while utilizing queuing theory principles to determine optimal encoded packet quantities. This integrated approach significantly enhances end-user QoE, and the experimental results demonstrate MP2-QUIC’s superior performance compared to existing MPQUIC encoding solutions, yielding a 68.85% reduction in peak decoding overhead and marked improvements in Peak Signal-to-Noise Ratio (PSNR).","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"71 3","pages":"756-773"},"PeriodicalIF":4.8,"publicationDate":"2025-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144998284","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Protograph-Based Raptor-Like LDPC Codes With an Add-On Structure for Reliable Communications 基于原型的猛禽类LDPC代码,具有可靠通信的附加结构
IF 4.8 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-06-12 DOI: 10.1109/TBC.2025.3575341
Hyejin Ro;Junghyun Kim;Hosung Park;Sang-Hyo Kim;Seok-Ki Ahn;Sung-Ik Park
The 5G multicast and broadcast service (MBS) has been discussed since 3GPP Release 17, emphasizing resource-efficient transmission for multiple users. A primary focus of 5G MBS is enhancing reliability, even for the broadcast mode without retransmissions. In discussing 6G, the hyper reliable communication is also an important use case. In this context, the design of channel codes with low error floors is crucial to ensure robust communication for such demanding scenarios. Protograph-based raptor-like (PBRL) low-density parity-check (LDPC) codes have good error-correcting performance and rate-compatibility but the construction has focused on waterfall rather than error floor. In this paper, we propose an add-on structure for PBRL LDPC codes to have low error floors, which consists of edges added on the protographs of original PBRL LDPC codes. The added edges play a role of boosting up the reliability of weak variable nodes in the original PBRL LDPC codes. We propose two construction algorithms, one for use at a fixed rate and the other for rate-compatible use. It is shown via simulations that the proposed codes have lower error floors than the original PBRL LDPC codes for various rates. Since the edge addition does not change the existing edge connections in the protograph, an adaptive use with/without the add-on structure has an effect of implementing two PBRL LDPC codes for high-speed and reliable communications in an efficient way while keeping the system backward-compatible with the original PBRL LDPC code.
5G组播和广播业务(MBS)从3GPP Release 17开始讨论,强调多用户的资源高效传输。5G MBS的主要焦点是提高可靠性,即使在没有重传的广播模式下也是如此。在讨论6G时,超可靠通信也是一个重要的用例。在这种情况下,设计具有低错误层的信道编码对于确保这种要求苛刻的场景的健壮通信至关重要。基于原型的类猛禽(PBRL)低密度奇偶校验(LDPC)码具有良好的纠错性能和码率兼容性,但其结构侧重于瀑布而非错误层。本文提出了一种低错误层的PBRL LDPC码附加结构,该附加结构由在原始PBRL LDPC码的原型图上添加的边组成。在原PBRL LDPC编码中,增加的边对弱变量节点的可靠性起到了提高的作用。我们提出了两种构造算法,一种用于固定速率,另一种用于速率兼容使用。仿真结果表明,在不同的速率下,本文提出的码比原PBRL LDPC码具有更低的误差层。由于边缘添加不会改变原型机中现有的边缘连接,因此有/没有附加结构的自适应使用具有实现高速可靠通信的两个PBRL LDPC码的效果,同时保持系统与原始PBRL LDPC码的向后兼容。
{"title":"Protograph-Based Raptor-Like LDPC Codes With an Add-On Structure for Reliable Communications","authors":"Hyejin Ro;Junghyun Kim;Hosung Park;Sang-Hyo Kim;Seok-Ki Ahn;Sung-Ik Park","doi":"10.1109/TBC.2025.3575341","DOIUrl":"https://doi.org/10.1109/TBC.2025.3575341","url":null,"abstract":"The 5G multicast and broadcast service (MBS) has been discussed since 3GPP Release 17, emphasizing resource-efficient transmission for multiple users. A primary focus of 5G MBS is enhancing reliability, even for the broadcast mode without retransmissions. In discussing 6G, the hyper reliable communication is also an important use case. In this context, the design of channel codes with low error floors is crucial to ensure robust communication for such demanding scenarios. Protograph-based raptor-like (PBRL) low-density parity-check (LDPC) codes have good error-correcting performance and rate-compatibility but the construction has focused on waterfall rather than error floor. In this paper, we propose an add-on structure for PBRL LDPC codes to have low error floors, which consists of edges added on the protographs of original PBRL LDPC codes. The added edges play a role of boosting up the reliability of weak variable nodes in the original PBRL LDPC codes. We propose two construction algorithms, one for use at a fixed rate and the other for rate-compatible use. It is shown via simulations that the proposed codes have lower error floors than the original PBRL LDPC codes for various rates. Since the edge addition does not change the existing edge connections in the protograph, an adaptive use with/without the add-on structure has an effect of implementing two PBRL LDPC codes for high-speed and reliable communications in an efficient way while keeping the system backward-compatible with the original PBRL LDPC code.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"71 3","pages":"717-731"},"PeriodicalIF":4.8,"publicationDate":"2025-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144998094","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MAIP: A Multi-Attribute Informativeness Proxy for Image Semantic Broadcasting Communication MAIP:图像语义广播通信的多属性信息代理
IF 4.8 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-06-11 DOI: 10.1109/TBC.2025.3573144
Zhuo Zhang;Shuai Xiao;Guipeng Lan;Meng Xi;Jiabao Wen;Jiachen Yang
In the image semantic broadcasting communication system, the resources of the channel are limited, which restricts the transmission and broadcasting of large-scale image data. This paper proposed a deep learning assisted image semantic broadcasting scheme to improve source efficiency and alleviate communication resource pressure at the transmission terminal. We adopt an image informativeness evaluation method to screen high information image data and implement this data-driven source optimization scheme. Specifically, we propose a Multi Attribute Information Proxy (MAIP) method that integrates fine-grained information attributes such as uncertainty, novelty, and diversity to evaluate and screen image semantic broadcast data. Used to support the formation of optimal image data broadcast transmission strategies. To demonstrate the effectiveness of the proposed MAIP, we compared it with state-of-the-art over three benchmarks CIFAR-10, mini ImageNet and Fashion Minst based on active learning as a validation experiment.
在图像语义广播通信系统中,由于信道资源有限,限制了大规模图像数据的传输和广播。本文提出了一种深度学习辅助图像语义广播方案,以提高源效率,缓解传输终端的通信资源压力。采用图像信息量评价方法筛选高信息量的图像数据,实现数据驱动的数据源优化方案。具体而言,我们提出了一种多属性信息代理(MAIP)方法,该方法集成了不确定性、新颖性和多样性等细粒度信息属性,以评估和筛选图像语义广播数据。用于支持形成最优的图像数据广播传输策略。为了证明所提出的MAIP的有效性,我们将其与基于主动学习的最先进的三个基准CIFAR-10, mini ImageNet和Fashion Minst进行了比较,作为验证实验。
{"title":"MAIP: A Multi-Attribute Informativeness Proxy for Image Semantic Broadcasting Communication","authors":"Zhuo Zhang;Shuai Xiao;Guipeng Lan;Meng Xi;Jiabao Wen;Jiachen Yang","doi":"10.1109/TBC.2025.3573144","DOIUrl":"https://doi.org/10.1109/TBC.2025.3573144","url":null,"abstract":"In the image semantic broadcasting communication system, the resources of the channel are limited, which restricts the transmission and broadcasting of large-scale image data. This paper proposed a deep learning assisted image semantic broadcasting scheme to improve source efficiency and alleviate communication resource pressure at the transmission terminal. We adopt an image informativeness evaluation method to screen high information image data and implement this data-driven source optimization scheme. Specifically, we propose a Multi Attribute Information Proxy (MAIP) method that integrates fine-grained information attributes such as uncertainty, novelty, and diversity to evaluate and screen image semantic broadcast data. Used to support the formation of optimal image data broadcast transmission strategies. To demonstrate the effectiveness of the proposed MAIP, we compared it with state-of-the-art over three benchmarks CIFAR-10, mini ImageNet and Fashion Minst based on active learning as a validation experiment.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"71 3","pages":"903-913"},"PeriodicalIF":4.8,"publicationDate":"2025-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144998283","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Transactions on Broadcasting Information for Authors IEEE作者广播信息汇刊
IF 3.2 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-06-09 DOI: 10.1109/TBC.2025.3569995
{"title":"IEEE Transactions on Broadcasting Information for Authors","authors":"","doi":"10.1109/TBC.2025.3569995","DOIUrl":"https://doi.org/10.1109/TBC.2025.3569995","url":null,"abstract":"","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"71 2","pages":"C3-C4"},"PeriodicalIF":3.2,"publicationDate":"2025-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11027898","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144243590","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Transactions on Broadcasting Publication Information IEEE广播出版信息汇刊
IF 3.2 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-06-09 DOI: 10.1109/TBC.2025.3569993
{"title":"IEEE Transactions on Broadcasting Publication Information","authors":"","doi":"10.1109/TBC.2025.3569993","DOIUrl":"https://doi.org/10.1109/TBC.2025.3569993","url":null,"abstract":"","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"71 2","pages":"C2-C2"},"PeriodicalIF":3.2,"publicationDate":"2025-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11027900","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144243587","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fusion Prediction Model of Broadcast Radio Signal Propagation Based on the Coefficient of Variation Method 基于变异系数法的广播无线电信号传播融合预测模型
IF 4.8 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-06-04 DOI: 10.1109/TBC.2025.3570860
Yulong Hao;Jiaxuan Weng;Jian Wang;Zhongle Wu;Cheng Yang
To support the planning and development of broadcasting, we first develop a novel fusion prediction model by introducing the coefficient of variation method (CVM) in radio wave propagation prediction to enhance the accuracy of the broadcast propagation model and reduce the complexity of the fusion modeling method. The main contributions of this paper are as follows: (1) The CVM is introduced into the field of channel modeling for the first time, and a fusion modeling approach with high accuracy and low complexity based on this method is proposed. (2) A systematic analysis of the CVM and the fusion modeling approach is conducted, establishing a fusion channel model based on an improved CVM. Experimental results indicate that compared to the ITU-R P.1546, ITU-R P.2001, and ITM models, the improves the prediction accuracy of the proposed by 50.39%, 60.47%, and 55.98%, respectively.
为了支持广播事业的规划和发展,我们首先在无线电波传播预测中引入变异系数法(CVM),建立了一种新的融合预测模型,以提高广播传播模型的准确性,降低融合建模方法的复杂性。本文的主要贡献如下:(1)首次将CVM引入信道建模领域,并在此基础上提出了一种高精度、低复杂度的融合建模方法。(2)系统分析了CVM和融合建模方法,建立了基于改进CVM的融合通道模型。实验结果表明,与ITU-R P.1546、ITU-R P.2001和ITM模型相比,所提模型的预测精度分别提高了50.39%、60.47%和55.98%。
{"title":"Fusion Prediction Model of Broadcast Radio Signal Propagation Based on the Coefficient of Variation Method","authors":"Yulong Hao;Jiaxuan Weng;Jian Wang;Zhongle Wu;Cheng Yang","doi":"10.1109/TBC.2025.3570860","DOIUrl":"https://doi.org/10.1109/TBC.2025.3570860","url":null,"abstract":"To support the planning and development of broadcasting, we first develop a novel fusion prediction model by introducing the coefficient of variation method (CVM) in radio wave propagation prediction to enhance the accuracy of the broadcast propagation model and reduce the complexity of the fusion modeling method. The main contributions of this paper are as follows: (1) The CVM is introduced into the field of channel modeling for the first time, and a fusion modeling approach with high accuracy and low complexity based on this method is proposed. (2) A systematic analysis of the CVM and the fusion modeling approach is conducted, establishing a fusion channel model based on an improved CVM. Experimental results indicate that compared to the ITU-R P.1546, ITU-R P.2001, and ITM models, the improves the prediction accuracy of the proposed by 50.39%, 60.47%, and 55.98%, respectively.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"71 3","pages":"774-783"},"PeriodicalIF":4.8,"publicationDate":"2025-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144998095","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Social VR With Holographic Comms: Enablers for New Engaging Experiences Within the TV / Video Consumption Landscape 带有全息通信的社交VR:电视/视频消费领域中新的引人入胜的体验的推动者
IF 4.8 1区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-06-02 DOI: 10.1109/TBC.2025.3570869
Mario Montagud Climent;Marc Martos;Álvaro Egea;Sergi Fernández Langa
Social Virtual Reality (VR) enables shared media experiences between remote people inside immersive and realistic 3D spaces, providing richer and more natural interactions than in classical 2D social conferencing tools. Likewise, the benefits and engagement can even be magnified by integrating realistic and volumetric user representations (i.e., 3D holograms) in these virtual environments rather than synthetic avatars. This paper presents the design and evaluation of an interactive Social VR scenario for a joint and collaborative exploration of a catalogue of professional video clips by a broadcaster. On the one hand, the scenario includes a control panel to select the desired year and clip. After the year selection, a time travel through a lift effect is enforced to teleport users through a multi-level semi-open building in which each level / floor represents one year, and its look-and-feel is customized to resemble that year. On the other hand, the scenario allows the integration of up to four users represented as 3D holograms (full-body and full volume Point Clouds), each one with his/her own screen for video consumption, and arranged in a cross 360° shape to allow for a natural visual interaction among themselves. The evaluation results with N=48 professionals of the broadcast sector not only provide relevant insights about the technical requirements and obtained performance, but confirm the satisfactory user experience (in terms of presence, togetherness, quality of interaction) provided by the presented technology and VR scenario and, most importantly, reveal and contribute to identifying the potential and opportunities of Social VR in the broadcast / video consumption landscape.
社交虚拟现实(VR)使远程人员能够在沉浸式和逼真的3D空间中共享媒体体验,提供比传统2D社交会议工具更丰富、更自然的交互。同样,通过在这些虚拟环境中集成逼真的和立体的用户表示(即3D全息图),而不是合成的虚拟形象,甚至可以放大这些好处和用户粘性。本文介绍了一个交互式社交VR场景的设计和评估,该场景用于联合和协作探索广播公司的专业视频剪辑目录。一方面,该场景包括一个控制面板,用于选择所需的年份和剪辑。在年份选择之后,通过电梯效果进行时间旅行,将用户传送到多层半开放式建筑中,其中每层/楼层代表一年,其外观和感觉被定制为与该年相似。另一方面,该场景允许将多达四个用户集成为3D全息图(全身和全体积点云),每个用户都有自己的屏幕用于视频消费,并以交叉360°的形状排列,以允许他们之间的自然视觉交互。N=48名广播行业专业人士的评估结果不仅提供了有关技术要求和获得的性能的相关见解,而且证实了所呈现的技术和VR场景提供的令人满意的用户体验(在存在,团聚,交互质量方面),最重要的是,揭示并有助于识别社交VR在广播/视频消费领域的潜力和机会。
{"title":"Social VR With Holographic Comms: Enablers for New Engaging Experiences Within the TV / Video Consumption Landscape","authors":"Mario Montagud Climent;Marc Martos;Álvaro Egea;Sergi Fernández Langa","doi":"10.1109/TBC.2025.3570869","DOIUrl":"https://doi.org/10.1109/TBC.2025.3570869","url":null,"abstract":"Social Virtual Reality (VR) enables shared media experiences between remote people inside immersive and realistic 3D spaces, providing richer and more natural interactions than in classical 2D social conferencing tools. Likewise, the benefits and engagement can even be magnified by integrating realistic and volumetric user representations (i.e., 3D holograms) in these virtual environments rather than synthetic avatars. This paper presents the design and evaluation of an interactive Social VR scenario for a joint and collaborative exploration of a catalogue of professional video clips by a broadcaster. On the one hand, the scenario includes a control panel to select the desired year and clip. After the year selection, a time travel through a lift effect is enforced to teleport users through a multi-level semi-open building in which each level / floor represents one year, and its look-and-feel is customized to resemble that year. On the other hand, the scenario allows the integration of up to four users represented as 3D holograms (full-body and full volume Point Clouds), each one with his/her own screen for video consumption, and arranged in a cross 360° shape to allow for a natural visual interaction among themselves. The evaluation results with N=48 professionals of the broadcast sector not only provide relevant insights about the technical requirements and obtained performance, but confirm the satisfactory user experience (in terms of presence, togetherness, quality of interaction) provided by the presented technology and VR scenario and, most importantly, reveal and contribute to identifying the potential and opportunities of Social VR in the broadcast / video consumption landscape.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"71 3","pages":"793-807"},"PeriodicalIF":4.8,"publicationDate":"2025-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144998182","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Broadcasting
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1