{"title":"aVCSR: Adaptive Video Compressive Sensing Using Region-of-Interest Detection in the Compressed Domain","authors":"Jian Yang, Haixin Wang, Ittetsu Taniguchi, Yibo Fan, Jinjia Zhou","doi":"10.1109/mmul.2023.3342062","DOIUrl":null,"url":null,"abstract":"Existing video compressive sensing (CS) techniques with fixed sampling rates can deliver satisfactory reconstructed quality but necessitate large transmission bandwidth. To overcome this challenge, region-of-interest (ROI)-based CS algorithms have been introduced to allocate different coding resources between ROI and non-ROI segments. However, neglecting non-ROI excessively in these algorithms leads to unsatisfactory average quality for the eventual reconstruction. In this article, we integrate the ideas of these methods and propose a novel adaptive video CS approach using a low-complexity ROI detection method in the compressed domain. The ROI is detected and sampled by calculating the measurement variance between the reference frame and the subsequent frames. Conversely, the non-ROI is not transmitted but will be reconstructed by utilizing the reference frame through the corresponding position information. In addition, we present a compact method for adapting the threshold value, which allows each frame of a video to have a unique threshold rather than an artificially predetermined fixed value. Moreover, a reference-frame-updating strategy is developed to improve the versatility of the entire framework. Compared to state-of-the-art counterparts, extensive experimental results have demonstrated that our proposed methods achieve superior performance while tackling diverse scenes and using a lower sampling rate.","PeriodicalId":13240,"journal":{"name":"IEEE MultiMedia","volume":"1 1","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2023-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE MultiMedia","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1109/mmul.2023.3342062","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
Abstract
Existing video compressive sensing (CS) techniques with fixed sampling rates can deliver satisfactory reconstructed quality but necessitate large transmission bandwidth. To overcome this challenge, region-of-interest (ROI)-based CS algorithms have been introduced to allocate different coding resources between ROI and non-ROI segments. However, neglecting non-ROI excessively in these algorithms leads to unsatisfactory average quality for the eventual reconstruction. In this article, we integrate the ideas of these methods and propose a novel adaptive video CS approach using a low-complexity ROI detection method in the compressed domain. The ROI is detected and sampled by calculating the measurement variance between the reference frame and the subsequent frames. Conversely, the non-ROI is not transmitted but will be reconstructed by utilizing the reference frame through the corresponding position information. In addition, we present a compact method for adapting the threshold value, which allows each frame of a video to have a unique threshold rather than an artificially predetermined fixed value. Moreover, a reference-frame-updating strategy is developed to improve the versatility of the entire framework. Compared to state-of-the-art counterparts, extensive experimental results have demonstrated that our proposed methods achieve superior performance while tackling diverse scenes and using a lower sampling rate.
现有的固定采样率视频压缩传感(CS)技术可以提供令人满意的重建质量,但需要很大的传输带宽。为了克服这一难题,人们引入了基于感兴趣区域(ROI)的 CS 算法,在感兴趣区域和非感兴趣区域段之间分配不同的编码资源。然而,在这些算法中过度忽略非 ROI 会导致最终重建的平均质量不尽如人意。在本文中,我们综合了这些方法的思想,提出了一种新的自适应视频 CS 方法,在压缩域中使用低复杂度 ROI 检测方法。ROI 是通过计算参考帧和后续帧之间的测量方差来检测和采样的。相反,非 ROI 不会被传输,而是通过相应的位置信息利用参考帧进行重建。此外,我们还提出了一种自适应阈值的紧凑方法,它允许视频的每一帧都有一个独特的阈值,而不是人为预先确定的固定值。此外,我们还开发了一种参考帧更新策略,以提高整个框架的通用性。与最先进的同类方法相比,大量实验结果表明,我们提出的方法在处理不同场景和使用较低采样率时性能更优。
期刊介绍:
The magazine contains technical information covering a broad range of issues in multimedia systems and applications. Articles discuss research as well as advanced practice in hardware/software and are expected to span the range from theory to working systems. Especially encouraged are papers discussing experiences with new or advanced systems and subsystems. To avoid unnecessary overlap with existing publications, acceptable papers must have a significant focus on aspects unique to multimedia systems and applications. These aspects are likely to be related to the special needs of multimedia information compared to other electronic data, for example, the size requirements of digital media and the importance of time in the representation of such media. The following list is not exhaustive, but is representative of the topics that are covered: Hardware and software for media compression, coding & processing; Media representations & standards for storage, editing, interchange, transmission & presentation; Hardware platforms supporting multimedia applications; Operating systems suitable for multimedia applications; Storage devices & technologies for multimedia information; Network technologies, protocols, architectures & delivery techniques intended for multimedia; Synchronization issues; Multimedia databases; Formalisms for multimedia information systems & applications; Programming paradigms & languages for multimedia; Multimedia user interfaces; Media creation integration editing & management; Creation & modification of multimedia applications.