基于异构多核平台的HEVC帧内编码帧级并行性能评价

M. Mohamed, Bahri Nejmeddine, Batel Noureddine, Toubal Abdelmoughni, Masmoudi Nouri
{"title":"基于异构多核平台的HEVC帧内编码帧级并行性能评价","authors":"M. Mohamed, Bahri Nejmeddine, Batel Noureddine, Toubal Abdelmoughni, Masmoudi Nouri","doi":"10.1109/ICASS.2018.8652076","DOIUrl":null,"url":null,"abstract":"High Efficiency Video Coding (HEVC) is the latest video coding standard released as a successor of H.264/AVC, it expected to reduce the bitrate by 50% for the same perceptual quality. One of the major contributors to the higher compression performance of HEVC is the introduction of larger Coding Units (CU) with recursive partitioning mechanisms. This achievement in performance is accompanied by a high computational complexity, making this new standard very difficult to be embedded in current multimedia services and broadcast platforms. In this paper, a performance evaluation of All-Intra (AI) parallel realization of HEVC encoder is proposed, using a heterogeneous Octa-core CubieBoard4 platform that includes two quad-core ARM A7 and ARM A15. We used the OpenMP paradigm for parallel realization where each thread is assigned to a core processor to encode a separate frame. AI configuration is used to break coding dependencies between successive frames, which allow the parallel processing of a set of images. Experimental results shows that the proposed parallel realization of HEVC encoder, using eight threads, reduces the computational complexity to about 4.35×, without any loss in coding performance. These results do not match to the expected acceleration due to the heterogeneity of the platform.","PeriodicalId":358814,"journal":{"name":"2018 International Conference on Applied Smart Systems (ICASS)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Performance Evaluation of Frame-level Parallelization in HEVC Intra Coding Using Heterogeneous Multicore Platforms\",\"authors\":\"M. Mohamed, Bahri Nejmeddine, Batel Noureddine, Toubal Abdelmoughni, Masmoudi Nouri\",\"doi\":\"10.1109/ICASS.2018.8652076\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"High Efficiency Video Coding (HEVC) is the latest video coding standard released as a successor of H.264/AVC, it expected to reduce the bitrate by 50% for the same perceptual quality. One of the major contributors to the higher compression performance of HEVC is the introduction of larger Coding Units (CU) with recursive partitioning mechanisms. This achievement in performance is accompanied by a high computational complexity, making this new standard very difficult to be embedded in current multimedia services and broadcast platforms. In this paper, a performance evaluation of All-Intra (AI) parallel realization of HEVC encoder is proposed, using a heterogeneous Octa-core CubieBoard4 platform that includes two quad-core ARM A7 and ARM A15. We used the OpenMP paradigm for parallel realization where each thread is assigned to a core processor to encode a separate frame. AI configuration is used to break coding dependencies between successive frames, which allow the parallel processing of a set of images. Experimental results shows that the proposed parallel realization of HEVC encoder, using eight threads, reduces the computational complexity to about 4.35×, without any loss in coding performance. These results do not match to the expected acceleration due to the heterogeneity of the platform.\",\"PeriodicalId\":358814,\"journal\":{\"name\":\"2018 International Conference on Applied Smart Systems (ICASS)\",\"volume\":\"40 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 International Conference on Applied Smart Systems (ICASS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICASS.2018.8652076\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Applied Smart Systems (ICASS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICASS.2018.8652076","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

高效视频编码(High Efficiency Video Coding, HEVC)是继H.264/AVC之后发布的最新视频编码标准,它有望在相同的感知质量下将比特率降低50%。HEVC压缩性能提高的主要原因之一是引入了带有递归分区机制的更大的编码单元(CU)。这种性能上的成就伴随着较高的计算复杂度,使得这个新标准很难嵌入到当前的多媒体服务和广播平台中。本文利用异构八核CubieBoard4平台(包含两个四核ARM A7和ARM A15),对HEVC编码器的AI并行实现进行了性能评估。我们使用OpenMP范式进行并行实现,其中每个线程被分配给一个核心处理器来编码一个单独的帧。AI配置用于打破连续帧之间的编码依赖,从而允许并行处理一组图像。实验结果表明,采用8个线程并行实现HEVC编码器,在不影响编码性能的前提下,将计算复杂度降低到4.35倍左右。由于平台的不均匀性,这些结果与预期的加速度不匹配。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Performance Evaluation of Frame-level Parallelization in HEVC Intra Coding Using Heterogeneous Multicore Platforms
High Efficiency Video Coding (HEVC) is the latest video coding standard released as a successor of H.264/AVC, it expected to reduce the bitrate by 50% for the same perceptual quality. One of the major contributors to the higher compression performance of HEVC is the introduction of larger Coding Units (CU) with recursive partitioning mechanisms. This achievement in performance is accompanied by a high computational complexity, making this new standard very difficult to be embedded in current multimedia services and broadcast platforms. In this paper, a performance evaluation of All-Intra (AI) parallel realization of HEVC encoder is proposed, using a heterogeneous Octa-core CubieBoard4 platform that includes two quad-core ARM A7 and ARM A15. We used the OpenMP paradigm for parallel realization where each thread is assigned to a core processor to encode a separate frame. AI configuration is used to break coding dependencies between successive frames, which allow the parallel processing of a set of images. Experimental results shows that the proposed parallel realization of HEVC encoder, using eight threads, reduces the computational complexity to about 4.35×, without any loss in coding performance. These results do not match to the expected acceleration due to the heterogeneity of the platform.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Artificial Neural Network Modeling of Sustained Antihypertensive Drug Delivery using Polyelectrolyte Complex based on Carboxymethyl-kappa-carrageenan and Chitosan as Prospective Carriers Scaled artificial bee colony programming Semi-Supervised Learning for Medical Application: A Survey Simulation of 2D Optical Code Division Multiple Access System with Different Receiver Structures using Multi-Wavelength Optical Orthogonal Codes EPBT and CO2 emission from solar PV monocrystaline silicon
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1