{"title":"Low complexity disparity estimation for immersive 3D video transmission","authors":"B. Micallef, C. J. Debono, R. Farrugia","doi":"10.1109/ICCW.2013.6649306","DOIUrl":null,"url":null,"abstract":"Bandwidth-limited channels demand the transmission of the per-pixel depth maps with the texture data to provide immersive 3D video services that allow arbitrary 3D viewpoint reconstruction. This auxiliary depth data offers geometric information, which together with the multi-view and epipolar geometries, can be exploited during 3D video coding to calculate geometric positions for the search areas of disparity estimation. These positions represent a more accurate estimate match to compensate the current macro-block from than those provided by the median adopted by the H.264/MVC standard. The result is smaller search areas that reduce the encoder's computational requirement. In this work, we exploit this fact together with the largest depth variation within the macro-block to encode, to calculate and adaptively adjust these areas along the epipolar lines. The proposed solution achieves a speedup gain of up-to 32 times over the original disparity estimation, with negligible influence on the rate-distortion performance of 3D video coding. This highly reduces the computational cost of the H.264/MVC encoder and eases its need to be implemented on highly expensive systems that are otherwise necessary to meets the stringent latency requirement of broadcast transmissions. Moreover, it provides similar coding efficiencies required for such scenarios.","PeriodicalId":252497,"journal":{"name":"2013 IEEE International Conference on Communications Workshops (ICC)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE International Conference on Communications Workshops (ICC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCW.2013.6649306","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
Bandwidth-limited channels demand the transmission of the per-pixel depth maps with the texture data to provide immersive 3D video services that allow arbitrary 3D viewpoint reconstruction. This auxiliary depth data offers geometric information, which together with the multi-view and epipolar geometries, can be exploited during 3D video coding to calculate geometric positions for the search areas of disparity estimation. These positions represent a more accurate estimate match to compensate the current macro-block from than those provided by the median adopted by the H.264/MVC standard. The result is smaller search areas that reduce the encoder's computational requirement. In this work, we exploit this fact together with the largest depth variation within the macro-block to encode, to calculate and adaptively adjust these areas along the epipolar lines. The proposed solution achieves a speedup gain of up-to 32 times over the original disparity estimation, with negligible influence on the rate-distortion performance of 3D video coding. This highly reduces the computational cost of the H.264/MVC encoder and eases its need to be implemented on highly expensive systems that are otherwise necessary to meets the stringent latency requirement of broadcast transmissions. Moreover, it provides similar coding efficiencies required for such scenarios.