{"title":"基于兴趣域的H.264低功耗视频通信编码参数分配","authors":"Minghui Wang, T. Zhang, Chen Liu, S. Goto","doi":"10.1109/CSPA.2009.5069223","DOIUrl":null,"url":null,"abstract":"H.264 is the state-of-the-art in modern video compression standards. Its extremely high compression ratio meets the requirements the video communication between portable devices. Since the power is limited in portable devices, the huge computation of H.264 is a critical problem for hardware design. According to the human visual system (HVS) research, human vision is only able to focus on one area in a frame, which is defined as region-of-interest (ROI). In most cases, human face attracts the most attention of the device user. This phenomenon gives a chance to code all macroblocks unequally. In this work, the ROI is detected with an encoder-oriented fast algorithm, using chrominance and texture contrast features. After the ROI is detected, the encoder will allocated the coding parameters respectively in ROI and non-ROI. As a result, it keeps fine quality in ROI, saves much throughput in non-ROI, and greatly reduces the computation. The ROI detector and the encoder are also designed to be decoding-friendly and hardware-friendly.","PeriodicalId":338469,"journal":{"name":"2009 5th International Colloquium on Signal Processing & Its Applications","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":"{\"title\":\"Region-of-interest based H.264 encoding parameter allocation for low power video communication\",\"authors\":\"Minghui Wang, T. Zhang, Chen Liu, S. Goto\",\"doi\":\"10.1109/CSPA.2009.5069223\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"H.264 is the state-of-the-art in modern video compression standards. Its extremely high compression ratio meets the requirements the video communication between portable devices. Since the power is limited in portable devices, the huge computation of H.264 is a critical problem for hardware design. According to the human visual system (HVS) research, human vision is only able to focus on one area in a frame, which is defined as region-of-interest (ROI). In most cases, human face attracts the most attention of the device user. This phenomenon gives a chance to code all macroblocks unequally. In this work, the ROI is detected with an encoder-oriented fast algorithm, using chrominance and texture contrast features. After the ROI is detected, the encoder will allocated the coding parameters respectively in ROI and non-ROI. As a result, it keeps fine quality in ROI, saves much throughput in non-ROI, and greatly reduces the computation. The ROI detector and the encoder are also designed to be decoding-friendly and hardware-friendly.\",\"PeriodicalId\":338469,\"journal\":{\"name\":\"2009 5th International Colloquium on Signal Processing & Its Applications\",\"volume\":\"27 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-03-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 5th International Colloquium on Signal Processing & Its Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSPA.2009.5069223\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 5th International Colloquium on Signal Processing & Its Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSPA.2009.5069223","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Region-of-interest based H.264 encoding parameter allocation for low power video communication
H.264 is the state-of-the-art in modern video compression standards. Its extremely high compression ratio meets the requirements the video communication between portable devices. Since the power is limited in portable devices, the huge computation of H.264 is a critical problem for hardware design. According to the human visual system (HVS) research, human vision is only able to focus on one area in a frame, which is defined as region-of-interest (ROI). In most cases, human face attracts the most attention of the device user. This phenomenon gives a chance to code all macroblocks unequally. In this work, the ROI is detected with an encoder-oriented fast algorithm, using chrominance and texture contrast features. After the ROI is detected, the encoder will allocated the coding parameters respectively in ROI and non-ROI. As a result, it keeps fine quality in ROI, saves much throughput in non-ROI, and greatly reduces the computation. The ROI detector and the encoder are also designed to be decoding-friendly and hardware-friendly.