{"title":"H. 264关键模块的深度流水线DSP解决方案","authors":"Jinxiu Zhu, Ning Cao, Yushan Chen, Guoxuan Li","doi":"10.1109/SEC.2008.44","DOIUrl":null,"url":null,"abstract":"With the rapid development of microprocessor, embedded multimedia products are gradually becoming the mainstream in the market. However, the high coding efficiency enabled by the H.264 video compression standard comes with substantially greater algorithmic complexity as compared to that of existing standards. And this additional complexity results in many difficulties in the implementation and optimization tasks. This paper analyzes the algorithms of the two time-consuming modules of integer transform and motion estimation in H.264. Then optimizes the two modules based on the extended instruction set of C64x/C64x+. Finally, deeply pipelined DSP solutions to two modules are presented in this paper. The experiment results show that optimizing parallel assembly can make the codes more efficient.","PeriodicalId":231129,"journal":{"name":"2008 Fifth IEEE International Symposium on Embedded Computing","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deeply Pipelined DSP Solution to Key Modules in H. 264\",\"authors\":\"Jinxiu Zhu, Ning Cao, Yushan Chen, Guoxuan Li\",\"doi\":\"10.1109/SEC.2008.44\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the rapid development of microprocessor, embedded multimedia products are gradually becoming the mainstream in the market. However, the high coding efficiency enabled by the H.264 video compression standard comes with substantially greater algorithmic complexity as compared to that of existing standards. And this additional complexity results in many difficulties in the implementation and optimization tasks. This paper analyzes the algorithms of the two time-consuming modules of integer transform and motion estimation in H.264. Then optimizes the two modules based on the extended instruction set of C64x/C64x+. Finally, deeply pipelined DSP solutions to two modules are presented in this paper. The experiment results show that optimizing parallel assembly can make the codes more efficient.\",\"PeriodicalId\":231129,\"journal\":{\"name\":\"2008 Fifth IEEE International Symposium on Embedded Computing\",\"volume\":\"32 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-10-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 Fifth IEEE International Symposium on Embedded Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SEC.2008.44\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 Fifth IEEE International Symposium on Embedded Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SEC.2008.44","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Deeply Pipelined DSP Solution to Key Modules in H. 264
With the rapid development of microprocessor, embedded multimedia products are gradually becoming the mainstream in the market. However, the high coding efficiency enabled by the H.264 video compression standard comes with substantially greater algorithmic complexity as compared to that of existing standards. And this additional complexity results in many difficulties in the implementation and optimization tasks. This paper analyzes the algorithms of the two time-consuming modules of integer transform and motion estimation in H.264. Then optimizes the two modules based on the extended instruction set of C64x/C64x+. Finally, deeply pipelined DSP solutions to two modules are presented in this paper. The experiment results show that optimizing parallel assembly can make the codes more efficient.