Yang Kun, Zhang Chun, Du Guoze, Xie Jiangxiang, W. Zhihua
{"title":"H.264/AVG解码器的软硬件协同设计","authors":"Yang Kun, Zhang Chun, Du Guoze, Xie Jiangxiang, W. Zhihua","doi":"10.1109/ASSCC.2006.357866","DOIUrl":null,"url":null,"abstract":"A single chip decoder SOC for H.264 baseline profile, called OR264 (OR1K based H264 decoder), is presented in this paper. The chip has mixed hardware/software architecture to combine performance and flexibility. It is partitioned that the hardware is used to boost the performance and efficiency of key operations in H.264 decoder while the software is used to control the decoding flow and to synchronize the hardware modules. All hardware units operate in parallel. The hardware can decode a MB in 851 clock cycles under ideal condition. The chip is fabricated using UMC 0.18-mum 6-layers metal CMOS process. It contains 1.5 M transistors and 176k bits embedded SRAM. The die size is 4.8 mm x 4.8 mm and the critical path is 10 ns.","PeriodicalId":142478,"journal":{"name":"2006 IEEE Asian Solid-State Circuits Conference","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":"{\"title\":\"A Hardware-Software Co-design for H.264/AVG Decoder\",\"authors\":\"Yang Kun, Zhang Chun, Du Guoze, Xie Jiangxiang, W. Zhihua\",\"doi\":\"10.1109/ASSCC.2006.357866\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A single chip decoder SOC for H.264 baseline profile, called OR264 (OR1K based H264 decoder), is presented in this paper. The chip has mixed hardware/software architecture to combine performance and flexibility. It is partitioned that the hardware is used to boost the performance and efficiency of key operations in H.264 decoder while the software is used to control the decoding flow and to synchronize the hardware modules. All hardware units operate in parallel. The hardware can decode a MB in 851 clock cycles under ideal condition. The chip is fabricated using UMC 0.18-mum 6-layers metal CMOS process. It contains 1.5 M transistors and 176k bits embedded SRAM. The die size is 4.8 mm x 4.8 mm and the critical path is 10 ns.\",\"PeriodicalId\":142478,\"journal\":{\"name\":\"2006 IEEE Asian Solid-State Circuits Conference\",\"volume\":\"37 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2006-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"15\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2006 IEEE Asian Solid-State Circuits Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ASSCC.2006.357866\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2006 IEEE Asian Solid-State Circuits Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ASSCC.2006.357866","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 15
摘要
本文提出了一种用于H.264基线配置的单片解码器SOC,称为OR264(基于OR1K的H264解码器)。该芯片采用混合硬件/软件架构,将性能和灵活性结合起来。硬件部分主要用于提高H.264解码器关键操作的性能和效率,软件部分主要用于控制解码流程和实现各硬件模块的同步。所有硬件单元并行运行。在理想情况下,硬件可以在851个时钟周期内解码一个MB。该芯片采用UMC 0.18-mum 6层金属CMOS工艺制造。它包含1.5 M晶体管和176k位嵌入式SRAM。模具尺寸为4.8 mm × 4.8 mm,关键路径为10ns。
A Hardware-Software Co-design for H.264/AVG Decoder
A single chip decoder SOC for H.264 baseline profile, called OR264 (OR1K based H264 decoder), is presented in this paper. The chip has mixed hardware/software architecture to combine performance and flexibility. It is partitioned that the hardware is used to boost the performance and efficiency of key operations in H.264 decoder while the software is used to control the decoding flow and to synchronize the hardware modules. All hardware units operate in parallel. The hardware can decode a MB in 851 clock cycles under ideal condition. The chip is fabricated using UMC 0.18-mum 6-layers metal CMOS process. It contains 1.5 M transistors and 176k bits embedded SRAM. The die size is 4.8 mm x 4.8 mm and the critical path is 10 ns.