{"title":"High Throughput VLSI Architecture for Image Pyramid Generation in Computer Vision","authors":"Mihir Mody, Rajshekar Allu, Niraj Nandan, Hetual Sanghavi, Ankur Baranwal","doi":"10.1109/CONECCT55679.2022.9865803","DOIUrl":null,"url":null,"abstract":"Image Pyramids i.e. set of down-sampled images from high-resolution input is used in computer vision processing pipe to account for unknown distance of objects from Camera. The image pyramid can be set of Octave (down-sampling by 2) and/or Generic scaling (arbitrary downscaling ratio). The prior literature addresses generating all down-scaled images using either input or previous output as an overall scaling architecture and consists of \"N\" set of independent scalers, which are separately tuned to Octave and generic scaling. The disadvantage of the above approach is higher silicon area and DRAM Bandwidth proportional to number of independent scalars. The paper proposes a novel solution on top of traditional poly-phase filtering which consists of new concepts e.g. Re-scale from previous octave scale architecture, Multi-thread processing with flexible mapping of shared Scalers, Unconventional processing order for 2D scaling without line buffers, shared coefficients and flexible Region of Interest (ROI). The proposed solution is implemented as HW IP with 0.2 mm2 in 16nm process node with 720 Mpix/sec throughput, which is 3.5X lower in the area and 40% lower DRAM bandwidth compared to prior literature.","PeriodicalId":380005,"journal":{"name":"2022 IEEE International Conference on Electronics, Computing and Communication Technologies (CONECCT)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Electronics, Computing and Communication Technologies (CONECCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CONECCT55679.2022.9865803","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Image Pyramids i.e. set of down-sampled images from high-resolution input is used in computer vision processing pipe to account for unknown distance of objects from Camera. The image pyramid can be set of Octave (down-sampling by 2) and/or Generic scaling (arbitrary downscaling ratio). The prior literature addresses generating all down-scaled images using either input or previous output as an overall scaling architecture and consists of "N" set of independent scalers, which are separately tuned to Octave and generic scaling. The disadvantage of the above approach is higher silicon area and DRAM Bandwidth proportional to number of independent scalars. The paper proposes a novel solution on top of traditional poly-phase filtering which consists of new concepts e.g. Re-scale from previous octave scale architecture, Multi-thread processing with flexible mapping of shared Scalers, Unconventional processing order for 2D scaling without line buffers, shared coefficients and flexible Region of Interest (ROI). The proposed solution is implemented as HW IP with 0.2 mm2 in 16nm process node with 720 Mpix/sec throughput, which is 3.5X lower in the area and 40% lower DRAM bandwidth compared to prior literature.