A software engineering methodology to optimize caching in multi-processor DSP architectures: TMS320C80 results towards the real-time execution of low level image processing
{"title":"A software engineering methodology to optimize caching in multi-processor DSP architectures: TMS320C80 results towards the real-time execution of low level image processing","authors":"F. Lohier, P. Garda","doi":"10.1109/ASSET.1999.756763","DOIUrl":null,"url":null,"abstract":"This paper introduces an original software engineering methodology we developed while focusing on the implementation of a low-level image processing library targeted for a shared memory multi-processor DSP architecture: the TMS320C80. Real-time constraints led us to concentrate on the enhancement of data locality thanks to the software managing of caches based on an advanced multi-dimensional DMA. This contribution compares to other existing C80's image processing libraries in terms of genericity, flexibility and performance improvement. Our approach allows for the composing of concurrent processing chains grounded on a modular library gathering basic processing operators. Generic mechanisms allow to address all basic operator's requirements as well as to quickly expand the library thanks to a re-usable and well defined framework. Flexibility allows to dynamically re-configure a chain or to modify the region of interest and the number of processors. We finally demonstrate experimentally that our approach allows significant performance improvements.","PeriodicalId":340666,"journal":{"name":"Proceedings 1999 IEEE Symposium on Application-Specific Systems and Software Engineering and Technology. ASSET'99 (Cat. No.PR00122)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1999-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 1999 IEEE Symposium on Application-Specific Systems and Software Engineering and Technology. ASSET'99 (Cat. No.PR00122)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ASSET.1999.756763","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This paper introduces an original software engineering methodology we developed while focusing on the implementation of a low-level image processing library targeted for a shared memory multi-processor DSP architecture: the TMS320C80. Real-time constraints led us to concentrate on the enhancement of data locality thanks to the software managing of caches based on an advanced multi-dimensional DMA. This contribution compares to other existing C80's image processing libraries in terms of genericity, flexibility and performance improvement. Our approach allows for the composing of concurrent processing chains grounded on a modular library gathering basic processing operators. Generic mechanisms allow to address all basic operator's requirements as well as to quickly expand the library thanks to a re-usable and well defined framework. Flexibility allows to dynamically re-configure a chain or to modify the region of interest and the number of processors. We finally demonstrate experimentally that our approach allows significant performance improvements.