{"title":"重构图像和视频处理程序的流程以增加指令级并行性","authors":"M. Maresca, N. Zingirian","doi":"10.1109/EMPDP.2001.905066","DOIUrl":null,"url":null,"abstract":"This paper addresses the problem of preparing efficient implementations of Image Processing (IP) tasks for Instruction Level Parallel (ILP, i.e., superscalar and pipelined) architectures. First it shows an accurate analysis of ILP architectures and IP task structures. This analysis allows identifying specific sources of inefficiency that affect typical implementations of IP programs for ILP architectures. Then, it introduces a novel processing model, named Bucket Processing (BP), aimed at reducing the inefficiencies of IP programs characterized by the presence of nested loops, typical of image processing, and by the presence of conditional statements in the innermost loop bodies. Finally, it describes how BP restructures the program flow in such a way to deliver significant speed up in programs running on real ILP platforms.","PeriodicalId":262971,"journal":{"name":"Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing","volume":"44 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2001-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Restructuring the flow of image and video processing programs to increase instruction level parallelism\",\"authors\":\"M. Maresca, N. Zingirian\",\"doi\":\"10.1109/EMPDP.2001.905066\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper addresses the problem of preparing efficient implementations of Image Processing (IP) tasks for Instruction Level Parallel (ILP, i.e., superscalar and pipelined) architectures. First it shows an accurate analysis of ILP architectures and IP task structures. This analysis allows identifying specific sources of inefficiency that affect typical implementations of IP programs for ILP architectures. Then, it introduces a novel processing model, named Bucket Processing (BP), aimed at reducing the inefficiencies of IP programs characterized by the presence of nested loops, typical of image processing, and by the presence of conditional statements in the innermost loop bodies. Finally, it describes how BP restructures the program flow in such a way to deliver significant speed up in programs running on real ILP platforms.\",\"PeriodicalId\":262971,\"journal\":{\"name\":\"Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing\",\"volume\":\"44 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2001-02-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/EMPDP.2001.905066\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EMPDP.2001.905066","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Restructuring the flow of image and video processing programs to increase instruction level parallelism
This paper addresses the problem of preparing efficient implementations of Image Processing (IP) tasks for Instruction Level Parallel (ILP, i.e., superscalar and pipelined) architectures. First it shows an accurate analysis of ILP architectures and IP task structures. This analysis allows identifying specific sources of inefficiency that affect typical implementations of IP programs for ILP architectures. Then, it introduces a novel processing model, named Bucket Processing (BP), aimed at reducing the inefficiencies of IP programs characterized by the presence of nested loops, typical of image processing, and by the presence of conditional statements in the innermost loop bodies. Finally, it describes how BP restructures the program flow in such a way to deliver significant speed up in programs running on real ILP platforms.