{"title":"GPU Implementation of the Affine Transform for 3D Image Registration","authors":"D. Crookes, K. Boyle, P. Miller, C. Gillan","doi":"10.1109/IMVIP.2009.34","DOIUrl":null,"url":null,"abstract":"Recent developments in 3D low-light level CCD (L3CCD) image capture have resulted in vast volumes of data being produced in real time which require image registration. The amount of data involved means that acceleration of the processing is essential. One of the key steps in one iterative registration algorithm is the application of an affine transform to all the planes of a 3D image. This paper presents details and performance results for a number of parallelized implementations of the affine transform on the NVIDIA 8800 GPU series, and shows that the transform runs 128 times faster on the GPU than a C++ version on a PC, or 54 times faster when data transfer between the GPU and the host PC is included.","PeriodicalId":179564,"journal":{"name":"2009 13th International Machine Vision and Image Processing Conference","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2009-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 13th International Machine Vision and Image Processing Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IMVIP.2009.34","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Recent developments in 3D low-light level CCD (L3CCD) image capture have resulted in vast volumes of data being produced in real time which require image registration. The amount of data involved means that acceleration of the processing is essential. One of the key steps in one iterative registration algorithm is the application of an affine transform to all the planes of a 3D image. This paper presents details and performance results for a number of parallelized implementations of the affine transform on the NVIDIA 8800 GPU series, and shows that the transform runs 128 times faster on the GPU than a C++ version on a PC, or 54 times faster when data transfer between the GPU and the host PC is included.