L. Domanski, Changming Sun, Raquibul Hassan, P. Vallotton, Dadong Wang
{"title":"gpu的线性特征检测","authors":"L. Domanski, Changming Sun, Raquibul Hassan, P. Vallotton, Dadong Wang","doi":"10.1109/DICTA.2010.112","DOIUrl":null,"url":null,"abstract":"The acceleration of an existing linear feature detection algorithm for 2D images using GPUs is discussed. The two most time consuming components of this process are implemented on the GPU, namely, linear feature detection using dual-peak directional non-maximum suppression, and a gap filling process that joins disconnected feature masks to rectify false negatives. Multiple steps or image filters in each component are combined into a single GPU kernel to minimise data transfers to off-chip GPU RAM, and issues relating to on-chip memory utilisation, caching, and memory coalescing are considered. The presented algorithm is useful for applications needing to analyse complex linear structures, and examples are given for dense neurite images from the biotech domain.","PeriodicalId":246460,"journal":{"name":"2010 International Conference on Digital Image Computing: Techniques and Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Linear Feature Detection on GPUs\",\"authors\":\"L. Domanski, Changming Sun, Raquibul Hassan, P. Vallotton, Dadong Wang\",\"doi\":\"10.1109/DICTA.2010.112\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The acceleration of an existing linear feature detection algorithm for 2D images using GPUs is discussed. The two most time consuming components of this process are implemented on the GPU, namely, linear feature detection using dual-peak directional non-maximum suppression, and a gap filling process that joins disconnected feature masks to rectify false negatives. Multiple steps or image filters in each component are combined into a single GPU kernel to minimise data transfers to off-chip GPU RAM, and issues relating to on-chip memory utilisation, caching, and memory coalescing are considered. The presented algorithm is useful for applications needing to analyse complex linear structures, and examples are given for dense neurite images from the biotech domain.\",\"PeriodicalId\":246460,\"journal\":{\"name\":\"2010 International Conference on Digital Image Computing: Techniques and Applications\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 International Conference on Digital Image Computing: Techniques and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DICTA.2010.112\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 International Conference on Digital Image Computing: Techniques and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DICTA.2010.112","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The acceleration of an existing linear feature detection algorithm for 2D images using GPUs is discussed. The two most time consuming components of this process are implemented on the GPU, namely, linear feature detection using dual-peak directional non-maximum suppression, and a gap filling process that joins disconnected feature masks to rectify false negatives. Multiple steps or image filters in each component are combined into a single GPU kernel to minimise data transfers to off-chip GPU RAM, and issues relating to on-chip memory utilisation, caching, and memory coalescing are considered. The presented algorithm is useful for applications needing to analyse complex linear structures, and examples are given for dense neurite images from the biotech domain.