25 January 2011 Performance evaluation of canny edge detection on a tiled multicore architecture
Author Affiliations +
In the last few years, a variety of multicore architectures have been used to parallelize image processing applications. In this paper, we focus on assessing the parallel speed-ups of different Canny edge detection parallelization strategies on the Tile64, a tiled multicore architecture developed by the Tilera Corporation. Included in these strategies are different ways Canny edge detection can be parallelized, as well as differences in data management. The two parallelization strategies examined were loop-level parallelism and domain decomposition. Loop-level parallelism is achieved through the use of OpenMP,1 and it is capable of parallelization across the range of values over which a loop iterates. Domain decomposition is the process of breaking down an image into subimages, where each subimage is processed independently, in parallel. The results of the two strategies show that for the same number of threads, programmer implemented, domain decomposition exhibits higher speed-ups than the compiler managed, loop-level parallelism implemented with OpenMP.
© (2011) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Andrew Z. Brethorst, Andrew Z. Brethorst, Nehal Desai, Nehal Desai, Douglas P. Enright, Douglas P. Enright, Ronald Scrofano, Ronald Scrofano, } "Performance evaluation of canny edge detection on a tiled multicore architecture", Proc. SPIE 7872, Parallel Processing for Imaging Applications, 78720F (25 January 2011); doi: 10.1117/12.873004; https://doi.org/10.1117/12.873004


Back to Top