Presentation + Paper
9 October 2021 Adaptation of algorithms for efficient execution on GPUs
Vadim G. Bulavintsev, Dmitry D. Zhdanov
Author Affiliations +
Abstract
We propose a generalized method for adapting and optimizing algorithms for efficient execution on modern graphics processing units (GPU). The method consists of several steps. First, build a control flow graph (CFG) of the algorithm. Next, transform the CFG into a tree of loops and merge non-parallelizable loops into parallelizable ones. Finally, map the resulting loops tree to the tree of GPU computational units, unrolling the algorithm’s loops as necessary for the match. The method provides a convenient and robust mental framework and strategy for GPU code optimization. We demonstrate the method by adapting a backtracking search algorithm to the GPU platform and building an optimized implementation of the ResNeXt-50 neural network.
Conference Presentation
© (2021) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Vadim G. Bulavintsev and Dmitry D. Zhdanov "Adaptation of algorithms for efficient execution on GPUs", Proc. SPIE 11895, Optical Design and Testing XI, 118950T (9 October 2021); https://doi.org/10.1117/12.2601619
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Computer programming

Neural networks

Computer architecture

Graphics processing units

Back to Top