29 May 2013 Productive high-performance software for OpenCL devices
Author Affiliations +
Over the last three decades, CPUs have continued to produce large performance improvements from one generation to the next. However, CPUs have recently hit a performance wall and need parallel computing to move forward. Parallel computing over the next decade will become increasingly defined by heterogeneous computing, involving the use of accelerators in addition to CPUs to get computational tasks done. In order to use an accelerator, software changes must be made. Regular x86-based compilers cannot compile code to run on accelerators without these needed changes. The amount of software change required varies depending upon the availability of and reliance upon software tools that increase performance and productivity. Writing software that leverages the best parallel computing hardware, adapts well to the rapid pace of hardware updates, and minimizes developer muscle is the industry’s goal. OpenCL is the standard around which developers are able to achieve parallel performance. OpenCL itself is too difficult to program to receive general adoptions, but productive high-performing software libraries are becoming increasingly popular and capable in delivering lasting value to user applications.
© (2013) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
John M. Melonakos, John M. Melonakos, Pavan Yalamanchili, Pavan Yalamanchili, Chris McClanahan, Chris McClanahan, Umar Arshad, Umar Arshad, Michael Landes, Michael Landes, Shivapriya Jamboti, Shivapriya Jamboti, Abhijit Joshi, Abhijit Joshi, Shehzan Mohammed, Shehzan Mohammed, Kyle Spafford, Kyle Spafford, Vishwanath Venugopalakrishnan, Vishwanath Venugopalakrishnan, James Malcolm, James Malcolm, "Productive high-performance software for OpenCL devices", Proc. SPIE 8752, Modeling and Simulation for Defense Systems and Applications VIII, 87520C (29 May 2013); doi: 10.1117/12.2016216; https://doi.org/10.1117/12.2016216


Real-time supercomputing on iWarp
Proceedings of SPIE (April 29 1992)
A VLSI implementation of CAVLC for H.264/AVC
Proceedings of SPIE (October 30 2009)

Back to Top