Starlight: A kernel optimizer for GPU processing

Published: 01 Jan 2024, Last Modified: 22 Oct 2024J. Parallel Distributed Comput. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•We enrich the incomplete information provided by NVIDIA profilers.•Starlight can support the development of an application from the ground up.•Starlight predicts potential performance enhancements before altering the source code.•Automatic Roofline Model generation for any CUDA-capable GPU.•A qualitative overview of the various state-of-the-art solutions for GPU kernel optimization and Roofline Model generation.
Loading