Automated OpenCL GPU kernel fusion for Stan Math

Published: 01 Jan 2020, Last Modified: 30 Sept 2024IWOCL 2020EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: We developed an OpenCL GPU kernel fusion library for the Stan software for Bayesian statistics. The library automatically combines kernels, optimizes computation, and is simple to use. The practical utility of the library is that it speeds up the development of new GPU kernels while keeping the performance of automatically combined kernels comparable to hand crafted kernels. We demonstrate this with experiments on basic operations and a linear regression model likelihood.
Loading