Cutlass
CUDA Templates for Linear Algebra Subroutines and Solvers
|
Implements the epilogue phase of the GEMM kernel that efficiently updates global memory with the computed matrix product. More...
Go to the source code of this file.
Classes | |
struct | cutlass::gemm::GemmEpilogue< GemmEpilogueTraits_ > |
Namespaces | |
cutlass | |
cutlass::gemm | |