|
Cutlass
CUDA Templates for Linear Algebra Subroutines and Solvers
|
Implements the BLAS linear scaling function alpha*AB + beta*C. More...
#include "cutlass/fragment_multiply_add.h"Go to the source code of this file.
Classes | |
| struct | cutlass::gemm::LinearScaling< Scalar_, FragmentMultiplyAdd_ > |
| Functor to compute linear combination of fragments. More... | |
| struct | cutlass::gemm::LinearScaling< Scalar_, FragmentMultiplyAdd_ >::Params |
| The parameters. More... | |
Namespaces | |
| cutlass | |
| cutlass::gemm | |
Functions | |
| template<typename T > | |
| CUTLASS_DEVICE bool | cutlass::gemm::is_zero (T x) |
| CUTLASS_DEVICE bool | cutlass::gemm::is_zero (half x) |
1.8.14