Cutlass
CUDA Templates for Linear Algebra Subroutines and Solvers
Classes | Namespaces
hgemm_swizzle.h File Reference

Transposes a tile of 16b elements. Used by HGEMM to construct a K-strided layout in shared memory for multiplicands. More...

#include <cuda_fp16.h>
#include "cutlass/fragment.h"

Go to the source code of this file.

Classes

struct  cutlass::gemm::HgemmSwizzle< GlobalIterator_ >
 

Namespaces

 cutlass
 
 cutlass::gemm