|
Cutlass
CUDA Templates for Linear Algebra Subroutines and Solvers
|
#include <hgemm_global_tile.h>
Classes | |
| struct | ThreadOffset |
| Computes the thread offset in (H, W) based on thread ID. More... | |
Public Types | |
| typedef GemmGlobalTileTraits< kOperand_, kLayout_, Scalar_, Tile_, Threads_, kAccessSize_ > | Base |
| The base class. More... | |
| typedef Base::Threads | Threads |
| The threads. More... | |
| typedef Shape< 1, 2, Base::VectorizedTile::kC > | ThreadsDelta |
| The threads strides. More... | |
| typedef Shape< Base::Threads::kH *2, 1, Base::Threads::kW, Base::kAccessSize > | Delta |
| The strides in each dimension between different loads/stores. More... | |
| typedef Shape< Base::VectorizedTile::kH/Base::Threads::kH/2, 2, Base::VectorizedTile::kW/Base::Threads::kW, Base::VectorizedTile::kC/Base::kAccessSize > | Iterations |
| The number of iterations needed to load/store the tile. More... | |
Public Types inherited from cutlass::gemm::GemmGlobalTileTraits< kOperand_, kLayout_, Scalar_, Tile_, Threads_, kAccessSize_ > | |
| typedef platform::remove_const< Scalar_ >::type | Scalar |
| The scalar. More... | |
| typedef Scalar_ * | Pointer |
| The pointer. More... | |
| typedef Tile_ | Tile |
| The tile shape. More... | |
| typedef ReshapeTile< Tile_, kAccessSize_ >::Tile | VectorizedTile |
| The vectorized tile shape. More... | |
| typedef ReshapeThreads< VectorizedTile, Threads_ >::Threads | Threads |
| The threads shape. More... | |
| typedef Shape< 1, 1, VectorizedTile::kC > | ThreadsDelta |
| The relative offset between two elements in the H/W dimension in adjacent threads. More... | |
| typedef Shape< 0, Threads::kH, Threads::kW *kAccessSize > | Delta |
| The strides in each dimension between different loads/stores. More... | |
| typedef Shape< 0, 0, Threads::kW *ThreadsDelta::kW, kAccessSize > | ImmediateOffsetStrides |
| Strides for immediate offset computation. More... | |
| typedef Shape< 1, VectorizedTile::kH/Threads::kH, VectorizedTile::kW/Threads::kW, VectorizedTile::kC/kAccessSize > | Iterations |
| The number of iterations needed to load/store the tile. More... | |
| typedef GemmMultiplicandTraits< Tile, kOperand, kLayout > | MultiplicandTraits |
Additional Inherited Members | |
Static Public Attributes inherited from cutlass::gemm::GemmGlobalTileTraits< kOperand_, kLayout_, Scalar_, Tile_, Threads_, kAccessSize_ > | |
| static GemmOperand::Kind const | kOperand = kOperand_ |
| Identity of the operand. More... | |
| static MatrixLayout::Kind const | kLayout = kLayout_ |
| The layout. More... | |
| static int const | kAccessSize = kAccessSize_ |
| The number of scalars per LDG/STG. More... | |
| static MemorySpace::Kind const | kMemorySpace = MemorySpace::kGlobal |
| The memory space. More... | |
| typedef GemmGlobalTileTraits<kOperand_, kLayout_, Scalar_, Tile_, Threads_, kAccessSize_> cutlass::gemm::HgemmCrosswiseGlobalTileTraits< kOperand_, kLayout_, Scalar_, Tile_, Threads_, kAccessSize_ >::Base |
| typedef Shape<Base::Threads::kH * 2, 1, Base::Threads::kW, Base::kAccessSize> cutlass::gemm::HgemmCrosswiseGlobalTileTraits< kOperand_, kLayout_, Scalar_, Tile_, Threads_, kAccessSize_ >::Delta |
| typedef Shape<Base::VectorizedTile::kH / Base::Threads::kH / 2, 2, Base::VectorizedTile::kW / Base::Threads::kW, Base::VectorizedTile::kC / Base::kAccessSize> cutlass::gemm::HgemmCrosswiseGlobalTileTraits< kOperand_, kLayout_, Scalar_, Tile_, Threads_, kAccessSize_ >::Iterations |
| typedef Base::Threads cutlass::gemm::HgemmCrosswiseGlobalTileTraits< kOperand_, kLayout_, Scalar_, Tile_, Threads_, kAccessSize_ >::Threads |
| typedef Shape<1, 2, Base::VectorizedTile::kC> cutlass::gemm::HgemmCrosswiseGlobalTileTraits< kOperand_, kLayout_, Scalar_, Tile_, Threads_, kAccessSize_ >::ThreadsDelta |
1.8.14