|
Cutlass
CUDA Templates for Linear Algebra Subroutines and Solvers
|
Defines tile traits for several tile partitioning arrangements of threads expected to achieve efficient streaming performance. More...
#include "cutlass/tile_iterator.h"Go to the source code of this file.
Classes | |
| struct | cutlass::TiledThreadOffset< ThreadShape > |
| Basic thread offset function computed from a thread shape. More... | |
| struct | cutlass::TileTraitsStrideMajor< Tile_, Threads > |
| struct | cutlass::TileTraitsContiguousMajor< Tile_, Threads > |
| struct | cutlass::TileTraitsWarpRake< Tile_, Threads > |
| Tiling in which warps rake across the contiguous dimension. More... | |
| struct | cutlass::TileTraitsWarpRake< Tile_, Threads >::ThreadOffset |
| Computes the thread offset in (H, W) based on thread ID. More... | |
| struct | cutlass::TileTraitsStandard< Tile_, Threads > |
| Chooses 'best' shape to enable warp raking along contiguous dimension if possible. More... | |
Namespaces | |
| cutlass | |
1.8.14