Cutlass
CUDA Templates for Linear Algebra Subroutines and Solvers
|
Basic thread offset function computed from a thread shape.
#include <tile_traits_standard.h>
Public Member Functions | |
CUTLASS_HOST_DEVICE Coord< 4 > | operator() () const |
Computes the logical coordinate from thread shape. More... | |
|
inline |