Cutlass
CUDA Templates for Linear Algebra Subroutines and Solvers
|
Computes the thread offset in (H, W) based on thread ID.
#include <hgemm_global_tile.h>
Public Member Functions | |
CUTLASS_HOST_DEVICE Coord< 4 > | operator() () const |
|
inline |