Cutlass
CUDA Templates for Linear Algebra Subroutines and Solvers
|
#include <vector.h>
Public Types | |
enum | { kLanes = kLanes_ } |
The number of elements in the vector. More... | |
enum | { kVectorSize = kLanes * (int)sizeof(Scalar) } |
The size of the vector. More... | |
enum | |
The number of registers needed to store the vector. More... | |
typedef half | Scalar |
The scalar type. More... | |
Public Member Functions | |
CUTLASS_HOST_DEVICE Scalar const & | operator[] (uint32_t i) const |
Accessor to the ith lane. More... | |
CUTLASS_HOST_DEVICE Scalar & | operator[] (uint32_t i) |
Accessor to the ith lane. More... | |
Public Attributes | |
AlignedStruct< kVectorSize > | aligned_ |
The aligned storage to make sure we have good alignment. More... | |
uint16_t | scalars [kLanes] |
The associated array of scalars. More... | |
uint32_t | registers [kRegisters] |
The data in registers. More... | |
typedef half cutlass::Vector< half, kLanes_ >::Scalar |
anonymous enum |
|
inline |
|
inline |
AlignedStruct<kVectorSize> cutlass::Vector< half, kLanes_ >::aligned_ |
uint32_t cutlass::Vector< half, kLanes_ >::registers[kRegisters] |
uint16_t cutlass::Vector< half, kLanes_ >::scalars[kLanes] |