|
Cutlass
CUDA Templates for Linear Algebra Subroutines and Solvers
|
#include <vector.h>
Public Types | |
| enum | { kLanes = kLanes_ } |
| The number of elements in the vector. More... | |
| enum | { kVectorSize = kLanes * (int)sizeof(Scalar) } |
| The size of the vector. More... | |
| enum | |
| The number of registers needed to store the vector. More... | |
| typedef half | Scalar |
| The scalar type. More... | |
Public Member Functions | |
| CUTLASS_HOST_DEVICE Scalar const & | operator[] (uint32_t i) const |
| Accessor to the ith lane. More... | |
| CUTLASS_HOST_DEVICE Scalar & | operator[] (uint32_t i) |
| Accessor to the ith lane. More... | |
Public Attributes | |
| AlignedStruct< kVectorSize > | aligned_ |
| The aligned storage to make sure we have good alignment. More... | |
| uint16_t | scalars [kLanes] |
| The associated array of scalars. More... | |
| uint32_t | registers [kRegisters] |
| The data in registers. More... | |
| typedef half cutlass::Vector< half, kLanes_ >::Scalar |
| anonymous enum |
|
inline |
|
inline |
| AlignedStruct<kVectorSize> cutlass::Vector< half, kLanes_ >::aligned_ |
| uint32_t cutlass::Vector< half, kLanes_ >::registers[kRegisters] |
| uint16_t cutlass::Vector< half, kLanes_ >::scalars[kLanes] |
1.8.14