Cutlass
CUDA Templates for Linear Algebra Subroutines and Solvers
|
Vector definition for 4-bit signed integer datatype.
#include <vector.h>
Public Types | |
enum | { kLanes = kLanes_ } |
The number of elements in the vector. More... | |
enum | { kVectorSize = kLanes / 2 } |
The size of the vector. More... | |
enum | |
The number of registers needed to store the vector. More... | |
typedef int4_t | Scalar |
The scalar type. More... | |
Public Member Functions | |
CUTLASS_HOST_DEVICE | Vector () |
Default Constructor. More... | |
CUTLASS_HOST_DEVICE | Vector (uint32_t value) |
Constructor to convert from uint32_t type. More... | |
CUTLASS_HOST_DEVICE int | operator[] (uint32_t i) const |
Accessor to the ith lane. More... | |
Public Attributes | |
AlignedStruct< kVectorSize > | aligned_ |
The aligned storage to make sure we have good alignment. More... | |
uint32_t | registers [kRegisters] |
The data in registers. More... | |
typedef int4_t cutlass::Vector< int4_t, kLanes_ >::Scalar |
anonymous enum |
|
inline |
|
inline |
|
inline |
AlignedStruct<kVectorSize> cutlass::Vector< int4_t, kLanes_ >::aligned_ |
uint32_t cutlass::Vector< int4_t, kLanes_ >::registers[kRegisters] |