Core Functionality¶
Functionality to catch exceptions, dispatch kernels for different dtypes, and writing kernels.
-
template<typename
T, intN>
classaugpy::array¶ A simple fixed-size array. Importantly
sizeof(array)gives the actual size of the array in bytes, so it can be used as an argument in function calls etc.- Template Parameters
T: element typeN: length of array
-
template<class
T, class ...Tail>
array<T, 1 + sizeof...(Tail)>augpy::make_array(T head, Tail... tail)¶ Similar to std::make_tuple, but makes a fixed-length array instead.
Example:
auto tensors = make_array(tensor1, tensor2, tensor3);
-
BLOCKS_PER_SM¶ Default value for blocks to generate per SM. Used to calculate kernel launch config. Currently 8.