CUBLASLt gemm for the candle ML framework
Owned by Nicolas Patry.
#171 in #tensor
457 downloads per month
31KB 752 lines
CublasLt Matmul operation for the Candle ML framework. Allows for bias and Relu/Gelu fusing.
~16–26MB ~411K SLoC