Topic: "reduced-precision"
KernelTuner/kernel_float
CUDA/HIP header-only library for writing vectorized and low-precision (16 bit, 8 bit) GPU kernels
Language: C++ - Size: 7.23 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 7 - Forks: 1
