GitHub topics: reduced-precision
KernelTuner/kernel_float
CUDA/HIP header-only library for writing vectorized and low-precision (16 bit, 8 bit) GPU kernels
Language: C++ - Size: 7.23 MB - Last synced at: 14 days ago - Pushed at: 15 days ago - Stars: 7 - Forks: 1
