GitHub topics: cublaslt
Bruce-Lee-LY/cuda_hook
Hooked CUDA-related dynamic libraries by using automated code generation tools.
Language: C - Size: 717 KB - Last synced at: 18 days ago - Pushed at: over 1 year ago - Stars: 150 - Forks: 41

Bruce-Lee-LY/cutlass_gemm
Multiple GEMM operators are constructed with cutlass to support LLM inference.
Language: C++ - Size: 2.14 MB - Last synced at: 12 days ago - Pushed at: 7 months ago - Stars: 17 - Forks: 2

zhaocc1106/cuxx-programing
一些cuda库的样例,cuda、cublas、cublaslt、cusparse...
Language: Cuda - Size: 54.7 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

nghiapq77/face-recognition-cpp-tensorrt
Face Recognition with RetinaFace and ArcFace.
Language: C++ - Size: 490 KB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 59 - Forks: 16

vadimkantorov/fastmlp
[WIP] PyTorch bindings for cublasLt with an example of quantized i8f16 MLP
Size: 1000 Bytes - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0
