An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: cublaslt

Bruce-Lee-LY/cuda_hook

Hooked CUDA-related dynamic libraries by using automated code generation tools.

Language: C - Size: 717 KB - Last synced at: 18 days ago - Pushed at: over 1 year ago - Stars: 150 - Forks: 41

Bruce-Lee-LY/cutlass_gemm

Multiple GEMM operators are constructed with cutlass to support LLM inference.

Language: C++ - Size: 2.14 MB - Last synced at: 12 days ago - Pushed at: 7 months ago - Stars: 17 - Forks: 2

zhaocc1106/cuxx-programing

一些cuda库的样例,cuda、cublas、cublaslt、cusparse...

Language: Cuda - Size: 54.7 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

nghiapq77/face-recognition-cpp-tensorrt

Face Recognition with RetinaFace and ArcFace.

Language: C++ - Size: 490 KB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 59 - Forks: 16

vadimkantorov/fastmlp

[WIP] PyTorch bindings for cublasLt with an example of quantized i8f16 MLP

Size: 1000 Bytes - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0