GitHub topics: cusparse
cupy/cupy
NumPy & SciPy for GPU
Language: Python - Size: 43.3 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 10,334 - Forks: 924

NVIDIA/CUDALibrarySamples
CUDA Library Samples
Language: Cuda - Size: 35.2 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 2,010 - Forks: 396

Bruce-Lee-LY/cuda_hook
Hooked CUDA-related dynamic libraries by using automated code generation tools.
Language: C - Size: 717 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 150 - Forks: 41

zhaocc1106/cuxx-programing
cuda、cublas、cublaslt、cusparse...
Language: Cuda - Size: 82 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Qeolvdxu/pcg_precision_comparison
Compare preconditioned conjugate gradient algorithm with different sparse matrices, floating point precision levels, and implementations
Language: C - Size: 17.4 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

tmrob2/cuda2rust_sandpit
Minimal examples to get CUDA linear algebra programs working with Rust using CC & FFI.
Language: Rust - Size: 13.7 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 1

thilinarmtb/lsbench
Repository for benchmarking linear solvers on GPU.
Language: C - Size: 9.56 MB - Last synced at: 9 days ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 2

zishun/cuSolverRf-batch
A complete example of batched refactorization in cuSOLVER.
Language: C++ - Size: 247 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 1

chenxuhao/caffe-escoin
Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs
Language: C++ - Size: 37.8 MB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 12 - Forks: 2

Ending2015a/StableFluid-CUDA
A really old project that implemented the Stable Fluids using CUDA, cuBLAS and cuSPARSE
Language: C++ - Size: 114 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

pmontalb/CudaLightKernels
Collection of CUDA wrappers for a simplified kernel call
Language: Cuda - Size: 97.7 KB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0
