Topic: "cusolver"
cupy/cupy
NumPy & SciPy for GPU
Language: Python - Size: 40.6 MB - Last synced at: 5 days ago - Pushed at: 17 days ago - Stars: 10,198 - Forks: 910

NVIDIA/CUDALibrarySamples
CUDA Library Samples
Language: Cuda - Size: 35.1 MB - Last synced at: about 12 hours ago - Pushed at: 6 days ago - Stars: 1,933 - Forks: 389

lebedov/scikit-cuda
Python interface to GPU-powered libraries
Language: Python - Size: 2.44 MB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 991 - Forks: 181

KinglittleQ/torch-batch-svd
A 100x faster SVD for PyTorch⚡️
Language: C++ - Size: 48.8 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 465 - Forks: 36

Bruce-Lee-LY/cuda_hook
Hooked CUDA-related dynamic libraries by using automated code generation tools.
Language: C - Size: 717 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 150 - Forks: 41

conradsnicta/bandicoot-code
Bandicoot: C++ library for GPU linear algebra & scientific computing - https://coot.sourceforge.io
Size: 1000 Bytes - Last synced at: 6 days ago - Pushed at: almost 2 years ago - Stars: 29 - Forks: 5

jagennath-hari/CUDA-Accelerated-Visual-Inertial-Odometry-Fusion
Harness the power of GPU acceleration for fusing visual odometry and IMU data with an advanced Unscented Kalman Filter (UKF) implementation. Developed in C++ and utilizing CUDA, cuBLAS, and cuSOLVER, this system offers unparalleled real-time performance in state and covariance estimation for robotics and autonomous system applications.
Language: Cuda - Size: 211 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 25 - Forks: 2

hpjeonGIT/Solve_Ax_b
Sample CMake template solving Ax=b
Language: C++ - Size: 920 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 6 - Forks: 2

zishun/cuSolverRf-batch
A complete example of batched refactorization in cuSOLVER.
Language: C++ - Size: 247 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 1

VORTICITY-INC/VTensor
VTensor, a C++ library, facilitates tensor manipulation on GPUs, emulating the python-numpy style for ease of use. It leverages RMM (RAPIDS Memory Manager) for efficient device memory management. It also supports xtensor for host memory operations.
Language: C++ - Size: 6.65 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 4 - Forks: 0

maximilianbehr/cuexpm
Matrix Exponential Approximation using CUDA
Language: Cuda - Size: 57.6 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

maximilianbehr/cuPolar
Newton's and Hayley's Method for the Matrix Polar Decomposition using CUDA
Language: Cuda - Size: 28.3 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

yessenbayev/kmeans
K-means Clustering in CUDA with OpenGL Visualization
Language: C++ - Size: 779 MB - Last synced at: 3 months ago - Pushed at: almost 6 years ago - Stars: 2 - Forks: 0

mnovak42/leuven
Framework, toolkit and ready-to-use applications for numerical linear algebra dependent machine learning algorithms.
Language: C++ - Size: 45.4 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

vxj9800/cuda-matrix-lib
Language: Cuda - Size: 7.28 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

dendisuhubdy/cupy Fork of cupy/cupy
NumPy-like API accelerated with CUDA
Language: Python - Size: 13.6 MB - Last synced at: 12 months ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

zqy767/cl-matrix
matrix operation using gpu
Language: Common Lisp - Size: 20.5 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0
