GitHub topics: gemv
Bruce-Lee-LY/cuda_hgemv
Several optimization methods of half-precision general matrix vector multiplication (HGEMV) using CUDA core.
Language: Cuda - Size: 459 KB - Last synced at: 3 days ago - Pushed at: 8 months ago - Stars: 61 - Forks: 5

nsomatilda/Matilda
Matilda is a library to repeatedly multiply a constant matrix with a variable vector
Language: C++ - Size: 38.1 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

yzhaiustc/Optimizing-SGEMV-on-NVIDIA-GPUs
An implementation of SGEMV with performance comparable to cuBLAS.
Language: Cuda - Size: 43 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 4 - Forks: 4

yzhaiustc/Optimizing-DGEMV-on-Intel-CPUs
Highly optimized DGEMV on CPU with both serial and parallel performance better than MKL and OpenBLAS.
Language: C - Size: 23.4 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 1
