Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: matmul
jmaczan/tinyconvnet
Convolutional Neural Network from scratch in CuPy and tinygrad
Language: Shell - Size: 3.91 KB - Last synced: 8 days ago - Pushed: 8 days ago - Stars: 0 - Forks: 0
martins0n/matmul
Matrix-matrix multiplication implementations benchmarking
Language: Rust - Size: 43.9 MB - Last synced: about 1 month ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
jhson989/matmul_cublas
cuBLAS GEMM Example for FP32 MatMul
Language: Cuda - Size: 7.81 KB - Last synced: 2 months ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0
jhson989/SYCL-heterogeneous
CPU, GPU, and FPGA matrix multiplication examples via SYCL
Language: C++ - Size: 3.91 KB - Last synced: 2 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
eth-cscs/COSMA
Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm
Language: C++ - Size: 8.35 MB - Last synced: 28 days ago - Pushed: 3 months ago - Stars: 177 - Forks: 26
eth-cscs/Tiled-MM
Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.
Language: C++ - Size: 759 KB - Last synced: 26 days ago - Pushed: 3 months ago - Stars: 21 - Forks: 6
gha3mi/formatmul
ForMatmul - A Fortran library that overloads the matmul function to enable efficient matrix multiplication with/without coarray.
Language: Fortran - Size: 11.2 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 1 - Forks: 1
akifejaz/matmul-testbench
This is the simple script that generate matrixes of size 4 by 4, for testing Matmul.
Language: Python - Size: 21 MB - Last synced: 4 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
akifejaz/HwVerification
This repo contains the python scripts for MatMul's all modules testing.
Language: Python - Size: 30.2 MB - Last synced: 4 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0
LaserBorg/circuitpython_benchmark
Raspberry Pi Pico (RP2040) and Adafruit Metro M7 (NXP IMXRT10XX) benchmark
Language: Python - Size: 8.79 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
digital-nomad-cheng/matmul_cuda_kernel_tvm
Generate optimized MatMul cuda kernel automatically using tvm auto schedule
Language: Jupyter Notebook - Size: 48.8 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
LRZ-BADW/OMMOP
OpenMP Matrix Multiplication Offloading Playground
Language: C++ - Size: 31.3 KB - Last synced: 10 months ago - Pushed: over 1 year ago - Stars: 1 - Forks: 1
paxbun/float-matmul
Floating-point matrix multiplication implementation (arbitrary precision)
Language: Verilog - Size: 37.1 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 3 - Forks: 2
WilliamSpanfelner/day-76-computation_with_numpy
Check out the power of NumPy
Language: Jupyter Notebook - Size: 2.35 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
alprn42/Instruction-Counter
In this project, ınstruction numbers from a c program are counted with pin and c++.
Language: C++ - Size: 19.5 KB - Last synced: 11 months ago - Pushed: over 4 years ago - Stars: 1 - Forks: 0