Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: matmul

jmaczan/tinyconvnet

Convolutional Neural Network from scratch in CuPy and tinygrad

Language: Shell - Size: 3.91 KB - Last synced: 8 days ago - Pushed: 8 days ago - Stars: 0 - Forks: 0

martins0n/matmul

Matrix-matrix multiplication implementations benchmarking

Language: Rust - Size: 43.9 MB - Last synced: about 1 month ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

jhson989/matmul_cublas

cuBLAS GEMM Example for FP32 MatMul

Language: Cuda - Size: 7.81 KB - Last synced: 2 months ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

jhson989/SYCL-heterogeneous

CPU, GPU, and FPGA matrix multiplication examples via SYCL

Language: C++ - Size: 3.91 KB - Last synced: 2 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

eth-cscs/COSMA

Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm

Language: C++ - Size: 8.35 MB - Last synced: 28 days ago - Pushed: 3 months ago - Stars: 177 - Forks: 26

eth-cscs/Tiled-MM

Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.

Language: C++ - Size: 759 KB - Last synced: 26 days ago - Pushed: 3 months ago - Stars: 21 - Forks: 6

gha3mi/formatmul

ForMatmul - A Fortran library that overloads the matmul function to enable efficient matrix multiplication with/without coarray.

Language: Fortran - Size: 11.2 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 1 - Forks: 1

akifejaz/matmul-testbench

This is the simple script that generate matrixes of size 4 by 4, for testing Matmul.

Language: Python - Size: 21 MB - Last synced: 4 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

akifejaz/HwVerification

This repo contains the python scripts for MatMul's all modules testing.

Language: Python - Size: 30.2 MB - Last synced: 4 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

LaserBorg/circuitpython_benchmark

Raspberry Pi Pico (RP2040) and Adafruit Metro M7 (NXP IMXRT10XX) benchmark

Language: Python - Size: 8.79 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

digital-nomad-cheng/matmul_cuda_kernel_tvm

Generate optimized MatMul cuda kernel automatically using tvm auto schedule

Language: Jupyter Notebook - Size: 48.8 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

LRZ-BADW/OMMOP

OpenMP Matrix Multiplication Offloading Playground

Language: C++ - Size: 31.3 KB - Last synced: 10 months ago - Pushed: over 1 year ago - Stars: 1 - Forks: 1

paxbun/float-matmul

Floating-point matrix multiplication implementation (arbitrary precision)

Language: Verilog - Size: 37.1 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 3 - Forks: 2

WilliamSpanfelner/day-76-computation_with_numpy

Check out the power of NumPy

Language: Jupyter Notebook - Size: 2.35 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

alprn42/Instruction-Counter

In this project, ınstruction numbers from a c program are counted with pin and c++.

Language: C++ - Size: 19.5 KB - Last synced: 11 months ago - Pushed: over 4 years ago - Stars: 1 - Forks: 0