GitHub / yzhaiustc / Optimizing-DGEMM-on-Intel-CPUs-with-AVX512F
Stepwise optimizations of DGEMM on CPU, reaching performance faster than Intel MKL eventually, even under multithreading.
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/yzhaiustc%2FOptimizing-DGEMM-on-Intel-CPUs-with-AVX512F
PURL: pkg:github/yzhaiustc/Optimizing-DGEMM-on-Intel-CPUs-with-AVX512F
Stars: 65
Forks: 16
Open issues: 0
License: gpl-3.0
Language: C
Size: 3.33 MB
Dependencies parsed at: Pending
Created at: almost 5 years ago
Updated at: over 1 year ago
Pushed at: over 3 years ago
Last synced at: over 1 year ago
Topics: avx512, blas, gemm, mkl, openmp, simd