GitHub topics: simd-programming
jfalcou/eve
Expressive Vector Engine - SIMD in C++ Goes Brrrr
Language: C++ - Size: 57 MB - Last synced at: about 13 hours ago - Pushed at: 2 days ago - Stars: 1,176 - Forks: 60

google/highway
Performance-portable, length-agnostic SIMD with runtime dispatch
Language: C++ - Size: 28.1 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 4,553 - Forks: 342

gnuradio/volk
The Vector Optimized Library of Kernels
Language: C++ - Size: 3.69 MB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 579 - Forks: 211

VcDevel/Vc
SIMD Vector Classes for C++
Language: C++ - Size: 11.2 MB - Last synced at: 8 days ago - Pushed at: 11 months ago - Stars: 1,479 - Forks: 151

Dawoodoz/DFPSR
Fast realtime softare rendering library for C++14 using SSE/AVX/NEON. 2D, 3D and isometric rendering with minimal system dependencies.
Language: C++ - Size: 9.87 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 94 - Forks: 7

agenium-scale/nsimd
Agenium Scale vectorization library for CPUs and GPUs
Language: C - Size: 6.92 MB - Last synced at: 12 days ago - Pushed at: over 3 years ago - Stars: 333 - Forks: 30

dthuerck/mapmap_cpu
A high-performance general-purpose MRF MAP solver, heavily exploiting SIMD instructions.
Language: C++ - Size: 1.02 MB - Last synced at: about 12 hours ago - Pushed at: almost 2 years ago - Stars: 105 - Forks: 51

bsgbryan/roc
A thoroughly-modern real-time simulation engine
Language: TypeScript - Size: 326 KB - Last synced at: 17 days ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

gyrdym/ml_linalg
SIMD-based linear algebra and statistics for data science with dart
Language: Dart - Size: 1.1 MB - Last synced at: 11 days ago - Pushed at: 8 months ago - Stars: 83 - Forks: 9

cloudflare/sliceslice-rs
A fast implementation of single-pattern substring search using SIMD acceleration.
Language: Rust - Size: 350 KB - Last synced at: 13 days ago - Pushed at: 7 months ago - Stars: 96 - Forks: 18

Maratyszcza/psimd 📦
Portable 128-bit SIMD intrinsics
Language: C++ - Size: 21.5 KB - Last synced at: 10 days ago - Pushed at: almost 2 years ago - Stars: 58 - Forks: 34

andyD123/DR3
DR3 enables users to write vectorised code using generic lambdas and filters. Switch instruction set just by changing enclosing namespace
Language: C++ - Size: 19.2 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 32 - Forks: 4

db-tu-dresden/TSL
Template SIMD Library (+Generator)
Language: HTML - Size: 58.7 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 10 - Forks: 8

AmirhosseinRHT/Parallel-Programming-Course-Projects
This is parallel programing course projects
Language: C++ - Size: 63.2 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

ZhiyuZhangA/Cortex-plus-plus
Cortex++ is a light weight deep learning framework made in C++
Language: C++ - Size: 76.2 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

lemire/despacer
C library to remove white space from strings as fast as possible
Language: C - Size: 1.25 MB - Last synced at: 9 days ago - Pushed at: 8 months ago - Stars: 152 - Forks: 15

edanor/umesimd
UME::SIMD A library for explicit simd vectorization.
Language: C++ - Size: 5.89 MB - Last synced at: 6 days ago - Pushed at: over 7 years ago - Stars: 91 - Forks: 16

ms0g/vml
SIMD-accelerated Vector math lib
Language: Assembly - Size: 29.3 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 7 - Forks: 1

spiraldb/zig-cycleclock
Cross-platform CPU cycle clock based on Google Benchmark
Language: Zig - Size: 13.7 KB - Last synced at: 1 day ago - Pushed at: 9 months ago - Stars: 4 - Forks: 1

mtumilowicz/java17-mesi-false-sharing-processor-optimisations-workshop
Introduction to cache coherence: false sharing, MESI protocol and vectorization
Language: Java - Size: 428 KB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

agenium-scale/boost.simd
Boost SIMD
Size: 192 KB - Last synced at: 15 days ago - Pushed at: about 6 years ago - Stars: 232 - Forks: 48

vamsi-sripathi/simd
Collection of optimized SIMD implementations for popular patterns
Language: C - Size: 21.5 MB - Last synced at: 9 months ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 0

UCL-ARC/cluster_club_accelerated_python Fork of tkoskela/hpc_lecture_notes
Materials for ARC's cluster club session on accelerating scientific python codes
Language: Jupyter Notebook - Size: 3.69 MB - Last synced at: 5 months ago - Pushed at: 9 months ago - Stars: 2 - Forks: 0

gregyjames/tsunami
A High Performance C# wrapper that allows you to get the benefits of SIMD Intrinsics on List<T>.
Language: C# - Size: 363 KB - Last synced at: 18 days ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 0

g1n0st/AyaRay
A Modern C++ Windows-platform physically based renderer.
Language: C++ - Size: 31.8 MB - Last synced at: 5 days ago - Pushed at: about 2 years ago - Stars: 37 - Forks: 2

falkosch/edu.schwabe.raytracer
SSE/AVX accelerated implementation of recursive raytracing (a.k.a. Whitted Raytracing). Creative commons CC-BY-NC-SA licensed
Language: C++ - Size: 28.7 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

alighanbari2002/Parallel-Programming-Course-Projects
Parallel Programming course projects demonstrating various parallelism techniques with SIMD SSE3, OMP, and POSIX threads, including Intel Parallel Studio for analysis and parallelization.
Language: C++ - Size: 7.32 MB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

sahmad98/vstring
Vectroized String Helper Functions
Language: C++ - Size: 61.5 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 6 - Forks: 0

alainesp/simd-function
Python library to metaprogram C/C++ functions using SIMD instruction sets
Size: 145 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

BagritsevichStepan/moore-curve-with-simd
Fast algorithm for finding points of the Moore curve using SIMD operations
Language: C - Size: 14.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

NodixBlockchain/nodix_dApps
Collection of application on nodix blockchain
Language: HTML - Size: 16.6 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

DasudaRunner/DeltaCV
A high performance library for image processing
Language: C++ - Size: 9.05 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 130 - Forks: 12

andrelrt/SimdTTM
Simd To The Masses
Size: 80.1 KB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

Legendar11/GOST-Kuznyechik-Cipher-optimization
Master dissertation: optimization of the algorithm GOST 34.12 '15 (Kuznechik)
Language: C - Size: 349 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

Nkszk/Pymorph
Extended morphological filters using ARM SIMD instructions for Python on Raspberry Pi (Deep network for Gaussian denoising and image completion are included.)
Language: C - Size: 1.09 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 0

andrelrt/litesimd
Litesimd is a no overhead, header only, C++ library for SIMD processing, specialized on SIMD comparison and data shuffle.
Language: C++ - Size: 335 KB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 14 - Forks: 0

samkenxstream/SAMkenXMODel Fork of aws/serverless-application-model
🛡️📃PhoenixDOMString🚀SAMkenX📃AWS Serverless Application Model (SAM) is an open-source framework for building serverless applications https://aws.amazon.com/serverless/sam/
Language: Python - Size: 60 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1

filipmu/Bat-Sonar
Bat like sonar sensor that can track multiple targets and estimate angle of arrival using chirps and cross correlation in near real time.
Language: C - Size: 7.58 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 1

zamronypj/simd
Simple pascal demo project to show how to use Single Instruction Multiple Data (SIMD) using Intel SSE instruction
Language: Pascal - Size: 110 KB - Last synced at: 12 months ago - Pushed at: about 8 years ago - Stars: 6 - Forks: 1

ramesh-adhikari/HPC
High Performance Computing exercises
Language: C - Size: 10.7 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

SamKenX-Hub-Community/SAMkenxsimdjson Fork of samkenxstream/SAMkenxsimdjson
Parsing gigabytes of JSON per second
Language: C++ - Size: 20.6 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

denishoornaert/ELECH473
Language: C - Size: 1.73 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 2 - Forks: 0

maddsua/simd
AVX2 and SSE2 usecases and benchmarks
Language: C++ - Size: 131 KB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

PatwinchIR/ultra-sort
DSL for SIMD Sorting on AVX2 & AVX512
Language: C++ - Size: 6.43 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 30 - Forks: 2

parsa-hn/parallel-programming
Parallel programming course projects
Language: C++ - Size: 6.14 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

t0re199/ARCHP_PROJECT
C & Assembly optimized version of the Stochastic Gradient Descent x SoftSVM x Polynomial Kernel Method algorithm
Language: C - Size: 32.2 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

chamatht/mandelbrot-simd-rs
Rust implementation of mandelbrot set with simd
Language: Rust - Size: 1.03 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 3 - Forks: 1

favreau/SIMDVoxelizer
Simple voxelizer make use of CPU SIMD units
Language: C++ - Size: 303 KB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 1

edponce/masprng
Vector accelerated SPRNG library
Language: C - Size: 2.16 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

jeffamstutz/psimd
(experiments with) pragma-based SIMD C++ types
Language: C++ - Size: 231 KB - Last synced at: 27 days ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 0

AlexSabourinDev/CranberryKing_OptimizationPlayground
Optimization Playground
Language: C - Size: 162 MB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 1

JoaquinVillagra/Laboratorio_1_HPC
Laboratorio 1 de la asignatura de Computación de alto rendimiento. Universidad de Santiago de Chile, segundo semestre 2017.
Language: C - Size: 1.28 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0
