Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: avx
Dr-Noob/peakperf
Achieve peak performance on x86 CPUs and NVIDIA GPUs
Language: C++ - Size: 244 KB - Last synced: about 5 hours ago - Pushed: about 6 hours ago - Stars: 56 - Forks: 13
bgin/Radar-ElectroOptical-Simulation
(REOS) Radar and Electro-Optical Simulation Framework written in C++.
Language: C++ - Size: 28.3 MB - Last synced: about 11 hours ago - Pushed: about 12 hours ago - Stars: 51 - Forks: 16
Dioarya/mandelbrotset-image-generator
Rewrite of a personal project from back in December 2023.
Language: C++ - Size: 328 KB - Last synced: about 21 hours ago - Pushed: 1 day ago - Stars: 0 - Forks: 0
nidud/asmc
Masm compatible assembler
Language: Assembly - Size: 67.9 MB - Last synced: about 3 hours ago - Pushed: 1 day ago - Stars: 12 - Forks: 4
path-racer/pathlib
Lightweight AVX-optimized containers and routines for the Path game engine.
Language: C - Size: 102 MB - Last synced: about 23 hours ago - Pushed: 1 day ago - Stars: 1 - Forks: 0
Nemandza82/Symd
C++ header only template library designed to make it easier to write high-performance SIMD (SSE, AVX, Neon) and multi-threaded code.
Language: C++ - Size: 861 KB - Last synced: about 22 hours ago - Pushed: 2 days ago - Stars: 4 - Forks: 3
microsoft/DirectXMath
DirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps
Language: C++ - Size: 2.18 MB - Last synced: 2 days ago - Pushed: 2 days ago - Stars: 1,485 - Forks: 227
kfrlib/kfr
Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
Language: C++ - Size: 12 MB - Last synced: 3 days ago - Pushed: 4 days ago - Stars: 1,596 - Forks: 246
OpenNMT/CTranslate2
Fast inference engine for Transformer models
Language: C++ - Size: 13.5 MB - Last synced: 4 days ago - Pushed: 11 days ago - Stars: 2,828 - Forks: 249
redorav/hlslpp
Math library using hlsl syntax with SSE/NEON support
Language: C++ - Size: 7.61 MB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 451 - Forks: 39
lssfau/ExaStencils
Mirror of the official ExaStencils Project repository. Please open pull requests on GitLab: https://i10git.cs.fau.de/exastencils/exastencils
Language: Scala - Size: 299 MB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 3 - Forks: 1
spnda/fastgltf
A modern C++17 glTF 2.0 library focused on speed, correctness, and usability
Language: C++ - Size: 2.19 MB - Last synced: 6 days ago - Pushed: 7 days ago - Stars: 224 - Forks: 27
recp/cglm
π½ Highly Optimized 2D / 3D Graphics Math (glm) for C
Language: C - Size: 2.51 MB - Last synced: 10 days ago - Pushed: 19 days ago - Stars: 2,050 - Forks: 216
HugeONotation/AVEL
Another Vector Extensions Library
Language: C++ - Size: 1.21 MB - Last synced: 8 days ago - Pushed: 8 days ago - Stars: 0 - Forks: 0
HanabishiRecca/bin-cpuflags-x86
A small CLI tool to detect CPU flags (instruction sets) of X86 binaries.
Language: Rust - Size: 32.2 KB - Last synced: 8 days ago - Pushed: 8 days ago - Stars: 11 - Forks: 0
ermig1979/Simd
C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, VMX(Altivec) and VSX(Power7) for PowerPC, NEON for ARM.
Language: C++ - Size: 38.3 MB - Last synced: 6 days ago - Pushed: 8 days ago - Stars: 1,977 - Forks: 403
RRZE-HPC/OSACA
Open Source Architecture Code Analyzer
Language: Jupyter Notebook - Size: 8.19 MB - Last synced: 8 days ago - Pushed: 8 days ago - Stars: 274 - Forks: 15
libxsmm/libxsmm
Library for specialized dense and sparse matrix operations, and deep learning primitives.
Language: C - Size: 297 MB - Last synced: 25 days ago - Pushed: 25 days ago - Stars: 795 - Forks: 181
aff3ct/MIPP
MIPP is a portable wrapper for SIMD instructions written in C++11. It supports NEON, SSE, AVX, AVX-512 and SVE (length specific).
Language: C++ - Size: 2.01 MB - Last synced: 9 days ago - Pushed: 9 days ago - Stars: 463 - Forks: 86
mrecachinas/hexhamming
:heavy_division_sign: SIMD-accelerated bitwise hamming distance Python module for hexadecimal strings
Language: C++ - Size: 556 KB - Last synced: 8 days ago - Pushed: about 1 year ago - Stars: 17 - Forks: 4
pypy/fast-utf8-methods
Fast UTF-8 utility methods
Language: HTML - Size: 1.26 MB - Last synced: 10 days ago - Pushed: almost 7 years ago - Stars: 2 - Forks: 0
Alex313031/atom-ng Fork of atom/atom
:atom: The hyper-hackable text editor - Compiler Optimized, Community Maintained Fork
Language: JavaScript - Size: 337 MB - Last synced: 10 days ago - Pushed: 10 days ago - Stars: 26 - Forks: 1
VcDevel/Vc
SIMD Vector Classes for C++
Language: C++ - Size: 11 MB - Last synced: 9 days ago - Pushed: 3 months ago - Stars: 1,420 - Forks: 150
shibatch/sleef
SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Language: C - Size: 5.08 MB - Last synced: 10 days ago - Pushed: 15 days ago - Stars: 590 - Forks: 120
ClaudiuHKS/Se-Capabilities
Se Capabilities
Language: C++ - Size: 9.77 KB - Last synced: 14 days ago - Pushed: over 1 year ago - Stars: 2 - Forks: 0
tlk00/BitMagic
BitMagic Library
Language: C++ - Size: 62.1 MB - Last synced: 15 days ago - Pushed: 15 days ago - Stars: 399 - Forks: 46
Erkaman/sse-avx-rasterization
Triangle rasterization routines accelerated by SSE and AVX
Language: C++ - Size: 23.4 KB - Last synced: 9 days ago - Pushed: over 6 years ago - Stars: 65 - Forks: 10
Alex313031/geany-ng Fork of geany/geany
The flyweight IDE - Compiler Optimized Builds
Language: C - Size: 64.6 MB - Last synced: 15 days ago - Pushed: 15 days ago - Stars: 6 - Forks: 0
manodeep/Corrfunc
β‘οΈβ‘οΈβ‘οΈBlazing fast correlation functions on the CPU.
Language: C - Size: 150 MB - Last synced: 7 days ago - Pushed: about 1 month ago - Stars: 162 - Forks: 49
simd-everywhere/simde
Implementations of SIMD instruction sets for systems which don't natively support them.
Language: C - Size: 35 MB - Last synced: 18 days ago - Pushed: 20 days ago - Stars: 2,168 - Forks: 225
VectorChief/QuadRay-engine
Realtime raytracer using SIMD on ARM, MIPS, PPC and x86
Language: C - Size: 14.6 MB - Last synced: 17 days ago - Pushed: 17 days ago - Stars: 25 - Forks: 4
VectorChief/UniSIMD-assembler
SIMD macro assembler unified for ARM, MIPS, PPC and x86
Language: C - Size: 9.11 MB - Last synced: 17 days ago - Pushed: 17 days ago - Stars: 85 - Forks: 7
Alex313031/Thorium-Win
Chromium fork for Windows named after radioactive element No. 90; Windows builds of https://github.com/Alex313031/Thorium
Language: Batchfile - Size: 2.45 MB - Last synced: 17 days ago - Pushed: 18 days ago - Stars: 1,149 - Forks: 30
JohT/convolution-benchmarks
Benchmark convolution implementations in C++ with Catch2 visualized with Vega-Lite
Language: C++ - Size: 5.94 MB - Last synced: 25 days ago - Pushed: 26 days ago - Stars: 1 - Forks: 1
Alex313031/Mercury
Firefox fork with compiler optimizations and patches from Librewolf, Waterfox, and GNU IceCat.
Language: JavaScript - Size: 7.67 MB - Last synced: 22 days ago - Pushed: 23 days ago - Stars: 925 - Forks: 21
lemire/despacer
C library to remove white space from strings as fast as possible
Language: C - Size: 1.25 MB - Last synced: about 20 hours ago - Pushed: 5 months ago - Stars: 147 - Forks: 15
RobRich999/Chromium_Clang
Chromium browser compiled with the Clang/LLVM compiler.
Size: 1.62 MB - Last synced: 24 days ago - Pushed: 24 days ago - Stars: 144 - Forks: 10
jcmfernandes/ob64
A fast Base64 encoder and decoder as a Ruby gem. :racehorse:
Language: Ruby - Size: 63.5 KB - Last synced: 12 days ago - Pushed: over 2 years ago - Stars: 3 - Forks: 0
Alex313031/thorium
Chromium fork named after radioactive element No. 90. Windows and MacOS/Raspi/Android/Special builds are in different repositories, links are towards the top of the README.md.
Language: C++ - Size: 222 MB - Last synced: 28 days ago - Pushed: 28 days ago - Stars: 3,947 - Forks: 130
jfalcou/eve
Expressive Vector Engine - SIMD in C++ Goes Brrrr
Language: C++ - Size: 44.2 MB - Last synced: 27 days ago - Pushed: 27 days ago - Stars: 842 - Forks: 51
xtensor-stack/xsimd
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))
Language: C++ - Size: 3.64 MB - Last synced: 28 days ago - Pushed: about 1 month ago - Stars: 2,018 - Forks: 245
google/highway
Performance-portable, length-agnostic SIMD with runtime dispatch
Language: C++ - Size: 22.5 MB - Last synced: 29 days ago - Pushed: 29 days ago - Stars: 3,609 - Forks: 291
Balta-Stefan/Mandelbrot-viewer
Mandelbrot set viewer made in Qt (C++)
Language: C++ - Size: 3.46 MB - Last synced: 26 days ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
Balta-Stefan/BMP-blurrer
Language: C++ - Size: 7.81 MB - Last synced: 26 days ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
MarioSieg/Corium π¦
Corium is a modern scripting language which combines simple, safe and efficient programming.
Language: C++ - Size: 248 MB - Last synced: 10 days ago - Pushed: over 2 years ago - Stars: 18 - Forks: 4
thatsimo/progetto-21-22
Parallel FSS algorithm implementation in assembly-x86 (SSE, AVX, OpenMP)
Language: C - Size: 78.5 MB - Last synced: 28 days ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
Avereniect/AVEL
AVEL: Another Vector Extensions Library
Language: C++ - Size: 2.27 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 3 - Forks: 0
bluescarni/rakau
C++17 N-body Barnes-Hut on heterogeneous hardware architectures
Language: C++ - Size: 1.26 MB - Last synced: 10 days ago - Pushed: almost 4 years ago - Stars: 20 - Forks: 5
BoringBoredom/Linpack-Extended
Linpack Extended is a stress test for 64-bit Intel processors. It is based on the Intel Math Kernel Library.
Language: HTML - Size: 52.3 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 13 - Forks: 1
mkn/mkn.avx
C++ AVX wrappers for manual SIMD
Language: C++ - Size: 29.3 KB - Last synced: about 2 months ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 1
Alex313031/Mercury-Win7
Windows 7 builds of Mercury Browser (Based on ESR115 rather than stable tip-of-tree)
Language: JavaScript - Size: 6.45 MB - Last synced: 22 days ago - Pushed: 23 days ago - Stars: 25 - Forks: 1
oysteijo/simd_neuralnet
Feed-forward neural network implementation in C with SIMD instructions
Language: C - Size: 397 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 13 - Forks: 0
pq-crystals/dilithium
Language: C - Size: 454 KB - Last synced: about 2 months ago - Pushed: 4 months ago - Stars: 327 - Forks: 112
sahmad98/vstring
Vectroized String Helper Functions
Language: C++ - Size: 61.5 KB - Last synced: about 2 months ago - Pushed: over 4 years ago - Stars: 6 - Forks: 0
ihhub/penguinV
Computer vision library with focus on heterogeneous systems
Language: C++ - Size: 3.89 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 118 - Forks: 88
Maged152/Intel-Intrinsics-CPP-Wrapper
Intel Intrinsics C++ Wrapper
Language: C++ - Size: 199 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0
cristian-bicheru/detect-simd
Python library to detect CPU SIMD capabilities.
Language: C - Size: 31.3 KB - Last synced: 6 days ago - Pushed: about 3 years ago - Stars: 3 - Forks: 0
ltlollo/lattice
Vectorized primitives on Intel AVX/AVX2 for some Ring-LWE problems
Language: C - Size: 45.9 KB - Last synced: about 2 months ago - Pushed: about 7 years ago - Stars: 1 - Forks: 0
lucas-inocencio/computer-architecture
Some projects about computer architecture: dgemm problem, vectorial adder and cpu risc-v.
Language: C - Size: 5.98 MB - Last synced: 2 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0
JishinMaster/simd_utils
A header only library implementing common mathematical functions using SIMD intrinsics
Language: C - Size: 1.59 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 75 - Forks: 18
Alex313031/beaker-ng Fork of beakerbrowser/beaker
An experimental peer-to-peer Web browser - Compiler optimized, community maintained fork.
Language: JavaScript - Size: 44 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 1 - Forks: 0
powturbo/Turbo-Base64
Turbo Base64 - Fastest Base64 SIMD:SSE/AVX2/AVX512/Neon/Altivec - Faster than memcpy!
Language: C - Size: 439 KB - Last synced: 2 months ago - Pushed: 9 months ago - Stars: 245 - Forks: 36
Alex313031/Thorium-Special
Special builds of Thorium for SSE3 and different processors.
Size: 204 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 180 - Forks: 5
mind/wheels
Performance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Size: 39.1 KB - Last synced: about 1 month ago - Pushed: almost 5 years ago - Stars: 888 - Forks: 109
opencodewin/libmidi
midi player base on timidity and imgui
Language: C - Size: 15.6 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 61 - Forks: 11
guzba/nimsimd
Pleasant Nim bindings for SIMD instruction sets.
Language: Nim - Size: 65.4 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 60 - Forks: 6
Alex313031/Thorium-Linux-AVX2
Repo to serve AVX2 Linux builds of Thorium. https://github.com/Alex313031/Thorium/
Size: 9.77 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 26 - Forks: 0
minio/sha256-simd
Accelerate SHA256 computations in pure Go using AVX512, SHA Extensions for x86 and ARM64 for ARM. On AVX512 it provides an up to 8x improvement (over 3 GB/s per core). SHA Extensions give a performance boost of close to 4x over native.
Language: Go - Size: 171 KB - Last synced: 3 months ago - Pushed: 12 months ago - Stars: 919 - Forks: 118
dzaima/intrinsics-viewer
x86-64, ARM, and RVV intrinsics viewer
Language: JavaScript - Size: 727 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 16 - Forks: 1
Geolm/math_intrinsics
One header file library that implement missing transcendental math functions (cos, sin, acos, and more....) using 100% AVX/Neon instructions (no branching)
Language: C - Size: 213 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
PaddlePaddle/FlyCV
FlyCV is a high-performance library for processing computer visual tasks.
Language: C++ - Size: 28.1 MB - Last synced: 3 months ago - Pushed: 11 months ago - Stars: 559 - Forks: 56
NIR3X/FastXor.cpp
FastXor - SIMD-based XOR Encryption
Language: C++ - Size: 24.4 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
romz-pl/matrix-matrix-multiply
Algorithms for matrix matrix multiplication, dgemm, AVX-256, AVX-512
Language: C++ - Size: 55.7 KB - Last synced: 25 days ago - Pushed: almost 3 years ago - Stars: 10 - Forks: 2
swojtasiak/fcml-lib
A general purpose machine code manipulation library for x86-32 (IA-32) and x86-64 (AMD64) architectures (Assembler, Disassembler, Library).
Language: C - Size: 22.9 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 81 - Forks: 24
Alex313031/Thorium-Win-AVX2
Repo to serve AVX2 Windows builds of Thorium. https://github.com/Alex313031/Thorium/
Language: Batchfile - Size: 294 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 358 - Forks: 8
PoC-Consortium/engraver
PoCC Burstcoin Reference Plotter
Language: Rust - Size: 427 KB - Last synced: 13 days ago - Pushed: almost 3 years ago - Stars: 62 - Forks: 39
tk-yoshimura/AvxUInt
AVX Accelerated BigUInt Arithmetic Implements
Language: C# - Size: 236 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
anas-899/l2_distance_SIMD
NEON, AVX, SSE, C implementations for l2 distance
Language: C++ - Size: 5.86 KB - Last synced: 4 months ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0
cjmcv/hpc
Learning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
Language: C++ - Size: 1.82 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 49 - Forks: 5
alainesp/simd-function
Python library to metaprogram C/C++ functions using SIMD instruction sets
Size: 145 KB - Last synced: about 1 month ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
FCLC/AdvancedCiderXtensions
Measure accelerate BLAS performance
Language: Swift - Size: 65.4 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 5 - Forks: 1
IvanMzk/culib
Culib - library to work with CUDA using STL-like abstract types and algorithms
Language: C++ - Size: 443 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
falkosch/edu.schwabe.raytracer
SSE/AVX accelerated implementation of recursive raytracing (a.k.a. Whitted Raytracing). Creative commons CC-BY-NC-SA licensed
Language: C++ - Size: 28.7 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
alignedalignof/avx-image-integral
Image integral calculation using AVX
Language: C++ - Size: 131 KB - Last synced: 5 months ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0
alignedalignof/avx-4x8-filter
Small fixed size image correlation filter implemented with AVX
Language: C++ - Size: 2.93 KB - Last synced: 5 months ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0
bgin/Radar_ElectroOptical_Simulation
(REOS) Radar and ElectroOptical Simulation Framework written in Fortran.
Language: Fortran - Size: 39.2 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 45 - Forks: 14
Geolm/simd
Neon/AVX simd library, vector size agnostic
Language: C - Size: 269 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
nevinbaiju/transformer_cpp_ITCS-5182
Optimization of Attention layers for efficient inferencing on the CPU and GPU. It covers optimizations for AVX and CUDA also efficient memory processing techniques.
Language: C++ - Size: 96.7 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
manticore-projects/fpng-java
Java Wrapper for the fast, native FPNG Encoder
Language: C++ - Size: 28.5 MB - Last synced: 25 days ago - Pushed: 5 months ago - Stars: 2 - Forks: 2
pcineverdies/FFT-AVX-512 π¦
Fast Fourier Transform implementation though x86 AVX-512 SIMD extension
Language: C++ - Size: 7.81 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 1
sergcpp/math
Vector math library
Language: C++ - Size: 633 KB - Last synced: 5 months ago - Pushed: almost 6 years ago - Stars: 4 - Forks: 0
blackccpie/fastconv
fast 2D convolution implementation benchmark
Language: C++ - Size: 16.6 KB - Last synced: 6 months ago - Pushed: over 6 years ago - Stars: 6 - Forks: 2
Jacob-C-Smith/vectorize
High level abstractions for vectorized computing
Language: C - Size: 97.7 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 1 - Forks: 0
whypet/Hedra
A fast SIMD-optimized C++ 3D software renderer
Language: C++ - Size: 20.5 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0
l33tlamer/mongodb-without-avx Fork of rnsc/mongodb-without-avx
MongoDB v5/6 without AVX CPU requirement (Docker Image)
Language: Shell - Size: 44.9 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0
GregoryIstratov/mdb
Framework for making computation on CPU
Language: C - Size: 336 KB - Last synced: 6 months ago - Pushed: about 6 years ago - Stars: 1 - Forks: 0
Mathieu-Le-Gouill/Neural_Network
From scratch C++ Neural Network based on MNIST dataset using templated Tensors with SIMD intrinsics
Language: C++ - Size: 18.5 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0
gaasedelen/microavx
An AVX Lifter for the Hex-Rays Decompiler
Language: Python - Size: 102 KB - Last synced: 6 months ago - Pushed: about 1 year ago - Stars: 240 - Forks: 28
Steppenwolfe65/CEX
The CEX Cryptographic library in C++
Language: HTML - Size: 3.42 GB - Last synced: 7 months ago - Pushed: about 1 year ago - Stars: 55 - Forks: 25
Martinsos/opal
SIMD C/C++ library for massive optimal sequence alignment (local/SW, infix, overlap, global)
Language: C++ - Size: 19 MB - Last synced: 7 months ago - Pushed: 8 months ago - Stars: 28 - Forks: 8