Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: avx

Dr-Noob/peakperf

Achieve peak performance on x86 CPUs and NVIDIA GPUs

Language: C++ - Size: 244 KB - Last synced: about 5 hours ago - Pushed: about 6 hours ago - Stars: 56 - Forks: 13

bgin/Radar-ElectroOptical-Simulation

(REOS) Radar and Electro-Optical Simulation Framework written in C++.

Language: C++ - Size: 28.3 MB - Last synced: about 11 hours ago - Pushed: about 12 hours ago - Stars: 51 - Forks: 16

Dioarya/mandelbrotset-image-generator

Rewrite of a personal project from back in December 2023.

Language: C++ - Size: 328 KB - Last synced: about 21 hours ago - Pushed: 1 day ago - Stars: 0 - Forks: 0

nidud/asmc

Masm compatible assembler

Language: Assembly - Size: 67.9 MB - Last synced: about 3 hours ago - Pushed: 1 day ago - Stars: 12 - Forks: 4

path-racer/pathlib

Lightweight AVX-optimized containers and routines for the Path game engine.

Language: C - Size: 102 MB - Last synced: about 23 hours ago - Pushed: 1 day ago - Stars: 1 - Forks: 0

Nemandza82/Symd

C++ header only template library designed to make it easier to write high-performance SIMD (SSE, AVX, Neon) and multi-threaded code.

Language: C++ - Size: 861 KB - Last synced: about 22 hours ago - Pushed: 2 days ago - Stars: 4 - Forks: 3

microsoft/DirectXMath

DirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps

Language: C++ - Size: 2.18 MB - Last synced: 2 days ago - Pushed: 2 days ago - Stars: 1,485 - Forks: 227

kfrlib/kfr

Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)

Language: C++ - Size: 12 MB - Last synced: 3 days ago - Pushed: 4 days ago - Stars: 1,596 - Forks: 246

OpenNMT/CTranslate2

Fast inference engine for Transformer models

Language: C++ - Size: 13.5 MB - Last synced: 4 days ago - Pushed: 11 days ago - Stars: 2,828 - Forks: 249

redorav/hlslpp

Math library using hlsl syntax with SSE/NEON support

Language: C++ - Size: 7.61 MB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 451 - Forks: 39

lssfau/ExaStencils

Mirror of the official ExaStencils Project repository. Please open pull requests on GitLab: https://i10git.cs.fau.de/exastencils/exastencils

Language: Scala - Size: 299 MB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 3 - Forks: 1

spnda/fastgltf

A modern C++17 glTF 2.0 library focused on speed, correctness, and usability

Language: C++ - Size: 2.19 MB - Last synced: 6 days ago - Pushed: 7 days ago - Stars: 224 - Forks: 27

recp/cglm

πŸ“½ Highly Optimized 2D / 3D Graphics Math (glm) for C

Language: C - Size: 2.51 MB - Last synced: 10 days ago - Pushed: 19 days ago - Stars: 2,050 - Forks: 216

HugeONotation/AVEL

Another Vector Extensions Library

Language: C++ - Size: 1.21 MB - Last synced: 8 days ago - Pushed: 8 days ago - Stars: 0 - Forks: 0

HanabishiRecca/bin-cpuflags-x86

A small CLI tool to detect CPU flags (instruction sets) of X86 binaries.

Language: Rust - Size: 32.2 KB - Last synced: 8 days ago - Pushed: 8 days ago - Stars: 11 - Forks: 0

ermig1979/Simd

C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, VMX(Altivec) and VSX(Power7) for PowerPC, NEON for ARM.

Language: C++ - Size: 38.3 MB - Last synced: 6 days ago - Pushed: 8 days ago - Stars: 1,977 - Forks: 403

RRZE-HPC/OSACA

Open Source Architecture Code Analyzer

Language: Jupyter Notebook - Size: 8.19 MB - Last synced: 8 days ago - Pushed: 8 days ago - Stars: 274 - Forks: 15

libxsmm/libxsmm

Library for specialized dense and sparse matrix operations, and deep learning primitives.

Language: C - Size: 297 MB - Last synced: 25 days ago - Pushed: 25 days ago - Stars: 795 - Forks: 181

aff3ct/MIPP

MIPP is a portable wrapper for SIMD instructions written in C++11. It supports NEON, SSE, AVX, AVX-512 and SVE (length specific).

Language: C++ - Size: 2.01 MB - Last synced: 9 days ago - Pushed: 9 days ago - Stars: 463 - Forks: 86

mrecachinas/hexhamming

:heavy_division_sign: SIMD-accelerated bitwise hamming distance Python module for hexadecimal strings

Language: C++ - Size: 556 KB - Last synced: 8 days ago - Pushed: about 1 year ago - Stars: 17 - Forks: 4

pypy/fast-utf8-methods

Fast UTF-8 utility methods

Language: HTML - Size: 1.26 MB - Last synced: 10 days ago - Pushed: almost 7 years ago - Stars: 2 - Forks: 0

Alex313031/atom-ng Fork of atom/atom

:atom: The hyper-hackable text editor - Compiler Optimized, Community Maintained Fork

Language: JavaScript - Size: 337 MB - Last synced: 10 days ago - Pushed: 10 days ago - Stars: 26 - Forks: 1

VcDevel/Vc

SIMD Vector Classes for C++

Language: C++ - Size: 11 MB - Last synced: 9 days ago - Pushed: 3 months ago - Stars: 1,420 - Forks: 150

shibatch/sleef

SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT

Language: C - Size: 5.08 MB - Last synced: 10 days ago - Pushed: 15 days ago - Stars: 590 - Forks: 120

ClaudiuHKS/Se-Capabilities

Se Capabilities

Language: C++ - Size: 9.77 KB - Last synced: 14 days ago - Pushed: over 1 year ago - Stars: 2 - Forks: 0

tlk00/BitMagic

BitMagic Library

Language: C++ - Size: 62.1 MB - Last synced: 15 days ago - Pushed: 15 days ago - Stars: 399 - Forks: 46

Erkaman/sse-avx-rasterization

Triangle rasterization routines accelerated by SSE and AVX

Language: C++ - Size: 23.4 KB - Last synced: 9 days ago - Pushed: over 6 years ago - Stars: 65 - Forks: 10

Alex313031/geany-ng Fork of geany/geany

The flyweight IDE - Compiler Optimized Builds

Language: C - Size: 64.6 MB - Last synced: 15 days ago - Pushed: 15 days ago - Stars: 6 - Forks: 0

manodeep/Corrfunc

⚑️⚑️⚑️Blazing fast correlation functions on the CPU.

Language: C - Size: 150 MB - Last synced: 7 days ago - Pushed: about 1 month ago - Stars: 162 - Forks: 49

simd-everywhere/simde

Implementations of SIMD instruction sets for systems which don't natively support them.

Language: C - Size: 35 MB - Last synced: 18 days ago - Pushed: 20 days ago - Stars: 2,168 - Forks: 225

VectorChief/QuadRay-engine

Realtime raytracer using SIMD on ARM, MIPS, PPC and x86

Language: C - Size: 14.6 MB - Last synced: 17 days ago - Pushed: 17 days ago - Stars: 25 - Forks: 4

VectorChief/UniSIMD-assembler

SIMD macro assembler unified for ARM, MIPS, PPC and x86

Language: C - Size: 9.11 MB - Last synced: 17 days ago - Pushed: 17 days ago - Stars: 85 - Forks: 7

Alex313031/Thorium-Win

Chromium fork for Windows named after radioactive element No. 90; Windows builds of https://github.com/Alex313031/Thorium

Language: Batchfile - Size: 2.45 MB - Last synced: 17 days ago - Pushed: 18 days ago - Stars: 1,149 - Forks: 30

JohT/convolution-benchmarks

Benchmark convolution implementations in C++ with Catch2 visualized with Vega-Lite

Language: C++ - Size: 5.94 MB - Last synced: 25 days ago - Pushed: 26 days ago - Stars: 1 - Forks: 1

Alex313031/Mercury

Firefox fork with compiler optimizations and patches from Librewolf, Waterfox, and GNU IceCat.

Language: JavaScript - Size: 7.67 MB - Last synced: 22 days ago - Pushed: 23 days ago - Stars: 925 - Forks: 21

lemire/despacer

C library to remove white space from strings as fast as possible

Language: C - Size: 1.25 MB - Last synced: about 20 hours ago - Pushed: 5 months ago - Stars: 147 - Forks: 15

RobRich999/Chromium_Clang

Chromium browser compiled with the Clang/LLVM compiler.

Size: 1.62 MB - Last synced: 24 days ago - Pushed: 24 days ago - Stars: 144 - Forks: 10

jcmfernandes/ob64

A fast Base64 encoder and decoder as a Ruby gem. :racehorse:

Language: Ruby - Size: 63.5 KB - Last synced: 12 days ago - Pushed: over 2 years ago - Stars: 3 - Forks: 0

Alex313031/thorium

Chromium fork named after radioactive element No. 90. Windows and MacOS/Raspi/Android/Special builds are in different repositories, links are towards the top of the README.md.

Language: C++ - Size: 222 MB - Last synced: 28 days ago - Pushed: 28 days ago - Stars: 3,947 - Forks: 130

jfalcou/eve

Expressive Vector Engine - SIMD in C++ Goes Brrrr

Language: C++ - Size: 44.2 MB - Last synced: 27 days ago - Pushed: 27 days ago - Stars: 842 - Forks: 51

xtensor-stack/xsimd

C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))

Language: C++ - Size: 3.64 MB - Last synced: 28 days ago - Pushed: about 1 month ago - Stars: 2,018 - Forks: 245

google/highway

Performance-portable, length-agnostic SIMD with runtime dispatch

Language: C++ - Size: 22.5 MB - Last synced: 29 days ago - Pushed: 29 days ago - Stars: 3,609 - Forks: 291

Balta-Stefan/Mandelbrot-viewer

Mandelbrot set viewer made in Qt (C++)

Language: C++ - Size: 3.46 MB - Last synced: 26 days ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

Balta-Stefan/BMP-blurrer

Language: C++ - Size: 7.81 MB - Last synced: 26 days ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

MarioSieg/Corium πŸ“¦

Corium is a modern scripting language which combines simple, safe and efficient programming.

Language: C++ - Size: 248 MB - Last synced: 10 days ago - Pushed: over 2 years ago - Stars: 18 - Forks: 4

thatsimo/progetto-21-22

Parallel FSS algorithm implementation in assembly-x86 (SSE, AVX, OpenMP)

Language: C - Size: 78.5 MB - Last synced: 28 days ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

Avereniect/AVEL

AVEL: Another Vector Extensions Library

Language: C++ - Size: 2.27 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 3 - Forks: 0

bluescarni/rakau

C++17 N-body Barnes-Hut on heterogeneous hardware architectures

Language: C++ - Size: 1.26 MB - Last synced: 10 days ago - Pushed: almost 4 years ago - Stars: 20 - Forks: 5

BoringBoredom/Linpack-Extended

Linpack Extended is a stress test for 64-bit Intel processors. It is based on the Intel Math Kernel Library.

Language: HTML - Size: 52.3 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 13 - Forks: 1

mkn/mkn.avx

C++ AVX wrappers for manual SIMD

Language: C++ - Size: 29.3 KB - Last synced: about 2 months ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 1

Alex313031/Mercury-Win7

Windows 7 builds of Mercury Browser (Based on ESR115 rather than stable tip-of-tree)

Language: JavaScript - Size: 6.45 MB - Last synced: 22 days ago - Pushed: 23 days ago - Stars: 25 - Forks: 1

oysteijo/simd_neuralnet

Feed-forward neural network implementation in C with SIMD instructions

Language: C - Size: 397 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 13 - Forks: 0

pq-crystals/dilithium

Language: C - Size: 454 KB - Last synced: about 2 months ago - Pushed: 4 months ago - Stars: 327 - Forks: 112

sahmad98/vstring

Vectroized String Helper Functions

Language: C++ - Size: 61.5 KB - Last synced: about 2 months ago - Pushed: over 4 years ago - Stars: 6 - Forks: 0

ihhub/penguinV

Computer vision library with focus on heterogeneous systems

Language: C++ - Size: 3.89 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 118 - Forks: 88

Maged152/Intel-Intrinsics-CPP-Wrapper

Intel Intrinsics C++ Wrapper

Language: C++ - Size: 199 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

cristian-bicheru/detect-simd

Python library to detect CPU SIMD capabilities.

Language: C - Size: 31.3 KB - Last synced: 6 days ago - Pushed: about 3 years ago - Stars: 3 - Forks: 0

ltlollo/lattice

Vectorized primitives on Intel AVX/AVX2 for some Ring-LWE problems

Language: C - Size: 45.9 KB - Last synced: about 2 months ago - Pushed: about 7 years ago - Stars: 1 - Forks: 0

lucas-inocencio/computer-architecture

Some projects about computer architecture: dgemm problem, vectorial adder and cpu risc-v.

Language: C - Size: 5.98 MB - Last synced: 2 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0

JishinMaster/simd_utils

A header only library implementing common mathematical functions using SIMD intrinsics

Language: C - Size: 1.59 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 75 - Forks: 18

Alex313031/beaker-ng Fork of beakerbrowser/beaker

An experimental peer-to-peer Web browser - Compiler optimized, community maintained fork.

Language: JavaScript - Size: 44 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 1 - Forks: 0

powturbo/Turbo-Base64

Turbo Base64 - Fastest Base64 SIMD:SSE/AVX2/AVX512/Neon/Altivec - Faster than memcpy!

Language: C - Size: 439 KB - Last synced: 2 months ago - Pushed: 9 months ago - Stars: 245 - Forks: 36

Alex313031/Thorium-Special

Special builds of Thorium for SSE3 and different processors.

Size: 204 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 180 - Forks: 5

mind/wheels

Performance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)

Size: 39.1 KB - Last synced: about 1 month ago - Pushed: almost 5 years ago - Stars: 888 - Forks: 109

opencodewin/libmidi

midi player base on timidity and imgui

Language: C - Size: 15.6 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 61 - Forks: 11

guzba/nimsimd

Pleasant Nim bindings for SIMD instruction sets.

Language: Nim - Size: 65.4 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 60 - Forks: 6

Alex313031/Thorium-Linux-AVX2

Repo to serve AVX2 Linux builds of Thorium. https://github.com/Alex313031/Thorium/

Size: 9.77 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 26 - Forks: 0

minio/sha256-simd

Accelerate SHA256 computations in pure Go using AVX512, SHA Extensions for x86 and ARM64 for ARM. On AVX512 it provides an up to 8x improvement (over 3 GB/s per core). SHA Extensions give a performance boost of close to 4x over native.

Language: Go - Size: 171 KB - Last synced: 3 months ago - Pushed: 12 months ago - Stars: 919 - Forks: 118

dzaima/intrinsics-viewer

x86-64, ARM, and RVV intrinsics viewer

Language: JavaScript - Size: 727 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 16 - Forks: 1

Geolm/math_intrinsics

One header file library that implement missing transcendental math functions (cos, sin, acos, and more....) using 100% AVX/Neon instructions (no branching)

Language: C - Size: 213 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

PaddlePaddle/FlyCV

FlyCV is a high-performance library for processing computer visual tasks.

Language: C++ - Size: 28.1 MB - Last synced: 3 months ago - Pushed: 11 months ago - Stars: 559 - Forks: 56

NIR3X/FastXor.cpp

FastXor - SIMD-based XOR Encryption

Language: C++ - Size: 24.4 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

romz-pl/matrix-matrix-multiply

Algorithms for matrix matrix multiplication, dgemm, AVX-256, AVX-512

Language: C++ - Size: 55.7 KB - Last synced: 25 days ago - Pushed: almost 3 years ago - Stars: 10 - Forks: 2

swojtasiak/fcml-lib

A general purpose machine code manipulation library for x86-32 (IA-32) and x86-64 (AMD64) architectures (Assembler, Disassembler, Library).

Language: C - Size: 22.9 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 81 - Forks: 24

Alex313031/Thorium-Win-AVX2

Repo to serve AVX2 Windows builds of Thorium. https://github.com/Alex313031/Thorium/

Language: Batchfile - Size: 294 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 358 - Forks: 8

PoC-Consortium/engraver

PoCC Burstcoin Reference Plotter

Language: Rust - Size: 427 KB - Last synced: 13 days ago - Pushed: almost 3 years ago - Stars: 62 - Forks: 39

tk-yoshimura/AvxUInt

AVX Accelerated BigUInt Arithmetic Implements

Language: C# - Size: 236 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

anas-899/l2_distance_SIMD

NEON, AVX, SSE, C implementations for l2 distance

Language: C++ - Size: 5.86 KB - Last synced: 4 months ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0

cjmcv/hpc

Learning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )

Language: C++ - Size: 1.82 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 49 - Forks: 5

alainesp/simd-function

Python library to metaprogram C/C++ functions using SIMD instruction sets

Size: 145 KB - Last synced: about 1 month ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

FCLC/AdvancedCiderXtensions

Measure accelerate BLAS performance

Language: Swift - Size: 65.4 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 5 - Forks: 1

IvanMzk/culib

Culib - library to work with CUDA using STL-like abstract types and algorithms

Language: C++ - Size: 443 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

falkosch/edu.schwabe.raytracer

SSE/AVX accelerated implementation of recursive raytracing (a.k.a. Whitted Raytracing). Creative commons CC-BY-NC-SA licensed

Language: C++ - Size: 28.7 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

alignedalignof/avx-image-integral

Image integral calculation using AVX

Language: C++ - Size: 131 KB - Last synced: 5 months ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0

alignedalignof/avx-4x8-filter

Small fixed size image correlation filter implemented with AVX

Language: C++ - Size: 2.93 KB - Last synced: 5 months ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

bgin/Radar_ElectroOptical_Simulation

(REOS) Radar and ElectroOptical Simulation Framework written in Fortran.

Language: Fortran - Size: 39.2 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 45 - Forks: 14

Geolm/simd

Neon/AVX simd library, vector size agnostic

Language: C - Size: 269 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

nevinbaiju/transformer_cpp_ITCS-5182

Optimization of Attention layers for efficient inferencing on the CPU and GPU. It covers optimizations for AVX and CUDA also efficient memory processing techniques.

Language: C++ - Size: 96.7 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

manticore-projects/fpng-java

Java Wrapper for the fast, native FPNG Encoder

Language: C++ - Size: 28.5 MB - Last synced: 25 days ago - Pushed: 5 months ago - Stars: 2 - Forks: 2

pcineverdies/FFT-AVX-512 πŸ“¦

Fast Fourier Transform implementation though x86 AVX-512 SIMD extension

Language: C++ - Size: 7.81 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 1

sergcpp/math

Vector math library

Language: C++ - Size: 633 KB - Last synced: 5 months ago - Pushed: almost 6 years ago - Stars: 4 - Forks: 0

blackccpie/fastconv

fast 2D convolution implementation benchmark

Language: C++ - Size: 16.6 KB - Last synced: 6 months ago - Pushed: over 6 years ago - Stars: 6 - Forks: 2

Jacob-C-Smith/vectorize

High level abstractions for vectorized computing

Language: C - Size: 97.7 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 1 - Forks: 0

whypet/Hedra

A fast SIMD-optimized C++ 3D software renderer

Language: C++ - Size: 20.5 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

l33tlamer/mongodb-without-avx Fork of rnsc/mongodb-without-avx

MongoDB v5/6 without AVX CPU requirement (Docker Image)

Language: Shell - Size: 44.9 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

GregoryIstratov/mdb

Framework for making computation on CPU

Language: C - Size: 336 KB - Last synced: 6 months ago - Pushed: about 6 years ago - Stars: 1 - Forks: 0

Mathieu-Le-Gouill/Neural_Network

From scratch C++ Neural Network based on MNIST dataset using templated Tensors with SIMD intrinsics

Language: C++ - Size: 18.5 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

gaasedelen/microavx

An AVX Lifter for the Hex-Rays Decompiler

Language: Python - Size: 102 KB - Last synced: 6 months ago - Pushed: about 1 year ago - Stars: 240 - Forks: 28

Steppenwolfe65/CEX

The CEX Cryptographic library in C++

Language: HTML - Size: 3.42 GB - Last synced: 7 months ago - Pushed: about 1 year ago - Stars: 55 - Forks: 25

Martinsos/opal

SIMD C/C++ library for massive optimal sequence alignment (local/SW, infix, overlap, global)

Language: C++ - Size: 19 MB - Last synced: 7 months ago - Pushed: 8 months ago - Stars: 28 - Forks: 8