GitHub topics: intel-intrinsics
DLTcollab/sse2neon
A translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation
Language: C++ - Size: 2.8 MB - Last synced at: 11 days ago - Pushed at: 14 days ago - Stars: 1,370 - Forks: 219

z1skgr/SIMD-instruction-MPI-PTHREADS-parallism
Parallelism standards for accelerating performance on calculations for detection of positive DNA selection
Language: C - Size: 866 KB - Last synced at: 19 days ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

z1skgr/OpenMP-pthreads-parallelComputing
Parallization protocols for accelerating algorithm performance
Language: C - Size: 6.45 MB - Last synced at: 19 days ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

FilippoSiri/mandelbrot-optimization
Project that aims to optimize the implementation of an algorithm that generates the Mandelbrot set using parallelization, vectorization and cuda
Language: C++ - Size: 1.16 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

m3y54m/sobel-simd-opencv
Using SIMD instructions in image processing using OpenCV
Language: C++ - Size: 114 KB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

unikraft/lib-psimd
Unikraft port of psimd, portable SIMD intrinsics
Size: 6.84 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 2

AntoinePassemiers/Cythrinsic
Header files for allowing Intel intrinsics in Cython
Language: Python - Size: 38.1 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 8 - Forks: 0

unikraft/lib-intel-intrinsics
Unikraft port of Intel intrinsics
Language: C - Size: 551 KB - Last synced at: 11 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 4

borisfoko/Matrix-Multiplication-SIMD-Intrinsics-and-FPU
NxN Matrix Multiplication using SIMD with Intrinsics (MMX, SSE, SSE2, AVX, etc.) and FPU as inline ASM in C
Language: C - Size: 8.71 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 1

ste7en/AVX2_Vectorization_Project
CS Engineering Project - Code vectorization for AVX/AVX2 platforms with Intel Intrinsics - developed at Politecnico di Milano as BSc student
Language: C - Size: 17 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

ell-hol/simd-parallelized-haar-transform
8x speedup of 1D Haar-Transform using intel SIMD intrinsics
Language: C - Size: 116 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

ValenYamamoto/matrix-multiply-optimization
Summer internship 2020, LLNL HPCCEA. Used Intel MSRs to evaluate and optimize different matrix multiplication algorithms.
Language: C - Size: 54.7 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0
