GitHub topics: gpgpu-computing
planetis-m/compute-sim
Learn and understand compute shader operations and control flow.
Language: Nim - Size: 245 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 19 - Forks: 0

d3p1/thr2pxl
A 3D (thr) model to pixel transformation with motion effect
Language: TypeScript - Size: 51.8 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

gpuweb/gpuweb
Where the GPU for the Web work happens!
Language: Bikeshed - Size: 140 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 4,998 - Forks: 326

mikeroyal/GPU-Guide
Graphics Processing Unit (GPU) Architecture Guide
Language: Shell - Size: 815 KB - Last synced at: about 17 hours ago - Pushed at: over 3 years ago - Stars: 203 - Forks: 16

deepakkumar1984/Amplifier.NET
Amplifier allows .NET developers to easily run complex applications with intensive mathematical computation on Intel CPU/GPU, NVIDIA, AMD without writing any additional C kernel code. Write your function in .NET and Amplifier will take care of running it on your favorite hardware.
Language: C# - Size: 3.65 MB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 179 - Forks: 21

m4rs-mt/ILGPU
ILGPU JIT Compiler for high-performance .Net GPU programs
Language: C# - Size: 11.1 MB - Last synced at: 13 days ago - Pushed at: 19 days ago - Stars: 1,534 - Forks: 129

eyalroz/cuda-api-wrappers
Thin, unified, C++-flavored wrappers for the CUDA APIs
Language: C++ - Size: 2.85 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 837 - Forks: 83

Xeanos7913/vkgpu
Vulkan Compute shader powered General Purpose GPU programming library.
Language: C++ - Size: 40 KB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 1 - Forks: 0

ProjectPhysX/OpenCL-Wrapper
OpenCL is the most powerful programming language ever created. Yet the OpenCL C++ bindings are cumbersome and the code overhead prevents many people from getting started. I created this lightweight OpenCL-Wrapper to greatly simplify OpenCL software development with C++ while keeping functionality and performance.
Language: C++ - Size: 300 KB - Last synced at: 23 days ago - Pushed at: 24 days ago - Stars: 390 - Forks: 40

mikeroyal/Metal-Guide
Metal Guide
Language: Swift - Size: 78.1 KB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 128 - Forks: 11

naavis/haloray
GPU-accelerated atmospheric ice crystal halo simulator
Language: C - Size: 19.7 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 30 - Forks: 2

DoubangoTelecom/compv
Insanely fast Open Source Computer Vision library for ARM and x86 devices (Up to #50 times faster than OpenCV)
Language: C++ - Size: 323 MB - Last synced at: 29 days ago - Pushed at: 30 days ago - Stars: 198 - Forks: 44

mikeroyal/CUDA-Guide
CUDA Guide
Language: Cuda - Size: 83 KB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 64 - Forks: 7

inonitz/compute-shader-fluid-2d
Implementation of GPU Gems 38 using OpenGL Compute Shaders
Language: C++ - Size: 3.59 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

cea-hpc/HARP
Small tool for profiling the performance of hardware-accelerated Rust code using OpenCL and CUDA
Language: Rust - Size: 1.39 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 1

jeronimosg/gpgpu-rs
Simple experimental async GPGPU framework for Rust
Language: Rust - Size: 1.58 MB - Last synced at: 11 days ago - Pushed at: about 1 year ago - Stars: 144 - Forks: 7

pleiszenburg/gravitation
n-body-simulation performance test suite
Language: Python - Size: 2.22 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 18 - Forks: 1

jinekgames/PTX4CPU
PTX interpreter which lets you run CUDA code on CPU
Language: C++ - Size: 145 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Bakkode/Swarm
Parallel processing abstraction based on CUDA and OpenCL (and HIP) for Java
Language: Java - Size: 341 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

OGRECave/ogre-gpgpu
GPGPU compute with Ogre using CUDA or OpenCL
Language: C++ - Size: 3.76 MB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 12 - Forks: 4

piellardj/water-webgpu
WebGPU water simulation handling up to a million particles.
Language: TypeScript - Size: 77.9 MB - Last synced at: 30 days ago - Pushed at: 9 months ago - Stars: 42 - Forks: 0

denyskryvytskyi/capgemini-simd
SIMD usage for vector additon, matrix multiplication, dot product, and substring search
Language: Assembly - Size: 12 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

DLR-AMR/t8gpu
Header-only finite volume library targetting GPUs using t8code as meshing backend.
Language: C++ - Size: 2.99 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

marianoktm/GVirtuS Fork of gvirtus/GVirtuS
A GPGPU Transparent Virtualization Component for High Performance Computing Clouds.
Language: C++ - Size: 11.6 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

scloin/local_node_connectivity_with_GPGPU
Speed up the computation of local node connectivity with CUDA
Language: Cuda - Size: 2.83 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

cdeterman/gpuR
R interface to use GPU's
Language: R - Size: 12 MB - Last synced at: 6 months ago - Pushed at: almost 5 years ago - Stars: 241 - Forks: 26

mspronesti/baylib
High-performance library for approximate inference on discrete Bayesian networks on GPU and CPU
Language: C++ - Size: 1.67 MB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 30 - Forks: 1

Glavnokoman/vuh
Vulkan compute for people
Language: C++ - Size: 705 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 340 - Forks: 34

DrSnowbird/blazegraph 📦
blazegraph
Language: Java - Size: 89.8 KB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

MPSQUARK/BAVCL
Hardware-accelerated Vector Compute Library for .NET Containing Quality of life improvements and functionality intended for data science, graphical processing and GPGPU.
Language: C# - Size: 1.77 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 8 - Forks: 2

ipapadop/accxx
A library for using accelerators (CUDA and OpenCL) in modern C++
Language: C++ - Size: 28.3 KB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

b0nes164/ShaderOneSweep 📦
A compute shader implementation of the OneSweep sorting algorithm.
Language: HLSL - Size: 93.8 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 57 - Forks: 6

coldfunction/qCUDA
qCUDA: GPGPU Virtualization at a New API Remoting Method with Para-virtualization
Language: C - Size: 89.9 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 91 - Forks: 31

bastie/GPGPU
GPGPU 2024
Language: Swift - Size: 63.5 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

MrMagnifico/cuda-fluid-sim
2D fluid simulation in CUDA
Language: C++ - Size: 4.86 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

codeplaysoftware/LPGPU2-CodeXL 📦
LPGPU2 CodeXL power performance analysis and feedback tool for GPUs
Language: C++ - Size: 985 MB - Last synced at: 6 months ago - Pushed at: about 6 years ago - Stars: 35 - Forks: 12

AbhishekRS4/cuda_parallel_programs
Cuda Parallel Programming Kernels
Language: Cuda - Size: 15.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

gyatskov/radix-sort
GPU optimized implementation of Radix Sort via OpenCL
Language: C++ - Size: 2.31 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 2

law-dwg/LSQR-CUDA
This is a LSQR-CUDA implementation written by Lawrence Ayers under the supervision of Stefan Guthe of the GRIS institute at the Technische Universität Darmstadt. The LSQR library was authored Chris Paige and Michael Saunders.
Language: Cuda - Size: 117 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 11 - Forks: 0

yasuohasegawa/UnityGPGPUSample
This sample code is experimental. It's too many batches and the Setpass calls. It does work as the GPGPU.
Language: C# - Size: 6.84 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

Konrad-Ziarko/Levenshtein
Sniffer, KeyLogger, Clipboard listener, USB scanner with ADS support; Computes Levenshtein minimum edit-distance between two strings
Language: C# - Size: 1.26 MB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 4 - Forks: 2

crazybiocomputing/times
Tiny Image Processing in ECMAScript
Language: JavaScript - Size: 1.85 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 14 - Forks: 7

mmattioli/OpenCL-Adventures
Learning how to design heterogeneous compute applications using OpenCL with an emphasis on GPU acceleration
Language: C++ - Size: 43.9 KB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

akhan3/mms-gpu
Micromagnetic simulator on CUDA
Language: C++ - Size: 8.65 MB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

architector1324/EasyCL
OpenCL based lightweight c++ computing library
Language: C++ - Size: 311 KB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 7 - Forks: 0

Qazalbash/CUDA_Spring2023 Fork of mmmovania/CUDA_Spring2023
The companion git repo for the Spring 2023 CUDA course
Language: Jupyter Notebook - Size: 1.81 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 0

OlivierSohn/gpgpu-experiments
experiments with OpenCL to do GPGPU.
Language: C++ - Size: 55.7 KB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 1

xLilia/CelularAutomataOpenGL_gpgpu
Celular automata | gpgpu computing
Language: C++ - Size: 2.23 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

KaoCC/Orochi-recipe
Orochi-recipe
Language: C++ - Size: 5.86 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

harubaru/vkcl
Vulkan Compute Library
Language: C++ - Size: 661 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 1

architector1324/EasyCL2
OpenCL based lightweight c computing library
Language: C - Size: 35.2 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

qalshidi/comfi
Collisional Multi-Fluid ion MHD code
Language: C++ - Size: 1000 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 0

conn-team/cuda-fractals
CUDA-accelerated fractals deep zooming.
Language: C++ - Size: 3.46 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 0

pengc99/nvidia-gp-gpu
Bare config for GP-GPU server using nVidia video cards in Debian Linux.
Language: Shell - Size: 47.9 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

ShreyTiwari/CUDA-Programming
Contains a few basic examples to get started with CUDA parallel programming models.
Language: Cuda - Size: 8.5 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

DoubangoTelecom/ultimateText-SDK
Realtime text detection and recognition in natural scene images (in the wild) using artificial-intelligence
Size: 13.7 KB - Last synced at: about 1 month ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 0

simonboots/OpenSteerCUDA
CUDA port of OpenSteer
Language: C++ - Size: 1.02 MB - Last synced at: about 2 years ago - Pushed at: over 15 years ago - Stars: 5 - Forks: 1

dafadey/GPGPU_OpenCL_vs_CUDA
This is a repository with sample codes for testing memory bandwidth, arithmetic latency hiding and shared/local memory performance on AMD and nVidia devices
Language: C++ - Size: 41 KB - Last synced at: 6 months ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

VadimGush/Saku
Tool for parallel computing
Language: C++ - Size: 74.2 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

SamChatfield/dpc
University of Birmingham - Distributed and Parallel Computing
Language: Cuda - Size: 38.1 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

111qqz/CS344
Introduction to Parallel Programming class code
Language: Cuda - Size: 22 MB - Last synced at: almost 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

cbekar/Kuda
A complete x86 race checker accelerated by Cuda GPU
Language: Cuda - Size: 14.6 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0
