An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: gpgpu-computing

planetis-m/compute-sim

Learn and understand compute shader operations and control flow.

Language: Nim - Size: 245 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 19 - Forks: 0

d3p1/thr2pxl

A 3D (thr) model to pixel transformation with motion effect

Language: TypeScript - Size: 51.8 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

gpuweb/gpuweb

Where the GPU for the Web work happens!

Language: Bikeshed - Size: 140 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 4,998 - Forks: 326

mikeroyal/GPU-Guide

Graphics Processing Unit (GPU) Architecture Guide

Language: Shell - Size: 815 KB - Last synced at: about 17 hours ago - Pushed at: over 3 years ago - Stars: 203 - Forks: 16

deepakkumar1984/Amplifier.NET

Amplifier allows .NET developers to easily run complex applications with intensive mathematical computation on Intel CPU/GPU, NVIDIA, AMD without writing any additional C kernel code. Write your function in .NET and Amplifier will take care of running it on your favorite hardware.

Language: C# - Size: 3.65 MB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 179 - Forks: 21

m4rs-mt/ILGPU

ILGPU JIT Compiler for high-performance .Net GPU programs

Language: C# - Size: 11.1 MB - Last synced at: 13 days ago - Pushed at: 19 days ago - Stars: 1,534 - Forks: 129

eyalroz/cuda-api-wrappers

Thin, unified, C++-flavored wrappers for the CUDA APIs

Language: C++ - Size: 2.85 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 837 - Forks: 83

Xeanos7913/vkgpu

Vulkan Compute shader powered General Purpose GPU programming library.

Language: C++ - Size: 40 KB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 1 - Forks: 0

ProjectPhysX/OpenCL-Wrapper

OpenCL is the most powerful programming language ever created. Yet the OpenCL C++ bindings are cumbersome and the code overhead prevents many people from getting started. I created this lightweight OpenCL-Wrapper to greatly simplify OpenCL software development with C++ while keeping functionality and performance.

Language: C++ - Size: 300 KB - Last synced at: 23 days ago - Pushed at: 24 days ago - Stars: 390 - Forks: 40

mikeroyal/Metal-Guide

Metal Guide

Language: Swift - Size: 78.1 KB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 128 - Forks: 11

naavis/haloray

GPU-accelerated atmospheric ice crystal halo simulator

Language: C - Size: 19.7 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 30 - Forks: 2

DoubangoTelecom/compv

Insanely fast Open Source Computer Vision library for ARM and x86 devices (Up to #50 times faster than OpenCV)

Language: C++ - Size: 323 MB - Last synced at: 29 days ago - Pushed at: 30 days ago - Stars: 198 - Forks: 44

mikeroyal/CUDA-Guide

CUDA Guide

Language: Cuda - Size: 83 KB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 64 - Forks: 7

inonitz/compute-shader-fluid-2d

Implementation of GPU Gems 38 using OpenGL Compute Shaders

Language: C++ - Size: 3.59 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

cea-hpc/HARP

Small tool for profiling the performance of hardware-accelerated Rust code using OpenCL and CUDA

Language: Rust - Size: 1.39 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 1

jeronimosg/gpgpu-rs

Simple experimental async GPGPU framework for Rust

Language: Rust - Size: 1.58 MB - Last synced at: 11 days ago - Pushed at: about 1 year ago - Stars: 144 - Forks: 7

pleiszenburg/gravitation

n-body-simulation performance test suite

Language: Python - Size: 2.22 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 18 - Forks: 1

jinekgames/PTX4CPU

PTX interpreter which lets you run CUDA code on CPU

Language: C++ - Size: 145 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Bakkode/Swarm

Parallel processing abstraction based on CUDA and OpenCL (and HIP) for Java

Language: Java - Size: 341 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

OGRECave/ogre-gpgpu

GPGPU compute with Ogre using CUDA or OpenCL

Language: C++ - Size: 3.76 MB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 12 - Forks: 4

piellardj/water-webgpu

WebGPU water simulation handling up to a million particles.

Language: TypeScript - Size: 77.9 MB - Last synced at: 30 days ago - Pushed at: 9 months ago - Stars: 42 - Forks: 0

denyskryvytskyi/capgemini-simd

SIMD usage for vector additon, matrix multiplication, dot product, and substring search

Language: Assembly - Size: 12 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

DLR-AMR/t8gpu

Header-only finite volume library targetting GPUs using t8code as meshing backend.

Language: C++ - Size: 2.99 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

marianoktm/GVirtuS Fork of gvirtus/GVirtuS

A GPGPU Transparent Virtualization Component for High Performance Computing Clouds.

Language: C++ - Size: 11.6 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

scloin/local_node_connectivity_with_GPGPU

Speed up the computation of local node connectivity with CUDA

Language: Cuda - Size: 2.83 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

cdeterman/gpuR

R interface to use GPU's

Language: R - Size: 12 MB - Last synced at: 6 months ago - Pushed at: almost 5 years ago - Stars: 241 - Forks: 26

mspronesti/baylib

High-performance library for approximate inference on discrete Bayesian networks on GPU and CPU

Language: C++ - Size: 1.67 MB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 30 - Forks: 1

Glavnokoman/vuh

Vulkan compute for people

Language: C++ - Size: 705 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 340 - Forks: 34

DrSnowbird/blazegraph 📦

blazegraph

Language: Java - Size: 89.8 KB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

MPSQUARK/BAVCL

Hardware-accelerated Vector Compute Library for .NET Containing Quality of life improvements and functionality intended for data science, graphical processing and GPGPU.

Language: C# - Size: 1.77 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 8 - Forks: 2

ipapadop/accxx

A library for using accelerators (CUDA and OpenCL) in modern C++

Language: C++ - Size: 28.3 KB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

b0nes164/ShaderOneSweep 📦

A compute shader implementation of the OneSweep sorting algorithm.

Language: HLSL - Size: 93.8 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 57 - Forks: 6

coldfunction/qCUDA

qCUDA: GPGPU Virtualization at a New API Remoting Method with Para-virtualization

Language: C - Size: 89.9 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 91 - Forks: 31

bastie/GPGPU

GPGPU 2024

Language: Swift - Size: 63.5 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

MrMagnifico/cuda-fluid-sim

2D fluid simulation in CUDA

Language: C++ - Size: 4.86 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

codeplaysoftware/LPGPU2-CodeXL 📦

LPGPU2 CodeXL power performance analysis and feedback tool for GPUs

Language: C++ - Size: 985 MB - Last synced at: 6 months ago - Pushed at: about 6 years ago - Stars: 35 - Forks: 12

AbhishekRS4/cuda_parallel_programs

Cuda Parallel Programming Kernels

Language: Cuda - Size: 15.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

gyatskov/radix-sort

GPU optimized implementation of Radix Sort via OpenCL

Language: C++ - Size: 2.31 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 2

law-dwg/LSQR-CUDA

This is a LSQR-CUDA implementation written by Lawrence Ayers under the supervision of Stefan Guthe of the GRIS institute at the Technische Universität Darmstadt. The LSQR library was authored Chris Paige and Michael Saunders.

Language: Cuda - Size: 117 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 11 - Forks: 0

yasuohasegawa/UnityGPGPUSample

This sample code is experimental. It's too many batches and the Setpass calls. It does work as the GPGPU.

Language: C# - Size: 6.84 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

Konrad-Ziarko/Levenshtein

Sniffer, KeyLogger, Clipboard listener, USB scanner with ADS support; Computes Levenshtein minimum edit-distance between two strings

Language: C# - Size: 1.26 MB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 4 - Forks: 2

crazybiocomputing/times

Tiny Image Processing in ECMAScript

Language: JavaScript - Size: 1.85 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 14 - Forks: 7

mmattioli/OpenCL-Adventures

Learning how to design heterogeneous compute applications using OpenCL with an emphasis on GPU acceleration

Language: C++ - Size: 43.9 KB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

akhan3/mms-gpu

Micromagnetic simulator on CUDA

Language: C++ - Size: 8.65 MB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

architector1324/EasyCL

OpenCL based lightweight c++ computing library

Language: C++ - Size: 311 KB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 7 - Forks: 0

Qazalbash/CUDA_Spring2023 Fork of mmmovania/CUDA_Spring2023

The companion git repo for the Spring 2023 CUDA course

Language: Jupyter Notebook - Size: 1.81 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 0

OlivierSohn/gpgpu-experiments

experiments with OpenCL to do GPGPU.

Language: C++ - Size: 55.7 KB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 1

xLilia/CelularAutomataOpenGL_gpgpu

Celular automata | gpgpu computing

Language: C++ - Size: 2.23 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

KaoCC/Orochi-recipe

Orochi-recipe

Language: C++ - Size: 5.86 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

harubaru/vkcl

Vulkan Compute Library

Language: C++ - Size: 661 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 1

architector1324/EasyCL2

OpenCL based lightweight c computing library

Language: C - Size: 35.2 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

qalshidi/comfi

Collisional Multi-Fluid ion MHD code

Language: C++ - Size: 1000 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 0

conn-team/cuda-fractals

CUDA-accelerated fractals deep zooming.

Language: C++ - Size: 3.46 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 0

pengc99/nvidia-gp-gpu

Bare config for GP-GPU server using nVidia video cards in Debian Linux.

Language: Shell - Size: 47.9 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

ShreyTiwari/CUDA-Programming

Contains a few basic examples to get started with CUDA parallel programming models.

Language: Cuda - Size: 8.5 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

DoubangoTelecom/ultimateText-SDK

Realtime text detection and recognition in natural scene images (in the wild) using artificial-intelligence

Size: 13.7 KB - Last synced at: about 1 month ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 0

simonboots/OpenSteerCUDA

CUDA port of OpenSteer

Language: C++ - Size: 1.02 MB - Last synced at: about 2 years ago - Pushed at: over 15 years ago - Stars: 5 - Forks: 1

dafadey/GPGPU_OpenCL_vs_CUDA

This is a repository with sample codes for testing memory bandwidth, arithmetic latency hiding and shared/local memory performance on AMD and nVidia devices

Language: C++ - Size: 41 KB - Last synced at: 6 months ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

VadimGush/Saku

Tool for parallel computing

Language: C++ - Size: 74.2 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

SamChatfield/dpc

University of Birmingham - Distributed and Parallel Computing

Language: Cuda - Size: 38.1 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

111qqz/CS344

Introduction to Parallel Programming class code

Language: Cuda - Size: 22 MB - Last synced at: almost 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

cbekar/Kuda

A complete x86 race checker accelerated by Cuda GPU

Language: Cuda - Size: 14.6 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0