An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: gpu-programming

romitjain/learning-gpu-programming

Learnings and experimentation with GPU programming

Language: Cuda - Size: 226 KB - Last synced at: about 14 hours ago - Pushed at: about 15 hours ago - Stars: 1 - Forks: 0

Nicolas-Ferre/wgso

WebGPU Shader Orchestrator to create GPU-native applications

Language: Rust - Size: 125 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

Alan-Rock-GS/GpuScript

GpuScript allows you to write C# programs that run at supercomputer speeds on a single GPU. Learn it in 30 minutes. Write & debug large and complex projects specifically designed to run on the GPU.

Size: 142 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 191 - Forks: 18

arminkz/VulkanEngine

Vulkan boilerplate / examples

Language: C++ - Size: 164 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

geomstats/geomstats

Computations and statistics on manifolds with geometric structures.

Language: Python - Size: 211 MB - Last synced at: about 7 hours ago - Pushed at: 2 months ago - Stars: 1,346 - Forks: 260

Young-TW/hippp

Write GPU program with RAII

Language: C++ - Size: 4.88 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

EmbarkStudios/rust-gpu

🐉 Making Rust a first-class language and ecosystem for GPU shaders 🚧

Language: Rust - Size: 248 MB - Last synced at: about 1 hour ago - Pushed at: 6 months ago - Stars: 7,482 - Forks: 249

taskflow/taskflow

A General-purpose Task-parallel Programming System using Modern C++

Language: C++ - Size: 137 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 10,810 - Forks: 1,272

maya-undefined/gpu-desktop-calculator

Language: Cuda - Size: 48.8 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 7 - Forks: 0

Rust-GPU/Rust-CUDA

Ecosystem of libraries and tools for writing and executing fast GPU code fully in Rust.

Language: Rust - Size: 5.99 MB - Last synced at: 5 days ago - Pushed at: 11 days ago - Stars: 4,345 - Forks: 182

YichengDWu/MoYe.jl

Programming Gemm Kernels on NVIDIA GPUs with Tensor Cores in Julia

Language: Julia - Size: 7.24 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 41 - Forks: 0

Misteri4452y/taskflow

Smart weekly planner with auto-scheduling and Google Calendar integration

Language: Python - Size: 31.3 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

exaloop/codon

A high-performance, zero-overhead, extensible Python compiler with built-in NumPy support

Language: Python - Size: 5.87 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 15,645 - Forks: 535

QianMo/GPU-Gems-Book-Source-Code

:cd: CD Content ( Source Code ) Collection of Book <GPU Gems > 1~ 3 | 《GPU精粹》 1~ 3 随书CD(源代码)珍藏

Language: C++ - Size: 1.01 GB - Last synced at: 4 days ago - Pushed at: about 7 years ago - Stars: 1,072 - Forks: 447

plasma-umass/scalene

Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals

Language: Python - Size: 13.9 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 12,626 - Forks: 405

NVIDIA/cccl

CUDA Core Compute Libraries

Language: C++ - Size: 79.8 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1,621 - Forks: 212

taichi-dev/taichi

Productive, portable, and performant GPU programming in Python.

Language: C++ - Size: 57.4 MB - Last synced at: 6 days ago - Pushed at: 12 days ago - Stars: 27,057 - Forks: 2,342

johannesugb/VolumetricLinesUnity

Source of the Volumetric Lines Asset from Unity's Asset Store

Language: C# - Size: 1.52 MB - Last synced at: about 4 hours ago - Pushed at: about 3 years ago - Stars: 196 - Forks: 20

software-mansion/TypeGPU

TypeScript library that enhances the WebGPU API, allowing resource management in a type-safe, declarative way.

Language: TypeScript - Size: 56.7 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 340 - Forks: 8

mikeroyal/GPU-Guide

Graphics Processing Unit (GPU) Architecture Guide

Language: Shell - Size: 815 KB - Last synced at: about 9 hours ago - Pushed at: over 3 years ago - Stars: 203 - Forks: 16

fastflow/fastflow

FastFlow pattern-based parallel programming framework (formerly on sourceforge)

Language: C++ - Size: 136 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 292 - Forks: 70

JuliaGPU/AMDGPU.jl

AMD GPU (ROCm) programming in Julia

Language: Julia - Size: 8.79 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 301 - Forks: 57

shreyansh26/MLSys-Experiments

A collection of scripts on experimenting and implementing MLSys-related stuff

Language: Jupyter Notebook - Size: 78.3 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 2 - Forks: 0

eomii/rules_ll

An Upstream Clang/LLVM-based toolchain for contemporary C++ and heterogeneous programming

Language: Starlark - Size: 3.96 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 89 - Forks: 10

GameWin221/Gemino

⚡High-Performance Vulkan Renderer🌋

Language: C++ - Size: 8.66 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 4 - Forks: 0

Rust-GPU/rust-gpu

🐉 Making Rust a first-class language and ecosystem for GPU shaders 🚧

Language: Rust - Size: 291 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 1,769 - Forks: 51

calebwin/emu

The write-once-run-anywhere GPGPU library for Rust

Language: Rust - Size: 342 MB - Last synced at: 12 days ago - Pushed at: over 2 years ago - Stars: 1,606 - Forks: 52

QianMo/GPU-Pro-Books-Source-Code

:cd: Source Code Collection of Book <GPU Pro> 1~ 7 | 《GPU Pro》1~ 7 书本源代码珍藏

Language: GLSL - Size: 2.73 GB - Last synced at: 11 days ago - Pushed at: over 5 years ago - Stars: 680 - Forks: 348

pjyi2147/CUDA_HTN_Workshop

Introduction to Nvidia CUDA workshop repository @ Hack the North 2024

Language: Jupyter Notebook - Size: 8.47 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 5 - Forks: 2

adamnemecek/awesome-metal

A collection of Metal and MetalKit projects and resources. Very much work in progress.

Size: 21.5 KB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 215 - Forks: 20

Vincent-Therrien/gpu-arena

Compare and test GPU programming frameworks

Language: C++ - Size: 3.52 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 109 - Forks: 8

NLeSC-COMPAS/kmm

KMM: parallel dataflow scheduler and efficient memory management for multi-GPU platforms

Language: C++ - Size: 6.85 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0

Rontim/GPU-Parallel-Processing-AI

This repository explores the use of GPU parallel processing in the context of Artificial Intelligence (AI), specifically leveraging GPUs for accelerating computations in deep learning tasks.

Language: Jupyter Notebook - Size: 54.9 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

i-Taylo/iUnlockerGL

iUnlocker GLTool is a Magisk module designed to spoof GPU information, allowing users to modify GPU informations for unlocking graphics in games and testing.

Language: Shell - Size: 85.6 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 12 - Forks: 1

kartavyaantani/CUDA_IMAGE_PROCESSING

A CUDA-accelerated image processing project featuring multiple GPU-based filters and enhancement techniques. Implements convolution, edge detection, Non-Local Means (NLM) denoising, K-Nearest Neighbors (KNN), and pixelization. Each operation is optimized using CUDA kernels for real-time performance on large images. The project supports command-line

Language: Jupyter Notebook - Size: 5.4 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 1 - Forks: 0

ProjectPhysX/OpenCL-Wrapper

OpenCL is the most powerful programming language ever created. Yet the OpenCL C++ bindings are cumbersome and the code overhead prevents many people from getting started. I created this lightweight OpenCL-Wrapper to greatly simplify OpenCL software development with C++ while keeping functionality and performance.

Language: C++ - Size: 300 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 390 - Forks: 40

AmesingFlank/taichi.js

Modern GPU Compute and Rendering in Javascript

Language: TypeScript - Size: 220 MB - Last synced at: 11 days ago - Pushed at: 10 months ago - Stars: 491 - Forks: 19

palapav/triton-compute-kernels

A collection of Triton compute kernels for common ML operations

Size: 3.91 KB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

yashkathe/Image-Noise-Reduction-with-CUDA

This project conducts an analysis of image denoising technique - median blur, comparing GPU-accelerated (Numba) and CPU-based (OpenCV) processing speeds.

Language: Jupyter Notebook - Size: 25.4 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 3 - Forks: 0

wmmae/wmma_extension

An extension library of WMMA API (Tensor Core API)

Language: Cuda - Size: 698 KB - Last synced at: 2 days ago - Pushed at: 10 months ago - Stars: 96 - Forks: 15

LLNL/CARE

CHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as loop fusion capability and a portable interface for many numerical algorithms. It provides all the basics for anyone wanting to write portable code.

Language: C++ - Size: 1.47 MB - Last synced at: 12 days ago - Pushed at: 26 days ago - Stars: 30 - Forks: 4

aryagxr/cuda

100 Days of CUDA!!!

Language: Cuda - Size: 120 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 5 - Forks: 0

andrewmilson/ministark

🏃‍♂️💨 GPU accelerated STARK prover built on @arkworks-rs

Language: Rust - Size: 1.65 MB - Last synced at: 23 days ago - Pushed at: 6 months ago - Stars: 354 - Forks: 36

uber/aresdb

A GPU-powered real-time analytics storage and query engine.

Language: Go - Size: 12.4 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 3,047 - Forks: 232

NVIDIA/optix-dev

OptiX SDK headers, everything needed to build & run OptiX applications. SDK samples not included.

Language: C++ - Size: 186 KB - Last synced at: 27 days ago - Pushed at: 2 months ago - Stars: 18 - Forks: 2

aditiisaxena/CUDA-Accelerated-Box-Filter-for-Texture-Image-Enhancement

Enhances grayscale texture images using a CUDA-based box filter. Built with CUDA, C++14, and OpenCV for high-performance image processing.

Language: Cuda - Size: 65.6 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

bfGraph/STGraph

🌟 Vertex Centric approach for building GNN/TGNNs

Language: Python - Size: 13.7 MB - Last synced at: 29 days ago - Pushed at: 6 months ago - Stars: 22 - Forks: 0

eedalong/ECE408

Code base and slides for ECE408:Applied Parallel Programming On GPU.

Language: C++ - Size: 35.6 MB - Last synced at: 22 days ago - Pushed at: almost 4 years ago - Stars: 122 - Forks: 34

lucidrains/triton-transformer

Implementation of a Transformer, but completely in Triton

Language: Python - Size: 34.3 MB - Last synced at: about 1 month ago - Pushed at: about 3 years ago - Stars: 263 - Forks: 16

AlfonsoLRz/LiDAR_BRDF

Source code of "Enhancing LiDAR point cloud generation with BRDF-based appearance modelling" (yet to be published).

Size: 18.6 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

NielsOuvrard/metal-sand-box

Metal graphics experiments based on 'Metal by Tutorials'

Language: Swift - Size: 16.5 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

tgautam03/xGeMM

Accelerated General (FP32) Matrix Multiplication from scratch in CUDA

Language: Cuda - Size: 5.8 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 111 - Forks: 6

Nicolas-Ferre/ragna

A Rust library for easily creating GPU-native applications

Language: Rust - Size: 197 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

hollance/metal-gpgpu

Collection of notes on how to use Apple’s Metal API for compute tasks

Size: 1000 Bytes - Last synced at: about 1 month ago - Pushed at: almost 7 years ago - Stars: 103 - Forks: 4

tgautam03/tGeMM

General Matrix Multiplication using NVIDIA Tensor Cores

Language: Cuda - Size: 47.9 KB - Last synced at: 27 days ago - Pushed at: 4 months ago - Stars: 13 - Forks: 3

ysh329/OpenCL-101

Learn OpenCL step by step.

Language: C - Size: 476 KB - Last synced at: 29 days ago - Pushed at: over 2 years ago - Stars: 135 - Forks: 29

tgautam03/xFilters

GPU (CUDA) accelerated filters using 2D convolution for high resolution images.

Language: C++ - Size: 58.2 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 6 - Forks: 1

ankhoa1212/cuda-program

This is a GPU program built with CUDA using parallel reduction

Language: C - Size: 13.8 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Awrsha/Advanced-CUDA-Programming-GPU-Architecture

This repository provides a comprehensive guide to optimizing GPU kernels for performance, with a focus on NVIDIA GPUs. It covers key tools and techniques such as CUDA, PyTorch, and Triton, aimed at improving computational efficiency for deep learning and scientific computing tasks.

Language: Cuda - Size: 25.2 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

mikeroyal/Vulkan-Guide

Vulkan Guide

Language: C++ - Size: 43 KB - Last synced at: 11 days ago - Pushed at: over 3 years ago - Stars: 28 - Forks: 2

ProjectPhysX/PTXprofiler

A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.

Language: C++ - Size: 11.7 KB - Last synced at: 28 days ago - Pushed at: about 2 months ago - Stars: 50 - Forks: 6

QianMo/Game-Programmer-Study-Notes

:anchor: 我的游戏程序员生涯的读书笔记合辑。你可以把它看作一个加强版的Blog。涉及图形学、实时渲染、编程实践、GPU编程、设计模式、软件工程等内容。Keep Reading , Keep Writing , Keep Coding.

Size: 752 MB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 9,412 - Forks: 1,722

vista-art/fragmentcolor

🦀 Easy GPU programming for Javascript, Python, Swift, and Kotlin.

Language: Rust - Size: 49.2 MB - Last synced at: 19 days ago - Pushed at: about 2 months ago - Stars: 4 - Forks: 0

unisa-hpc/sycl-bench

SYCL Benchmark Suite

Language: C++ - Size: 24.7 MB - Last synced at: 5 days ago - Pushed at: 3 months ago - Stars: 64 - Forks: 34

michel-meneses/great-opencl-examples

Collection of easy, well-documented and useful OpenCL examples in C++.

Language: C++ - Size: 1000 KB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 75 - Forks: 27

Heteroflow/Heteroflow

Concurrent CPU-GPU Programming using Task Models

Language: C++ - Size: 1.58 MB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 101 - Forks: 13

xframes-project/xframes

GPU-accelerated GUI development for the desktop and the browser

Language: TypeScript - Size: 28.4 MB - Last synced at: 7 days ago - Pushed at: 4 months ago - Stars: 13 - Forks: 0

coderonion/cuda-beginner-course-cpp-version

bilibili视频【CUDA 12.x 并行编程入门(C++版)】配套代码

Language: Cuda - Size: 20.5 KB - Last synced at: 5 days ago - Pushed at: 9 months ago - Stars: 30 - Forks: 4

jayeshthk/Parallel_Computing Fork of ShashankDavalgi/Parallel_Computing

CUDA computing example repo. with complex matrix multiplication.

Language: C - Size: 14.6 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Elsword016/100days_Triton

Learning triton and GPU acceleration from scratch

Language: Jupyter Notebook - Size: 1.53 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

coderonion/cuda-beginner-course-rust-version

bilibili视频【CUDA 12.x 并行编程入门(Rust版)】配套代码

Language: Rust - Size: 10.7 KB - Last synced at: 5 days ago - Pushed at: 9 months ago - Stars: 6 - Forks: 0

ParaGroup/WindFlow

A C++17 Data Stream Processing Parallel Library for Multicores and GPUs

Language: C++ - Size: 48.9 MB - Last synced at: 4 days ago - Pushed at: 2 months ago - Stars: 81 - Forks: 19

DannyDoesGraphics/DARE

Danny's Awesome Rendering Engine

Language: Rust - Size: 4.35 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1 - Forks: 0

machineko/SwiftCU

SwiftCU is a wrapper for CUDA runtime API's (exposed as cxxCU) with extra utilities for device management, memory ops and kernel execution, along with a robust suite of tests. Repo is tested on newest (v12.5) CUDA runtime API on both Linux and Windows.

Language: Swift - Size: 613 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

gpufit/Gpufit

GPU-accelerated Levenberg-Marquardt curve fitting in CUDA

Language: Cuda - Size: 1.16 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 319 - Forks: 96

ShadyBoukhary/GPU-research-FFT-OpenACC-CUDA

Case studies constitute a modern interdisciplinary and valuable teaching practice which plays a critical and fundamental role in the development of new skills and the formation of new knowledge. This research studies the behavior and performance of two interdisciplinary and widely adopted scientific kernels, a Fast Fourier Transform and Matrix Multiplication. Both routines are implemented in the two current most popular many-core programming models CUDA and OpenACC. A Fast Fourier Transform (FFT) samples a signal over a period of time and divides it into its frequency components, computing the Discrete Fourier Transform (DFT) of a sequence. Unlike the traditional approach to computing a DFT, FFT algorithms reduce the complexity of the problem from O(n2) to O(nLog2n). Matrix multiplication is a cornerstone routine in Mathematics, Artificial Intelligence and Machine Learning. This research also shows that the nature of the problem plays a crucial role in determining what many-core model will provide the highest benefit in performance.

Language: Cuda - Size: 9.12 MB - Last synced at: 20 days ago - Pushed at: over 6 years ago - Stars: 13 - Forks: 3

anselm67/CUDA_mnist

A CUDA implementation of MNIST - for CUDA beginners.

Language: Cuda - Size: 19.5 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

AlexJMercer/CUDA-NPP-Assignment

Learning about CUDA and NVIDIA Performance Primitives. Part of Coursera Assignment.

Language: C++ - Size: 9.49 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

DanHouseman/AdaptiveSort

C# Extention methods for super efficient sorting using CPU, GPU, and FPGA

Language: C# - Size: 33.2 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

dj-himp/DX11GPUParticles

A fully gpu particle system with Directx 11

Language: C++ - Size: 240 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 9 - Forks: 1

andi611/Apriori-and-Eclat-Frequent-Itemset-Mining

Implementation of the Apriori and Eclat algorithms, two of the best-known basic algorithms for mining frequent item sets in a set of transactions, implementation in Python.

Language: Python - Size: 4.05 MB - Last synced at: 28 days ago - Pushed at: over 6 years ago - Stars: 48 - Forks: 19

YaccConstructor/Brahma.FSharp Fork of gsvgit/Brahma.FSharp

F# quotation to OpenCL translator and respective runtime to utilize GPGPUs in F# applications.

Language: F# - Size: 52.1 MB - Last synced at: 12 days ago - Pushed at: about 1 month ago - Stars: 75 - Forks: 17

Shapur1234/Fractl

Fractal renderer written in rust supporting multithreading, gpu compute and wasm

Language: Rust - Size: 43.7 MB - Last synced at: 11 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

abeleinin/Metal-Puzzles

Solve Puzzles. Learn Metal 🤘

Language: Jupyter Notebook - Size: 3.84 MB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 505 - Forks: 22

xmartlabs/cuda-calculator Fork of karthikeyann/cuda-calculator

Online CUDA Occupancy Calculator

Language: CoffeeScript - Size: 186 KB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 74 - Forks: 12

SartajBhuvaji/Cuda

Deloped CUDA kernel functions to load and train a Convolution Neural Network from scratch.

Language: Cuda - Size: 286 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

rbga/A51

An attempt...

Language: Python - Size: 137 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

miEsMar/BsaLib

BsaLib - a Fortran library for the Bispectral Stochastic Analysis

Language: Fortran - Size: 5.47 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 1

subspecs/Cocaine

Cocaine is a multi-platform C library that can be used to accelerate large workloads/big data/anything really with the power of a GPU with ease. A .NET wrapper is available in the link below.

Language: C - Size: 1.44 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

Islam-hady9/deep-cuda

Image Classification with CNN in CUDA C++

Language: Jupyter Notebook - Size: 119 KB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

acetinkaya/Nvdia-CUDA-Setup

NVIDIA GPU Kurulumu

Size: 49.8 KB - Last synced at: 2 months ago - Pushed at: 10 months ago - Stars: 20 - Forks: 0

benc-uk/webgl-sandbox

Interactive editor & sandbox for creating & running WebGL2 shaders

Language: JavaScript - Size: 4.71 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

pclank/GlitterCL

Interoperability between OpenGL and OpenCL made easy. Baed on the popular Glitter repo.

Language: C++ - Size: 165 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

lawmurray/gpu-gemm

CUDA kernel for matrix-matrix multiplication on Nvidia GPUs, using a Hilbert curve to improve L2 cache utilization.

Language: Cuda - Size: 34.2 KB - Last synced at: 27 days ago - Pushed at: 6 months ago - Stars: 3 - Forks: 0

satyajitghana/GPU-Programming

Contains the contents of GPU Architecture and Programming course done on NPTEL

Language: Cuda - Size: 9.37 MB - Last synced at: 6 days ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 5

mikemag/Mastermind

Playing all games of Mastermind quickly

Language: Jupyter Notebook - Size: 15.2 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

Brian-Jiang/GPUFluidSimulation

The goal of this project is to implement GPU driven fluid simulation and the shading of the water surface in C++ and DirectX 11 using compute shaders.

Language: C++ - Size: 34.3 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Brian-Jiang/GPUParticles

A GPU driven particle system built in C++ and DirectX 11, utilizing compute shaders with indirect dispatch and indirect draw call.

Language: C++ - Size: 390 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

raghavm1/GPU-PSO

A novel, optimized implementation of the Particle Swarm Optimization on GPUs, built using CUDA C++

Language: Cuda - Size: 94.7 KB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 1

light-magician/jakehendersonx.github.io

writing about software I find interesting

Language: HTML - Size: 2.67 MB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0