Topic: "gpu-programming"
xmartlabs/gpgpu-comparison
Size: 0 Bytes - Last synced at: over 1 year ago - Pushed at: about 8 years ago - Stars: 8 - Forks: 2
maya-undefined/gpu-desktop-calculator
Language: Cuda - Size: 48.8 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 7 - Forks: 0
coderonion/cuda-beginner-course-rust-version
bilibili视频【CUDA 12.x 并行编程入门(Rust版)】配套代码
Language: Rust - Size: 10.7 KB - Last synced at: 19 days ago - Pushed at: about 1 year ago - Stars: 7 - Forks: 0
priteshgohil/CUDA-programming-tutorial
Get started with CUDA programming
Language: Cuda - Size: 3.63 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 3
effepivi/gvxr-CMPB
Simulation of X-ray projections on GPU: benchmarking gVirtualXray with clinically realistic phantoms
Language: Jupyter Notebook - Size: 4.2 GB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 3
fbasatemur/CUDA-Matrix
2D and 3D Matrix Convolution and Matrix Multiplication with CUDA
Language: C++ - Size: 8.79 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 7 - Forks: 1
estradjm/Code-Portfolio
Code Portfolio -- Collection of Interesting CS and ECE Projects in different languages (C, C++, Python, CPU & GPU Parallel Paradigms, MATLAB, and VHDL) and target hardware with technical reports, and my Vim Config
Language: C - Size: 146 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 7 - Forks: 1
vitormeriat/presentations
Slides and notes we've presented out
Language: Jupyter Notebook - Size: 53.6 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 7 - Forks: 0
harsh-99/Traffic-sign-detection
Traffic sign detection and classification
Language: Jupyter Notebook - Size: 43.8 MB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 7 - Forks: 11
renato-yuzup/axis-fem
Hybrid CPU-GPU Finite Element Software for Structural Analysis in Mechanical Engineering
Language: MATLAB - Size: 49.7 MB - Last synced at: 8 months ago - Pushed at: over 6 years ago - Stars: 7 - Forks: 0
abeduplaa/BlindDeconvolutionGPU
Speeding up blind deconvolution of a blurred image by using GPUs
Language: Cuda - Size: 48.4 MB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 7 - Forks: 0
vanities/PolarisBiosEditor-1.6.7
AMD GPU Polaris Bios Editor
Language: C# - Size: 139 KB - Last synced at: 7 months ago - Pushed at: over 7 years ago - Stars: 7 - Forks: 8
akashdeepjassal/GPU-Programming
Language: C - Size: 50.8 KB - Last synced at: over 2 years ago - Pushed at: almost 8 years ago - Stars: 7 - Forks: 1
vista-art/fragmentcolor
🦀 Easy GPU programming for Javascript, Python, Swift, and Kotlin.
Language: Rust - Size: 63.2 MB - Last synced at: about 19 hours ago - Pushed at: 1 day ago - Stars: 6 - Forks: 0
Mgepahmge/CuWeaver
A CUDA concurrency library designed to simplify concurrency programming, offering C++-style wrappers for selected CUDA Runtime APIs
Language: Cuda - Size: 1.48 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 6 - Forks: 0
PawseySC/sc20-gpu-offloading
Materials for "Differences between OpenACC and OpenMP offloading models" tutorial.
Language: C - Size: 650 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 6 - Forks: 5
jrajan14/CUDA_Programs
Nvidia CUDA Programs. High-performance computing with my collection of CUDA programs, meticulously crafted to harness the immense power of NVIDIA's GPU architecture. From blazingly fast simulations to data-intensive parallel processing, these programs showcase my passion for pushing the boundaries of performance optimization.
Language: Cuda - Size: 30.8 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 5 - Forks: 2
pjyi2147/CUDA_HTN_Workshop
Introduction to Nvidia CUDA workshop repository @ Hack the North 2024
Language: Jupyter Notebook - Size: 8.47 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 5 - Forks: 2
coderonion/cuda-beginner-course-python-version
bilibili视频【CUDA 12.x 并行编程入门(Python版)】配套代码
Language: Python - Size: 3.91 KB - Last synced at: 19 days ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0
KnowledgePending/Pycuda-Docker
🐳🐍Pycuda Docker Environment for GPU Accelerated Python
Language: Dockerfile - Size: 567 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 0
thomasp85/shady
Compile and Execute Shaders from R
Language: C++ - Size: 13.7 KB - Last synced at: 7 months ago - Pushed at: almost 6 years ago - Stars: 5 - Forks: 1
benc-uk/webgl-sandbox
Interactive editor & sandbox for creating & running WebGL2 shaders
Language: JavaScript - Size: 4.71 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 4 - Forks: 0
JuliaWGPU/WGPUCompute.jl
Compute shaders interface for WGPU from julia
Language: Julia - Size: 336 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 4 - Forks: 1
GameWin221/Gemino
⚡High-Performance Vulkan Renderer🌋
Language: C++ - Size: 8.66 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 4 - Forks: 0
evanmcclure/hello_gpu
Hello world example for Rust on GPU
Language: Rust - Size: 6.84 KB - Last synced at: 7 months ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0
kai-kj/microcompute
A small library for gpu computing
Language: C - Size: 486 KB - Last synced at: 7 months ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0
FernandoSchett/DPCPP_for_dummies
This repository contains code samples in DPC++, an extension of the C++ standard created by Intel for heterogeneous parallel programming.
Language: C++ - Size: 546 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 0
Cusymint/cusymint
CUDA symbolic integration
Language: Cuda - Size: 3.67 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 0
Qazalbash/CUDA_Spring2023 Fork of mmmovania/CUDA_Spring2023
The companion git repo for the Spring 2023 CUDA course
Language: Jupyter Notebook - Size: 1.81 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 0
marcoplaitano/counting-sort-cuda
Parallelized version of Counting Sort using CUDA
Language: C - Size: 26.4 KB - Last synced at: 4 months ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 0
pnikitakis/high-performance-computing
5 problem sets of parallel programming on CPU and GPU. University projects for High Performance Computing Systems (Fall 2016).
Language: Cuda - Size: 1.06 MB - Last synced at: 2 months ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 0
termijn/webgl-volumerendering
WebGL based implementation of 3D volume rendering
Language: JavaScript - Size: 13.4 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 4
dereklstinson/hip
go bindings for hip
Language: Go - Size: 87.9 KB - Last synced at: 7 months ago - Pushed at: about 6 years ago - Stars: 4 - Forks: 1
itslokesh/Multi-Max-Clique
Multi-Max-Clique, an application that solves Maximum Clique Problem using the parallel branch and bound approach and achieved linear and super-linear speedups in CUDA.
Language: Cuda - Size: 829 KB - Last synced at: 5 months ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 0
sivagnanamn/nvidia-gpu-stats
CUDA script to check NVIDIA GPU device properties & memory available
Language: Cuda - Size: 371 KB - Last synced at: 4 months ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 3
yafangshih/GPGPU_Programming_2016S
Perlin Noise, Poisson Image Editing implemented in CUDA. Course assignments of GPU programming at National Taiwan University.
Language: Cuda - Size: 9.59 MB - Last synced at: over 2 years ago - Pushed at: almost 8 years ago - Stars: 4 - Forks: 3
DiamondLightSource/fast-feedback-service
GPU based service to provide fast-feedback results
Language: C++ - Size: 1010 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 3 - Forks: 3
Oabraham1/chronos
Chronos is a time-based GPU partitioning utility that allows multiple users or applications to share a single GPU by creating exclusive time-limited partitions with automatic expiration. Built with OpenCL, it works across platforms including macOS (Apple Silicon & Intel), Linux, and Windows.
Language: C++ - Size: 86.9 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 3 - Forks: 0
ocentra/bitnet.rs
Pure Rust engine for BitNet LLMs — Conversion, Inference, Training and Research. With streaming and GPU/CPU support
Language: Rust - Size: 2.43 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 3 - Forks: 0
yashkathe/Image-Noise-Reduction-with-CUDA
This project conducts an analysis of image denoising technique - median blur, comparing GPU-accelerated (Numba) and CPU-based (OpenCV) processing speeds.
Language: Jupyter Notebook - Size: 25.4 MB - Last synced at: 6 months ago - Pushed at: 7 months ago - Stars: 3 - Forks: 0
dipta007/gpu-wait
A package to run commands when GPU resources are available
Language: Python - Size: 21.5 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 3 - Forks: 0
predsci/multigpu-test-code
This code mimics the basic MPI+OpenACC tasks of PSI's MAS Solar MHD code, for use with testing multi-GPU multi-node clusters
Language: Fortran - Size: 36.1 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 3 - Forks: 0
Awrsha/Advanced-CUDA-Programming-GPU-Architecture
This repository provides a comprehensive guide to optimizing GPU kernels for performance, with a focus on NVIDIA GPUs. It covers key tools and techniques such as CUDA, PyTorch, and Triton, aimed at improving computational efficiency for deep learning and scientific computing tasks.
Language: Cuda - Size: 25.2 MB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 3 - Forks: 0
lawmurray/gpu-gemm
CUDA kernel for matrix-matrix multiplication on Nvidia GPUs, using a Hilbert curve to improve L2 cache utilization.
Language: Cuda - Size: 34.2 KB - Last synced at: 7 months ago - Pushed at: 12 months ago - Stars: 3 - Forks: 0
veera-adithya-d/Hardware-aware-algorithm
Inference module of Imagenet
Language: C++ - Size: 1.3 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 1
mina1460/GPU_programming_with_CUDA
A repo for all my projects using nVidia CUDA toolkit for programming GPGPUs
Language: C++ - Size: 349 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 1
arpankapoor/pycuda-vgg16
vgg16 inference implementation using tensorflow, numpy and pycuda
Language: Python - Size: 222 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0
gurbaaz27/CS433A-Design-Exercises
Solutions of design exercises in CS433A: Parallel Programming, Spring Semester 2021-22
Language: C - Size: 722 KB - Last synced at: 5 months ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 1
leanerr/GPU-Programming-MN-Matrices
Write a program that initializes two M×N matrices and computes the sum of the two matrices on the GPU device. After copying the result back to the host, your program should print
Language: Jupyter Notebook - Size: 2.6 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0
michael-elkh/cellular_automaton-futhark-cuda-opencl
A small project to evaluate performance between Futhark, Cuda and OpenCL
Language: C - Size: 87.9 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 0
TennisGazelle/CUDA-CapsuleNetwork-Methods
A clean, pure C++/CUDA implementation of Capsule Networks, no cuDNN, TF, Keras, or libraries.
Language: C++ - Size: 24.1 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 1
sbreban/mandelbrot-gpu
Language: C++ - Size: 77.1 KB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 0
Mrezadwiprasetiawan/cpp-playground
A collection of C++ experiments and code created as part of exploration and practice
Language: C++ - Size: 21.2 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 2 - Forks: 1
shreyansh26/MLSys-Experiments
A collection of scripts on experimenting and implementing MLSys-related stuff
Language: Jupyter Notebook - Size: 83.1 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 2 - Forks: 0
subspecs/Cocaine
Cocaine is a multi-platform C library that can be used to accelerate large workloads/big data/anything really with the power of a GPU with ease. A .NET wrapper is available in the link below.
Language: C - Size: 1.44 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 2 - Forks: 0
mikemag/Mastermind
Playing all games of Mastermind quickly
Language: Jupyter Notebook - Size: 15.2 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0
SeungjaeLim/CUDA.tutorial
References content from the OLCF CUDA Training Series. (https://github.com/olcf/cuda-training-series)
Language: Cuda - Size: 84 KB - Last synced at: 3 months ago - Pushed at: 12 months ago - Stars: 2 - Forks: 1
DominikLindorfer/SYCL-IntelGPU-Quickstart
Lightweight & simplified approach to SYCL development
Language: C++ - Size: 2.75 MB - Last synced at: 4 months ago - Pushed at: 12 months ago - Stars: 2 - Forks: 0
GMAP/GSParLib
GSParLib: A Multi-Level Programming Interface Unifying OpenCL and CUDA for Expressing Stream and Data Parallelism
Language: C++ - Size: 144 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0
MysteryCoder456/learn_opengl
My OpenGL Journey using Rust
Language: Rust - Size: 1.3 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0
kig/glslscript
GLSL as a scripting language. Asynchronous IO runtime for Vulkan compute shaders.
Language: GLSL - Size: 91.8 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0
subspecs/CocaineNET
The .NET wrapper of the Cocaine C library. Cocaine is a multi-platform C library that can be used to accelerate large workloads/big data/anything really with the power of a GPU with ease.
Language: C# - Size: 74.2 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0
DhruvSrikanth/CUDANN
A distributed implementation of a deep learning framework in CUDA.
Language: C++ - Size: 186 KB - Last synced at: 5 months ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0
ArmanDavoodi/Parallel-Sorting
Parallel and sequential implementations of different sorting algorithms in C++ using OpenMP and CUDA
Language: C++ - Size: 1.16 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0
joulook/Parallel-Processing-Spring-2021
In this repository you can find all of my projects for Parallel Processing Course when I was in 2nd semester of my master's at SUT.
Language: Java - Size: 3.27 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0
MehranTaghian/CUDA-OpenMP-samples
Sample codes for parallel programming using OpenMP on CPU and CUDA on GPU
Language: Cuda - Size: 4.97 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0
alexktvsky/raytracer
Raytracer implemented with CPU and GPU using CUDA
Language: C++ - Size: 2.4 MB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 2 - Forks: 0
DanieleParravicini/FastMST
Project for Advanced Algorithm and Parallel Programming course. Academic Year 2018-2019
Language: Cuda - Size: 13.6 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 0
mihi-r/numba_timer
A helper package to easily time Numba CUDA GPU events ⌛
Language: Python - Size: 1.95 KB - Last synced at: 2 months ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 0
JuanCasado/CUDA_2048
Implementation of 2048 game with CUDA
Language: Cuda - Size: 94.3 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 2 - Forks: 1
qin-yu/julia-svm-gpu-cuda
2019 [Julia] GPU CUDAnative SVM: a stochastic decomposition implementation of support-vector machine training
Language: Cuda - Size: 20.3 MB - Last synced at: 13 days ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0
ImperialStranger/Python-GPU_benchmark
Python-GPU_benchmark is a module that provides all informations of your Graphics Card
Language: Python - Size: 28.3 KB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 2 - Forks: 0
barufa/GPU-QuickSort
Implementación del algoritmo GPU-QuickSort en Cuda.
Language: Cuda - Size: 3.64 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 2 - Forks: 1
gholomia/Parallax
Multi-core Programming coursework and assignments under the supervision of Prof. Mahmoud Momtazpour.
Size: 6.75 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 0
nmazidi/GaussianBlur-CUDA-MPI
Gaussian blurring in CUDA and MPI.
Language: C++ - Size: 60.3 MB - Last synced at: over 2 years ago - Pushed at: almost 8 years ago - Stars: 2 - Forks: 1
debowin/gpu-parallel-recommender-system
GPGPU Parallel User-User Collaborative Filtering System in CUDA C
Language: C++ - Size: 30.8 MB - Last synced at: over 1 year ago - Pushed at: almost 8 years ago - Stars: 2 - Forks: 0
estradjm/Parallel-Gaussian-Blurring
LANL Parallel Computing Summer Research Institute 2017 GPU Exercise - C implementation of Gaussian Blurring of .ppm format image
Language: C - Size: 483 KB - Last synced at: over 1 year ago - Pushed at: almost 8 years ago - Stars: 2 - Forks: 1
bhattmansi/Implementation-of-Cholesky-Decomposition-in-GPU-using-CUDA
Parallel implementation of Cholesky Decomposition using CUDA APIs
Language: Cuda - Size: 4.88 KB - Last synced at: over 2 years ago - Pushed at: almost 8 years ago - Stars: 2 - Forks: 2
rustamzh/cuda-kmeans
A course project of Introduction to Parallel Systems and GPU programming class
Language: Cuda - Size: 76.2 KB - Last synced at: over 2 years ago - Pushed at: over 8 years ago - Stars: 2 - Forks: 1
ivantag13/dist-GPU-accelerated-tree-search Fork of Guillaume-Helbecque/GPU-accelerated-tree-search-Chapel
Distributed GPU-accelerated tree search: Investigating a B&B algorithm based on a MPI+X (X=OpenMP, MPI, CUDA, HIP, etc) implementation
Language: C - Size: 664 KB - Last synced at: about 17 hours ago - Pushed at: about 19 hours ago - Stars: 1 - Forks: 0
Misteri4452y/taskflow
Smart weekly planner with auto-scheduling and Google Calendar integration
Language: Python - Size: 31.3 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0
simar-rekhi/triton
LLM-assisted compiler pass generation with Triton & CUDA
Language: Jupyter Notebook - Size: 17.6 KB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 1 - Forks: 0
Mantissagithub/edge_detection_gpu
GPU-accelerated Canny edge detector in CUDA C++. Parallelizes Gaussian filtering, gradient computation, non-maximum suppression, and hysteresis thresholding for real-time edge detection performance
Language: Cuda - Size: 4.49 MB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 1 - Forks: 0
nbathreya/CUDA-Signal-Processor
GPU-Accelerated Signal Processing
Language: Python - Size: 17.6 KB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 1 - Forks: 0
Young-TW/hippp
Write GPU program with RAII
Language: C++ - Size: 85.9 KB - Last synced at: 13 days ago - Pushed at: 18 days ago - Stars: 1 - Forks: 0
AIComputing101/reinforcement-learning-101
An opinionated, end‑to‑end tutorial project for learning Reinforcement Learning (RL) from first principles to deployment. No notebooks. Everything is an explicit, inspectable Python script you can diff, profile, containerize, and ship.
Language: Python - Size: 222 KB - Last synced at: 20 days ago - Pushed at: 26 days ago - Stars: 1 - Forks: 0
sudoDeVinci/skyDeVisionImager
Advanced environmental monitoring platform combining computer vision and geospatial analysis. Low-compute cloud detection, 3D terrain visualization from GeoTIFF data, multi-camera calibration, and statistical validation. scalable architecture with Flask web interface and SQLite backend.
Language: Python - Size: 20.8 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 1 - Forks: 0
AmanSwar/KernelLab
collection of high-performance CUDA implementations, ranging from naive to highly optimized versions.
Language: Cuda - Size: 6.68 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0
DannyDoesGraphics/DARE
Danny's Awesome Rendering Engine
Language: Rust - Size: 4.48 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0
cybersecurity-dev/awesome-gpu-programming
Awesome GPU Programming
Size: 11.7 KB - Last synced at: 12 days ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0
LLAA178/LeetGPU-Guidebook
一步步通关GPU编程
Size: 76.2 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0
elymsyr/auv_control_model
This repository implements an imitation learning pipeline for AUV control. It uses the "FossenNet" neural network to mimic an optimal NL-MPC policy and includes tools for data generation, training, and real-time C++ inference on GPUs.
Language: Jupyter Notebook - Size: 43.6 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0
RosaStack/blackmetal
Apple's Metal, everywhere!
Language: Rust - Size: 109 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0
jaredhoberock/ubu
Language: C++ - Size: 1.97 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0
Aelstraz/Unity-GPU-Compute
GPU Compute provides an easy way to setup & execute GPU compute shaders asynchronously in Unity. Reduces the amount of code and complexity to execute a compute shader. Create, edit and read buffers easily (buffer strides & lengths are calculated automatically). Automatically calculate optimal GPU thread group sizes for your workload. Plus more!
Language: C# - Size: 69.3 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0
romitjain/learning-gpu-programming
Learnings and experimentation with GPU programming
Language: Cuda - Size: 398 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0
jeffasante/metal-raymarch-rs
A basic 3D raymarcher built with Rust and Apple's Metal API. A learning project exploring SDF rendering.
Language: Rust - Size: 1020 KB - Last synced at: 16 days ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0
rbga/A51-Realtime-AI-Object-Detection-with-Pyglet-Powered-UI
Real-time object detection app using YOLOv5/YOLOv8 with custom UI built from scratch using Pyglet & OpenGL. UI animations made in Adobe After Effects, rendered as GIFs, and integrated via uxElements.py. Multi-core processing enables live capture, detection, and display with low latency. Uses Open Images v7 dataset. Train mode is WIP.
Language: Python - Size: 137 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0
kartavyaantani/CUDA_IMAGE_PROCESSING
A CUDA-accelerated image processing project featuring multiple GPU-based filters and enhancement techniques. Implements convolution, edge detection, Non-Local Means (NLM) denoising, K-Nearest Neighbors (KNN), and pixelization. Each operation is optimized using CUDA kernels for real-time performance on large images. The project supports command-line
Language: Jupyter Notebook - Size: 5.4 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0
machineko/SwiftCU
SwiftCU is a wrapper for CUDA runtime API's (exposed as cxxCU) with extra utilities for device management, memory ops and kernel execution, along with a robust suite of tests. Repo is tested on newest (v12.5) CUDA runtime API on both Linux and Windows.
Language: Swift - Size: 613 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0