An open API service providing repository metadata for many open source software ecosystems.

Topic: "gpu-programming"

xmartlabs/gpgpu-comparison

Size: 0 Bytes - Last synced at: over 1 year ago - Pushed at: about 8 years ago - Stars: 8 - Forks: 2

maya-undefined/gpu-desktop-calculator

Language: Cuda - Size: 48.8 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 7 - Forks: 0

coderonion/cuda-beginner-course-rust-version

bilibili视频【CUDA 12.x 并行编程入门(Rust版)】配套代码

Language: Rust - Size: 10.7 KB - Last synced at: 19 days ago - Pushed at: about 1 year ago - Stars: 7 - Forks: 0

priteshgohil/CUDA-programming-tutorial

Get started with CUDA programming

Language: Cuda - Size: 3.63 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 3

effepivi/gvxr-CMPB

Simulation of X-ray projections on GPU: benchmarking gVirtualXray with clinically realistic phantoms

Language: Jupyter Notebook - Size: 4.2 GB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 3

fbasatemur/CUDA-Matrix

2D and 3D Matrix Convolution and Matrix Multiplication with CUDA

Language: C++ - Size: 8.79 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 7 - Forks: 1

estradjm/Code-Portfolio

Code Portfolio -- Collection of Interesting CS and ECE Projects in different languages (C, C++, Python, CPU & GPU Parallel Paradigms, MATLAB, and VHDL) and target hardware with technical reports, and my Vim Config

Language: C - Size: 146 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 7 - Forks: 1

vitormeriat/presentations

Slides and notes we've presented out

Language: Jupyter Notebook - Size: 53.6 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 7 - Forks: 0

harsh-99/Traffic-sign-detection

Traffic sign detection and classification

Language: Jupyter Notebook - Size: 43.8 MB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 7 - Forks: 11

renato-yuzup/axis-fem

Hybrid CPU-GPU Finite Element Software for Structural Analysis in Mechanical Engineering

Language: MATLAB - Size: 49.7 MB - Last synced at: 8 months ago - Pushed at: over 6 years ago - Stars: 7 - Forks: 0

abeduplaa/BlindDeconvolutionGPU

Speeding up blind deconvolution of a blurred image by using GPUs

Language: Cuda - Size: 48.4 MB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 7 - Forks: 0

vanities/PolarisBiosEditor-1.6.7

AMD GPU Polaris Bios Editor

Language: C# - Size: 139 KB - Last synced at: 7 months ago - Pushed at: over 7 years ago - Stars: 7 - Forks: 8

akashdeepjassal/GPU-Programming

Language: C - Size: 50.8 KB - Last synced at: over 2 years ago - Pushed at: almost 8 years ago - Stars: 7 - Forks: 1

vista-art/fragmentcolor

🦀 Easy GPU programming for Javascript, Python, Swift, and Kotlin.

Language: Rust - Size: 63.2 MB - Last synced at: about 19 hours ago - Pushed at: 1 day ago - Stars: 6 - Forks: 0

Mgepahmge/CuWeaver

A CUDA concurrency library designed to simplify concurrency programming, offering C++-style wrappers for selected CUDA Runtime APIs

Language: Cuda - Size: 1.48 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 6 - Forks: 0

PawseySC/sc20-gpu-offloading

Materials for "Differences between OpenACC and OpenMP offloading models" tutorial.

Language: C - Size: 650 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 6 - Forks: 5

jrajan14/CUDA_Programs

Nvidia CUDA Programs. High-performance computing with my collection of CUDA programs, meticulously crafted to harness the immense power of NVIDIA's GPU architecture. From blazingly fast simulations to data-intensive parallel processing, these programs showcase my passion for pushing the boundaries of performance optimization.

Language: Cuda - Size: 30.8 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 5 - Forks: 2

pjyi2147/CUDA_HTN_Workshop

Introduction to Nvidia CUDA workshop repository @ Hack the North 2024

Language: Jupyter Notebook - Size: 8.47 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 5 - Forks: 2

coderonion/cuda-beginner-course-python-version

bilibili视频【CUDA 12.x 并行编程入门(Python版)】配套代码

Language: Python - Size: 3.91 KB - Last synced at: 19 days ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

KnowledgePending/Pycuda-Docker

🐳🐍Pycuda Docker Environment for GPU Accelerated Python

Language: Dockerfile - Size: 567 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 0

thomasp85/shady

Compile and Execute Shaders from R

Language: C++ - Size: 13.7 KB - Last synced at: 7 months ago - Pushed at: almost 6 years ago - Stars: 5 - Forks: 1

benc-uk/webgl-sandbox

Interactive editor & sandbox for creating & running WebGL2 shaders

Language: JavaScript - Size: 4.71 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 4 - Forks: 0

JuliaWGPU/WGPUCompute.jl

Compute shaders interface for WGPU from julia

Language: Julia - Size: 336 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 4 - Forks: 1

GameWin221/Gemino

⚡High-Performance Vulkan Renderer🌋

Language: C++ - Size: 8.66 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 4 - Forks: 0

evanmcclure/hello_gpu

Hello world example for Rust on GPU

Language: Rust - Size: 6.84 KB - Last synced at: 7 months ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

kai-kj/microcompute

A small library for gpu computing

Language: C - Size: 486 KB - Last synced at: 7 months ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

FernandoSchett/DPCPP_for_dummies

This repository contains code samples in DPC++, an extension of the C++ standard created by Intel for heterogeneous parallel programming.

Language: C++ - Size: 546 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 0

Cusymint/cusymint

CUDA symbolic integration

Language: Cuda - Size: 3.67 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 0

Qazalbash/CUDA_Spring2023 Fork of mmmovania/CUDA_Spring2023

The companion git repo for the Spring 2023 CUDA course

Language: Jupyter Notebook - Size: 1.81 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 0

marcoplaitano/counting-sort-cuda

Parallelized version of Counting Sort using CUDA

Language: C - Size: 26.4 KB - Last synced at: 4 months ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 0

pnikitakis/high-performance-computing

5 problem sets of parallel programming on CPU and GPU. University projects for High Performance Computing Systems (Fall 2016).

Language: Cuda - Size: 1.06 MB - Last synced at: 2 months ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 0

termijn/webgl-volumerendering

WebGL based implementation of 3D volume rendering

Language: JavaScript - Size: 13.4 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 4

dereklstinson/hip

go bindings for hip

Language: Go - Size: 87.9 KB - Last synced at: 7 months ago - Pushed at: about 6 years ago - Stars: 4 - Forks: 1

itslokesh/Multi-Max-Clique

Multi-Max-Clique, an application that solves Maximum Clique Problem using the parallel branch and bound approach and achieved linear and super-linear speedups in CUDA.

Language: Cuda - Size: 829 KB - Last synced at: 5 months ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 0

sivagnanamn/nvidia-gpu-stats

CUDA script to check NVIDIA GPU device properties & memory available

Language: Cuda - Size: 371 KB - Last synced at: 4 months ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 3

yafangshih/GPGPU_Programming_2016S

Perlin Noise, Poisson Image Editing implemented in CUDA. Course assignments of GPU programming at National Taiwan University.

Language: Cuda - Size: 9.59 MB - Last synced at: over 2 years ago - Pushed at: almost 8 years ago - Stars: 4 - Forks: 3

DiamondLightSource/fast-feedback-service

GPU based service to provide fast-feedback results

Language: C++ - Size: 1010 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 3 - Forks: 3

Oabraham1/chronos

Chronos is a time-based GPU partitioning utility that allows multiple users or applications to share a single GPU by creating exclusive time-limited partitions with automatic expiration. Built with OpenCL, it works across platforms including macOS (Apple Silicon & Intel), Linux, and Windows.

Language: C++ - Size: 86.9 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 3 - Forks: 0

ocentra/bitnet.rs

Pure Rust engine for BitNet LLMs — Conversion, Inference, Training and Research. With streaming and GPU/CPU support

Language: Rust - Size: 2.43 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 3 - Forks: 0

yashkathe/Image-Noise-Reduction-with-CUDA

This project conducts an analysis of image denoising technique - median blur, comparing GPU-accelerated (Numba) and CPU-based (OpenCV) processing speeds.

Language: Jupyter Notebook - Size: 25.4 MB - Last synced at: 6 months ago - Pushed at: 7 months ago - Stars: 3 - Forks: 0

dipta007/gpu-wait

A package to run commands when GPU resources are available

Language: Python - Size: 21.5 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 3 - Forks: 0

predsci/multigpu-test-code

This code mimics the basic MPI+OpenACC tasks of PSI's MAS Solar MHD code, for use with testing multi-GPU multi-node clusters

Language: Fortran - Size: 36.1 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 3 - Forks: 0

Awrsha/Advanced-CUDA-Programming-GPU-Architecture

This repository provides a comprehensive guide to optimizing GPU kernels for performance, with a focus on NVIDIA GPUs. It covers key tools and techniques such as CUDA, PyTorch, and Triton, aimed at improving computational efficiency for deep learning and scientific computing tasks.

Language: Cuda - Size: 25.2 MB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 3 - Forks: 0

lawmurray/gpu-gemm

CUDA kernel for matrix-matrix multiplication on Nvidia GPUs, using a Hilbert curve to improve L2 cache utilization.

Language: Cuda - Size: 34.2 KB - Last synced at: 7 months ago - Pushed at: 12 months ago - Stars: 3 - Forks: 0

veera-adithya-d/Hardware-aware-algorithm

Inference module of Imagenet

Language: C++ - Size: 1.3 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 1

mina1460/GPU_programming_with_CUDA

A repo for all my projects using nVidia CUDA toolkit for programming GPGPUs

Language: C++ - Size: 349 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 1

arpankapoor/pycuda-vgg16

vgg16 inference implementation using tensorflow, numpy and pycuda

Language: Python - Size: 222 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

gurbaaz27/CS433A-Design-Exercises

Solutions of design exercises in CS433A: Parallel Programming, Spring Semester 2021-22

Language: C - Size: 722 KB - Last synced at: 5 months ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 1

leanerr/GPU-Programming-MN-Matrices

Write a program that initializes two M×N matrices and computes the sum of the two matrices on the GPU device. After copying the result back to the host, your program should print

Language: Jupyter Notebook - Size: 2.6 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

michael-elkh/cellular_automaton-futhark-cuda-opencl

A small project to evaluate performance between Futhark, Cuda and OpenCL

Language: C - Size: 87.9 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 0

TennisGazelle/CUDA-CapsuleNetwork-Methods

A clean, pure C++/CUDA implementation of Capsule Networks, no cuDNN, TF, Keras, or libraries.

Language: C++ - Size: 24.1 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 1

sbreban/mandelbrot-gpu

Language: C++ - Size: 77.1 KB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 0

Mrezadwiprasetiawan/cpp-playground

A collection of C++ experiments and code created as part of exploration and practice

Language: C++ - Size: 21.2 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 2 - Forks: 1

shreyansh26/MLSys-Experiments

A collection of scripts on experimenting and implementing MLSys-related stuff

Language: Jupyter Notebook - Size: 83.1 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 2 - Forks: 0

subspecs/Cocaine

Cocaine is a multi-platform C library that can be used to accelerate large workloads/big data/anything really with the power of a GPU with ease. A .NET wrapper is available in the link below.

Language: C - Size: 1.44 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 2 - Forks: 0

mikemag/Mastermind

Playing all games of Mastermind quickly

Language: Jupyter Notebook - Size: 15.2 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

SeungjaeLim/CUDA.tutorial

References content from the OLCF CUDA Training Series. (https://github.com/olcf/cuda-training-series)

Language: Cuda - Size: 84 KB - Last synced at: 3 months ago - Pushed at: 12 months ago - Stars: 2 - Forks: 1

DominikLindorfer/SYCL-IntelGPU-Quickstart

Lightweight & simplified approach to SYCL development

Language: C++ - Size: 2.75 MB - Last synced at: 4 months ago - Pushed at: 12 months ago - Stars: 2 - Forks: 0

GMAP/GSParLib

GSParLib: A Multi-Level Programming Interface Unifying OpenCL and CUDA for Expressing Stream and Data Parallelism

Language: C++ - Size: 144 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

MysteryCoder456/learn_opengl

My OpenGL Journey using Rust

Language: Rust - Size: 1.3 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

kig/glslscript

GLSL as a scripting language. Asynchronous IO runtime for Vulkan compute shaders.

Language: GLSL - Size: 91.8 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

subspecs/CocaineNET

The .NET wrapper of the Cocaine C library. Cocaine is a multi-platform C library that can be used to accelerate large workloads/big data/anything really with the power of a GPU with ease.

Language: C# - Size: 74.2 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

DhruvSrikanth/CUDANN

A distributed implementation of a deep learning framework in CUDA.

Language: C++ - Size: 186 KB - Last synced at: 5 months ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

ArmanDavoodi/Parallel-Sorting

Parallel and sequential implementations of different sorting algorithms in C++ using OpenMP and CUDA

Language: C++ - Size: 1.16 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0

joulook/Parallel-Processing-Spring-2021

In this repository you can find all of my projects for Parallel Processing Course when I was in 2nd semester of my master's at SUT.

Language: Java - Size: 3.27 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0

MehranTaghian/CUDA-OpenMP-samples

Sample codes for parallel programming using OpenMP on CPU and CUDA on GPU

Language: Cuda - Size: 4.97 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

alexktvsky/raytracer

Raytracer implemented with CPU and GPU using CUDA

Language: C++ - Size: 2.4 MB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 2 - Forks: 0

DanieleParravicini/FastMST

Project for Advanced Algorithm and Parallel Programming course. Academic Year 2018-2019

Language: Cuda - Size: 13.6 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 0

mihi-r/numba_timer

A helper package to easily time Numba CUDA GPU events ⌛

Language: Python - Size: 1.95 KB - Last synced at: 2 months ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 0

JuanCasado/CUDA_2048

Implementation of 2048 game with CUDA

Language: Cuda - Size: 94.3 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 2 - Forks: 1

qin-yu/julia-svm-gpu-cuda

2019 [Julia] GPU CUDAnative SVM: a stochastic decomposition implementation of support-vector machine training

Language: Cuda - Size: 20.3 MB - Last synced at: 13 days ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0

ImperialStranger/Python-GPU_benchmark

Python-GPU_benchmark is a module that provides all informations of your Graphics Card

Language: Python - Size: 28.3 KB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 2 - Forks: 0

barufa/GPU-QuickSort

Implementación del algoritmo GPU-QuickSort en Cuda.

Language: Cuda - Size: 3.64 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 2 - Forks: 1

gholomia/Parallax

Multi-core Programming coursework and assignments under the supervision of Prof. Mahmoud Momtazpour.

Size: 6.75 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 0

nmazidi/GaussianBlur-CUDA-MPI

Gaussian blurring in CUDA and MPI.

Language: C++ - Size: 60.3 MB - Last synced at: over 2 years ago - Pushed at: almost 8 years ago - Stars: 2 - Forks: 1

debowin/gpu-parallel-recommender-system

GPGPU Parallel User-User Collaborative Filtering System in CUDA C

Language: C++ - Size: 30.8 MB - Last synced at: over 1 year ago - Pushed at: almost 8 years ago - Stars: 2 - Forks: 0

estradjm/Parallel-Gaussian-Blurring

LANL Parallel Computing Summer Research Institute 2017 GPU Exercise - C implementation of Gaussian Blurring of .ppm format image

Language: C - Size: 483 KB - Last synced at: over 1 year ago - Pushed at: almost 8 years ago - Stars: 2 - Forks: 1

bhattmansi/Implementation-of-Cholesky-Decomposition-in-GPU-using-CUDA

Parallel implementation of Cholesky Decomposition using CUDA APIs

Language: Cuda - Size: 4.88 KB - Last synced at: over 2 years ago - Pushed at: almost 8 years ago - Stars: 2 - Forks: 2

rustamzh/cuda-kmeans

A course project of Introduction to Parallel Systems and GPU programming class

Language: Cuda - Size: 76.2 KB - Last synced at: over 2 years ago - Pushed at: over 8 years ago - Stars: 2 - Forks: 1

ivantag13/dist-GPU-accelerated-tree-search Fork of Guillaume-Helbecque/GPU-accelerated-tree-search-Chapel

Distributed GPU-accelerated tree search: Investigating a B&B algorithm based on a MPI+X (X=OpenMP, MPI, CUDA, HIP, etc) implementation

Language: C - Size: 664 KB - Last synced at: about 17 hours ago - Pushed at: about 19 hours ago - Stars: 1 - Forks: 0

Misteri4452y/taskflow

Smart weekly planner with auto-scheduling and Google Calendar integration

Language: Python - Size: 31.3 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

simar-rekhi/triton

LLM-assisted compiler pass generation with Triton & CUDA

Language: Jupyter Notebook - Size: 17.6 KB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 1 - Forks: 0

Mantissagithub/edge_detection_gpu

GPU-accelerated Canny edge detector in CUDA C++. Parallelizes Gaussian filtering, gradient computation, non-maximum suppression, and hysteresis thresholding for real-time edge detection performance

Language: Cuda - Size: 4.49 MB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 1 - Forks: 0

nbathreya/CUDA-Signal-Processor

GPU-Accelerated Signal Processing

Language: Python - Size: 17.6 KB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 1 - Forks: 0

Young-TW/hippp

Write GPU program with RAII

Language: C++ - Size: 85.9 KB - Last synced at: 13 days ago - Pushed at: 18 days ago - Stars: 1 - Forks: 0

AIComputing101/reinforcement-learning-101

An opinionated, end‑to‑end tutorial project for learning Reinforcement Learning (RL) from first principles to deployment. No notebooks. Everything is an explicit, inspectable Python script you can diff, profile, containerize, and ship.

Language: Python - Size: 222 KB - Last synced at: 20 days ago - Pushed at: 26 days ago - Stars: 1 - Forks: 0

sudoDeVinci/skyDeVisionImager

Advanced environmental monitoring platform combining computer vision and geospatial analysis. Low-compute cloud detection, 3D terrain visualization from GeoTIFF data, multi-camera calibration, and statistical validation. scalable architecture with Flask web interface and SQLite backend.

Language: Python - Size: 20.8 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 1 - Forks: 0

AmanSwar/KernelLab

collection of high-performance CUDA implementations, ranging from naive to highly optimized versions.

Language: Cuda - Size: 6.68 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

DannyDoesGraphics/DARE

Danny's Awesome Rendering Engine

Language: Rust - Size: 4.48 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

cybersecurity-dev/awesome-gpu-programming

Awesome GPU Programming

Size: 11.7 KB - Last synced at: 12 days ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

LLAA178/LeetGPU-Guidebook

一步步通关GPU编程

Size: 76.2 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

elymsyr/auv_control_model

This repository implements an imitation learning pipeline for AUV control. It uses the "FossenNet" neural network to mimic an optimal NL-MPC policy and includes tools for data generation, training, and real-time C++ inference on GPUs.

Language: Jupyter Notebook - Size: 43.6 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

RosaStack/blackmetal

Apple's Metal, everywhere!

Language: Rust - Size: 109 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

jaredhoberock/ubu

Language: C++ - Size: 1.97 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

Aelstraz/Unity-GPU-Compute

GPU Compute provides an easy way to setup & execute GPU compute shaders asynchronously in Unity. Reduces the amount of code and complexity to execute a compute shader. Create, edit and read buffers easily (buffer strides & lengths are calculated automatically). Automatically calculate optimal GPU thread group sizes for your workload. Plus more!

Language: C# - Size: 69.3 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

romitjain/learning-gpu-programming

Learnings and experimentation with GPU programming

Language: Cuda - Size: 398 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

jeffasante/metal-raymarch-rs

A basic 3D raymarcher built with Rust and Apple's Metal API. A learning project exploring SDF rendering.

Language: Rust - Size: 1020 KB - Last synced at: 16 days ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

rbga/A51-Realtime-AI-Object-Detection-with-Pyglet-Powered-UI

Real-time object detection app using YOLOv5/YOLOv8 with custom UI built from scratch using Pyglet & OpenGL. UI animations made in Adobe After Effects, rendered as GIFs, and integrated via uxElements.py. Multi-core processing enables live capture, detection, and display with low latency. Uses Open Images v7 dataset. Train mode is WIP.

Language: Python - Size: 137 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

kartavyaantani/CUDA_IMAGE_PROCESSING

A CUDA-accelerated image processing project featuring multiple GPU-based filters and enhancement techniques. Implements convolution, edge detection, Non-Local Means (NLM) denoising, K-Nearest Neighbors (KNN), and pixelization. Each operation is optimized using CUDA kernels for real-time performance on large images. The project supports command-line

Language: Jupyter Notebook - Size: 5.4 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

machineko/SwiftCU

SwiftCU is a wrapper for CUDA runtime API's (exposed as cxxCU) with extra utilities for device management, memory ops and kernel execution, along with a robust suite of tests. Repo is tested on newest (v12.5) CUDA runtime API on both Linux and Windows.

Language: Swift - Size: 613 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0