An open API service providing repository metadata for many open source software ecosystems.

Topic: "gpu-computing"

catboost/catboost

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.

Language: C++ - Size: 1.51 GB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 8,665 - Forks: 1,247

gyroflow/gyroflow

Video stabilization using gyroscope data

Language: Rust - Size: 83.8 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 7,933 - Forks: 366

google/tf-quant-finance

High-performance TensorFlow library for quantitative finance.

Language: Python - Size: 16.9 MB - Last synced at: 16 days ago - Pushed at: 8 months ago - Stars: 5,030 - Forks: 647

NVIDIA/thrust 📦

[ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl

Language: C++ - Size: 17 MB - Last synced at: 9 days ago - Pushed at: almost 2 years ago - Stars: 4,984 - Forks: 763

ProjectPhysX/FluidX3D

The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.

Language: C++ - Size: 21.4 MB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 4,759 - Forks: 428

tensorflow/lingvo

Lingvo

Language: Python - Size: 142 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 2,851 - Forks: 452

microsoft/pai 📦

Resource scheduling and cluster management for AI

Language: JavaScript - Size: 70.5 MB - Last synced at: 23 days ago - Pushed at: over 1 year ago - Stars: 2,677 - Forks: 549

KomputeProject/kompute

General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing usecases. Backed by the Linux Foundation.

Language: C++ - Size: 25.3 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2,344 - Forks: 177

jbush001/NyuziProcessor

GPGPU microprocessor architecture

Language: C - Size: 31.4 MB - Last synced at: 6 months ago - Pushed at: about 1 year ago - Stars: 2,082 - Forks: 360

NVIDIA/cccl

CUDA Core Compute Libraries

Language: C++ - Size: 295 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 2,022 - Forks: 289

inducer/pycuda

CUDA integration for Python, plus shiny features

Language: Python - Size: 2.87 MB - Last synced at: 9 days ago - Pushed at: about 1 month ago - Stars: 2,001 - Forks: 297

SciML/SciMLBook

Parallel Computing and Scientific Machine Learning (SciML): Methods and Applications (MIT 18.337J/6.338J)

Language: HTML - Size: 128 MB - Last synced at: 28 days ago - Pushed at: about 2 months ago - Stars: 1,943 - Forks: 357

chelsea0x3b/dfdx

Deep learning in Rust, with shape checked tensors and neural networks

Language: Rust - Size: 2.6 MB - Last synced at: 3 days ago - Pushed at: over 1 year ago - Stars: 1,859 - Forks: 105

mikbry/awesome-webgpu

😎 Curated list of awesome things around WebGPU ecosystem.

Size: 126 KB - Last synced at: about 13 hours ago - Pushed at: 12 days ago - Stars: 1,762 - Forks: 76

AdaptiveCpp/AdaptiveCpp

Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications adapt themselves to all the hardware in the system - even at runtime!

Language: C++ - Size: 14.3 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1,731 - Forks: 201

software-mansion/TypeGPU

A modular and open-ended toolkit for WebGPU, with advanced type inference and the ability to write shaders in TypeScript

Language: TypeScript - Size: 261 MB - Last synced at: about 6 hours ago - Pushed at: about 6 hours ago - Stars: 1,708 - Forks: 36

BindsNET/bindsnet

Simulation of spiking neural networks (SNNs) using PyTorch.

Language: Python - Size: 61.5 MB - Last synced at: 9 days ago - Pushed at: 12 days ago - Stars: 1,636 - Forks: 341

calebwin/emu

The write-once-run-anywhere GPGPU library for Rust

Language: Rust - Size: 342 MB - Last synced at: 6 days ago - Pushed at: almost 3 years ago - Stars: 1,609 - Forks: 52

mratsim/Arraymancer

A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends

Language: Nim - Size: 3.81 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 1,380 - Forks: 95

NVIDIA/MatX

An efficient C++17 GPU numerical computing library with Python-like syntax

Language: C++ - Size: 21.5 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1,359 - Forks: 108

beehive-lab/TornadoVM

TornadoVM: A practical and efficient heterogeneous programming framework for managed languages

Language: Java - Size: 160 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 1,344 - Forks: 124

LuxCoreRender/LuxCore

LuxCore source repository

Language: C++ - Size: 156 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 1,262 - Forks: 156

stotko/stdgpu

stdgpu: Efficient STL-like Data Structures on the GPU

Language: C++ - Size: 5.01 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 1,234 - Forks: 91

uncomplicate/neanderthal

Fast Clojure Matrix Library

Language: Clojure - Size: 3.96 MB - Last synced at: 21 days ago - Pushed at: 26 days ago - Stars: 1,111 - Forks: 58

AccelerateHS/accelerate

Embedded language for high-performance array computations

Language: Haskell - Size: 15.4 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 940 - Forks: 130

eyalroz/cuda-api-wrappers

Thin, unified, C++-flavored wrappers for the CUDA APIs

Language: C++ - Size: 2.88 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 860 - Forks: 84

LuxCoreRender/BlendLuxCore

Blender Integration for LuxCore

Language: Python - Size: 341 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 819 - Forks: 99

Langhalsdino/Kubernetes-GPU-Guide

This guide should help fellow researchers and hobbyists to easily automate and accelerate there deep leaning training with their own Kubernetes GPU cluster.

Language: Shell - Size: 431 KB - Last synced at: 4 months ago - Pushed at: about 3 years ago - Stars: 818 - Forks: 114

zszazi/Deep-learning-in-cloud

List of Deep Learning Cloud Providers

Size: 74.2 KB - Last synced at: 5 months ago - Pushed at: 7 months ago - Stars: 784 - Forks: 94

ComputationalRadiationPhysics/picongpu

Performance-Portable Particle-in-Cell Simulations for the Exascale Era :sparkles:

Language: C++ - Size: 59.1 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 762 - Forks: 225

iot-salzburg/gpu-jupyter

GPU-Jupyter: Your GPU-accelerated JupyterLab with a rich data science toolstack, TensorFlow and PyTorch for your reproducible deep learning experiments.

Language: Jupyter Notebook - Size: 1.21 MB - Last synced at: 6 months ago - Pushed at: 9 months ago - Stars: 743 - Forks: 237

googlefonts/compute-shader-101

Sample code for compute shader 101 training

Language: Rust - Size: 284 KB - Last synced at: 27 days ago - Pushed at: 7 months ago - Stars: 596 - Forks: 35

huiscliu/Tutorials

Parallel programming tutorials

Language: C - Size: 55 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 560 - Forks: 191

ginkgo-project/ginkgo

Numerical linear algebra software package

Language: C++ - Size: 158 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 523 - Forks: 99

AmesingFlank/taichi.js

Modern GPU Compute and Rendering in Javascript

Language: TypeScript - Size: 220 MB - Last synced at: 24 days ago - Pushed at: over 1 year ago - Stars: 515 - Forks: 20

FAST-Imaging/FAST

A framework for high-performance medical image processing, neural network inference and visualization

Language: C++ - Size: 20.1 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 488 - Forks: 108

ccsb-scripps/AutoDock-GPU

AutoDock for GPUs and other accelerators

Language: C++ - Size: 44.4 MB - Last synced at: 6 months ago - Pushed at: 10 months ago - Stars: 479 - Forks: 123

JuliaGPU/KernelAbstractions.jl

Heterogeneous programming in Julia

Language: Julia - Size: 4.73 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 466 - Forks: 80

tumaer/JAXFLUIDS

Differentiable Fluid Dynamics Package

Language: Python - Size: 12.6 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 466 - Forks: 85

triSYCL/triSYCL

Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group

Language: C++ - Size: 382 MB - Last synced at: 6 months ago - Pushed at: about 1 year ago - Stars: 443 - Forks: 98

ProjectPhysX/OpenCL-Wrapper

OpenCL is the most powerful programming language ever created. Yet the OpenCL C++ bindings are cumbersome and the code overhead prevents many people from getting started. I created this lightweight OpenCL-Wrapper to greatly simplify OpenCL software development with C++ while keeping functionality and performance.

Language: C++ - Size: 405 KB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 442 - Forks: 43

kpet/clvk

Implementation of OpenCL 3.0 on Vulkan

Language: C++ - Size: 1.74 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 413 - Forks: 46

RRZE-HPC/gpu-benches

collection of benchmarks to measure basic GPU capabilities

Language: C++ - Size: 1.78 MB - Last synced at: 5 months ago - Pushed at: 9 months ago - Stars: 386 - Forks: 55

KernelTuner/kernel_tuner

Kernel Tuner

Language: Python - Size: 41.5 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 372 - Forks: 59

andrewmilson/ministark

🏃‍♂️💨 GPU accelerated STARK prover built on @arkworks-rs

Language: Rust - Size: 1.65 MB - Last synced at: 27 days ago - Pushed at: 12 months ago - Stars: 365 - Forks: 36

uncomplicate/bayadera

High-performance Bayesian Data Analysis on the GPU in Clojure

Language: Clojure - Size: 1020 KB - Last synced at: 6 months ago - Pushed at: about 5 years ago - Stars: 365 - Forks: 23

Zydak/Vulkan-Path-Tracer

Vulkan Path Tracer. Physically based path tracer made in Vulkan with Ray Tracing Pipeline.

Language: C++ - Size: 462 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 361 - Forks: 12

Glavnokoman/vuh

Vulkan compute for people

Language: C++ - Size: 705 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 340 - Forks: 34

gpufit/Gpufit

GPU-accelerated Levenberg-Marquardt curve fitting in CUDA

Language: Cuda - Size: 1.14 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 332 - Forks: 99

brandondube/prysm

physical optics: integrated modeling, phase retrieval, segmented systems, polynomials and fitting, sequential raytracing...

Language: Python - Size: 12.2 MB - Last synced at: 24 days ago - Pushed at: 11 months ago - Stars: 315 - Forks: 53

favreau/Sol-R Fork of cyrillefavreau/Sol-R

Open-Source CUDA/OpenCL Speed Of Light Ray-tracer

Language: C++ - Size: 22 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 306 - Forks: 12

fastflow/fastflow

FastFlow pattern-based parallel programming framework (formerly on sourceforge)

Language: C++ - Size: 178 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 296 - Forks: 72

baggepinnen/MonteCarloMeasurements.jl

Propagation of distributions by Monte-Carlo sampling: Real number types with uncertainty represented by samples.

Language: Julia - Size: 5.25 MB - Last synced at: 9 days ago - Pushed at: 4 months ago - Stars: 285 - Forks: 18

uncomplicate/clojurecl

ClojureCL is a Clojure library for parallel computations with OpenCL.

Language: Clojure - Size: 910 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 280 - Forks: 18

CodedK/CUDA-by-Example-source-code-for-the-book-s-examples-

CUDA by Example, written by two senior members of the CUDA software platform team, shows programmers how to employ this new technology. The authors introduce each area of CUDA development through working examples.

Language: C - Size: 1.07 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 272 - Forks: 108

mfem/PyMFEM

Python wrapper for MFEM

Language: SWIG - Size: 26 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 262 - Forks: 63

ProjectPhysX/OpenCL-Benchmark

A small OpenCL benchmark program to measure peak GPU/CPU performance.

Language: C++ - Size: 294 KB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 259 - Forks: 34

ROCm/Tensile

[DEPRECATED] Moved to ROCm/rocm-libraries repo

Language: Python - Size: 98.2 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 254 - Forks: 166

CaNS-World/CaNS

A code for fast, massively-parallel direct numerical simulations (DNS) of canonical flows

Language: Fortran - Size: 1.13 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 253 - Forks: 85

niessner/Opt

Opt DSL

Language: Terra - Size: 22.8 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 252 - Forks: 68

denosaurs/netsaur

Powerful Powerful Machine Learning library with GPU, CPU and WASM backends

Language: Rust - Size: 146 MB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 250 - Forks: 5

mikeroyal/GPU-Guide

Graphics Processing Unit (GPU) Architecture Guide

Language: Shell - Size: 815 KB - Last synced at: about 6 hours ago - Pushed at: almost 4 years ago - Stars: 248 - Forks: 20

cdeterman/gpuR

R interface to use GPU's

Language: R - Size: 12 MB - Last synced at: 4 months ago - Pushed at: over 5 years ago - Stars: 244 - Forks: 26

BasBuller/PySNN

Efficient Spiking Neural Network framework, built on top of PyTorch for GPU acceleration

Language: Python - Size: 12.8 MB - Last synced at: 7 months ago - Pushed at: over 1 year ago - Stars: 225 - Forks: 27

rsnemmen/OpenCL-examples

Simple OpenCL examples for exploiting GPU computing

Language: Objective-C++ - Size: 3.46 MB - Last synced at: 6 months ago - Pushed at: over 1 year ago - Stars: 213 - Forks: 73

penn-graphics-research/claymore

Language: Cuda - Size: 30.7 MB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 209 - Forks: 31

shiinamiyuki/akari_render

High Performance CPU/GPU Physically Based Renderer in Rust

Language: Rust - Size: 150 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 208 - Forks: 10

lnstadrum/beatmup

Beatmup: image and signal processing library

Language: C++ - Size: 11.8 MB - Last synced at: 24 days ago - Pushed at: almost 2 years ago - Stars: 205 - Forks: 15

preda/gpuowl

GPU Mersenne primality test.

Language: C++ - Size: 13.6 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 197 - Forks: 49

uncomplicate/clojurecuda

Clojure library for CUDA development

Language: Clojure - Size: 563 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 191 - Forks: 10

zeam-vm/pelemay

Pelemay is a native compiler for Elixir, which generates SIMD instructions. It has a plan to generate for GPU code.

Language: Elixir - Size: 410 KB - Last synced at: 3 months ago - Pushed at: almost 5 years ago - Stars: 189 - Forks: 13

NumPower/numpower

PHP extension for efficient scientific computing and array manipulation with GPU support

Language: PHP - Size: 526 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 172 - Forks: 4

EMI-Group/evomo

EvoMO is a GPU-accelerated library for evolutionary multiobjective optimization (EMO)

Language: Python - Size: 1000 KB - Last synced at: about 15 hours ago - Pushed at: about 1 month ago - Stars: 171 - Forks: 21

nixonyh/GPUClothSimulationInUnity 📦

Trying to replicate what this legend did: https://youtu.be/kCGHXlLR3l8

Language: C# - Size: 201 MB - Last synced at: 25 days ago - Pushed at: about 3 years ago - Stars: 170 - Forks: 16

artyom-beilis/dlprimitives

Deep Learning Primitives and Mini-Framework for OpenCL

Language: C++ - Size: 58.2 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 169 - Forks: 16

AccelerateHS/accelerate-llvm

LLVM backend for Accelerate

Language: Haskell - Size: 3.95 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 167 - Forks: 60

Ricks-Lab/gpu-utils

A set of utilities for monitoring and customizing GPU performance

Language: Python - Size: 3.98 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 156 - Forks: 24

exospherehost/exospherehost

Infra for scalable and reliable AI agents

Language: Python - Size: 34.4 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 155 - Forks: 38

SamGinzburg/VectorVisor

VectorVisor is a vectorizing binary translator for GPUs, designed to make it easy to run many copies of a single-threaded WebAssembly program in parallel using GPUs

Language: WebAssembly - Size: 216 MB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 150 - Forks: 4

lachlan2k/phatcrack

Modern web-based distributed hashcracking solution, built on hashcat

Language: Go - Size: 11.2 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 146 - Forks: 12

GooFit/GooFit

Code repository for the massively-parallel framework for maximum-likelihood fits, implemented in CUDA/OpenMP

Language: Cuda - Size: 98 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 141 - Forks: 41

houkensjtu/taichi-fluid

A collection of CFD related resources for Taichi developers.

Size: 5.84 MB - Last synced at: 5 months ago - Pushed at: 8 months ago - Stars: 139 - Forks: 6

ComputationalRadiationPhysics/cuda_memtest

Fork of CUDA GPU memtest :eyeglasses:

Language: C++ - Size: 275 KB - Last synced at: 29 days ago - Pushed at: about 1 year ago - Stars: 134 - Forks: 32

AnicetNgrt/jiro-nn

A Deep Learning and preprocessing framework in Rust with support for CPU and GPU.

Language: Rust - Size: 17.5 MB - Last synced at: 8 days ago - Pushed at: about 2 years ago - Stars: 133 - Forks: 3

IntelPython/dpctl

Python SYCL bindings and SYCL-based Python Array API library

Language: C++ - Size: 223 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 117 - Forks: 31

PyOCL/OpenCLGA

A Python Library for Genetic Algorithm on OpenCL

Language: Python - Size: 17.4 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 117 - Forks: 32

tensordiffeq/TensorDiffEq

Efficient and Scalable Physics-Informed Deep Learning and Scientific Machine Learning on top of Tensorflow for multi-worker distributed computing

Language: Python - Size: 1.28 MB - Last synced at: 18 days ago - Pushed at: over 3 years ago - Stars: 116 - Forks: 43

ROCm/hipBLASLt

[DEPRECATED] Moved to ROCm/rocm-libraries repo

Language: Assembly - Size: 1.59 GB - Last synced at: about 21 hours ago - Pushed at: about 23 hours ago - Stars: 114 - Forks: 146

barbagroup/PetIBM

PetIBM - toolbox and applications of the immersed-boundary method on distributed-memory architectures

Language: C++ - Size: 14.9 MB - Last synced at: 24 days ago - Pushed at: over 3 years ago - Stars: 111 - Forks: 52

ashvardanian/ParallelReductionsBenchmark

Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal, and Rust - all it takes to sum a lot of numbers fast!

Language: C++ - Size: 17.4 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 109 - Forks: 10

wmmae/wmma_extension

An extension library of WMMA API (Tensor Core API)

Language: Cuda - Size: 698 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 106 - Forks: 16

radiantone/entangle

A lightweight (serverless) native python parallel processing framework based on simple decorators and call graphs.

Language: Python - Size: 2.33 MB - Last synced at: 4 months ago - Pushed at: about 3 years ago - Stars: 104 - Forks: 7

slai-labs/get-beam

Run GPU inference and training jobs on serverless infrastructure that scales with you.

Language: Shell - Size: 5.96 MB - Last synced at: 7 months ago - Pushed at: over 1 year ago - Stars: 102 - Forks: 23

DeepMLNet/DeepNet

Deep.Net machine learning framework for F#

Language: F# - Size: 230 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 102 - Forks: 9

Heteroflow/Heteroflow

Concurrent CPU-GPU Programming using Task Models

Language: C++ - Size: 1.58 MB - Last synced at: 8 months ago - Pushed at: almost 6 years ago - Stars: 101 - Forks: 13

getlilac/lilac

Lilac is an open-source tool that ensures your data scientists always have enough gpus for their work. We seamlessly connect compute from any source, on-prem or cloud.

Language: TypeScript - Size: 43.8 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 100 - Forks: 11

RedBlight/RaytrAMP

Shooting and bouncing rays method for radar cross-section calculations, accelerated with BVH algorithm running on GPU (C++ AMP).

Language: C++ - Size: 51 MB - Last synced at: about 1 year ago - Pushed at: almost 7 years ago - Stars: 98 - Forks: 31

etaler/Etaler

A flexable HTM (Hierarchical Temporal Memory) framework with full GPU support.

Language: C++ - Size: 73.8 MB - Last synced at: 4 months ago - Pushed at: almost 3 years ago - Stars: 95 - Forks: 15

larsgeb/m1-gpu-cpp

Metal Shading Language on Apple M1's GPU for scientific C++.

Language: C++ - Size: 10.9 MB - Last synced at: 8 months ago - Pushed at: about 2 years ago - Stars: 91 - Forks: 18

coldfunction/qCUDA

qCUDA: GPGPU Virtualization at a New API Remoting Method with Para-virtualization

Language: C - Size: 89.9 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 91 - Forks: 31