Topic: "gpu-computing"
catboost/catboost
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
Language: C++ - Size: 1.51 GB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 8,665 - Forks: 1,247
gyroflow/gyroflow
Video stabilization using gyroscope data
Language: Rust - Size: 83.8 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 7,933 - Forks: 366
google/tf-quant-finance
High-performance TensorFlow library for quantitative finance.
Language: Python - Size: 16.9 MB - Last synced at: 16 days ago - Pushed at: 8 months ago - Stars: 5,030 - Forks: 647
NVIDIA/thrust 📦
[ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl
Language: C++ - Size: 17 MB - Last synced at: 9 days ago - Pushed at: almost 2 years ago - Stars: 4,984 - Forks: 763
ProjectPhysX/FluidX3D
The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.
Language: C++ - Size: 21.4 MB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 4,759 - Forks: 428
tensorflow/lingvo
Lingvo
Language: Python - Size: 142 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 2,851 - Forks: 452
microsoft/pai 📦
Resource scheduling and cluster management for AI
Language: JavaScript - Size: 70.5 MB - Last synced at: 23 days ago - Pushed at: over 1 year ago - Stars: 2,677 - Forks: 549
KomputeProject/kompute
General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing usecases. Backed by the Linux Foundation.
Language: C++ - Size: 25.3 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2,344 - Forks: 177
jbush001/NyuziProcessor
GPGPU microprocessor architecture
Language: C - Size: 31.4 MB - Last synced at: 6 months ago - Pushed at: about 1 year ago - Stars: 2,082 - Forks: 360
NVIDIA/cccl
CUDA Core Compute Libraries
Language: C++ - Size: 295 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 2,022 - Forks: 289
inducer/pycuda
CUDA integration for Python, plus shiny features
Language: Python - Size: 2.87 MB - Last synced at: 9 days ago - Pushed at: about 1 month ago - Stars: 2,001 - Forks: 297
SciML/SciMLBook
Parallel Computing and Scientific Machine Learning (SciML): Methods and Applications (MIT 18.337J/6.338J)
Language: HTML - Size: 128 MB - Last synced at: 28 days ago - Pushed at: about 2 months ago - Stars: 1,943 - Forks: 357
chelsea0x3b/dfdx
Deep learning in Rust, with shape checked tensors and neural networks
Language: Rust - Size: 2.6 MB - Last synced at: 3 days ago - Pushed at: over 1 year ago - Stars: 1,859 - Forks: 105
mikbry/awesome-webgpu
😎 Curated list of awesome things around WebGPU ecosystem.
Size: 126 KB - Last synced at: about 13 hours ago - Pushed at: 12 days ago - Stars: 1,762 - Forks: 76
AdaptiveCpp/AdaptiveCpp
Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications adapt themselves to all the hardware in the system - even at runtime!
Language: C++ - Size: 14.3 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1,731 - Forks: 201
software-mansion/TypeGPU
A modular and open-ended toolkit for WebGPU, with advanced type inference and the ability to write shaders in TypeScript
Language: TypeScript - Size: 261 MB - Last synced at: about 6 hours ago - Pushed at: about 6 hours ago - Stars: 1,708 - Forks: 36
BindsNET/bindsnet
Simulation of spiking neural networks (SNNs) using PyTorch.
Language: Python - Size: 61.5 MB - Last synced at: 9 days ago - Pushed at: 12 days ago - Stars: 1,636 - Forks: 341
calebwin/emu
The write-once-run-anywhere GPGPU library for Rust
Language: Rust - Size: 342 MB - Last synced at: 6 days ago - Pushed at: almost 3 years ago - Stars: 1,609 - Forks: 52
mratsim/Arraymancer
A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Language: Nim - Size: 3.81 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 1,380 - Forks: 95
NVIDIA/MatX
An efficient C++17 GPU numerical computing library with Python-like syntax
Language: C++ - Size: 21.5 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1,359 - Forks: 108
beehive-lab/TornadoVM
TornadoVM: A practical and efficient heterogeneous programming framework for managed languages
Language: Java - Size: 160 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 1,344 - Forks: 124
LuxCoreRender/LuxCore
LuxCore source repository
Language: C++ - Size: 156 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 1,262 - Forks: 156
stotko/stdgpu
stdgpu: Efficient STL-like Data Structures on the GPU
Language: C++ - Size: 5.01 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 1,234 - Forks: 91
uncomplicate/neanderthal
Fast Clojure Matrix Library
Language: Clojure - Size: 3.96 MB - Last synced at: 21 days ago - Pushed at: 26 days ago - Stars: 1,111 - Forks: 58
AccelerateHS/accelerate
Embedded language for high-performance array computations
Language: Haskell - Size: 15.4 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 940 - Forks: 130
eyalroz/cuda-api-wrappers
Thin, unified, C++-flavored wrappers for the CUDA APIs
Language: C++ - Size: 2.88 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 860 - Forks: 84
LuxCoreRender/BlendLuxCore
Blender Integration for LuxCore
Language: Python - Size: 341 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 819 - Forks: 99
Langhalsdino/Kubernetes-GPU-Guide
This guide should help fellow researchers and hobbyists to easily automate and accelerate there deep leaning training with their own Kubernetes GPU cluster.
Language: Shell - Size: 431 KB - Last synced at: 4 months ago - Pushed at: about 3 years ago - Stars: 818 - Forks: 114
zszazi/Deep-learning-in-cloud
List of Deep Learning Cloud Providers
Size: 74.2 KB - Last synced at: 5 months ago - Pushed at: 7 months ago - Stars: 784 - Forks: 94
ComputationalRadiationPhysics/picongpu
Performance-Portable Particle-in-Cell Simulations for the Exascale Era :sparkles:
Language: C++ - Size: 59.1 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 762 - Forks: 225
iot-salzburg/gpu-jupyter
GPU-Jupyter: Your GPU-accelerated JupyterLab with a rich data science toolstack, TensorFlow and PyTorch for your reproducible deep learning experiments.
Language: Jupyter Notebook - Size: 1.21 MB - Last synced at: 6 months ago - Pushed at: 9 months ago - Stars: 743 - Forks: 237
googlefonts/compute-shader-101
Sample code for compute shader 101 training
Language: Rust - Size: 284 KB - Last synced at: 27 days ago - Pushed at: 7 months ago - Stars: 596 - Forks: 35
huiscliu/Tutorials
Parallel programming tutorials
Language: C - Size: 55 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 560 - Forks: 191
ginkgo-project/ginkgo
Numerical linear algebra software package
Language: C++ - Size: 158 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 523 - Forks: 99
AmesingFlank/taichi.js
Modern GPU Compute and Rendering in Javascript
Language: TypeScript - Size: 220 MB - Last synced at: 24 days ago - Pushed at: over 1 year ago - Stars: 515 - Forks: 20
FAST-Imaging/FAST
A framework for high-performance medical image processing, neural network inference and visualization
Language: C++ - Size: 20.1 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 488 - Forks: 108
ccsb-scripps/AutoDock-GPU
AutoDock for GPUs and other accelerators
Language: C++ - Size: 44.4 MB - Last synced at: 6 months ago - Pushed at: 10 months ago - Stars: 479 - Forks: 123
JuliaGPU/KernelAbstractions.jl
Heterogeneous programming in Julia
Language: Julia - Size: 4.73 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 466 - Forks: 80
tumaer/JAXFLUIDS
Differentiable Fluid Dynamics Package
Language: Python - Size: 12.6 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 466 - Forks: 85
triSYCL/triSYCL
Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group
Language: C++ - Size: 382 MB - Last synced at: 6 months ago - Pushed at: about 1 year ago - Stars: 443 - Forks: 98
ProjectPhysX/OpenCL-Wrapper
OpenCL is the most powerful programming language ever created. Yet the OpenCL C++ bindings are cumbersome and the code overhead prevents many people from getting started. I created this lightweight OpenCL-Wrapper to greatly simplify OpenCL software development with C++ while keeping functionality and performance.
Language: C++ - Size: 405 KB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 442 - Forks: 43
kpet/clvk
Implementation of OpenCL 3.0 on Vulkan
Language: C++ - Size: 1.74 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 413 - Forks: 46
RRZE-HPC/gpu-benches
collection of benchmarks to measure basic GPU capabilities
Language: C++ - Size: 1.78 MB - Last synced at: 5 months ago - Pushed at: 9 months ago - Stars: 386 - Forks: 55
KernelTuner/kernel_tuner
Kernel Tuner
Language: Python - Size: 41.5 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 372 - Forks: 59
andrewmilson/ministark
🏃♂️💨 GPU accelerated STARK prover built on @arkworks-rs
Language: Rust - Size: 1.65 MB - Last synced at: 27 days ago - Pushed at: 12 months ago - Stars: 365 - Forks: 36
uncomplicate/bayadera
High-performance Bayesian Data Analysis on the GPU in Clojure
Language: Clojure - Size: 1020 KB - Last synced at: 6 months ago - Pushed at: about 5 years ago - Stars: 365 - Forks: 23
Zydak/Vulkan-Path-Tracer
Vulkan Path Tracer. Physically based path tracer made in Vulkan with Ray Tracing Pipeline.
Language: C++ - Size: 462 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 361 - Forks: 12
Glavnokoman/vuh
Vulkan compute for people
Language: C++ - Size: 705 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 340 - Forks: 34
gpufit/Gpufit
GPU-accelerated Levenberg-Marquardt curve fitting in CUDA
Language: Cuda - Size: 1.14 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 332 - Forks: 99
brandondube/prysm
physical optics: integrated modeling, phase retrieval, segmented systems, polynomials and fitting, sequential raytracing...
Language: Python - Size: 12.2 MB - Last synced at: 24 days ago - Pushed at: 11 months ago - Stars: 315 - Forks: 53
favreau/Sol-R Fork of cyrillefavreau/Sol-R
Open-Source CUDA/OpenCL Speed Of Light Ray-tracer
Language: C++ - Size: 22 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 306 - Forks: 12
fastflow/fastflow
FastFlow pattern-based parallel programming framework (formerly on sourceforge)
Language: C++ - Size: 178 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 296 - Forks: 72
baggepinnen/MonteCarloMeasurements.jl
Propagation of distributions by Monte-Carlo sampling: Real number types with uncertainty represented by samples.
Language: Julia - Size: 5.25 MB - Last synced at: 9 days ago - Pushed at: 4 months ago - Stars: 285 - Forks: 18
uncomplicate/clojurecl
ClojureCL is a Clojure library for parallel computations with OpenCL.
Language: Clojure - Size: 910 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 280 - Forks: 18
CodedK/CUDA-by-Example-source-code-for-the-book-s-examples-
CUDA by Example, written by two senior members of the CUDA software platform team, shows programmers how to employ this new technology. The authors introduce each area of CUDA development through working examples.
Language: C - Size: 1.07 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 272 - Forks: 108
mfem/PyMFEM
Python wrapper for MFEM
Language: SWIG - Size: 26 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 262 - Forks: 63
ProjectPhysX/OpenCL-Benchmark
A small OpenCL benchmark program to measure peak GPU/CPU performance.
Language: C++ - Size: 294 KB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 259 - Forks: 34
ROCm/Tensile
[DEPRECATED] Moved to ROCm/rocm-libraries repo
Language: Python - Size: 98.2 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 254 - Forks: 166
CaNS-World/CaNS
A code for fast, massively-parallel direct numerical simulations (DNS) of canonical flows
Language: Fortran - Size: 1.13 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 253 - Forks: 85
niessner/Opt
Opt DSL
Language: Terra - Size: 22.8 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 252 - Forks: 68
denosaurs/netsaur
Powerful Powerful Machine Learning library with GPU, CPU and WASM backends
Language: Rust - Size: 146 MB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 250 - Forks: 5
mikeroyal/GPU-Guide
Graphics Processing Unit (GPU) Architecture Guide
Language: Shell - Size: 815 KB - Last synced at: about 6 hours ago - Pushed at: almost 4 years ago - Stars: 248 - Forks: 20
cdeterman/gpuR
R interface to use GPU's
Language: R - Size: 12 MB - Last synced at: 4 months ago - Pushed at: over 5 years ago - Stars: 244 - Forks: 26
BasBuller/PySNN
Efficient Spiking Neural Network framework, built on top of PyTorch for GPU acceleration
Language: Python - Size: 12.8 MB - Last synced at: 7 months ago - Pushed at: over 1 year ago - Stars: 225 - Forks: 27
rsnemmen/OpenCL-examples
Simple OpenCL examples for exploiting GPU computing
Language: Objective-C++ - Size: 3.46 MB - Last synced at: 6 months ago - Pushed at: over 1 year ago - Stars: 213 - Forks: 73
penn-graphics-research/claymore
Language: Cuda - Size: 30.7 MB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 209 - Forks: 31
shiinamiyuki/akari_render
High Performance CPU/GPU Physically Based Renderer in Rust
Language: Rust - Size: 150 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 208 - Forks: 10
lnstadrum/beatmup
Beatmup: image and signal processing library
Language: C++ - Size: 11.8 MB - Last synced at: 24 days ago - Pushed at: almost 2 years ago - Stars: 205 - Forks: 15
preda/gpuowl
GPU Mersenne primality test.
Language: C++ - Size: 13.6 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 197 - Forks: 49
uncomplicate/clojurecuda
Clojure library for CUDA development
Language: Clojure - Size: 563 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 191 - Forks: 10
zeam-vm/pelemay
Pelemay is a native compiler for Elixir, which generates SIMD instructions. It has a plan to generate for GPU code.
Language: Elixir - Size: 410 KB - Last synced at: 3 months ago - Pushed at: almost 5 years ago - Stars: 189 - Forks: 13
NumPower/numpower
PHP extension for efficient scientific computing and array manipulation with GPU support
Language: PHP - Size: 526 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 172 - Forks: 4
EMI-Group/evomo
EvoMO is a GPU-accelerated library for evolutionary multiobjective optimization (EMO)
Language: Python - Size: 1000 KB - Last synced at: about 15 hours ago - Pushed at: about 1 month ago - Stars: 171 - Forks: 21
nixonyh/GPUClothSimulationInUnity 📦
Trying to replicate what this legend did: https://youtu.be/kCGHXlLR3l8
Language: C# - Size: 201 MB - Last synced at: 25 days ago - Pushed at: about 3 years ago - Stars: 170 - Forks: 16
artyom-beilis/dlprimitives
Deep Learning Primitives and Mini-Framework for OpenCL
Language: C++ - Size: 58.2 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 169 - Forks: 16
AccelerateHS/accelerate-llvm
LLVM backend for Accelerate
Language: Haskell - Size: 3.95 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 167 - Forks: 60
Ricks-Lab/gpu-utils
A set of utilities for monitoring and customizing GPU performance
Language: Python - Size: 3.98 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 156 - Forks: 24
exospherehost/exospherehost
Infra for scalable and reliable AI agents
Language: Python - Size: 34.4 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 155 - Forks: 38
SamGinzburg/VectorVisor
VectorVisor is a vectorizing binary translator for GPUs, designed to make it easy to run many copies of a single-threaded WebAssembly program in parallel using GPUs
Language: WebAssembly - Size: 216 MB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 150 - Forks: 4
lachlan2k/phatcrack
Modern web-based distributed hashcracking solution, built on hashcat
Language: Go - Size: 11.2 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 146 - Forks: 12
GooFit/GooFit
Code repository for the massively-parallel framework for maximum-likelihood fits, implemented in CUDA/OpenMP
Language: Cuda - Size: 98 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 141 - Forks: 41
houkensjtu/taichi-fluid
A collection of CFD related resources for Taichi developers.
Size: 5.84 MB - Last synced at: 5 months ago - Pushed at: 8 months ago - Stars: 139 - Forks: 6
ComputationalRadiationPhysics/cuda_memtest
Fork of CUDA GPU memtest :eyeglasses:
Language: C++ - Size: 275 KB - Last synced at: 29 days ago - Pushed at: about 1 year ago - Stars: 134 - Forks: 32
AnicetNgrt/jiro-nn
A Deep Learning and preprocessing framework in Rust with support for CPU and GPU.
Language: Rust - Size: 17.5 MB - Last synced at: 8 days ago - Pushed at: about 2 years ago - Stars: 133 - Forks: 3
IntelPython/dpctl
Python SYCL bindings and SYCL-based Python Array API library
Language: C++ - Size: 223 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 117 - Forks: 31
PyOCL/OpenCLGA
A Python Library for Genetic Algorithm on OpenCL
Language: Python - Size: 17.4 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 117 - Forks: 32
tensordiffeq/TensorDiffEq
Efficient and Scalable Physics-Informed Deep Learning and Scientific Machine Learning on top of Tensorflow for multi-worker distributed computing
Language: Python - Size: 1.28 MB - Last synced at: 18 days ago - Pushed at: over 3 years ago - Stars: 116 - Forks: 43
ROCm/hipBLASLt
[DEPRECATED] Moved to ROCm/rocm-libraries repo
Language: Assembly - Size: 1.59 GB - Last synced at: about 21 hours ago - Pushed at: about 23 hours ago - Stars: 114 - Forks: 146
barbagroup/PetIBM
PetIBM - toolbox and applications of the immersed-boundary method on distributed-memory architectures
Language: C++ - Size: 14.9 MB - Last synced at: 24 days ago - Pushed at: over 3 years ago - Stars: 111 - Forks: 52
ashvardanian/ParallelReductionsBenchmark
Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal, and Rust - all it takes to sum a lot of numbers fast!
Language: C++ - Size: 17.4 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 109 - Forks: 10
wmmae/wmma_extension
An extension library of WMMA API (Tensor Core API)
Language: Cuda - Size: 698 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 106 - Forks: 16
radiantone/entangle
A lightweight (serverless) native python parallel processing framework based on simple decorators and call graphs.
Language: Python - Size: 2.33 MB - Last synced at: 4 months ago - Pushed at: about 3 years ago - Stars: 104 - Forks: 7
slai-labs/get-beam
Run GPU inference and training jobs on serverless infrastructure that scales with you.
Language: Shell - Size: 5.96 MB - Last synced at: 7 months ago - Pushed at: over 1 year ago - Stars: 102 - Forks: 23
DeepMLNet/DeepNet
Deep.Net machine learning framework for F#
Language: F# - Size: 230 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 102 - Forks: 9
Heteroflow/Heteroflow
Concurrent CPU-GPU Programming using Task Models
Language: C++ - Size: 1.58 MB - Last synced at: 8 months ago - Pushed at: almost 6 years ago - Stars: 101 - Forks: 13
getlilac/lilac
Lilac is an open-source tool that ensures your data scientists always have enough gpus for their work. We seamlessly connect compute from any source, on-prem or cloud.
Language: TypeScript - Size: 43.8 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 100 - Forks: 11
RedBlight/RaytrAMP
Shooting and bouncing rays method for radar cross-section calculations, accelerated with BVH algorithm running on GPU (C++ AMP).
Language: C++ - Size: 51 MB - Last synced at: about 1 year ago - Pushed at: almost 7 years ago - Stars: 98 - Forks: 31
etaler/Etaler
A flexable HTM (Hierarchical Temporal Memory) framework with full GPU support.
Language: C++ - Size: 73.8 MB - Last synced at: 4 months ago - Pushed at: almost 3 years ago - Stars: 95 - Forks: 15
larsgeb/m1-gpu-cpp
Metal Shading Language on Apple M1's GPU for scientific C++.
Language: C++ - Size: 10.9 MB - Last synced at: 8 months ago - Pushed at: about 2 years ago - Stars: 91 - Forks: 18
coldfunction/qCUDA
qCUDA: GPGPU Virtualization at a New API Remoting Method with Para-virtualization
Language: C - Size: 89.9 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 91 - Forks: 31