Topic: "opencl"
hashcat/hashcat
World's fastest and most advanced password recovery utility
Language: C - Size: 75.1 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 22,580 - Forks: 3,058

apache/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Language: Python - Size: 107 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 12,254 - Forks: 3,577

openwall/john
John the Ripper jumbo - advanced offline password cracker, which supports hundreds of hash and cipher types, and runs on many operating systems, CPUs, GPUs, and even some FPGAs
Language: C - Size: 126 MB - Last synced at: 4 days ago - Pushed at: 11 days ago - Stars: 11,287 - Forks: 2,224

aidlearning/AidLearning-FrameWork
🔥🔥🔥AidLearning is a powerful AIOT development platform, AidLearning builds a linux env supporting GUI, deep learning and visual IDE on Android...Now Aid supports CPU+GPU+NPU for inference with high performance acceleration...Linux on Android or HarmonyOS
Language: Python - Size: 76.7 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 5,667 - Forks: 709

LWJGL/lwjgl3
LWJGL is a Java library that enables cross-platform access to popular native APIs useful in the development of graphics (OpenGL, Vulkan, bgfx), audio (OpenAL, Opus), parallel computing (OpenCL, CUDA) and XR (OpenVR, LibOVR, OpenXR) applications.
Language: Java - Size: 121 MB - Last synced at: 4 days ago - Pushed at: 10 days ago - Stars: 5,003 - Forks: 655

XiaoMi/mace
MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.
Language: C++ - Size: 30.3 MB - Last synced at: 6 months ago - Pushed at: 11 months ago - Stars: 4,927 - Forks: 817

arrayfire/arrayfire
ArrayFire: a general purpose GPU library.
Language: C++ - Size: 18.5 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 4,692 - Forks: 543

dotnet/Silk.NET
The high-speed OpenGL, OpenCL, OpenAL, OpenXR, GLFW, SDL, Vulkan, Assimp, WebGPU, and DirectX bindings library your mother warned you about.
Language: C# - Size: 1.34 GB - Last synced at: 5 days ago - Pushed at: 10 days ago - Stars: 4,496 - Forks: 428

ProjectPhysX/FluidX3D
The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.
Language: C++ - Size: 21 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 4,400 - Forks: 380

opentk/opentk
The Open Toolkit library is a fast, low-level C# wrapper for OpenGL, OpenAL & OpenCL. It also includes windowing, mouse, keyboard and joystick input and a robust and fast math library, giving you everything you need to write your own renderer or game engine. OpenTK can be used standalone or inside a GUI on Windows, Linux, Mac.
Language: C# - Size: 153 MB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 3,370 - Forks: 642

ARM-software/ComputeLibrary
The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.
Language: C++ - Size: 834 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 2,958 - Forks: 792

diku-dk/futhark
:boom::computer::boom: A data-parallel functional programming language
Language: Haskell - Size: 49.8 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 2,507 - Forks: 174

dmlc/nnvm 📦
Language: C++ - Size: 1.13 MB - Last synced at: 7 days ago - Pushed at: over 6 years ago - Stars: 1,658 - Forks: 280

DTolm/VkFFT
Vulkan/CUDA/HIP/OpenCL/Level Zero/Metal Fast Fourier Transform library
Language: C++ - Size: 38 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 1,612 - Forks: 104

boostorg/compute
A C++ GPU Computing Library for OpenCL
Language: C++ - Size: 8.32 MB - Last synced at: 12 days ago - Pushed at: about 1 month ago - Stars: 1,601 - Forks: 337

wangzhaode/mnn-llm
llm deploy project based mnn. This project has merged into MNN.
Language: C++ - Size: 11.6 MB - Last synced at: 22 days ago - Pushed at: 4 months ago - Stars: 1,574 - Forks: 172

m4rs-mt/ILGPU
ILGPU JIT Compiler for high-performance .Net GPU programs
Language: C# - Size: 11.1 MB - Last synced at: 13 days ago - Pushed at: 19 days ago - Stars: 1,534 - Forks: 129

mratsim/Arraymancer
A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Language: Nim - Size: 3.8 MB - Last synced at: 3 days ago - Pushed at: 2 months ago - Stars: 1,364 - Forks: 96

doonny/PipeCNN
An OpenCL-based FPGA Accelerator for Convolutional Neural Networks
Language: C - Size: 3.7 MB - Last synced at: 6 months ago - Pushed at: about 3 years ago - Stars: 1,252 - Forks: 369

beehive-lab/TornadoVM
TornadoVM: A practical and efficient heterogeneous programming framework for managed languages
Language: Java - Size: 152 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1,236 - Forks: 120

intel/compute-runtime
Intel® Graphics Compute Runtime for oneAPI Level Zero and OpenCL™ Driver
Language: C++ - Size: 139 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1,231 - Forks: 250

LuxCoreRender/LuxCore
LuxCore source repository
Language: C++ - Size: 152 MB - Last synced at: 4 days ago - Pushed at: 15 days ago - Stars: 1,215 - Forks: 149

fff-rs/juice
The Hacker's Machine Learning Engine
Language: Rust - Size: 37.2 MB - Last synced at: 1 day ago - Pushed at: 10 months ago - Stars: 1,119 - Forks: 75

inducer/pyopencl
OpenCL integration for Python, plus shiny features
Language: Python - Size: 5.59 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1,096 - Forks: 246

CNugteren/CLBlast
Tuned OpenCL BLAS
Language: C++ - Size: 6.7 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 1,096 - Forks: 204

uncomplicate/neanderthal
Fast Clojure Matrix Library
Language: Clojure - Size: 3.91 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 1,092 - Forks: 57

pocl/pocl
pocl - Portable Computing Language
Language: C - Size: 60.6 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 979 - Forks: 265

mirkosertic/Bytecoder
Framework to interpret and transpile JVM bytecode to JavaScript, OpenCL or WebAssembly.
Language: Java - Size: 2.21 GB - Last synced at: 6 months ago - Pushed at: 12 months ago - Stars: 897 - Forks: 58

e-ago/bitcracker
BitCracker is the first open source password cracking tool for memory units encrypted with BitLocker
Language: C - Size: 203 MB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 872 - Forks: 192

keyvank/femtoGPT
Pure Rust implementation of a minimal Generative Pretrained Transformer
Language: Rust - Size: 670 KB - Last synced at: 23 days ago - Pushed at: 8 months ago - Stars: 867 - Forks: 60

hughperkins/coriander
Build NVIDIA® CUDA™ code for OpenCL™ 1.2 devices
Language: LLVM - Size: 7.78 MB - Last synced at: 22 days ago - Pushed at: 11 months ago - Stars: 859 - Forks: 88

arrayfire/arrayfire-rust
Rust wrapper for ArrayFire
Language: Rust - Size: 18.4 MB - Last synced at: about 18 hours ago - Pushed at: over 1 year ago - Stars: 827 - Forks: 59

githubharald/CTCDecoder
Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing. Implemented in Python.
Language: Python - Size: 1010 KB - Last synced at: 29 days ago - Pushed at: almost 4 years ago - Stars: 825 - Forks: 183

DeadSix27/waifu2x-converter-cpp Fork of tanakamura/waifu2x-converter-cpp 📦
Improved fork of Waifu2X C++ using OpenCL and OpenCV
Language: C++ - Size: 64.4 MB - Last synced at: 1 day ago - Pushed at: about 3 years ago - Stars: 794 - Forks: 86

hughperkins/tf-coriander
OpenCL 1.2 implementation for Tensorflow
Language: C++ - Size: 91.6 MB - Last synced at: 22 days ago - Pushed at: over 2 years ago - Stars: 791 - Forks: 91

ddemidov/amgcl
C++ library for solving large sparse linear systems with algebraic multigrid method
Language: C++ - Size: 7.87 MB - Last synced at: 28 days ago - Pushed at: 2 months ago - Stars: 784 - Forks: 124

LuxCoreRender/BlendLuxCore
Blender Integration for LuxCore
Language: Python - Size: 341 MB - Last synced at: 1 day ago - Pushed at: 14 days ago - Stars: 782 - Forks: 92

doe300/VC4CL
OpenCL implementation running on the VideoCore IV GPU of the Raspberry Pi models
Language: C++ - Size: 1010 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 736 - Forks: 81

ddemidov/vexcl
VexCL is a C++ vector expression template library for OpenCL/CUDA/OpenMP
Language: C++ - Size: 22.8 MB - Last synced at: 28 days ago - Pushed at: 7 months ago - Stars: 710 - Forks: 82

KhronosGroup/OpenCL-Headers
Khronos OpenCL-Headers
Language: C - Size: 800 KB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 705 - Forks: 251

google/clspv
Clspv is a compiler for OpenCL C to Vulkan compute shaders
Language: LLVM - Size: 10.9 MB - Last synced at: 4 days ago - Pushed at: 9 days ago - Stars: 667 - Forks: 92

inducer/loopy
A code generator for array-based code on CPUs and GPUs
Language: Python - Size: 12.3 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 602 - Forks: 74

gchudov/cuetools.net
CD image processing suite with optimized lossless encoders in C#
Language: C# - Size: 41.5 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 546 - Forks: 53

pypr/pysph
A framework for Smoothed Particle Hydrodynamics in Python
Language: Python - Size: 7.05 MB - Last synced at: 3 days ago - Pushed at: 23 days ago - Stars: 483 - Forks: 139

inviwo/inviwo
Inviwo - Interactive Visualization Workshop
Language: C++ - Size: 807 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 480 - Forks: 150

gfx-rs/rspirv
Rust implementation of SPIR-V module processing functionalities
Language: Rust - Size: 1.57 MB - Last synced at: 5 days ago - Pushed at: 19 days ago - Stars: 469 - Forks: 61

smistad/FAST
A framework for high-performance medical image processing, neural network inference and visualization
Language: C++ - Size: 19.4 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 465 - Forks: 106

Syncleus/aparapi
The New Official Aparapi: a framework for executing native Java and Scala code on the GPU.
Language: Java - Size: 68.9 MB - Last synced at: 11 months ago - Pushed at: over 2 years ago - Stars: 462 - Forks: 59

ccsb-scripps/AutoDock-GPU
AutoDock for GPUs and other accelerators
Language: C++ - Size: 44.4 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 458 - Forks: 122

petercunha/Pine
:evergreen_tree: Aimbot powered by real-time object detection with neural networks, GPU accelerated with Nvidia. Optimized for use with CS:GO.
Language: Python - Size: 83.4 MB - Last synced at: 2 days ago - Pushed at: almost 4 years ago - Stars: 445 - Forks: 75

libtangle/qcgpu
High Performance Tools for Quantum Computing
Language: Python - Size: 20.4 MB - Last synced at: 17 days ago - Pushed at: almost 2 years ago - Stars: 443 - Forks: 52

triSYCL/triSYCL
Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group
Language: C++ - Size: 382 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 441 - Forks: 98

Polytonic/Chlorine
Dead Simple OpenCL
Language: C++ - Size: 960 KB - Last synced at: 12 days ago - Pushed at: about 9 years ago - Stars: 430 - Forks: 24

ParRes/Kernels
This is a set of simple programs that can be used to explore the features of a parallel platform.
Language: C - Size: 23.6 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 427 - Forks: 109

xmrig/xmrig-amd
Monero AMD (OpenCL) miner
Language: C++ - Size: 1.54 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 423 - Forks: 227

arrayfire/arrayfire-python
Python bindings for ArrayFire: A general purpose GPU library.
Language: Python - Size: 1.59 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 419 - Forks: 64

libocca/occa
Portable and vendor neutral framework for parallel programming on heterogeneous platforms.
Language: C++ - Size: 17.7 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 416 - Forks: 87

ekondis/mixbench
A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)
Language: C++ - Size: 351 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 392 - Forks: 70

kpet/clvk
Implementation of OpenCL 3.0 on Vulkan
Language: C++ - Size: 1.97 MB - Last synced at: 2 days ago - Pushed at: 20 days ago - Stars: 390 - Forks: 42

ProjectPhysX/OpenCL-Wrapper
OpenCL is the most powerful programming language ever created. Yet the OpenCL C++ bindings are cumbersome and the code overhead prevents many people from getting started. I created this lightweight OpenCL-Wrapper to greatly simplify OpenCL software development with C++ while keeping functionality and performance.
Language: C++ - Size: 300 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 390 - Forks: 40

vlang/vsl
V library to develop Artificial Intelligence and High-Performance Scientific Computations
Language: V - Size: 11.4 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 368 - Forks: 46

uncomplicate/bayadera
High-performance Bayesian Data Analysis on the GPU in Clojure
Language: Clojure - Size: 1020 KB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 365 - Forks: 23

Xilinx/SDAccel_Examples
SDAccel Examples
Language: C++ - Size: 366 MB - Last synced at: 6 months ago - Pushed at: almost 3 years ago - Stars: 356 - Forks: 209

arunsivaramanneo/GPU-Viewer
A front-end to glxinfo, vulkaninfo, clinfo and es2_info - Linux
Language: Python - Size: 23.8 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 355 - Forks: 23

jrprice/Oclgrind
An OpenCL device simulator and debugger
Language: C++ - Size: 2.59 MB - Last synced at: 3 days ago - Pushed at: 8 months ago - Stars: 355 - Forks: 62

Oblomov/clinfo
Print all known information about all available OpenCL platforms and devices in the system
Language: C - Size: 736 KB - Last synced at: 16 days ago - Pushed at: 3 months ago - Stars: 343 - Forks: 81

UoB-HPC/BabelStream
STREAM, for lots of devices written in many programming models
Language: C++ - Size: 2.36 MB - Last synced at: 20 days ago - Pushed at: 9 months ago - Stars: 333 - Forks: 118

KernelTuner/kernel_tuner
Kernel Tuner
Language: Python - Size: 41 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 331 - Forks: 53

a2flo/floor
A C++ Compute/Graphics Library and Toolchain enabling same-source CUDA/Host/Metal/OpenCL/Vulkan C++ programming and execution.
Language: C++ - Size: 13.6 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 328 - Forks: 22

intel/opencl-intercept-layer
Intercept Layer for Debugging and Analyzing OpenCL Applications
Language: C++ - Size: 2.49 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 328 - Forks: 83

codeplaysoftware/computecpp-sdk 📦
Collection of samples and utilities for using ComputeCpp, Codeplay's SYCL implementation
Language: C - Size: 1.14 MB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 320 - Forks: 90

AlexanderVeselov/RayTracing
Realtime GPU Path tracer based on OpenCL and OpenGL
Language: C++ - Size: 141 MB - Last synced at: 6 months ago - Pushed at: 11 months ago - Stars: 315 - Forks: 33

favreau/Sol-R Fork of cyrillefavreau/Sol-R
Open-Source CUDA/OpenCL Speed Of Light Ray-tracer
Language: C++ - Size: 22 MB - Last synced at: 12 days ago - Pushed at: 11 months ago - Stars: 306 - Forks: 12

tonyrog/cl
OpenCL binding for Erlang
Language: C - Size: 503 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 289 - Forks: 48

uncomplicate/clojurecl
ClojureCL is a Clojure library for parallel computations with OpenCL.
Language: Clojure - Size: 874 KB - Last synced at: 29 days ago - Pushed at: 12 months ago - Stars: 280 - Forks: 18

JuliaGPU/OpenCL.jl
OpenCL Julia bindings
Language: Julia - Size: 8.78 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 272 - Forks: 40

CHIP-SPV/chipStar
chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.
Language: C++ - Size: 27.6 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 269 - Forks: 36

artyom-beilis/pytorch_dlprim
DLPrimitives/OpenCL out of tree backend for pytorch
Language: C++ - Size: 1.36 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 263 - Forks: 17

GPUOpen-Tools/gpu_performance_api
GPU Performance API for AMD GPUs
Language: C++ - Size: 72.7 MB - Last synced at: 4 days ago - Pushed at: about 2 months ago - Stars: 259 - Forks: 48

harujoh/KelpNet
Pure C# machine learning framework
Language: C# - Size: 17.3 MB - Last synced at: 13 days ago - Pushed at: over 1 year ago - Stars: 244 - Forks: 29

ROCm/Tensile
Stretching GPU performance for GEMMs and tensor contractions.
Language: Python - Size: 95 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 237 - Forks: 159

shapelets/khiva
An open-source library of algorithms to analyse time series in GPU and CPU.
Language: C++ - Size: 2.24 MB - Last synced at: 4 days ago - Pushed at: over 3 years ago - Stars: 236 - Forks: 31

PRiME-project/PRiMEStereoMatch
A heterogeneous and fully parallel stereo matching algorithm for depth estimation, implementing a local adaptive support weight (ADSW) Guided Image Filter (GIF) cost aggregation stage. Developed in both C++ and OpenCL.
Language: C++ - Size: 27.5 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 220 - Forks: 64

bh107/bohrium
Automatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
Language: C++ - Size: 32.4 MB - Last synced at: 10 days ago - Pushed at: over 4 years ago - Stars: 220 - Forks: 31

ePi5131/patch.aul
AviUtlのバグを直す/高速化する/機能追加
Language: C++ - Size: 936 KB - Last synced at: 3 days ago - Pushed at: almost 2 years ago - Stars: 216 - Forks: 15

angeluriot/Galaxy_simulation
An n-body type simulation using GPU acceleration to simulate galaxies, galaxy collisions and expanding universes.
Language: C++ - Size: 84.7 MB - Last synced at: 28 days ago - Pushed at: 12 months ago - Stars: 201 - Forks: 20

ThoughtWorksInc/Compute.scala
Scientific computing with N-dimensional arrays
Language: Scala - Size: 3.54 MB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 200 - Forks: 19

spcl/hls_tutorial_examples
Examples shown as part of the tutorial "Productive parallel programming on FPGA with high-level synthesis".
Language: C++ - Size: 1.27 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 199 - Forks: 46

rsnemmen/OpenCL-examples
Simple OpenCL examples for exploiting GPU computing
Language: Objective-C++ - Size: 3.46 MB - Last synced at: 6 months ago - Pushed at: 9 months ago - Stars: 197 - Forks: 72

ProjectPhysX/OpenCL-Benchmark
A small OpenCL benchmark program to measure peak GPU/CPU performance.
Language: C++ - Size: 309 KB - Last synced at: 29 days ago - Pushed at: about 1 month ago - Stars: 195 - Forks: 27

bernardladenthin/BitcoinAddressFinder
A high performance bitcoin address finder.
Language: C - Size: 896 KB - Last synced at: about 14 hours ago - Pushed at: 3 days ago - Stars: 194 - Forks: 53

dividiti/ck-caffe
Collective Knowledge workflow for Caffe to automate installation across diverse platforms and to collaboratively evaluate and optimize Caffe-based workloads across diverse hardware, software and data sets (compilers, libraries, tools, models, inputs):
Language: CMake - Size: 3.39 MB - Last synced at: 7 days ago - Pushed at: over 5 years ago - Stars: 194 - Forks: 40

ROCm/MIVisionX
MIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into a single toolkit. AMD MIVisionX also delivers a highly optimized open-source implementation of the Khronos OpenVX™ and OpenVX™ Extensions.
Language: C++ - Size: 154 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 193 - Forks: 76

sowson/darknet
Darknet on OpenCL Convolutional Neural Networks on OpenCL on Intel & NVidia & AMD & Mali GPUs for macOS & GNU/Linux & Windows & FreeBSD
Language: C - Size: 32.1 MB - Last synced at: 29 days ago - Pushed at: about 1 month ago - Stars: 192 - Forks: 33

unitaryfoundation/qrack
Comprehensive, GPU accelerated framework for developing universal virtual quantum processors
Language: C++ - Size: 20.6 MB - Last synced at: 7 days ago - Pushed at: 13 days ago - Stars: 190 - Forks: 39

merrymercy/tvm-mali 📦
Optimizing Mobile Deep Learning on ARM GPU with TVM
Language: C - Size: 337 KB - Last synced at: 7 days ago - Pushed at: over 6 years ago - Stars: 181 - Forks: 27

MegEngine/mperf
mperf是一个面向移动/嵌入式平台的算子性能调优工具箱
Language: C++ - Size: 794 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 180 - Forks: 32

deepakkumar1984/Amplifier.NET
Amplifier allows .NET developers to easily run complex applications with intensive mathematical computation on Intel CPU/GPU, NVIDIA, AMD without writing any additional C kernel code. Write your function in .NET and Amplifier will take care of running it on your favorite hardware.
Language: C# - Size: 3.65 MB - Last synced at: 4 days ago - Pushed at: about 1 month ago - Stars: 179 - Forks: 21

yuhc/gpu-rodinia 📦
Rodinia benchmark
Language: C - Size: 33.5 MB - Last synced at: 5 days ago - Pushed at: about 2 years ago - Stars: 179 - Forks: 95

CNugteren/CLTune
CLTune: An automatic OpenCL & CUDA kernel tuner
Language: C++ - Size: 1.6 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 177 - Forks: 36
