An open API service providing repository metadata for many open source software ecosystems.

Topic: "opencl"

hashcat/hashcat

World's fastest and most advanced password recovery utility

Language: C - Size: 75.1 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 22,580 - Forks: 3,058

apache/tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Language: Python - Size: 107 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 12,254 - Forks: 3,577

openwall/john

John the Ripper jumbo - advanced offline password cracker, which supports hundreds of hash and cipher types, and runs on many operating systems, CPUs, GPUs, and even some FPGAs

Language: C - Size: 126 MB - Last synced at: 4 days ago - Pushed at: 11 days ago - Stars: 11,287 - Forks: 2,224

aidlearning/AidLearning-FrameWork

🔥🔥🔥AidLearning is a powerful AIOT development platform, AidLearning builds a linux env supporting GUI, deep learning and visual IDE on Android...Now Aid supports CPU+GPU+NPU for inference with high performance acceleration...Linux on Android or HarmonyOS

Language: Python - Size: 76.7 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 5,667 - Forks: 709

LWJGL/lwjgl3

LWJGL is a Java library that enables cross-platform access to popular native APIs useful in the development of graphics (OpenGL, Vulkan, bgfx), audio (OpenAL, Opus), parallel computing (OpenCL, CUDA) and XR (OpenVR, LibOVR, OpenXR) applications.

Language: Java - Size: 121 MB - Last synced at: 4 days ago - Pushed at: 10 days ago - Stars: 5,003 - Forks: 655

XiaoMi/mace

MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.

Language: C++ - Size: 30.3 MB - Last synced at: 6 months ago - Pushed at: 11 months ago - Stars: 4,927 - Forks: 817

arrayfire/arrayfire

ArrayFire: a general purpose GPU library.

Language: C++ - Size: 18.5 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 4,692 - Forks: 543

dotnet/Silk.NET

The high-speed OpenGL, OpenCL, OpenAL, OpenXR, GLFW, SDL, Vulkan, Assimp, WebGPU, and DirectX bindings library your mother warned you about.

Language: C# - Size: 1.34 GB - Last synced at: 5 days ago - Pushed at: 10 days ago - Stars: 4,496 - Forks: 428

ProjectPhysX/FluidX3D

The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.

Language: C++ - Size: 21 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 4,400 - Forks: 380

opentk/opentk

The Open Toolkit library is a fast, low-level C# wrapper for OpenGL, OpenAL & OpenCL. It also includes windowing, mouse, keyboard and joystick input and a robust and fast math library, giving you everything you need to write your own renderer or game engine. OpenTK can be used standalone or inside a GUI on Windows, Linux, Mac.

Language: C# - Size: 153 MB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 3,370 - Forks: 642

ARM-software/ComputeLibrary

The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.

Language: C++ - Size: 834 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 2,958 - Forks: 792

diku-dk/futhark

:boom::computer::boom: A data-parallel functional programming language

Language: Haskell - Size: 49.8 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 2,507 - Forks: 174

dmlc/nnvm 📦

Language: C++ - Size: 1.13 MB - Last synced at: 7 days ago - Pushed at: over 6 years ago - Stars: 1,658 - Forks: 280

DTolm/VkFFT

Vulkan/CUDA/HIP/OpenCL/Level Zero/Metal Fast Fourier Transform library

Language: C++ - Size: 38 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 1,612 - Forks: 104

boostorg/compute

A C++ GPU Computing Library for OpenCL

Language: C++ - Size: 8.32 MB - Last synced at: 12 days ago - Pushed at: about 1 month ago - Stars: 1,601 - Forks: 337

wangzhaode/mnn-llm

llm deploy project based mnn. This project has merged into MNN.

Language: C++ - Size: 11.6 MB - Last synced at: 22 days ago - Pushed at: 4 months ago - Stars: 1,574 - Forks: 172

m4rs-mt/ILGPU

ILGPU JIT Compiler for high-performance .Net GPU programs

Language: C# - Size: 11.1 MB - Last synced at: 13 days ago - Pushed at: 19 days ago - Stars: 1,534 - Forks: 129

mratsim/Arraymancer

A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends

Language: Nim - Size: 3.8 MB - Last synced at: 3 days ago - Pushed at: 2 months ago - Stars: 1,364 - Forks: 96

doonny/PipeCNN

An OpenCL-based FPGA Accelerator for Convolutional Neural Networks

Language: C - Size: 3.7 MB - Last synced at: 6 months ago - Pushed at: about 3 years ago - Stars: 1,252 - Forks: 369

beehive-lab/TornadoVM

TornadoVM: A practical and efficient heterogeneous programming framework for managed languages

Language: Java - Size: 152 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1,236 - Forks: 120

intel/compute-runtime

Intel® Graphics Compute Runtime for oneAPI Level Zero and OpenCL™ Driver

Language: C++ - Size: 139 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1,231 - Forks: 250

LuxCoreRender/LuxCore

LuxCore source repository

Language: C++ - Size: 152 MB - Last synced at: 4 days ago - Pushed at: 15 days ago - Stars: 1,215 - Forks: 149

fff-rs/juice

The Hacker's Machine Learning Engine

Language: Rust - Size: 37.2 MB - Last synced at: 1 day ago - Pushed at: 10 months ago - Stars: 1,119 - Forks: 75

inducer/pyopencl

OpenCL integration for Python, plus shiny features

Language: Python - Size: 5.59 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1,096 - Forks: 246

CNugteren/CLBlast

Tuned OpenCL BLAS

Language: C++ - Size: 6.7 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 1,096 - Forks: 204

uncomplicate/neanderthal

Fast Clojure Matrix Library

Language: Clojure - Size: 3.91 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 1,092 - Forks: 57

pocl/pocl

pocl - Portable Computing Language

Language: C - Size: 60.6 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 979 - Forks: 265

mirkosertic/Bytecoder

Framework to interpret and transpile JVM bytecode to JavaScript, OpenCL or WebAssembly.

Language: Java - Size: 2.21 GB - Last synced at: 6 months ago - Pushed at: 12 months ago - Stars: 897 - Forks: 58

e-ago/bitcracker

BitCracker is the first open source password cracking tool for memory units encrypted with BitLocker

Language: C - Size: 203 MB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 872 - Forks: 192

keyvank/femtoGPT

Pure Rust implementation of a minimal Generative Pretrained Transformer

Language: Rust - Size: 670 KB - Last synced at: 23 days ago - Pushed at: 8 months ago - Stars: 867 - Forks: 60

hughperkins/coriander

Build NVIDIA® CUDA™ code for OpenCL™ 1.2 devices

Language: LLVM - Size: 7.78 MB - Last synced at: 22 days ago - Pushed at: 11 months ago - Stars: 859 - Forks: 88

arrayfire/arrayfire-rust

Rust wrapper for ArrayFire

Language: Rust - Size: 18.4 MB - Last synced at: about 18 hours ago - Pushed at: over 1 year ago - Stars: 827 - Forks: 59

githubharald/CTCDecoder

Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing. Implemented in Python.

Language: Python - Size: 1010 KB - Last synced at: 29 days ago - Pushed at: almost 4 years ago - Stars: 825 - Forks: 183

DeadSix27/waifu2x-converter-cpp Fork of tanakamura/waifu2x-converter-cpp 📦

Improved fork of Waifu2X C++ using OpenCL and OpenCV

Language: C++ - Size: 64.4 MB - Last synced at: 1 day ago - Pushed at: about 3 years ago - Stars: 794 - Forks: 86

hughperkins/tf-coriander

OpenCL 1.2 implementation for Tensorflow

Language: C++ - Size: 91.6 MB - Last synced at: 22 days ago - Pushed at: over 2 years ago - Stars: 791 - Forks: 91

ddemidov/amgcl

C++ library for solving large sparse linear systems with algebraic multigrid method

Language: C++ - Size: 7.87 MB - Last synced at: 28 days ago - Pushed at: 2 months ago - Stars: 784 - Forks: 124

LuxCoreRender/BlendLuxCore

Blender Integration for LuxCore

Language: Python - Size: 341 MB - Last synced at: 1 day ago - Pushed at: 14 days ago - Stars: 782 - Forks: 92

doe300/VC4CL

OpenCL implementation running on the VideoCore IV GPU of the Raspberry Pi models

Language: C++ - Size: 1010 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 736 - Forks: 81

ddemidov/vexcl

VexCL is a C++ vector expression template library for OpenCL/CUDA/OpenMP

Language: C++ - Size: 22.8 MB - Last synced at: 28 days ago - Pushed at: 7 months ago - Stars: 710 - Forks: 82

KhronosGroup/OpenCL-Headers

Khronos OpenCL-Headers

Language: C - Size: 800 KB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 705 - Forks: 251

google/clspv

Clspv is a compiler for OpenCL C to Vulkan compute shaders

Language: LLVM - Size: 10.9 MB - Last synced at: 4 days ago - Pushed at: 9 days ago - Stars: 667 - Forks: 92

inducer/loopy

A code generator for array-based code on CPUs and GPUs

Language: Python - Size: 12.3 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 602 - Forks: 74

gchudov/cuetools.net

CD image processing suite with optimized lossless encoders in C#

Language: C# - Size: 41.5 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 546 - Forks: 53

pypr/pysph

A framework for Smoothed Particle Hydrodynamics in Python

Language: Python - Size: 7.05 MB - Last synced at: 3 days ago - Pushed at: 23 days ago - Stars: 483 - Forks: 139

inviwo/inviwo

Inviwo - Interactive Visualization Workshop

Language: C++ - Size: 807 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 480 - Forks: 150

gfx-rs/rspirv

Rust implementation of SPIR-V module processing functionalities

Language: Rust - Size: 1.57 MB - Last synced at: 5 days ago - Pushed at: 19 days ago - Stars: 469 - Forks: 61

smistad/FAST

A framework for high-performance medical image processing, neural network inference and visualization

Language: C++ - Size: 19.4 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 465 - Forks: 106

Syncleus/aparapi

The New Official Aparapi: a framework for executing native Java and Scala code on the GPU.

Language: Java - Size: 68.9 MB - Last synced at: 11 months ago - Pushed at: over 2 years ago - Stars: 462 - Forks: 59

ccsb-scripps/AutoDock-GPU

AutoDock for GPUs and other accelerators

Language: C++ - Size: 44.4 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 458 - Forks: 122

petercunha/Pine

:evergreen_tree: Aimbot powered by real-time object detection with neural networks, GPU accelerated with Nvidia. Optimized for use with CS:GO.

Language: Python - Size: 83.4 MB - Last synced at: 2 days ago - Pushed at: almost 4 years ago - Stars: 445 - Forks: 75

libtangle/qcgpu

High Performance Tools for Quantum Computing

Language: Python - Size: 20.4 MB - Last synced at: 17 days ago - Pushed at: almost 2 years ago - Stars: 443 - Forks: 52

triSYCL/triSYCL

Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group

Language: C++ - Size: 382 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 441 - Forks: 98

Polytonic/Chlorine

Dead Simple OpenCL

Language: C++ - Size: 960 KB - Last synced at: 12 days ago - Pushed at: about 9 years ago - Stars: 430 - Forks: 24

ParRes/Kernels

This is a set of simple programs that can be used to explore the features of a parallel platform.

Language: C - Size: 23.6 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 427 - Forks: 109

xmrig/xmrig-amd

Monero AMD (OpenCL) miner

Language: C++ - Size: 1.54 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 423 - Forks: 227

arrayfire/arrayfire-python

Python bindings for ArrayFire: A general purpose GPU library.

Language: Python - Size: 1.59 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 419 - Forks: 64

libocca/occa

Portable and vendor neutral framework for parallel programming on heterogeneous platforms.

Language: C++ - Size: 17.7 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 416 - Forks: 87

ekondis/mixbench

A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)

Language: C++ - Size: 351 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 392 - Forks: 70

kpet/clvk

Implementation of OpenCL 3.0 on Vulkan

Language: C++ - Size: 1.97 MB - Last synced at: 2 days ago - Pushed at: 20 days ago - Stars: 390 - Forks: 42

ProjectPhysX/OpenCL-Wrapper

OpenCL is the most powerful programming language ever created. Yet the OpenCL C++ bindings are cumbersome and the code overhead prevents many people from getting started. I created this lightweight OpenCL-Wrapper to greatly simplify OpenCL software development with C++ while keeping functionality and performance.

Language: C++ - Size: 300 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 390 - Forks: 40

vlang/vsl

V library to develop Artificial Intelligence and High-Performance Scientific Computations

Language: V - Size: 11.4 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 368 - Forks: 46

uncomplicate/bayadera

High-performance Bayesian Data Analysis on the GPU in Clojure

Language: Clojure - Size: 1020 KB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 365 - Forks: 23

Xilinx/SDAccel_Examples

SDAccel Examples

Language: C++ - Size: 366 MB - Last synced at: 6 months ago - Pushed at: almost 3 years ago - Stars: 356 - Forks: 209

arunsivaramanneo/GPU-Viewer

A front-end to glxinfo, vulkaninfo, clinfo and es2_info - Linux

Language: Python - Size: 23.8 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 355 - Forks: 23

jrprice/Oclgrind

An OpenCL device simulator and debugger

Language: C++ - Size: 2.59 MB - Last synced at: 3 days ago - Pushed at: 8 months ago - Stars: 355 - Forks: 62

Oblomov/clinfo

Print all known information about all available OpenCL platforms and devices in the system

Language: C - Size: 736 KB - Last synced at: 16 days ago - Pushed at: 3 months ago - Stars: 343 - Forks: 81

UoB-HPC/BabelStream

STREAM, for lots of devices written in many programming models

Language: C++ - Size: 2.36 MB - Last synced at: 20 days ago - Pushed at: 9 months ago - Stars: 333 - Forks: 118

KernelTuner/kernel_tuner

Kernel Tuner

Language: Python - Size: 41 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 331 - Forks: 53

a2flo/floor

A C++ Compute/Graphics Library and Toolchain enabling same-source CUDA/Host/Metal/OpenCL/Vulkan C++ programming and execution.

Language: C++ - Size: 13.6 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 328 - Forks: 22

intel/opencl-intercept-layer

Intercept Layer for Debugging and Analyzing OpenCL Applications

Language: C++ - Size: 2.49 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 328 - Forks: 83

codeplaysoftware/computecpp-sdk 📦

Collection of samples and utilities for using ComputeCpp, Codeplay's SYCL implementation

Language: C - Size: 1.14 MB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 320 - Forks: 90

AlexanderVeselov/RayTracing

Realtime GPU Path tracer based on OpenCL and OpenGL

Language: C++ - Size: 141 MB - Last synced at: 6 months ago - Pushed at: 11 months ago - Stars: 315 - Forks: 33

favreau/Sol-R Fork of cyrillefavreau/Sol-R

Open-Source CUDA/OpenCL Speed Of Light Ray-tracer

Language: C++ - Size: 22 MB - Last synced at: 12 days ago - Pushed at: 11 months ago - Stars: 306 - Forks: 12

tonyrog/cl

OpenCL binding for Erlang

Language: C - Size: 503 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 289 - Forks: 48

uncomplicate/clojurecl

ClojureCL is a Clojure library for parallel computations with OpenCL.

Language: Clojure - Size: 874 KB - Last synced at: 29 days ago - Pushed at: 12 months ago - Stars: 280 - Forks: 18

JuliaGPU/OpenCL.jl

OpenCL Julia bindings

Language: Julia - Size: 8.78 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 272 - Forks: 40

CHIP-SPV/chipStar

chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.

Language: C++ - Size: 27.6 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 269 - Forks: 36

artyom-beilis/pytorch_dlprim

DLPrimitives/OpenCL out of tree backend for pytorch

Language: C++ - Size: 1.36 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 263 - Forks: 17

GPUOpen-Tools/gpu_performance_api

GPU Performance API for AMD GPUs

Language: C++ - Size: 72.7 MB - Last synced at: 4 days ago - Pushed at: about 2 months ago - Stars: 259 - Forks: 48

harujoh/KelpNet

Pure C# machine learning framework

Language: C# - Size: 17.3 MB - Last synced at: 13 days ago - Pushed at: over 1 year ago - Stars: 244 - Forks: 29

ROCm/Tensile

Stretching GPU performance for GEMMs and tensor contractions.

Language: Python - Size: 95 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 237 - Forks: 159

shapelets/khiva

An open-source library of algorithms to analyse time series in GPU and CPU.

Language: C++ - Size: 2.24 MB - Last synced at: 4 days ago - Pushed at: over 3 years ago - Stars: 236 - Forks: 31

PRiME-project/PRiMEStereoMatch

A heterogeneous and fully parallel stereo matching algorithm for depth estimation, implementing a local adaptive support weight (ADSW) Guided Image Filter (GIF) cost aggregation stage. Developed in both C++ and OpenCL.

Language: C++ - Size: 27.5 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 220 - Forks: 64

bh107/bohrium

Automatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX

Language: C++ - Size: 32.4 MB - Last synced at: 10 days ago - Pushed at: over 4 years ago - Stars: 220 - Forks: 31

ePi5131/patch.aul

AviUtlのバグを直す/高速化する/機能追加

Language: C++ - Size: 936 KB - Last synced at: 3 days ago - Pushed at: almost 2 years ago - Stars: 216 - Forks: 15

angeluriot/Galaxy_simulation

An n-body type simulation using GPU acceleration to simulate galaxies, galaxy collisions and expanding universes.

Language: C++ - Size: 84.7 MB - Last synced at: 28 days ago - Pushed at: 12 months ago - Stars: 201 - Forks: 20

ThoughtWorksInc/Compute.scala

Scientific computing with N-dimensional arrays

Language: Scala - Size: 3.54 MB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 200 - Forks: 19

spcl/hls_tutorial_examples

Examples shown as part of the tutorial "Productive parallel programming on FPGA with high-level synthesis".

Language: C++ - Size: 1.27 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 199 - Forks: 46

rsnemmen/OpenCL-examples

Simple OpenCL examples for exploiting GPU computing

Language: Objective-C++ - Size: 3.46 MB - Last synced at: 6 months ago - Pushed at: 9 months ago - Stars: 197 - Forks: 72

ProjectPhysX/OpenCL-Benchmark

A small OpenCL benchmark program to measure peak GPU/CPU performance.

Language: C++ - Size: 309 KB - Last synced at: 29 days ago - Pushed at: about 1 month ago - Stars: 195 - Forks: 27

bernardladenthin/BitcoinAddressFinder

A high performance bitcoin address finder.

Language: C - Size: 896 KB - Last synced at: about 14 hours ago - Pushed at: 3 days ago - Stars: 194 - Forks: 53

dividiti/ck-caffe

Collective Knowledge workflow for Caffe to automate installation across diverse platforms and to collaboratively evaluate and optimize Caffe-based workloads across diverse hardware, software and data sets (compilers, libraries, tools, models, inputs):

Language: CMake - Size: 3.39 MB - Last synced at: 7 days ago - Pushed at: over 5 years ago - Stars: 194 - Forks: 40

ROCm/MIVisionX

MIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into a single toolkit. AMD MIVisionX also delivers a highly optimized open-source implementation of the Khronos OpenVX™ and OpenVX™ Extensions.

Language: C++ - Size: 154 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 193 - Forks: 76

sowson/darknet

Darknet on OpenCL Convolutional Neural Networks on OpenCL on Intel & NVidia & AMD & Mali GPUs for macOS & GNU/Linux & Windows & FreeBSD

Language: C - Size: 32.1 MB - Last synced at: 29 days ago - Pushed at: about 1 month ago - Stars: 192 - Forks: 33

unitaryfoundation/qrack

Comprehensive, GPU accelerated framework for developing universal virtual quantum processors

Language: C++ - Size: 20.6 MB - Last synced at: 7 days ago - Pushed at: 13 days ago - Stars: 190 - Forks: 39

merrymercy/tvm-mali 📦

Optimizing Mobile Deep Learning on ARM GPU with TVM

Language: C - Size: 337 KB - Last synced at: 7 days ago - Pushed at: over 6 years ago - Stars: 181 - Forks: 27

MegEngine/mperf

mperf是一个面向移动/嵌入式平台的算子性能调优工具箱

Language: C++ - Size: 794 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 180 - Forks: 32

deepakkumar1984/Amplifier.NET

Amplifier allows .NET developers to easily run complex applications with intensive mathematical computation on Intel CPU/GPU, NVIDIA, AMD without writing any additional C kernel code. Write your function in .NET and Amplifier will take care of running it on your favorite hardware.

Language: C# - Size: 3.65 MB - Last synced at: 4 days ago - Pushed at: about 1 month ago - Stars: 179 - Forks: 21

yuhc/gpu-rodinia 📦

Rodinia benchmark

Language: C - Size: 33.5 MB - Last synced at: 5 days ago - Pushed at: about 2 years ago - Stars: 179 - Forks: 95

CNugteren/CLTune

CLTune: An automatic OpenCL & CUDA kernel tuner

Language: C++ - Size: 1.6 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 177 - Forks: 36