Topic: "opencl"
primitiv/primitiv
A Neural Network Toolkit.
Language: C++ - Size: 2.93 MB - Last synced at: about 2 months ago - Pushed at: over 5 years ago - Stars: 174 - Forks: 27

artyom-beilis/dlprimitives
Deep Learning Primitives and Mini-Framework for OpenCL
Language: C++ - Size: 58.2 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 169 - Forks: 16

preda/gpuowl
GPU Mersenne primality test.
Language: C++ - Size: 13.5 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 164 - Forks: 39

rust-nvml/nvml-wrapper
Safe Rust wrapper for the NVIDIA Management Library
Language: Rust - Size: 966 KB - Last synced at: 3 days ago - Pushed at: 26 days ago - Stars: 161 - Forks: 39

decred/gominer Fork of Dirbaio/gominer
Go (golang) based GPU miner for Decred.
Language: C - Size: 534 KB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 159 - Forks: 77

Bkmz21/CompactCNNCascade
A binary library for very fast face detection using compact CNNs.
Language: C++ - Size: 16.7 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 158 - Forks: 49

jtomori/vft 📦
:boom::snowflake::hammer: VFX Fractal Toolkit
Language: C - Size: 14.2 MB - Last synced at: 20 days ago - Pushed at: almost 6 years ago - Stars: 155 - Forks: 15

fixstars/clpy Fork of cupy/cupy
OpenCL backend for CuPy
Language: Python - Size: 14.3 MB - Last synced at: 2 days ago - Pushed at: about 4 years ago - Stars: 153 - Forks: 13

githubharald/DeslantImg
The deslanting algorithm sets text upright in images. Python, C++ and OpenCL implementations provided.
Language: C++ - Size: 591 KB - Last synced at: 13 days ago - Pushed at: over 3 years ago - Stars: 150 - Forks: 38

dizcza/docker-hashcat
Latest hashcat docker for CUDA, OpenCL, and POCL. Deployed on Vast.ai
Language: Dockerfile - Size: 51.8 KB - Last synced at: 3 days ago - Pushed at: 2 months ago - Stars: 148 - Forks: 42

csiro-robotics/ohm
An efficient, extensible occupancy map supporting probabilistic occupancy, normal distribution transforms in CPU and GPU.
Language: C++ - Size: 4.86 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 147 - Forks: 18

GPUOpen-ProfessionalCompute-Libraries/amdovx-core 📦
AMD OpenVX Core -- a sub-module of amdovx-modules:
Language: C++ - Size: 1.15 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 147 - Forks: 53

rAzoR8/SpvGenTwo
SpvGenTwo is a SPIR-V building and parsing library written in plain C++17 without any dependencies. No STL or other 3rd-Party library needed.
Language: C++ - Size: 1.93 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 146 - Forks: 13

TF2-Engine/TF2
An Open Source Deep Learning Inference Engine Based on FPGA
Language: Python - Size: 110 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 146 - Forks: 61

mathiasbourgoin/SPOC
Stream Processing with OCaml
Language: OCaml - Size: 83.3 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 140 - Forks: 10

libmir/dcompute
DCompute: Native execution of D on GPUs and other Accelerators
Language: D - Size: 158 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 138 - Forks: 26

SamGinzburg/VectorVisor
VectorVisor is a vectorizing binary translator for GPUs, designed to make it easy to run many copies of a single-threaded WebAssembly program in parallel using GPUs
Language: WebAssembly - Size: 216 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 137 - Forks: 3

ysh329/OpenCL-101
Learn OpenCL step by step.
Language: C - Size: 476 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 135 - Forks: 29

AICAN-Research/FAST-Pathology
⚡ Open-source software for deep learning-based digital pathology
Language: C++ - Size: 81.2 MB - Last synced at: 10 days ago - Pushed at: 12 months ago - Stars: 134 - Forks: 27

thi-ng/raymarchcl
Experimental OpenCL voxel rendering/raymarching via Clojure REPL (from 2013)
Language: C - Size: 2.58 MB - Last synced at: about 2 months ago - Pushed at: almost 10 years ago - Stars: 134 - Forks: 5

AnicetNgrt/jiro-nn
A Deep Learning and preprocessing framework in Rust with support for CPU and GPU.
Language: Rust - Size: 17.5 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 131 - Forks: 3

openwall/john-packages
Community packages of John the Ripper, the auditing tool and advanced offline password cracker (Docker images, Windows PortableApp, Mac OS, Flatpak, and Ubuntu SNAP packages)
Language: Shell - Size: 5.86 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 123 - Forks: 23

cvjena/cn24
Convolutional (Patch) Networks for Semantic Segmentation
Language: C++ - Size: 8.41 MB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 123 - Forks: 44

yui0/slibs
Single file libraries for C/C++
Language: C - Size: 12.9 MB - Last synced at: 20 days ago - Pushed at: 10 months ago - Stars: 121 - Forks: 11

can-lehmann/exprgrad
An experimental deep learning framework for Nim based on a differentiable array programming language
Language: Nim - Size: 303 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 121 - Forks: 1

ihhub/penguinV
Computer vision library with focus on heterogeneous systems
Language: C++ - Size: 3.89 MB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 119 - Forks: 90

KhronosGroup/libclcxx
OpenCL specific C++ libraries implemented in C++ for OpenCL kernel language published in releases of OpenCL-Docs
Size: 151 MB - Last synced at: 20 days ago - Pushed at: about 2 years ago - Stars: 119 - Forks: 32

dicecco1/fpga_caffe
Language: C++ - Size: 94.6 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 119 - Forks: 51

doe300/VC4C
Compiler for the VC4CL OpenCL implementation
Language: C - Size: 25.9 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 118 - Forks: 37

PyOCL/OpenCLGA
A Python Library for Genetic Algorithm on OpenCL
Language: Python - Size: 17.4 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 117 - Forks: 32

codeplaysoftware/portDNN
portDNN is a library implementing neural network algorithms written using SYCL
Language: C++ - Size: 55.2 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 111 - Forks: 22

nickgildea/leven
Complete source for my experimental voxel engine
Language: C++ - Size: 9.02 MB - Last synced at: 2 months ago - Pushed at: over 7 years ago - Stars: 109 - Forks: 15

nunofachada/cf4ocl 📦
C Framework for OpenCL
Language: C - Size: 3.45 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 108 - Forks: 19

WincerChan/SolVanityCL
GPU vanity address generator for Solana
Language: C - Size: 195 KB - Last synced at: about 17 hours ago - Pushed at: 12 days ago - Stars: 106 - Forks: 39

intel/compute-samples
Intel® GPU Compute Samples
Language: C++ - Size: 14.5 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 106 - Forks: 18

ChrisCummins/clgen
Deep learning program generator
Language: Python - Size: 8.75 MB - Last synced at: 27 days ago - Pushed at: over 1 year ago - Stars: 106 - Forks: 30

adda-team/adda
ADDA - light scattering simulator based on the discrete dipole approximation
Language: C - Size: 36.9 MB - Last synced at: 3 days ago - Pushed at: about 1 month ago - Stars: 102 - Forks: 59

arrayfire/arrayfire-ml
ArrayFire's Machine Learning Library.
Language: C++ - Size: 81.1 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 102 - Forks: 23

bashbaug/SimpleOpenCLSamples
Simple OpenCL Samples that Build with Khronos Headers and Libs
Language: C++ - Size: 1.7 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 101 - Forks: 25

Khanattila/KNLMeansCL
An optimized OpenCL implementation of the Non-local means de-noising algorithm
Language: C++ - Size: 926 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 99 - Forks: 20

ashvardanian/ParallelReductionsBenchmark
Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal, and Rust - all it takes to sum a lot of numbers fast!
Language: C++ - Size: 17.4 MB - Last synced at: 1 day ago - Pushed at: 4 days ago - Stars: 97 - Forks: 9

ghostlander/nsgminer
NeoScrypt OpenCL GPU Miner
Language: C - Size: 14.3 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 97 - Forks: 68

PlasmaPower/nano-vanity
A NANO vanity address generator (supports OpenCL)
Language: C - Size: 171 KB - Last synced at: 2 days ago - Pushed at: 12 months ago - Stars: 93 - Forks: 32

jslee02/awesome-gpgpu
:sunglasses: A curated list of awesome GPGPU (CUDA/OpenCL/Vulkan) resources
Size: 25.4 KB - Last synced at: 21 days ago - Pushed at: about 2 years ago - Stars: 93 - Forks: 9

sukhmeetbawa/OpenCL-AMD-Fedora 📦
AMD OpenCL userspace drivers for Fedora. Currently not working for fedora 37
Language: Shell - Size: 653 KB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 92 - Forks: 9

evanmiller/ProjCL
GPU and vector-enabled map projections, geodesic calculations, and image warping 🌎🌍🌏
Language: C - Size: 197 KB - Last synced at: 27 days ago - Pushed at: over 4 years ago - Stars: 91 - Forks: 11

etaler/Etaler
A flexable HTM (Hierarchical Temporal Memory) framework with full GPU support.
Language: C++ - Size: 73.8 MB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 89 - Forks: 14

GPUOpen-Tools/radeon_compute_profiler
The Radeon Compute Profiler (RCP) is a performance analysis tool that gathers data from the API run-time and GPU for OpenCL™ and ROCm/HSA applications. This information can be used by developers to discover bottlenecks in the application and to find ways to optimize the application's performance.
Language: C++ - Size: 1.05 MB - Last synced at: 20 days ago - Pushed at: almost 5 years ago - Stars: 87 - Forks: 19

Cibiv/NextGenMap
NextGenMap is a flexible highly sensitive short read mapping tool that handles much higher mismatch rates than comparable algorithms while still outperforming them in terms of runtime. This allows analysing large scale datasets even with increased SNP rates or higher error rates (e.g. caused by specialized experimental protocols) and avoids biases caused by highly variable regions in the genome.
Language: C++ - Size: 113 MB - Last synced at: about 1 month ago - Pushed at: about 6 years ago - Stars: 87 - Forks: 8

tugrul512bit/Cekirdekler
Multi-device OpenCL kernel load balancer and pipeliner API for C#. Uses shared-distributed memory model to keep GPUs updated fast while using same kernel on all devices(for simplicity).
Language: C# - Size: 10.6 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 86 - Forks: 9

inducer/pycparserext
Extensions for Eli Bendersky's pycparser
Language: Python - Size: 127 KB - Last synced at: 4 days ago - Pushed at: 6 months ago - Stars: 85 - Forks: 30

OpenCL/GEGL-OpenCL
Gimp-GEGL is the first official OpenCL Porting Project of
Language: C - Size: 75.7 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 85 - Forks: 31

fff-rs/coaster 📦
Extendable HPC-Framework for CUDA, OpenCL and common CPU
Language: Rust - Size: 2.97 MB - Last synced at: 10 months ago - Pushed at: over 5 years ago - Stars: 85 - Forks: 7

thinkoco/c5soc_opencl
DE1SOC DE10-NANO DE10-Standard OpenCL hardware that support VGA and desktop. And Some applications such as usb camera YUYV to RGB , Sobel and so on.
Language: Verilog - Size: 30.5 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 84 - Forks: 39

OCL-dev/ocl-icd
OpenCL ICD Loader (free software)
Language: C - Size: 571 KB - Last synced at: 20 days ago - Pushed at: about 2 months ago - Stars: 83 - Forks: 23

ROCm/rocALUTION
Next generation library for iterative sparse solvers for ROCm platform
Language: C++ - Size: 10.8 MB - Last synced at: 3 days ago - Pushed at: 6 days ago - Stars: 81 - Forks: 42

rindow/rindow-neuralnetworks
Neural networks library for machine learning on PHP
Language: PHP - Size: 782 KB - Last synced at: 3 days ago - Pushed at: 2 months ago - Stars: 81 - Forks: 12

CosmicFusion/fedora-amdgpu-pro 📦
A repository that provides the proprietary driver for fedora without having to deal with hassle of getting RHEL repo to work , and it has 32 bit libraries
Language: Shell - Size: 295 KB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 81 - Forks: 12

ghostop14/gr-clenabled
OpenCL/GPU-enabled common blocks for GNURadio
Language: C++ - Size: 6.13 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 81 - Forks: 16

villekf/OMEGA
Open-source multi-dimensional tomographic reconstruction software (OMEGA)
Language: MATLAB - Size: 77.7 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 80 - Forks: 18

harskish/fluctus
An interactive OpenCL wavefront path tracer
Language: C++ - Size: 90.7 MB - Last synced at: about 2 months ago - Pushed at: 12 months ago - Stars: 78 - Forks: 16

utwente-fmt/vercors
The VerCors verification toolset for verifying parallel and concurrent software
Language: Scala - Size: 541 MB - Last synced at: about 3 hours ago - Pushed at: about 4 hours ago - Stars: 76 - Forks: 32

jonysy/parenchyma
An extensible HPC framework for CUDA, OpenCL and native CPU.
Language: Rust - Size: 843 KB - Last synced at: 10 days ago - Pushed at: almost 7 years ago - Stars: 76 - Forks: 4

YaccConstructor/Brahma.FSharp Fork of gsvgit/Brahma.FSharp
F# quotation to OpenCL translator and respective runtime to utilize GPGPUs in F# applications.
Language: F# - Size: 52.1 MB - Last synced at: 28 days ago - Pushed at: 2 months ago - Stars: 75 - Forks: 17

byt3n33dl3/PasswordCracker
A Survival Knife (Fantastic) Force Attacks, Incorporating Teeth Cybertooth && John the Ripper, most Advanced Password and Logon Cracker.
Language: C - Size: 42.3 MB - Last synced at: 8 days ago - Pushed at: 6 months ago - Stars: 75 - Forks: 9

michel-meneses/great-opencl-examples
Collection of easy, well-documented and useful OpenCL examples in C++.
Language: C++ - Size: 1000 KB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 75 - Forks: 27

hipacc/hipacc
A domain-specific language and compiler for image processing
Language: C++ - Size: 23 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 75 - Forks: 12

elftausend/custos
A minimal OpenCL, CUDA, Vulkan and host CPU array manipulation engine / framework.
Language: Rust - Size: 3.27 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 73 - Forks: 9

gallickgunner/Yune
GPU based framework for writing Raytracers/Pathtracers. (Pronounced as "Yu-nay")
Language: C++ - Size: 10.8 MB - Last synced at: 2 months ago - Pushed at: about 5 years ago - Stars: 73 - Forks: 6

ianhuang-777/guetzli-cuda-opencl Fork of google/guetzli
Perceptual JPEG encoder, optimized with CUDA&OpenCL, full JPEG format support.
Language: C++ - Size: 271 MB - Last synced at: about 11 hours ago - Pushed at: over 6 years ago - Stars: 73 - Forks: 16

OAID/MXNet-HRT
Heterogeneous Run Time version of MXNet. Added heterogeneous capabilities to the MXNet, uses heterogeneous computing infrastructure framework to speed up Deep Learning on Arm-based heterogeneous embedded platform. It also retains all the features of the original MXNet architecture which users deploy their applications seamlessly.
Language: C++ - Size: 26.9 MB - Last synced at: about 1 month ago - Pushed at: over 7 years ago - Stars: 72 - Forks: 30

pypr/compyle
Execute a subset of Python on HPC platforms
Language: Python - Size: 615 KB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 69 - Forks: 14

ngirot/BruteForce
A simple brute forcer written in GO for SHA1, SHA256, SHA512, MD5 and bcrypt
Language: Go - Size: 141 KB - Last synced at: 11 months ago - Pushed at: about 2 years ago - Stars: 69 - Forks: 17

lepoco/CUDAfy.NET 📦
CUDAfy .NET allows easy development of high performance GPGPU applications completely from the .NET. It's developed in C#.
Language: C# - Size: 3.46 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 69 - Forks: 14

ksachdeva/opencv-mtcnn
An implementation of MTCNN Face detector using OpenCV's DNN module
Language: C++ - Size: 2.1 MB - Last synced at: 19 days ago - Pushed at: almost 5 years ago - Stars: 69 - Forks: 22

gcp/Leela
Leela - a Go program combining Monte Carlo simulations and Neural Networks.
Language: C++ - Size: 250 MB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 68 - Forks: 17

srohit0/trafficVision
MIVisionX toolkit is a comprehensive computer vision and machine intelligence libraries, utilities and applications bundled into a single toolkit.
Language: C++ - Size: 15.2 MB - Last synced at: about 1 month ago - Pushed at: almost 6 years ago - Stars: 67 - Forks: 22

inducer/boxtree
Quad/octree building for FMMs in Python and OpenCL
Language: Python - Size: 1.99 MB - Last synced at: 3 days ago - Pushed at: 21 days ago - Stars: 65 - Forks: 20

unisa-hpc/sycl-bench
SYCL Benchmark Suite
Language: C++ - Size: 24.7 MB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 64 - Forks: 34

ROCm/rpp
AMD ROCm Performance Primitives (RPP) library is a comprehensive high-performance computer vision library for AMD processors with HIP/OpenCL/CPU back-ends.
Language: C++ - Size: 119 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 62 - Forks: 43

varunnagpaal/Digital-Hardware-Modelling
Digital Hardware Modelling using VHDL, Verilog, SystemVerilog, SystemC, HLS(C++, OpenCL)
Language: VHDL - Size: 45.6 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 62 - Forks: 13

JuliaGPU/CLArrays.jl 📦
OpenCL-backed GPU Arrays
Language: Julia - Size: 65.4 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 61 - Forks: 12

shibatch/rectdetect
Realtime rectangle detector with GPGPU
Language: C - Size: 60.5 KB - Last synced at: 27 days ago - Pushed at: over 4 years ago - Stars: 61 - Forks: 20

mz24cn/clnet
OpenCL for Nets - A Deep Learning Framework based on OpenCL, written by C++. Supports popular MLP, RNN(LSTM), CNN(ResNet). Friendly debugger. Transparent data. No library dependencies. 基于OpenCL的深度学习计算框架,C++开发,支持多层感知器,长短时记忆模型,卷积神经网络,残差网络。调试方便,数据透明。无外部依赖。
Language: C - Size: 979 KB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 61 - Forks: 13

cjmcv/hpc
Learning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
Language: C++ - Size: 2 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 60 - Forks: 8

HiPerCoRe/KTT
Kernel Tuning Toolkit
Language: C++ - Size: 286 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 59 - Forks: 12

Photosounder/rouziclib
This is my personal library of code that is common to my different projects (Photosounder, SplineEQ, Spiral and others)
Language: C - Size: 7.07 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 59 - Forks: 9

xdeyyan/Tron-Profanity
🚀波场TRX靓号生成器,代码开源,利用 gpu 进行加速,安全可靠--TRON-TRX account generator, open source code, using GPU for acceleration, safe and reliable
Language: C - Size: 9.98 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 57 - Forks: 28

rbaygildin/learn-gpgpu
Algorithms implemented in CUDA + resources about GPGPU
Language: Cuda - Size: 226 KB - Last synced at: 13 days ago - Pushed at: over 3 years ago - Stars: 56 - Forks: 14

internaut/mastersthesis-mobile-gpgpu
Prototypes for GPGPU on Android, using OpenCL, OpenGL ES 2.0 shaders, or RenderScript.
Language: C - Size: 27.7 MB - Last synced at: about 2 years ago - Pushed at: over 10 years ago - Stars: 56 - Forks: 23

ProteusMRIgHIFU/BabelViscoFDTD
Software library for FDTD of viscoelastic equation using a staggered grid arrangement with support for GPU and CPU backends
Language: Jupyter Notebook - Size: 115 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 55 - Forks: 10

geggo/gpyfft
python wrapper for the OpenCL FFT library clFFT
Language: Python - Size: 1.09 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 54 - Forks: 21

rg2/xreg
Library and executables for modeling and registration applications in medical image analysis. Particular emphasis on intraoperative fluoroscopic (X-ray) navigation via 2D/3D registration.
Language: C++ - Size: 1.21 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 54 - Forks: 13

NUCAR-DEV/Hetero-Mark
A Benchmark Suite for Heterogeneous System Computation
Language: Jupyter Notebook - Size: 184 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 53 - Forks: 15

tirumalnaidu/opencl-hls-cnn-accelerator
OpenCL HLS based CNN Accelerator on Intel DE10 Nano FPGA.
Language: C - Size: 49.5 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 53 - Forks: 10

ctuning/ctuning-programs
Collective Knowledge extension with unified and customizable benchmarks (with extensible JSON meta information) to be easily integrated with customizable and portable Collective Knowledge workflows. You can easily compile and run these benchmarks using different compilers, environments, hardware and OS (Linux, MacOS, Windows, Android). More info:
Language: C - Size: 10.9 MB - Last synced at: 23 days ago - Pushed at: over 3 years ago - Stars: 53 - Forks: 12

bashbaug/OpenCLPapers
A Collection of Articles and other OpenCL Papers
Size: 90.8 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 52 - Forks: 8

Par4All/par4all
Par4All is an automatic parallelizing and optimizing compiler (workbench) for C and Fortran sequential programs
Language: C - Size: 671 MB - Last synced at: about 1 month ago - Pushed at: about 10 years ago - Stars: 52 - Forks: 11

ProjectPhysX/PTXprofiler
A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.
Language: C++ - Size: 11.7 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 50 - Forks: 6

s-ol/gpWFC
openCL-accelerated python implementation of the Wave Function Collapse procgen algorithm
Language: Python - Size: 40 KB - Last synced at: about 2 months ago - Pushed at: over 6 years ago - Stars: 50 - Forks: 1
