An open API service providing repository metadata for many open source software ecosystems.

Topic: "opencl"

primitiv/primitiv

A Neural Network Toolkit.

Language: C++ - Size: 2.93 MB - Last synced at: about 2 months ago - Pushed at: over 5 years ago - Stars: 174 - Forks: 27

artyom-beilis/dlprimitives

Deep Learning Primitives and Mini-Framework for OpenCL

Language: C++ - Size: 58.2 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 169 - Forks: 16

preda/gpuowl

GPU Mersenne primality test.

Language: C++ - Size: 13.5 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 164 - Forks: 39

rust-nvml/nvml-wrapper

Safe Rust wrapper for the NVIDIA Management Library

Language: Rust - Size: 966 KB - Last synced at: 3 days ago - Pushed at: 26 days ago - Stars: 161 - Forks: 39

decred/gominer Fork of Dirbaio/gominer

Go (golang) based GPU miner for Decred.

Language: C - Size: 534 KB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 159 - Forks: 77

Bkmz21/CompactCNNCascade

A binary library for very fast face detection using compact CNNs.

Language: C++ - Size: 16.7 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 158 - Forks: 49

jtomori/vft 📦

:boom::snowflake::hammer: VFX Fractal Toolkit

Language: C - Size: 14.2 MB - Last synced at: 20 days ago - Pushed at: almost 6 years ago - Stars: 155 - Forks: 15

fixstars/clpy Fork of cupy/cupy

OpenCL backend for CuPy

Language: Python - Size: 14.3 MB - Last synced at: 2 days ago - Pushed at: about 4 years ago - Stars: 153 - Forks: 13

githubharald/DeslantImg

The deslanting algorithm sets text upright in images. Python, C++ and OpenCL implementations provided.

Language: C++ - Size: 591 KB - Last synced at: 13 days ago - Pushed at: over 3 years ago - Stars: 150 - Forks: 38

dizcza/docker-hashcat

Latest hashcat docker for CUDA, OpenCL, and POCL. Deployed on Vast.ai

Language: Dockerfile - Size: 51.8 KB - Last synced at: 3 days ago - Pushed at: 2 months ago - Stars: 148 - Forks: 42

csiro-robotics/ohm

An efficient, extensible occupancy map supporting probabilistic occupancy, normal distribution transforms in CPU and GPU.

Language: C++ - Size: 4.86 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 147 - Forks: 18

GPUOpen-ProfessionalCompute-Libraries/amdovx-core 📦

AMD OpenVX Core -- a sub-module of amdovx-modules:

Language: C++ - Size: 1.15 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 147 - Forks: 53

rAzoR8/SpvGenTwo

SpvGenTwo is a SPIR-V building and parsing library written in plain C++17 without any dependencies. No STL or other 3rd-Party library needed.

Language: C++ - Size: 1.93 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 146 - Forks: 13

TF2-Engine/TF2

An Open Source Deep Learning Inference Engine Based on FPGA

Language: Python - Size: 110 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 146 - Forks: 61

mathiasbourgoin/SPOC

Stream Processing with OCaml

Language: OCaml - Size: 83.3 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 140 - Forks: 10

libmir/dcompute

DCompute: Native execution of D on GPUs and other Accelerators

Language: D - Size: 158 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 138 - Forks: 26

SamGinzburg/VectorVisor

VectorVisor is a vectorizing binary translator for GPUs, designed to make it easy to run many copies of a single-threaded WebAssembly program in parallel using GPUs

Language: WebAssembly - Size: 216 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 137 - Forks: 3

ysh329/OpenCL-101

Learn OpenCL step by step.

Language: C - Size: 476 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 135 - Forks: 29

AICAN-Research/FAST-Pathology

⚡ Open-source software for deep learning-based digital pathology

Language: C++ - Size: 81.2 MB - Last synced at: 10 days ago - Pushed at: 12 months ago - Stars: 134 - Forks: 27

thi-ng/raymarchcl

Experimental OpenCL voxel rendering/raymarching via Clojure REPL (from 2013)

Language: C - Size: 2.58 MB - Last synced at: about 2 months ago - Pushed at: almost 10 years ago - Stars: 134 - Forks: 5

AnicetNgrt/jiro-nn

A Deep Learning and preprocessing framework in Rust with support for CPU and GPU.

Language: Rust - Size: 17.5 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 131 - Forks: 3

openwall/john-packages

Community packages of John the Ripper, the auditing tool and advanced offline password cracker (Docker images, Windows PortableApp, Mac OS, Flatpak, and Ubuntu SNAP packages)

Language: Shell - Size: 5.86 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 123 - Forks: 23

cvjena/cn24

Convolutional (Patch) Networks for Semantic Segmentation

Language: C++ - Size: 8.41 MB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 123 - Forks: 44

yui0/slibs

Single file libraries for C/C++

Language: C - Size: 12.9 MB - Last synced at: 20 days ago - Pushed at: 10 months ago - Stars: 121 - Forks: 11

can-lehmann/exprgrad

An experimental deep learning framework for Nim based on a differentiable array programming language

Language: Nim - Size: 303 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 121 - Forks: 1

ihhub/penguinV

Computer vision library with focus on heterogeneous systems

Language: C++ - Size: 3.89 MB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 119 - Forks: 90

KhronosGroup/libclcxx

OpenCL specific C++ libraries implemented in C++ for OpenCL kernel language published in releases of OpenCL-Docs

Size: 151 MB - Last synced at: 20 days ago - Pushed at: about 2 years ago - Stars: 119 - Forks: 32

dicecco1/fpga_caffe

Language: C++ - Size: 94.6 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 119 - Forks: 51

doe300/VC4C

Compiler for the VC4CL OpenCL implementation

Language: C - Size: 25.9 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 118 - Forks: 37

PyOCL/OpenCLGA

A Python Library for Genetic Algorithm on OpenCL

Language: Python - Size: 17.4 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 117 - Forks: 32

codeplaysoftware/portDNN

portDNN is a library implementing neural network algorithms written using SYCL

Language: C++ - Size: 55.2 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 111 - Forks: 22

nickgildea/leven

Complete source for my experimental voxel engine

Language: C++ - Size: 9.02 MB - Last synced at: 2 months ago - Pushed at: over 7 years ago - Stars: 109 - Forks: 15

nunofachada/cf4ocl 📦

C Framework for OpenCL

Language: C - Size: 3.45 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 108 - Forks: 19

WincerChan/SolVanityCL

GPU vanity address generator for Solana

Language: C - Size: 195 KB - Last synced at: about 17 hours ago - Pushed at: 12 days ago - Stars: 106 - Forks: 39

intel/compute-samples

Intel® GPU Compute Samples

Language: C++ - Size: 14.5 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 106 - Forks: 18

ChrisCummins/clgen

Deep learning program generator

Language: Python - Size: 8.75 MB - Last synced at: 27 days ago - Pushed at: over 1 year ago - Stars: 106 - Forks: 30

adda-team/adda

ADDA - light scattering simulator based on the discrete dipole approximation

Language: C - Size: 36.9 MB - Last synced at: 3 days ago - Pushed at: about 1 month ago - Stars: 102 - Forks: 59

arrayfire/arrayfire-ml

ArrayFire's Machine Learning Library.

Language: C++ - Size: 81.1 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 102 - Forks: 23

bashbaug/SimpleOpenCLSamples

Simple OpenCL Samples that Build with Khronos Headers and Libs

Language: C++ - Size: 1.7 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 101 - Forks: 25

Khanattila/KNLMeansCL

An optimized OpenCL implementation of the Non-local means de-noising algorithm

Language: C++ - Size: 926 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 99 - Forks: 20

ashvardanian/ParallelReductionsBenchmark

Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal, and Rust - all it takes to sum a lot of numbers fast!

Language: C++ - Size: 17.4 MB - Last synced at: 1 day ago - Pushed at: 4 days ago - Stars: 97 - Forks: 9

ghostlander/nsgminer

NeoScrypt OpenCL GPU Miner

Language: C - Size: 14.3 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 97 - Forks: 68

PlasmaPower/nano-vanity

A NANO vanity address generator (supports OpenCL)

Language: C - Size: 171 KB - Last synced at: 2 days ago - Pushed at: 12 months ago - Stars: 93 - Forks: 32

jslee02/awesome-gpgpu

:sunglasses: A curated list of awesome GPGPU (CUDA/OpenCL/Vulkan) resources

Size: 25.4 KB - Last synced at: 21 days ago - Pushed at: about 2 years ago - Stars: 93 - Forks: 9

sukhmeetbawa/OpenCL-AMD-Fedora 📦

AMD OpenCL userspace drivers for Fedora. Currently not working for fedora 37

Language: Shell - Size: 653 KB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 92 - Forks: 9

evanmiller/ProjCL

GPU and vector-enabled map projections, geodesic calculations, and image warping 🌎🌍🌏

Language: C - Size: 197 KB - Last synced at: 27 days ago - Pushed at: over 4 years ago - Stars: 91 - Forks: 11

etaler/Etaler

A flexable HTM (Hierarchical Temporal Memory) framework with full GPU support.

Language: C++ - Size: 73.8 MB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 89 - Forks: 14

GPUOpen-Tools/radeon_compute_profiler

The Radeon Compute Profiler (RCP) is a performance analysis tool that gathers data from the API run-time and GPU for OpenCL™ and ROCm/HSA applications. This information can be used by developers to discover bottlenecks in the application and to find ways to optimize the application's performance.

Language: C++ - Size: 1.05 MB - Last synced at: 20 days ago - Pushed at: almost 5 years ago - Stars: 87 - Forks: 19

Cibiv/NextGenMap

NextGenMap is a flexible highly sensitive short read mapping tool that handles much higher mismatch rates than comparable algorithms while still outperforming them in terms of runtime. This allows analysing large scale datasets even with increased SNP rates or higher error rates (e.g. caused by specialized experimental protocols) and avoids biases caused by highly variable regions in the genome.

Language: C++ - Size: 113 MB - Last synced at: about 1 month ago - Pushed at: about 6 years ago - Stars: 87 - Forks: 8

tugrul512bit/Cekirdekler

Multi-device OpenCL kernel load balancer and pipeliner API for C#. Uses shared-distributed memory model to keep GPUs updated fast while using same kernel on all devices(for simplicity).

Language: C# - Size: 10.6 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 86 - Forks: 9

inducer/pycparserext

Extensions for Eli Bendersky's pycparser

Language: Python - Size: 127 KB - Last synced at: 4 days ago - Pushed at: 6 months ago - Stars: 85 - Forks: 30

OpenCL/GEGL-OpenCL

Gimp-GEGL is the first official OpenCL Porting Project of

Language: C - Size: 75.7 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 85 - Forks: 31

fff-rs/coaster 📦

Extendable HPC-Framework for CUDA, OpenCL and common CPU

Language: Rust - Size: 2.97 MB - Last synced at: 10 months ago - Pushed at: over 5 years ago - Stars: 85 - Forks: 7

thinkoco/c5soc_opencl

DE1SOC DE10-NANO DE10-Standard OpenCL hardware that support VGA and desktop. And Some applications such as usb camera YUYV to RGB , Sobel and so on.

Language: Verilog - Size: 30.5 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 84 - Forks: 39

OCL-dev/ocl-icd

OpenCL ICD Loader (free software)

Language: C - Size: 571 KB - Last synced at: 20 days ago - Pushed at: about 2 months ago - Stars: 83 - Forks: 23

ROCm/rocALUTION

Next generation library for iterative sparse solvers for ROCm platform

Language: C++ - Size: 10.8 MB - Last synced at: 3 days ago - Pushed at: 6 days ago - Stars: 81 - Forks: 42

rindow/rindow-neuralnetworks

Neural networks library for machine learning on PHP

Language: PHP - Size: 782 KB - Last synced at: 3 days ago - Pushed at: 2 months ago - Stars: 81 - Forks: 12

CosmicFusion/fedora-amdgpu-pro 📦

A repository that provides the proprietary driver for fedora without having to deal with hassle of getting RHEL repo to work , and it has 32 bit libraries

Language: Shell - Size: 295 KB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 81 - Forks: 12

ghostop14/gr-clenabled

OpenCL/GPU-enabled common blocks for GNURadio

Language: C++ - Size: 6.13 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 81 - Forks: 16

villekf/OMEGA

Open-source multi-dimensional tomographic reconstruction software (OMEGA)

Language: MATLAB - Size: 77.7 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 80 - Forks: 18

harskish/fluctus

An interactive OpenCL wavefront path tracer

Language: C++ - Size: 90.7 MB - Last synced at: about 2 months ago - Pushed at: 12 months ago - Stars: 78 - Forks: 16

utwente-fmt/vercors

The VerCors verification toolset for verifying parallel and concurrent software

Language: Scala - Size: 541 MB - Last synced at: about 3 hours ago - Pushed at: about 4 hours ago - Stars: 76 - Forks: 32

jonysy/parenchyma

An extensible HPC framework for CUDA, OpenCL and native CPU.

Language: Rust - Size: 843 KB - Last synced at: 10 days ago - Pushed at: almost 7 years ago - Stars: 76 - Forks: 4

YaccConstructor/Brahma.FSharp Fork of gsvgit/Brahma.FSharp

F# quotation to OpenCL translator and respective runtime to utilize GPGPUs in F# applications.

Language: F# - Size: 52.1 MB - Last synced at: 28 days ago - Pushed at: 2 months ago - Stars: 75 - Forks: 17

byt3n33dl3/PasswordCracker

A Survival Knife (Fantastic) Force Attacks, Incorporating Teeth Cybertooth && John the Ripper, most Advanced Password and Logon Cracker.

Language: C - Size: 42.3 MB - Last synced at: 8 days ago - Pushed at: 6 months ago - Stars: 75 - Forks: 9

michel-meneses/great-opencl-examples

Collection of easy, well-documented and useful OpenCL examples in C++.

Language: C++ - Size: 1000 KB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 75 - Forks: 27

hipacc/hipacc

A domain-specific language and compiler for image processing

Language: C++ - Size: 23 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 75 - Forks: 12

elftausend/custos

A minimal OpenCL, CUDA, Vulkan and host CPU array manipulation engine / framework.

Language: Rust - Size: 3.27 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 73 - Forks: 9

gallickgunner/Yune

GPU based framework for writing Raytracers/Pathtracers. (Pronounced as "Yu-nay")

Language: C++ - Size: 10.8 MB - Last synced at: 2 months ago - Pushed at: about 5 years ago - Stars: 73 - Forks: 6

ianhuang-777/guetzli-cuda-opencl Fork of google/guetzli

Perceptual JPEG encoder, optimized with CUDA&OpenCL, full JPEG format support.

Language: C++ - Size: 271 MB - Last synced at: about 11 hours ago - Pushed at: over 6 years ago - Stars: 73 - Forks: 16

OAID/MXNet-HRT

Heterogeneous Run Time version of MXNet. Added heterogeneous capabilities to the MXNet, uses heterogeneous computing infrastructure framework to speed up Deep Learning on Arm-based heterogeneous embedded platform. It also retains all the features of the original MXNet architecture which users deploy their applications seamlessly.

Language: C++ - Size: 26.9 MB - Last synced at: about 1 month ago - Pushed at: over 7 years ago - Stars: 72 - Forks: 30

pypr/compyle

Execute a subset of Python on HPC platforms

Language: Python - Size: 615 KB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 69 - Forks: 14

ngirot/BruteForce

A simple brute forcer written in GO for SHA1, SHA256, SHA512, MD5 and bcrypt

Language: Go - Size: 141 KB - Last synced at: 11 months ago - Pushed at: about 2 years ago - Stars: 69 - Forks: 17

lepoco/CUDAfy.NET 📦

CUDAfy .NET allows easy development of high performance GPGPU applications completely from the .NET. It's developed in C#.

Language: C# - Size: 3.46 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 69 - Forks: 14

ksachdeva/opencv-mtcnn

An implementation of MTCNN Face detector using OpenCV's DNN module

Language: C++ - Size: 2.1 MB - Last synced at: 19 days ago - Pushed at: almost 5 years ago - Stars: 69 - Forks: 22

gcp/Leela

Leela - a Go program combining Monte Carlo simulations and Neural Networks.

Language: C++ - Size: 250 MB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 68 - Forks: 17

srohit0/trafficVision

MIVisionX toolkit is a comprehensive computer vision and machine intelligence libraries, utilities and applications bundled into a single toolkit.

Language: C++ - Size: 15.2 MB - Last synced at: about 1 month ago - Pushed at: almost 6 years ago - Stars: 67 - Forks: 22

inducer/boxtree

Quad/octree building for FMMs in Python and OpenCL

Language: Python - Size: 1.99 MB - Last synced at: 3 days ago - Pushed at: 21 days ago - Stars: 65 - Forks: 20

unisa-hpc/sycl-bench

SYCL Benchmark Suite

Language: C++ - Size: 24.7 MB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 64 - Forks: 34

ROCm/rpp

AMD ROCm Performance Primitives (RPP) library is a comprehensive high-performance computer vision library for AMD processors with HIP/OpenCL/CPU back-ends.

Language: C++ - Size: 119 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 62 - Forks: 43

varunnagpaal/Digital-Hardware-Modelling

Digital Hardware Modelling using VHDL, Verilog, SystemVerilog, SystemC, HLS(C++, OpenCL)

Language: VHDL - Size: 45.6 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 62 - Forks: 13

JuliaGPU/CLArrays.jl 📦

OpenCL-backed GPU Arrays

Language: Julia - Size: 65.4 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 61 - Forks: 12

shibatch/rectdetect

Realtime rectangle detector with GPGPU

Language: C - Size: 60.5 KB - Last synced at: 27 days ago - Pushed at: over 4 years ago - Stars: 61 - Forks: 20

mz24cn/clnet

OpenCL for Nets - A Deep Learning Framework based on OpenCL, written by C++. Supports popular MLP, RNN(LSTM), CNN(ResNet). Friendly debugger. Transparent data. No library dependencies. 基于OpenCL的深度学习计算框架,C++开发,支持多层感知器,长短时记忆模型,卷积神经网络,残差网络。调试方便,数据透明。无外部依赖。

Language: C - Size: 979 KB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 61 - Forks: 13

cjmcv/hpc

Learning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )

Language: C++ - Size: 2 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 60 - Forks: 8

HiPerCoRe/KTT

Kernel Tuning Toolkit

Language: C++ - Size: 286 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 59 - Forks: 12

Photosounder/rouziclib

This is my personal library of code that is common to my different projects (Photosounder, SplineEQ, Spiral and others)

Language: C - Size: 7.07 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 59 - Forks: 9

xdeyyan/Tron-Profanity

🚀波场TRX靓号生成器,代码开源,利用 gpu 进行加速,安全可靠--TRON-TRX account generator, open source code, using GPU for acceleration, safe and reliable

Language: C - Size: 9.98 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 57 - Forks: 28

rbaygildin/learn-gpgpu

Algorithms implemented in CUDA + resources about GPGPU

Language: Cuda - Size: 226 KB - Last synced at: 13 days ago - Pushed at: over 3 years ago - Stars: 56 - Forks: 14

internaut/mastersthesis-mobile-gpgpu

Prototypes for GPGPU on Android, using OpenCL, OpenGL ES 2.0 shaders, or RenderScript.

Language: C - Size: 27.7 MB - Last synced at: about 2 years ago - Pushed at: over 10 years ago - Stars: 56 - Forks: 23

ProteusMRIgHIFU/BabelViscoFDTD

Software library for FDTD of viscoelastic equation using a staggered grid arrangement with support for GPU and CPU backends

Language: Jupyter Notebook - Size: 115 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 55 - Forks: 10

geggo/gpyfft

python wrapper for the OpenCL FFT library clFFT

Language: Python - Size: 1.09 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 54 - Forks: 21

rg2/xreg

Library and executables for modeling and registration applications in medical image analysis. Particular emphasis on intraoperative fluoroscopic (X-ray) navigation via 2D/3D registration.

Language: C++ - Size: 1.21 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 54 - Forks: 13

NUCAR-DEV/Hetero-Mark

A Benchmark Suite for Heterogeneous System Computation

Language: Jupyter Notebook - Size: 184 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 53 - Forks: 15

tirumalnaidu/opencl-hls-cnn-accelerator

OpenCL HLS based CNN Accelerator on Intel DE10 Nano FPGA.

Language: C - Size: 49.5 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 53 - Forks: 10

ctuning/ctuning-programs

Collective Knowledge extension with unified and customizable benchmarks (with extensible JSON meta information) to be easily integrated with customizable and portable Collective Knowledge workflows. You can easily compile and run these benchmarks using different compilers, environments, hardware and OS (Linux, MacOS, Windows, Android). More info:

Language: C - Size: 10.9 MB - Last synced at: 23 days ago - Pushed at: over 3 years ago - Stars: 53 - Forks: 12

bashbaug/OpenCLPapers

A Collection of Articles and other OpenCL Papers

Size: 90.8 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 52 - Forks: 8

Par4All/par4all

Par4All is an automatic parallelizing and optimizing compiler (workbench) for C and Fortran sequential programs

Language: C - Size: 671 MB - Last synced at: about 1 month ago - Pushed at: about 10 years ago - Stars: 52 - Forks: 11

ProjectPhysX/PTXprofiler

A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.

Language: C++ - Size: 11.7 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 50 - Forks: 6

s-ol/gpWFC

openCL-accelerated python implementation of the Wave Function Collapse procgen algorithm

Language: Python - Size: 40 KB - Last synced at: about 2 months ago - Pushed at: over 6 years ago - Stars: 50 - Forks: 1