Topic: "gpu-acceleration"
tensorflow/tfjs
A WebGL accelerated JavaScript library for training and deploying ML models.
Language: TypeScript - Size: 165 MB - Last synced at: 4 days ago - Pushed at: 9 days ago - Stars: 18,768 - Forks: 1,967

NVIDIA/TensorRT
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
Language: C++ - Size: 130 MB - Last synced at: 3 days ago - Pushed at: about 2 months ago - Stars: 11,497 - Forks: 2,182

tensorflow/tfjs-core 📦
WebGL-accelerated ML // linear algebra // automatic differentiation for JavaScript.
Language: TypeScript - Size: 362 MB - Last synced at: 8 days ago - Pushed at: over 5 years ago - Stars: 8,481 - Forks: 949

raphamorim/rio
A hardware-accelerated GPU terminal emulator focusing to run in desktops and browsers.
Language: Rust - Size: 260 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 4,992 - Forks: 175

cornellius-gp/gpytorch
A highly efficient implementation of Gaussian Processes in PyTorch
Language: Python - Size: 29.3 MB - Last synced at: 3 days ago - Pushed at: about 2 months ago - Stars: 3,689 - Forks: 564

NVIDIA/GenerativeAIExamples
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
Language: Python - Size: 69 MB - Last synced at: 16 days ago - Pushed at: about 1 month ago - Stars: 2,981 - Forks: 710

Hedgehog-Computing/hedgehog-lab
Run, compile and execute JavaScript for Scientific Computing and Data Visualization TOTALLY TOTALLY TOTALLY in your BROWSER! An open source scientific computing environment for JavaScript TOTALLY in your browser, matrix operations with GPU acceleration, TeX support, data visualization and symbolic computation.
Language: TypeScript - Size: 28.3 MB - Last synced at: 12 days ago - Pushed at: 12 months ago - Stars: 2,369 - Forks: 140

BlazingDB/blazingsql
BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.
Language: C++ - Size: 41.4 MB - Last synced at: 12 days ago - Pushed at: over 2 years ago - Stars: 1,959 - Forks: 184

TianZerL/Anime4KCPP
A high performance anime upscaler
Language: C++ - Size: 7.44 MB - Last synced at: 15 days ago - Pushed at: about 1 month ago - Stars: 1,864 - Forks: 145

coreylowman/dfdx
Deep learning in Rust, with shape checked tensors and neural networks
Language: Rust - Size: 2.6 MB - Last synced at: 17 days ago - Pushed at: 9 months ago - Stars: 1,799 - Forks: 107

emacs-ng/emacs-ng
A new approach to Emacs - Including TypeScript, Threading, Async I/O, and WebRender.
Language: Emacs Lisp - Size: 416 MB - Last synced at: 16 days ago - Pushed at: about 2 months ago - Stars: 1,742 - Forks: 73

calebwin/emu
The write-once-run-anywhere GPGPU library for Rust
Language: Rust - Size: 342 MB - Last synced at: 12 days ago - Pushed at: over 2 years ago - Stars: 1,604 - Forks: 52

NVIDIA/cccl
CUDA Core Compute Libraries
Language: C++ - Size: 79.9 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1,602 - Forks: 208

beehive-lab/TornadoVM
TornadoVM: A practical and efficient heterogeneous programming framework for managed languages
Language: Java - Size: 152 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1,229 - Forks: 119

stotko/stdgpu
stdgpu: Efficient STL-like Data Structures on the GPU
Language: C++ - Size: 4.87 MB - Last synced at: 13 days ago - Pushed at: 2 months ago - Stars: 1,212 - Forks: 88

Jaysmito101/TerraForge3D
Cross Platform Professional Procedural Terrain Generation & Texturing Tool
Language: C++ - Size: 630 MB - Last synced at: 4 days ago - Pushed at: about 1 month ago - Stars: 1,047 - Forks: 97

Liu-xiandong/How_to_optimize_in_GPU
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
Language: Cuda - Size: 1.25 MB - Last synced at: 14 days ago - Pushed at: over 1 year ago - Stars: 997 - Forks: 152

NVIDIA-Merlin/HugeCTR
HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training
Language: C++ - Size: 55.7 MB - Last synced at: 3 days ago - Pushed at: about 1 month ago - Stars: 993 - Forks: 203

hughperkins/VeriGPU
OpenSource GPU, in Verilog, loosely based on RISC-V ISA
Language: SystemVerilog - Size: 6.76 MB - Last synced at: 26 days ago - Pushed at: 5 months ago - Stars: 961 - Forks: 109

dgasmith/opt_einsum
⚡️Optimizing einsum functions in NumPy, Tensorflow, Dask, and more with contraction order optimization.
Language: Python - Size: 4.11 MB - Last synced at: 3 days ago - Pushed at: about 1 month ago - Stars: 898 - Forks: 72

eszdman/PhotonCamera
Android Camera that uses Enhanced image processing
Language: Java - Size: 22.7 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 845 - Forks: 79

coreylowman/cudarc
Safe rust wrapper around CUDA toolkit
Language: Rust - Size: 2.79 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 821 - Forks: 97

NVIDIA-Merlin/Merlin
NVIDIA Merlin is an open source library providing end-to-end GPU-accelerated recommender systems, from feature engineering and preprocessing to training deep learning models and running inference in production.
Language: Python - Size: 38 MB - Last synced at: 13 days ago - Pushed at: 5 months ago - Stars: 816 - Forks: 123

limbo018/DREAMPlace
Deep learning toolkit-enabled VLSI placement
Language: C++ - Size: 18 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 777 - Forks: 216

NVlabs/sionna
Sionna: An Open-Source Library for Next-Generation Physical Layer Research
Language: Python - Size: 191 MB - Last synced at: 6 months ago - Pushed at: 7 months ago - Stars: 766 - Forks: 221

ttddee/Cascade 📦
Node-based image editor with GPU-acceleration.
Language: C++ - Size: 7.21 MB - Last synced at: 25 days ago - Pushed at: 10 months ago - Stars: 743 - Forks: 35

iot-salzburg/gpu-jupyter
GPU-Jupyter: Your GPU-accelerated JupyterLab with a rich data science toolstack, TensorFlow and PyTorch for your reproducible deep learning experiments.
Language: Jupyter Notebook - Size: 1.21 MB - Last synced at: 12 days ago - Pushed at: about 2 months ago - Stars: 733 - Forks: 236

Sergio0694/NeuralNetwork.NET
A TensorFlow-inspired neural network library built from scratch in C# 7.3 for .NET Standard 2.0, with GPU support through cuDNN
Language: C# - Size: 13.1 MB - Last synced at: 18 days ago - Pushed at: over 2 years ago - Stars: 556 - Forks: 88

philferriere/dlwin
GPU-accelerated Deep Learning on Windows 10 native
Language: Python - Size: 2.71 MB - Last synced at: 5 days ago - Pushed at: almost 3 years ago - Stars: 517 - Forks: 100

DavidDiazGuerra/gpuRIR
Python library for Room Impulse Response (RIR) simulation with GPU acceleration
Language: Cuda - Size: 4.58 MB - Last synced at: 25 days ago - Pushed at: 3 months ago - Stars: 515 - Forks: 95

MegviiRobot/MegBA
MegBA: A GPU-Based Distributed Library for Large-Scale Bundle Adjustment
Language: Cuda - Size: 1.3 MB - Last synced at: 5 months ago - Pushed at: 11 months ago - Stars: 450 - Forks: 61

EMI-Group/evox
Distributed GPU-Accelerated Framework for Evolutionary Computation. Comprehensive Library of Evolutionary Algorithms & Benchmark Problems.
Language: Python - Size: 37.9 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 445 - Forks: 71

ProjectPhysX/OpenCL-Wrapper
OpenCL is the most powerful programming language ever created. Yet the OpenCL C++ bindings are cumbersome and the code overhead prevents many people from getting started. I created this lightweight OpenCL-Wrapper to greatly simplify OpenCL software development with C++ while keeping functionality and performance.
Language: C++ - Size: 300 KB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 390 - Forks: 40

uncomplicate/bayadera
High-performance Bayesian Data Analysis on the GPU in Clojure
Language: Clojure - Size: 1020 KB - Last synced at: 17 days ago - Pushed at: over 4 years ago - Stars: 365 - Forks: 23

andrewmilson/ministark
🏃♂️💨 GPU accelerated STARK prover built on @arkworks-rs
Language: Rust - Size: 1.65 MB - Last synced at: 8 days ago - Pushed at: 5 months ago - Stars: 354 - Forks: 36

DataCanvasIO/HyperGBM
A full pipeline AutoML tool for tabular data
Language: Python - Size: 11 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 347 - Forks: 47

Glavnokoman/vuh
Vulkan compute for people
Language: C++ - Size: 705 KB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 340 - Forks: 34

gpufit/Gpufit
GPU-accelerated Levenberg-Marquardt curve fitting in CUDA
Language: Cuda - Size: 1.16 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 319 - Forks: 96

favreau/Sol-R Fork of cyrillefavreau/Sol-R
Open-Source CUDA/OpenCL Speed Of Light Ray-tracer
Language: C++ - Size: 22 MB - Last synced at: 6 months ago - Pushed at: 10 months ago - Stars: 306 - Forks: 14

quiver-team/torch-quiver
PyTorch Library for Low-Latency, High-Throughput Graph Learning on GPUs.
Language: Python - Size: 4.95 MB - Last synced at: 22 days ago - Pushed at: over 1 year ago - Stars: 299 - Forks: 36

baggepinnen/MonteCarloMeasurements.jl
Propagation of distributions by Monte-Carlo sampling: Real number types with uncertainty represented by samples.
Language: Julia - Size: 4.85 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 274 - Forks: 17

marian-nmt/marian-dev
Fast Neural Machine Translation in C++ - development repository
Language: C++ - Size: 18.7 MB - Last synced at: 12 days ago - Pushed at: 6 months ago - Stars: 271 - Forks: 129

stitchEm/stitchEm
Vahana VR & VideoStitch Studio: software to create immersive 360° VR video, live and in post-production
Language: C++ - Size: 7.26 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 244 - Forks: 62

AdrianAntico/AutoQuant
R package for automation of machine learning, forecasting, model evaluation, and model interpretation
Language: R - Size: 804 MB - Last synced at: about 2 hours ago - Pushed at: 4 months ago - Stars: 243 - Forks: 43

denosaurs/netsaur
Powerful Powerful Machine Learning library with GPU, CPU and WASM backends
Language: Rust - Size: 146 MB - Last synced at: 7 days ago - Pushed at: 7 months ago - Stars: 243 - Forks: 4

ROCm/Tensile
Stretching GPU performance for GEMMs and tensor contractions.
Language: Python - Size: 95 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 235 - Forks: 158

BasBuller/PySNN
Efficient Spiking Neural Network framework, built on top of PyTorch for GPU acceleration
Language: Python - Size: 12.8 MB - Last synced at: about 15 hours ago - Pushed at: 9 months ago - Stars: 225 - Forks: 27

clEsperanto/pyclesperanto_prototype
GPU-accelerated bio-image analysis focusing on 3D+t microscopy image data
Language: Jupyter Notebook - Size: 221 MB - Last synced at: 4 days ago - Pushed at: 2 months ago - Stars: 223 - Forks: 48

bh107/bohrium
Automatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
Language: C++ - Size: 32.4 MB - Last synced at: 6 months ago - Pushed at: over 4 years ago - Stars: 221 - Forks: 31

AudioKit/Waveform
GPU accelerated waveform view
Language: Swift - Size: 4.62 MB - Last synced at: 5 months ago - Pushed at: 6 months ago - Stars: 205 - Forks: 15

daktronics/cef-mixer
High Performance off-screen rendering (OSR) demo using CEF
Language: C++ - Size: 283 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 205 - Forks: 49

mikeroyal/GPU-Guide
Graphics Processing Unit (GPU) Architecture Guide
Language: Shell - Size: 815 KB - Last synced at: 4 days ago - Pushed at: about 3 years ago - Stars: 202 - Forks: 16

uncomplicate/clojurecuda
Clojure library for CUDA development
Language: Clojure - Size: 508 KB - Last synced at: 18 days ago - Pushed at: 3 months ago - Stars: 184 - Forks: 10

aliemo/transfomers-silicon-research
Research and Materials on Hardware implementation of Transformer Model
Language: Jupyter Notebook - Size: 1.84 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 183 - Forks: 25

PeculiarVentures/GammaCV
GammaCV is a WebGL accelerated Computer Vision library for browser
Language: JavaScript - Size: 28.3 MB - Last synced at: 18 days ago - Pushed at: 23 days ago - Stars: 182 - Forks: 24

ertis-research/kafka-ml
Kafka-ML: connecting the data stream with ML/AI frameworks (now TensorFlow and PyTorch!)
Language: Python - Size: 5.44 MB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 181 - Forks: 25

yzhao062/pytod
TOD: GPU-accelerated Outlier Detection via Tensor Operations
Language: Python - Size: 13.1 MB - Last synced at: 19 days ago - Pushed at: about 2 years ago - Stars: 180 - Forks: 24

eth-cscs/COSMA
Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm
Language: C++ - Size: 8.35 MB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 177 - Forks: 26

merzlab/QUICK
QUICK: A GPU-enabled ab intio quantum chemistry software package
Language: C - Size: 162 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 169 - Forks: 47

leoliuf/MRiLab
A Numerical Magnetic Resonance Imaging (MRI) Simulation Platform
Language: MATLAB - Size: 113 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 168 - Forks: 60

ucl-bug/jwave
A JAX-based research framework for differentiable and parallelizable acoustic simulations, on CPU, GPUs and TPUs
Language: Python - Size: 54.8 MB - Last synced at: 8 days ago - Pushed at: 7 months ago - Stars: 164 - Forks: 21

AI4Finance-Foundation/RLSolver
Solvers for NP-hard and NP-complete problems with an emphasis on high-performance GPU computing.
Language: Python - Size: 60.9 MB - Last synced at: 1 day ago - Pushed at: 8 months ago - Stars: 150 - Forks: 34

csiro-robotics/ohm
An efficient, extensible occupancy map supporting probabilistic occupancy, normal distribution transforms in CPU and GPU.
Language: C++ - Size: 4.86 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 147 - Forks: 18

ysh329/OpenCL-101
Learn OpenCL step by step.
Language: C - Size: 476 KB - Last synced at: 13 days ago - Pushed at: over 2 years ago - Stars: 135 - Forks: 29

JuliaHealth/KomaMRI.jl
Koma is a Pulseq-compatible framework to efficiently simulate Magnetic Resonance Imaging (MRI) acquisitions. The main focus of this package is to simulate general scenarios that could arise in pulse sequence development.
Language: Julia - Size: 541 MB - Last synced at: 14 days ago - Pushed at: 16 days ago - Stars: 132 - Forks: 22

arceryz/raylib-gpu-particles
Raylib 100% GPU particles example in 3D. Uses compute shaders and is fully documented. Millions of particles at 60 fps on a laptop.
Language: C - Size: 23.4 MB - Last synced at: 15 days ago - Pushed at: 11 months ago - Stars: 123 - Forks: 5

mightycow/Sluggish
Toy CPU and GPU implementations of the Slug rendering algorithm
Language: C - Size: 2.22 MB - Last synced at: 3 months ago - Pushed at: about 6 years ago - Stars: 119 - Forks: 13

TianZerL/pyanime4k
An easy way to use anime4k in python
Language: Python - Size: 61.5 KB - Last synced at: 13 days ago - Pushed at: almost 4 years ago - Stars: 118 - Forks: 17

tensordiffeq/TensorDiffEq
Efficient and Scalable Physics-Informed Deep Learning and Scientific Machine Learning on top of Tensorflow for multi-worker distributed computing
Language: Python - Size: 1.28 MB - Last synced at: 4 days ago - Pushed at: about 3 years ago - Stars: 113 - Forks: 42

icl-utk-edu/slate
SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) systems. It is developed as part of the U.S. Department of Energy Exascale Computing Project (ECP).
Language: C++ - Size: 22.1 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 110 - Forks: 23

microsoft/Accera
Open source cross-platform compiler for compute-intensive loops used in AI algorithms, from Microsoft Research
Language: C++ - Size: 13.4 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 110 - Forks: 19

IntelPython/dpnp
Data Parallel Extension for NumPy
Language: Python - Size: 697 MB - Last synced at: about 19 hours ago - Pushed at: about 20 hours ago - Stars: 107 - Forks: 21

kohonda/mppi_playground
Model Predictive Path Integral Control (MPPI) with PyTorch
Language: Python - Size: 13.4 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 103 - Forks: 13

cuhk-eda/Xplace
Xplace 2.0: An Extremely Fast, Extensible and Deterministic Placement Framework with Detailed-Routability Optimization
Language: C++ - Size: 81.3 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 102 - Forks: 9

slai-labs/get-beam
Run GPU inference and training jobs on serverless infrastructure that scales with you.
Language: Shell - Size: 5.96 MB - Last synced at: 8 days ago - Pushed at: 11 months ago - Stars: 102 - Forks: 23

DeepMLNet/DeepNet
Deep.Net machine learning framework for F#
Language: F# - Size: 230 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 102 - Forks: 9

arctern-io/arctern
Language: C++ - Size: 66.6 MB - Last synced at: 12 months ago - Pushed at: about 3 years ago - Stars: 102 - Forks: 53

Heteroflow/Heteroflow
Concurrent CPU-GPU Programming using Task Models
Language: C++ - Size: 1.58 MB - Last synced at: 25 days ago - Pushed at: over 5 years ago - Stars: 101 - Forks: 13

mitmath/JuliaComputation
Repository for Common Ground C25
Language: Julia - Size: 69.7 MB - Last synced at: 2 days ago - Pushed at: 5 months ago - Stars: 100 - Forks: 14

ashvardanian/ParallelReductionsBenchmark
Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal - all it takes to sum a lot of numbers fast!
Language: C++ - Size: 17.3 MB - Last synced at: 7 days ago - Pushed at: 2 months ago - Stars: 96 - Forks: 9

ucbrise/piranha
Piranha: A GPU Platform for Secure Computation
Language: C++ - Size: 71.5 MB - Last synced at: 15 days ago - Pushed at: about 2 years ago - Stars: 95 - Forks: 27

lowrollr/turbozero
fast + parallel AlphaZero in JAX
Language: Python - Size: 28.8 MB - Last synced at: 14 days ago - Pushed at: 4 months ago - Stars: 94 - Forks: 9

cnvrg/metagpu
K8s device plugin for GPU sharing
Language: Go - Size: 423 KB - Last synced at: 10 months ago - Pushed at: almost 2 years ago - Stars: 92 - Forks: 8

kklmn/xrt
Package xrt (XRayTracer) is a python software library for ray tracing and wave propagation in x-ray regime. It is primarily meant for modeling synchrotron sources, beamlines and beamline elements.
Language: Python - Size: 472 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 91 - Forks: 31

larsgeb/m1-gpu-cpp
Metal Shading Language on Apple M1's GPU for scientific C++.
Language: C++ - Size: 10.9 MB - Last synced at: 24 days ago - Pushed at: over 1 year ago - Stars: 91 - Forks: 18

guillaume-chevalier/GloVe-as-a-TensorFlow-Embedding-Layer
Taking a pretrained GloVe model, and using it as a TensorFlow embedding weight layer **inside the GPU**. Therefore, you only need to send the index of the words through the GPU data transfer bus, reducing data transfer overhead.
Language: Jupyter Notebook - Size: 52.7 KB - Last synced at: 27 days ago - Pushed at: over 6 years ago - Stars: 90 - Forks: 19

tugrul512bit/Cekirdekler
Multi-device OpenCL kernel load balancer and pipeliner API for C#. Uses shared-distributed memory model to keep GPUs updated fast while using same kernel on all devices(for simplicity).
Language: C# - Size: 10.6 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 86 - Forks: 9

adevaucorbeil/karamelo
An open source parallel C++ package for the material point method (MPM)
Language: C++ - Size: 31.2 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 85 - Forks: 22

JuliaTeachingCTU/Scientific-Programming-in-Julia
Repository for B0M36SPJ
Language: Jupyter Notebook - Size: 66.8 MB - Last synced at: 20 days ago - Pushed at: 4 months ago - Stars: 85 - Forks: 16

ParaGroup/WindFlow
A C++17 Data Stream Processing Parallel Library for Multicores and GPUs
Language: C++ - Size: 48.9 MB - Last synced at: about 23 hours ago - Pushed at: about 2 months ago - Stars: 81 - Forks: 19

FluidNumerics/SELF
Spectral Element Library in Fortran
Language: Fortran - Size: 48.5 MB - Last synced at: 6 days ago - Pushed at: 10 days ago - Stars: 78 - Forks: 11

aestream/aestream
Efficient streaming of sparse event data supporting files, network I/O, GPU peripherals (via Torch/Jax/Numpy) and neuromorphic protocols
Language: C++ - Size: 30.2 MB - Last synced at: 2 days ago - Pushed at: 8 months ago - Stars: 78 - Forks: 11

oalieno/asm2vec-pytorch
Unofficial implementation of asm2vec using pytorch ( with GPU acceleration )
Language: Python - Size: 60.5 KB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 75 - Forks: 21

TheoreticalEcology/s-jSDM
Scalable joint species distribution modeling
Language: R - Size: 51.3 MB - Last synced at: 5 days ago - Pushed at: 2 months ago - Stars: 70 - Forks: 15

EMI-Group/evorl
EvoRL is a fully GPU-accelerated framework for Evolutionary Reinforcement Learning, implemented with JAX. It supports Reinforcement Learning (RL), Evolutionary Computation (EC), Evolution-guided Reinforcement Learning (ERL), AutoRL, and seamless integration with GPU-optimized simulation environments.
Language: Python - Size: 2.74 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 69 - Forks: 7

wi-re/openMaelstrom
An open source GPU based SPH simulation with support for spatial adaptivity
Language: C++ - Size: 290 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 65 - Forks: 9

SciRuby/rbcuda
CUDA bindings for Ruby
Language: C - Size: 219 KB - Last synced at: about 1 month ago - Pushed at: almost 6 years ago - Stars: 64 - Forks: 10

PhasicFlow/phasicFlow
Parallel, highly efficient code (CPU and GPU) for DEM and CFD-DEM simulations.
Language: C++ - Size: 90.6 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 63 - Forks: 35

kunitoki/yup
YUP is an open-source library dedicated to empowering developers with advanced tools for cross-platform application development.
Language: C++ - Size: 20.5 MB - Last synced at: 11 days ago - Pushed at: 12 days ago - Stars: 63 - Forks: 8

brian-team/brian2cuda
A brian2 extension to simulate spiking neural networks on GPUs
Language: Python - Size: 122 MB - Last synced at: 14 days ago - Pushed at: 4 months ago - Stars: 63 - Forks: 13
