An open API service providing repository metadata for many open source software ecosystems.

Topic: "gpu-acceleration"

tensorflow/tfjs

A WebGL accelerated JavaScript library for training and deploying ML models.

Language: TypeScript - Size: 165 MB - Last synced at: 4 days ago - Pushed at: 9 days ago - Stars: 18,768 - Forks: 1,967

NVIDIA/TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

Language: C++ - Size: 130 MB - Last synced at: 3 days ago - Pushed at: about 2 months ago - Stars: 11,497 - Forks: 2,182

tensorflow/tfjs-core 📦

WebGL-accelerated ML // linear algebra // automatic differentiation for JavaScript.

Language: TypeScript - Size: 362 MB - Last synced at: 8 days ago - Pushed at: over 5 years ago - Stars: 8,481 - Forks: 949

raphamorim/rio

A hardware-accelerated GPU terminal emulator focusing to run in desktops and browsers.

Language: Rust - Size: 260 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 4,992 - Forks: 175

cornellius-gp/gpytorch

A highly efficient implementation of Gaussian Processes in PyTorch

Language: Python - Size: 29.3 MB - Last synced at: 3 days ago - Pushed at: about 2 months ago - Stars: 3,689 - Forks: 564

NVIDIA/GenerativeAIExamples

Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

Language: Python - Size: 69 MB - Last synced at: 16 days ago - Pushed at: about 1 month ago - Stars: 2,981 - Forks: 710

Hedgehog-Computing/hedgehog-lab

Run, compile and execute JavaScript for Scientific Computing and Data Visualization TOTALLY TOTALLY TOTALLY in your BROWSER! An open source scientific computing environment for JavaScript TOTALLY in your browser, matrix operations with GPU acceleration, TeX support, data visualization and symbolic computation.

Language: TypeScript - Size: 28.3 MB - Last synced at: 12 days ago - Pushed at: 12 months ago - Stars: 2,369 - Forks: 140

BlazingDB/blazingsql

BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.

Language: C++ - Size: 41.4 MB - Last synced at: 12 days ago - Pushed at: over 2 years ago - Stars: 1,959 - Forks: 184

TianZerL/Anime4KCPP

A high performance anime upscaler

Language: C++ - Size: 7.44 MB - Last synced at: 15 days ago - Pushed at: about 1 month ago - Stars: 1,864 - Forks: 145

coreylowman/dfdx

Deep learning in Rust, with shape checked tensors and neural networks

Language: Rust - Size: 2.6 MB - Last synced at: 17 days ago - Pushed at: 9 months ago - Stars: 1,799 - Forks: 107

emacs-ng/emacs-ng

A new approach to Emacs - Including TypeScript, Threading, Async I/O, and WebRender.

Language: Emacs Lisp - Size: 416 MB - Last synced at: 16 days ago - Pushed at: about 2 months ago - Stars: 1,742 - Forks: 73

calebwin/emu

The write-once-run-anywhere GPGPU library for Rust

Language: Rust - Size: 342 MB - Last synced at: 12 days ago - Pushed at: over 2 years ago - Stars: 1,604 - Forks: 52

NVIDIA/cccl

CUDA Core Compute Libraries

Language: C++ - Size: 79.9 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1,602 - Forks: 208

beehive-lab/TornadoVM

TornadoVM: A practical and efficient heterogeneous programming framework for managed languages

Language: Java - Size: 152 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1,229 - Forks: 119

stotko/stdgpu

stdgpu: Efficient STL-like Data Structures on the GPU

Language: C++ - Size: 4.87 MB - Last synced at: 13 days ago - Pushed at: 2 months ago - Stars: 1,212 - Forks: 88

Jaysmito101/TerraForge3D

Cross Platform Professional Procedural Terrain Generation & Texturing Tool

Language: C++ - Size: 630 MB - Last synced at: 4 days ago - Pushed at: about 1 month ago - Stars: 1,047 - Forks: 97

Liu-xiandong/How_to_optimize_in_GPU

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.

Language: Cuda - Size: 1.25 MB - Last synced at: 14 days ago - Pushed at: over 1 year ago - Stars: 997 - Forks: 152

NVIDIA-Merlin/HugeCTR

HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training

Language: C++ - Size: 55.7 MB - Last synced at: 3 days ago - Pushed at: about 1 month ago - Stars: 993 - Forks: 203

hughperkins/VeriGPU

OpenSource GPU, in Verilog, loosely based on RISC-V ISA

Language: SystemVerilog - Size: 6.76 MB - Last synced at: 26 days ago - Pushed at: 5 months ago - Stars: 961 - Forks: 109

dgasmith/opt_einsum

⚡️Optimizing einsum functions in NumPy, Tensorflow, Dask, and more with contraction order optimization.

Language: Python - Size: 4.11 MB - Last synced at: 3 days ago - Pushed at: about 1 month ago - Stars: 898 - Forks: 72

eszdman/PhotonCamera

Android Camera that uses Enhanced image processing

Language: Java - Size: 22.7 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 845 - Forks: 79

coreylowman/cudarc

Safe rust wrapper around CUDA toolkit

Language: Rust - Size: 2.79 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 821 - Forks: 97

NVIDIA-Merlin/Merlin

NVIDIA Merlin is an open source library providing end-to-end GPU-accelerated recommender systems, from feature engineering and preprocessing to training deep learning models and running inference in production.

Language: Python - Size: 38 MB - Last synced at: 13 days ago - Pushed at: 5 months ago - Stars: 816 - Forks: 123

limbo018/DREAMPlace

Deep learning toolkit-enabled VLSI placement

Language: C++ - Size: 18 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 777 - Forks: 216

NVlabs/sionna

Sionna: An Open-Source Library for Next-Generation Physical Layer Research

Language: Python - Size: 191 MB - Last synced at: 6 months ago - Pushed at: 7 months ago - Stars: 766 - Forks: 221

ttddee/Cascade 📦

Node-based image editor with GPU-acceleration.

Language: C++ - Size: 7.21 MB - Last synced at: 25 days ago - Pushed at: 10 months ago - Stars: 743 - Forks: 35

iot-salzburg/gpu-jupyter

GPU-Jupyter: Your GPU-accelerated JupyterLab with a rich data science toolstack, TensorFlow and PyTorch for your reproducible deep learning experiments.

Language: Jupyter Notebook - Size: 1.21 MB - Last synced at: 12 days ago - Pushed at: about 2 months ago - Stars: 733 - Forks: 236

Sergio0694/NeuralNetwork.NET

A TensorFlow-inspired neural network library built from scratch in C# 7.3 for .NET Standard 2.0, with GPU support through cuDNN

Language: C# - Size: 13.1 MB - Last synced at: 18 days ago - Pushed at: over 2 years ago - Stars: 556 - Forks: 88

philferriere/dlwin

GPU-accelerated Deep Learning on Windows 10 native

Language: Python - Size: 2.71 MB - Last synced at: 5 days ago - Pushed at: almost 3 years ago - Stars: 517 - Forks: 100

DavidDiazGuerra/gpuRIR

Python library for Room Impulse Response (RIR) simulation with GPU acceleration

Language: Cuda - Size: 4.58 MB - Last synced at: 25 days ago - Pushed at: 3 months ago - Stars: 515 - Forks: 95

MegviiRobot/MegBA

MegBA: A GPU-Based Distributed Library for Large-Scale Bundle Adjustment

Language: Cuda - Size: 1.3 MB - Last synced at: 5 months ago - Pushed at: 11 months ago - Stars: 450 - Forks: 61

EMI-Group/evox

Distributed GPU-Accelerated Framework for Evolutionary Computation. Comprehensive Library of Evolutionary Algorithms & Benchmark Problems.

Language: Python - Size: 37.9 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 445 - Forks: 71

ProjectPhysX/OpenCL-Wrapper

OpenCL is the most powerful programming language ever created. Yet the OpenCL C++ bindings are cumbersome and the code overhead prevents many people from getting started. I created this lightweight OpenCL-Wrapper to greatly simplify OpenCL software development with C++ while keeping functionality and performance.

Language: C++ - Size: 300 KB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 390 - Forks: 40

uncomplicate/bayadera

High-performance Bayesian Data Analysis on the GPU in Clojure

Language: Clojure - Size: 1020 KB - Last synced at: 17 days ago - Pushed at: over 4 years ago - Stars: 365 - Forks: 23

andrewmilson/ministark

🏃‍♂️💨 GPU accelerated STARK prover built on @arkworks-rs

Language: Rust - Size: 1.65 MB - Last synced at: 8 days ago - Pushed at: 5 months ago - Stars: 354 - Forks: 36

DataCanvasIO/HyperGBM

A full pipeline AutoML tool for tabular data

Language: Python - Size: 11 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 347 - Forks: 47

Glavnokoman/vuh

Vulkan compute for people

Language: C++ - Size: 705 KB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 340 - Forks: 34

gpufit/Gpufit

GPU-accelerated Levenberg-Marquardt curve fitting in CUDA

Language: Cuda - Size: 1.16 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 319 - Forks: 96

favreau/Sol-R Fork of cyrillefavreau/Sol-R

Open-Source CUDA/OpenCL Speed Of Light Ray-tracer

Language: C++ - Size: 22 MB - Last synced at: 6 months ago - Pushed at: 10 months ago - Stars: 306 - Forks: 14

quiver-team/torch-quiver

PyTorch Library for Low-Latency, High-Throughput Graph Learning on GPUs.

Language: Python - Size: 4.95 MB - Last synced at: 22 days ago - Pushed at: over 1 year ago - Stars: 299 - Forks: 36

baggepinnen/MonteCarloMeasurements.jl

Propagation of distributions by Monte-Carlo sampling: Real number types with uncertainty represented by samples.

Language: Julia - Size: 4.85 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 274 - Forks: 17

marian-nmt/marian-dev

Fast Neural Machine Translation in C++ - development repository

Language: C++ - Size: 18.7 MB - Last synced at: 12 days ago - Pushed at: 6 months ago - Stars: 271 - Forks: 129

stitchEm/stitchEm

Vahana VR & VideoStitch Studio: software to create immersive 360° VR video, live and in post-production

Language: C++ - Size: 7.26 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 244 - Forks: 62

AdrianAntico/AutoQuant

R package for automation of machine learning, forecasting, model evaluation, and model interpretation

Language: R - Size: 804 MB - Last synced at: about 2 hours ago - Pushed at: 4 months ago - Stars: 243 - Forks: 43

denosaurs/netsaur

Powerful Powerful Machine Learning library with GPU, CPU and WASM backends

Language: Rust - Size: 146 MB - Last synced at: 7 days ago - Pushed at: 7 months ago - Stars: 243 - Forks: 4

ROCm/Tensile

Stretching GPU performance for GEMMs and tensor contractions.

Language: Python - Size: 95 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 235 - Forks: 158

BasBuller/PySNN

Efficient Spiking Neural Network framework, built on top of PyTorch for GPU acceleration

Language: Python - Size: 12.8 MB - Last synced at: about 15 hours ago - Pushed at: 9 months ago - Stars: 225 - Forks: 27

clEsperanto/pyclesperanto_prototype

GPU-accelerated bio-image analysis focusing on 3D+t microscopy image data

Language: Jupyter Notebook - Size: 221 MB - Last synced at: 4 days ago - Pushed at: 2 months ago - Stars: 223 - Forks: 48

bh107/bohrium

Automatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX

Language: C++ - Size: 32.4 MB - Last synced at: 6 months ago - Pushed at: over 4 years ago - Stars: 221 - Forks: 31

AudioKit/Waveform

GPU accelerated waveform view

Language: Swift - Size: 4.62 MB - Last synced at: 5 months ago - Pushed at: 6 months ago - Stars: 205 - Forks: 15

daktronics/cef-mixer

High Performance off-screen rendering (OSR) demo using CEF

Language: C++ - Size: 283 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 205 - Forks: 49

mikeroyal/GPU-Guide

Graphics Processing Unit (GPU) Architecture Guide

Language: Shell - Size: 815 KB - Last synced at: 4 days ago - Pushed at: about 3 years ago - Stars: 202 - Forks: 16

uncomplicate/clojurecuda

Clojure library for CUDA development

Language: Clojure - Size: 508 KB - Last synced at: 18 days ago - Pushed at: 3 months ago - Stars: 184 - Forks: 10

aliemo/transfomers-silicon-research

Research and Materials on Hardware implementation of Transformer Model

Language: Jupyter Notebook - Size: 1.84 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 183 - Forks: 25

PeculiarVentures/GammaCV

GammaCV is a WebGL accelerated Computer Vision library for browser

Language: JavaScript - Size: 28.3 MB - Last synced at: 18 days ago - Pushed at: 23 days ago - Stars: 182 - Forks: 24

ertis-research/kafka-ml

Kafka-ML: connecting the data stream with ML/AI frameworks (now TensorFlow and PyTorch!)

Language: Python - Size: 5.44 MB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 181 - Forks: 25

yzhao062/pytod

TOD: GPU-accelerated Outlier Detection via Tensor Operations

Language: Python - Size: 13.1 MB - Last synced at: 19 days ago - Pushed at: about 2 years ago - Stars: 180 - Forks: 24

eth-cscs/COSMA

Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm

Language: C++ - Size: 8.35 MB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 177 - Forks: 26

merzlab/QUICK

QUICK: A GPU-enabled ab intio quantum chemistry software package

Language: C - Size: 162 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 169 - Forks: 47

leoliuf/MRiLab

A Numerical Magnetic Resonance Imaging (MRI) Simulation Platform

Language: MATLAB - Size: 113 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 168 - Forks: 60

ucl-bug/jwave

A JAX-based research framework for differentiable and parallelizable acoustic simulations, on CPU, GPUs and TPUs

Language: Python - Size: 54.8 MB - Last synced at: 8 days ago - Pushed at: 7 months ago - Stars: 164 - Forks: 21

AI4Finance-Foundation/RLSolver

Solvers for NP-hard and NP-complete problems with an emphasis on high-performance GPU computing.

Language: Python - Size: 60.9 MB - Last synced at: 1 day ago - Pushed at: 8 months ago - Stars: 150 - Forks: 34

csiro-robotics/ohm

An efficient, extensible occupancy map supporting probabilistic occupancy, normal distribution transforms in CPU and GPU.

Language: C++ - Size: 4.86 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 147 - Forks: 18

ysh329/OpenCL-101

Learn OpenCL step by step.

Language: C - Size: 476 KB - Last synced at: 13 days ago - Pushed at: over 2 years ago - Stars: 135 - Forks: 29

JuliaHealth/KomaMRI.jl

Koma is a Pulseq-compatible framework to efficiently simulate Magnetic Resonance Imaging (MRI) acquisitions. The main focus of this package is to simulate general scenarios that could arise in pulse sequence development.

Language: Julia - Size: 541 MB - Last synced at: 14 days ago - Pushed at: 16 days ago - Stars: 132 - Forks: 22

arceryz/raylib-gpu-particles

Raylib 100% GPU particles example in 3D. Uses compute shaders and is fully documented. Millions of particles at 60 fps on a laptop.

Language: C - Size: 23.4 MB - Last synced at: 15 days ago - Pushed at: 11 months ago - Stars: 123 - Forks: 5

mightycow/Sluggish

Toy CPU and GPU implementations of the Slug rendering algorithm

Language: C - Size: 2.22 MB - Last synced at: 3 months ago - Pushed at: about 6 years ago - Stars: 119 - Forks: 13

TianZerL/pyanime4k

An easy way to use anime4k in python

Language: Python - Size: 61.5 KB - Last synced at: 13 days ago - Pushed at: almost 4 years ago - Stars: 118 - Forks: 17

tensordiffeq/TensorDiffEq

Efficient and Scalable Physics-Informed Deep Learning and Scientific Machine Learning on top of Tensorflow for multi-worker distributed computing

Language: Python - Size: 1.28 MB - Last synced at: 4 days ago - Pushed at: about 3 years ago - Stars: 113 - Forks: 42

icl-utk-edu/slate

SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) systems. It is developed as part of the U.S. Department of Energy Exascale Computing Project (ECP).

Language: C++ - Size: 22.1 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 110 - Forks: 23

microsoft/Accera

Open source cross-platform compiler for compute-intensive loops used in AI algorithms, from Microsoft Research

Language: C++ - Size: 13.4 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 110 - Forks: 19

IntelPython/dpnp

Data Parallel Extension for NumPy

Language: Python - Size: 697 MB - Last synced at: about 19 hours ago - Pushed at: about 20 hours ago - Stars: 107 - Forks: 21

kohonda/mppi_playground

Model Predictive Path Integral Control (MPPI) with PyTorch

Language: Python - Size: 13.4 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 103 - Forks: 13

cuhk-eda/Xplace

Xplace 2.0: An Extremely Fast, Extensible and Deterministic Placement Framework with Detailed-Routability Optimization

Language: C++ - Size: 81.3 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 102 - Forks: 9

slai-labs/get-beam

Run GPU inference and training jobs on serverless infrastructure that scales with you.

Language: Shell - Size: 5.96 MB - Last synced at: 8 days ago - Pushed at: 11 months ago - Stars: 102 - Forks: 23

DeepMLNet/DeepNet

Deep.Net machine learning framework for F#

Language: F# - Size: 230 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 102 - Forks: 9

arctern-io/arctern

Language: C++ - Size: 66.6 MB - Last synced at: 12 months ago - Pushed at: about 3 years ago - Stars: 102 - Forks: 53

Heteroflow/Heteroflow

Concurrent CPU-GPU Programming using Task Models

Language: C++ - Size: 1.58 MB - Last synced at: 25 days ago - Pushed at: over 5 years ago - Stars: 101 - Forks: 13

mitmath/JuliaComputation

Repository for Common Ground C25

Language: Julia - Size: 69.7 MB - Last synced at: 2 days ago - Pushed at: 5 months ago - Stars: 100 - Forks: 14

ashvardanian/ParallelReductionsBenchmark

Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal - all it takes to sum a lot of numbers fast!

Language: C++ - Size: 17.3 MB - Last synced at: 7 days ago - Pushed at: 2 months ago - Stars: 96 - Forks: 9

ucbrise/piranha

Piranha: A GPU Platform for Secure Computation

Language: C++ - Size: 71.5 MB - Last synced at: 15 days ago - Pushed at: about 2 years ago - Stars: 95 - Forks: 27

lowrollr/turbozero

fast + parallel AlphaZero in JAX

Language: Python - Size: 28.8 MB - Last synced at: 14 days ago - Pushed at: 4 months ago - Stars: 94 - Forks: 9

cnvrg/metagpu

K8s device plugin for GPU sharing

Language: Go - Size: 423 KB - Last synced at: 10 months ago - Pushed at: almost 2 years ago - Stars: 92 - Forks: 8

kklmn/xrt

Package xrt (XRayTracer) is a python software library for ray tracing and wave propagation in x-ray regime. It is primarily meant for modeling synchrotron sources, beamlines and beamline elements.

Language: Python - Size: 472 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 91 - Forks: 31

larsgeb/m1-gpu-cpp

Metal Shading Language on Apple M1's GPU for scientific C++.

Language: C++ - Size: 10.9 MB - Last synced at: 24 days ago - Pushed at: over 1 year ago - Stars: 91 - Forks: 18

guillaume-chevalier/GloVe-as-a-TensorFlow-Embedding-Layer

Taking a pretrained GloVe model, and using it as a TensorFlow embedding weight layer **inside the GPU**. Therefore, you only need to send the index of the words through the GPU data transfer bus, reducing data transfer overhead.

Language: Jupyter Notebook - Size: 52.7 KB - Last synced at: 27 days ago - Pushed at: over 6 years ago - Stars: 90 - Forks: 19

tugrul512bit/Cekirdekler

Multi-device OpenCL kernel load balancer and pipeliner API for C#. Uses shared-distributed memory model to keep GPUs updated fast while using same kernel on all devices(for simplicity).

Language: C# - Size: 10.6 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 86 - Forks: 9

adevaucorbeil/karamelo

An open source parallel C++ package for the material point method (MPM)

Language: C++ - Size: 31.2 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 85 - Forks: 22

JuliaTeachingCTU/Scientific-Programming-in-Julia

Repository for B0M36SPJ

Language: Jupyter Notebook - Size: 66.8 MB - Last synced at: 20 days ago - Pushed at: 4 months ago - Stars: 85 - Forks: 16

ParaGroup/WindFlow

A C++17 Data Stream Processing Parallel Library for Multicores and GPUs

Language: C++ - Size: 48.9 MB - Last synced at: about 23 hours ago - Pushed at: about 2 months ago - Stars: 81 - Forks: 19

FluidNumerics/SELF

Spectral Element Library in Fortran

Language: Fortran - Size: 48.5 MB - Last synced at: 6 days ago - Pushed at: 10 days ago - Stars: 78 - Forks: 11

aestream/aestream

Efficient streaming of sparse event data supporting files, network I/O, GPU peripherals (via Torch/Jax/Numpy) and neuromorphic protocols

Language: C++ - Size: 30.2 MB - Last synced at: 2 days ago - Pushed at: 8 months ago - Stars: 78 - Forks: 11

oalieno/asm2vec-pytorch

Unofficial implementation of asm2vec using pytorch ( with GPU acceleration )

Language: Python - Size: 60.5 KB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 75 - Forks: 21

TheoreticalEcology/s-jSDM

Scalable joint species distribution modeling

Language: R - Size: 51.3 MB - Last synced at: 5 days ago - Pushed at: 2 months ago - Stars: 70 - Forks: 15

EMI-Group/evorl

EvoRL is a fully GPU-accelerated framework for Evolutionary Reinforcement Learning, implemented with JAX. It supports Reinforcement Learning (RL), Evolutionary Computation (EC), Evolution-guided Reinforcement Learning (ERL), AutoRL, and seamless integration with GPU-optimized simulation environments.

Language: Python - Size: 2.74 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 69 - Forks: 7

wi-re/openMaelstrom

An open source GPU based SPH simulation with support for spatial adaptivity

Language: C++ - Size: 290 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 65 - Forks: 9

SciRuby/rbcuda

CUDA bindings for Ruby

Language: C - Size: 219 KB - Last synced at: about 1 month ago - Pushed at: almost 6 years ago - Stars: 64 - Forks: 10

PhasicFlow/phasicFlow

Parallel, highly efficient code (CPU and GPU) for DEM and CFD-DEM simulations.

Language: C++ - Size: 90.6 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 63 - Forks: 35

kunitoki/yup

YUP is an open-source library dedicated to empowering developers with advanced tools for cross-platform application development.

Language: C++ - Size: 20.5 MB - Last synced at: 11 days ago - Pushed at: 12 days ago - Stars: 63 - Forks: 8

brian-team/brian2cuda

A brian2 extension to simulate spiking neural networks on GPUs

Language: Python - Size: 122 MB - Last synced at: 14 days ago - Pushed at: 4 months ago - Stars: 63 - Forks: 13