Topic: "continuous-control"
ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Language: Python - Size: 5.91 MB - Last synced at: 30 days ago - Pushed at: almost 3 years ago - Stars: 3,729 - Forks: 835

opendilab/LightZero
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
Language: Python - Size: 115 MB - Last synced at: about 9 hours ago - Pushed at: 5 days ago - Stars: 1,370 - Forks: 154

rl-tools/rl-tools
The Fastest Deep Reinforcement Learning Library
Language: C++ - Size: 7.25 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 794 - Forks: 30

ikostrikov/jaxrl
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
Language: Jupyter Notebook - Size: 4.17 MB - Last synced at: 11 months ago - Pushed at: over 2 years ago - Stars: 597 - Forks: 62

Omegastick/pytorch-cpp-rl 📦
PyTorch C++ Reinforcement Learning
Language: C++ - Size: 540 KB - Last synced at: 6 days ago - Pushed at: about 5 years ago - Stars: 523 - Forks: 88

ikostrikov/pytorch-trpo
PyTorch implementation of Trust Region Policy Optimization
Language: Python - Size: 9.77 KB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 440 - Forks: 90

denisyarats/pytorch_sac
PyTorch implementation of Soft Actor-Critic (SAC)
Language: Jupyter Notebook - Size: 14.5 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 426 - Forks: 90

chingyaoc/pytorch-REINFORCE
PyTorch Implementation of REINFORCE for both discrete & continuous control
Language: Python - Size: 330 KB - Last synced at: 6 days ago - Pushed at: about 8 years ago - Stars: 266 - Forks: 50

openai/EPG
Code for the paper "Evolved Policy Gradients"
Language: Python - Size: 457 KB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 250 - Forks: 55

andrewliao11/gail-tf
Tensorflow implementation of generative adversarial imitation learning
Language: Python - Size: 2.42 MB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 200 - Forks: 46

m5823779/motion-planner-reinforcement-learning
End to end motion planner using Deep Deterministic Policy Gradient (DDPG) in gazebo
Language: Python - Size: 28.5 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 142 - Forks: 38

andrewliao11/pytorch-a3c-mujoco
Implement A3C for Mujoco gym envs
Language: Python - Size: 230 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 73 - Forks: 20

zhihanyang2022/off-policy-continuous-control
Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)
Language: Python - Size: 137 MB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 72 - Forks: 10

fshamshirdar/pytorch-rdpg
PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)
Language: Python - Size: 23.4 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 44 - Forks: 21

Scitator/catalyst-rl-framework
Catalyst.RL: A Distributed Framework for Reproducible RL Research
Language: Python - Size: 9.77 KB - Last synced at: 4 days ago - Pushed at: about 6 years ago - Stars: 39 - Forks: 3

simionsoft/SimionZoo
A workbench for online model-free Reinforcement Learning on continuous control problems
Language: C++ - Size: 248 MB - Last synced at: 4 days ago - Pushed at: almost 2 years ago - Stars: 37 - Forks: 25

LQNew/Continuous_Control_Benchmark
Benchmark data (i.e., DeepMind Control Suite and MuJoCo) for RL.
Language: Python - Size: 73.6 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 24 - Forks: 0

BY571/Normalized-Advantage-Function-NAF-
PyTorch implementation of the Q-Learning Algorithm Normalized Advantage Function for continuous control problems + PER and N-step Method
Language: Jupyter Notebook - Size: 3.86 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 23 - Forks: 13

onlytailei/pytorch-rl Fork of jingweiz/pytorch-rl
Deep Reinforcement Learning with pytorch & visdom (the branch for A3C continuous control)
Language: Python - Size: 12.1 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 23 - Forks: 5

hcnoh/rl-collection-pytorch
A collection of Reinforcement Learning implementations with PyTorch
Language: Python - Size: 5.84 MB - Last synced at: 13 days ago - Pushed at: about 3 years ago - Stars: 20 - Forks: 1

mknbv/neuralode-rl
Neural Ordinary Differential Equations for Reinforcement Learning
Language: Python - Size: 365 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 19 - Forks: 3

alirezakazemipour/Continuous-PPO
Proximal Policy Optimization (Continuous Version) in PyTorch.
Language: Python - Size: 14.3 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 16 - Forks: 2

LQNew/LWDRLC
Lightweight deep RL Libraray for continuous control.
Language: Python - Size: 46 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 14 - Forks: 0

kinwo/deeprl-continuous-control
Learning Continuous Control in Deep Reinforcement Learning
Language: HTML - Size: 1.58 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 14 - Forks: 8

BackpropTools/BackpropTools
A Fast, Portable Deep Reinforcement Learning Library for Continuous Control
Language: C++ - Size: 1.94 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 13 - Forks: 0

Akella17/Deep-Bayesian-Quadrature-Policy-Optimization
Official implementation of the AAAI 2021 paper Deep Bayesian Quadrature Policy Optimization.
Language: Python - Size: 3.44 MB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 13 - Forks: 7

MahanFathi/HJxB
Continuous-Time/State/Action Fitted Value Iteration via Hamilton-Jacobi-Bellman (HJB)
Language: Python - Size: 142 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 12 - Forks: 0

sparisi/tensorl
Simple and self-contained TensorFlow implementation of reinforcement learning algorithms for continuous control, integrated with OpenAI Gym and other physics engines.
Language: Python - Size: 135 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 11 - Forks: 3

angel-ayala/webots-fire-scene 📦
A first approach for fire and smoke simulation Webots world to control a drone, manipulating its angles and thrust.
Language: Python - Size: 13.1 MB - Last synced at: 7 months ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 0

rupertbg/YES3
Whitelist intentionally-public buckets, block everything else
Language: Python - Size: 6.84 KB - Last synced at: 28 days ago - Pushed at: about 4 years ago - Stars: 9 - Forks: 1

zhengfeiwang/NeurIPS2018-AIforProsthetics
Reinforcement learning with musculoskeletal models
Language: Python - Size: 4.93 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 9 - Forks: 1

lgvaz/rlbox
RLbox: Solving OpenAI Gym with TensorFlow
Language: Python - Size: 2.62 MB - Last synced at: about 9 hours ago - Pushed at: about 7 years ago - Stars: 7 - Forks: 6

nalbert9/Deep_Reinforcement_Learning
Train agents to walk, drive by themself using Unity ML Agent, OpenAI Gym and PyTorch
Language: Python - Size: 11.8 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 0

SSubhnil/BAC-DAC-gym
Bayesian Actor-Critic with Neural Networks. Developing an OpenAI Gym toolkit for Bayesian AC reinforcement learning.
Language: Python - Size: 1.8 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 6 - Forks: 1

LinghengMeng/4_Room_World_Environment
This repository provides a simulation of 4-Room-World environment.
Language: Python - Size: 4.89 MB - Last synced at: 7 months ago - Pushed at: over 6 years ago - Stars: 6 - Forks: 3

YufengYuan/DRL
Implementation of Deep RL algorithms for continuous-control tasks
Language: Python - Size: 92.8 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 5 - Forks: 1

RoboticsDesignLab/jitterbug
A Jitterbug dm_control Reinforcement Learning domain
Language: Jupyter Notebook - Size: 44.4 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 1

MahanFathi/DeepDPG-TensorFlow
TensorFlow Implementation of Deep Deterministic Policy Gradients for Continuous OpenAI Gym Environments
Language: Python - Size: 10.7 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 0

telmo-correa/DRLND-project-2
Implementation of project 2 for Udacity's Deep Reinforcement Learning Nanodegree
Language: Jupyter Notebook - Size: 22.9 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 2 - Forks: 1

Sardhendu/DeepRL
Deep Reinforcement Learning Projects
Language: Jupyter Notebook - Size: 78 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 2

jacobyxu/Crawler_using_PPO
Implement PPO to solve Crawler problem in Unity
Language: Python - Size: 61.3 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

nslyubaykin/mbrl_multitasking
Model-Based RL Multi-Tasking with ReLAx
Language: Jupyter Notebook - Size: 11.1 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

dstuemk/simple-ppo-continuous
Implementation (TF2) of proximal policy optimization (PPO) for a continuous control task
Language: Python - Size: 22.5 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0

koulanurag/variable-td3
Learning n-step actions for control tasks
Language: Python - Size: 1.16 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

santhisenan/DDPG_Gym
An implementation of DDPG algorithm on OpenAI Gym Pendulum-v0 environment.
Language: Python - Size: 153 KB - Last synced at: 28 days ago - Pushed at: about 6 years ago - Stars: 2 - Forks: 0

fdasilva59/Udacity-DRL-ContinuousControl
My solution to Udacity Deep Reinforcement Learning Nanodegree 's Project 2 - Continuous Control
Language: Jupyter Notebook - Size: 5.1 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 3

marcelloaborges/Tennis-Collaboration-Continuous-Control
Udacity Deep Reinforcement Learning Nanodegree Program - Collaboration Continuous Control
Language: ASP - Size: 19.2 MB - Last synced at: 10 months ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0

partha746/DRLND_P2_Reacher_EnV
Continuous Control using DDPG Algorithm
Language: HTML - Size: 1.13 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 2

angel-ayala/gym-webots-fire 📦
Gym environment fro Webots Fire Scene
Language: Python - Size: 11.7 KB - Last synced at: 7 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

nslyubaykin/relax_mbpo_example
Example MBPO implementation with ReLAx
Language: Jupyter Notebook - Size: 86 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

nslyubaykin/parallel_ppo
Speeding Up PPO with Parallel Sampling
Language: Jupyter Notebook - Size: 4.88 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

tyranitar/continuous-control
A2C and D4PG implementations for the continuous control challenge. Part of the coursework for Udacity's Deep RL Nanodegree.
Language: Python - Size: 24.3 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

exajobs/ci-cd-collection
An ongoing curated list of awesome frameworks, important books, articles, talks, libraries, learning tutorials, best practices and technical resources about Continuous Integration & Continuous Delivery.
Size: 461 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

LQNew/ValueEstimationRL
Compute Q-Value Estimation for RL in MuJoCo environment.
Language: Python - Size: 46.1 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 1

BY571/TD3-and-Extensions
PyTorch implementation of Twin Delayed Deep Deterministic Policy Gradient (TD3) - including additional Extension to improve the algorithm's performance.
Language: Python - Size: 1.73 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

tim-vorona/Wolf-Rabbit
The tracking problem: competition between two recurrent neural networks at the sphere.
Language: Python - Size: 1000 KB - Last synced at: 10 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

surajitsaikia27/DRL_Continiuos_Control
Deep Deterministic Policy Gradients (DDPG) for controlling Robotic hand to grasp ball
Language: Jupyter Notebook - Size: 83 KB - Last synced at: 6 months ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

ZSoumia/Continous-control-Agent
This project was the second project of My deep reinforcement learning nanodegree
Language: HTML - Size: 6.55 MB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

silviomori/udacity-deep-reinforcement-learning-p2-continuous-control
Create and train a double-jointed arm agent that is able to maintain its hand in contact with a moving target
Language: Jupyter Notebook - Size: 15.9 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 2

MasterYexl/Mick
连续事件处理Dealing with the Continuous click Event
Language: JavaScript - Size: 10.7 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

marcelloaborges/Reacher-Continuous-Control
Udacity Deep Reinforcement Learning Nanodegree Program - Continuous Control
Language: ASP - Size: 18 MB - Last synced at: 10 months ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 5

legalaspro/rl-odyssey
RL-Odyssey is a research framework for continuous control that implements state-of-the-art RL algorithms (SAC, TD3, PPO, etc.) with clean experiment scripts and interactive notebooks.
Language: Jupyter Notebook - Size: 66 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

aidanscannell/dcmpc
Official PyTorch implementation of "DC-MPC: Discrete Codebook Model Predictive Control"
Language: Jupyter Notebook - Size: 30.4 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

KitStandart/rl_lib
Исследовательская библиотека обучения с подкреплением.
Language: Python - Size: 502 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Payam-Mousavi/RL-Continuous-Control
Using RL to control a double-jointed robot to reach target locations
Language: ASP - Size: 87.7 MB - Last synced at: about 11 hours ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/relax_frwr_example
Example FRWR (PDDM) implementation with ReLAx
Language: Jupyter Notebook - Size: 10 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/relax_cem_example
Example CEM implementation with ReLAx
Language: Jupyter Notebook - Size: 10.4 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/relax_random_shooting_example
Example Random Shooting implementation with ReLAx
Language: Jupyter Notebook - Size: 12.6 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/relax_ppo_example
Example PPO implementation with ReLAx
Language: Jupyter Notebook - Size: 3.7 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/relax_trpo_example
Example TRPO implementation with ReLAx
Language: Jupyter Notebook - Size: 2.35 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/relax_a2c_example
Example A2C implementation with ReLAx
Language: Jupyter Notebook - Size: 331 KB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/relax_vpg_example
Example VPG implementation with ReLAx
Language: Jupyter Notebook - Size: 384 KB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/relax_dyna_q_example
Example DYNA-Q implementation with ReLAx
Language: Jupyter Notebook - Size: 93.5 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/nstep_td3
Multistep TD3 for locomotion
Language: Jupyter Notebook - Size: 42.3 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/trpo_schedule_kl
Scheduling TRPO's KL Divergence Constraint
Language: Jupyter Notebook - Size: 212 KB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

harwiltz/sac
Simple implementation of SAC with PyTorch.
Language: Python - Size: 34.2 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

kHarshit/udacity-drlnd-projects 📦
Udacity's DRLND projects
Language: Jupyter Notebook - Size: 7.44 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

aaronsnoswell/line-world
A simple multi-modal continuous control RL environment
Language: Python - Size: 82 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

being-aerys/Reinforcement-Learning-Self-Projects
Fun with Reinforcement Learning in my spare time
Language: Jupyter Notebook - Size: 2.77 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

phate09/drl_continuous_control
Udacity deep reinforcement learning continuous control project
Language: Python - Size: 3.27 MB - Last synced at: 5 months ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

SIakovlev/Continuous-Control
Language: Jupyter Notebook - Size: 5.76 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 1

nithindd/ContinuousControl
Continuous Control using DDPG Algorithm
Language: Jupyter Notebook - Size: 43.7 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

marioyc/learning-to-run
Learning to Run NIPS 2017 Competition
Language: Python - Size: 65.4 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

TomorrowIsAnOtherDay/Deep-RL-Collection
Size: 0 Bytes - Last synced at: about 2 months ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0
