An open API service providing repository metadata for many open source software ecosystems.

Topic: "continuous-control"

ikostrikov/pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language: Python - Size: 5.91 MB - Last synced at: 30 days ago - Pushed at: almost 3 years ago - Stars: 3,729 - Forks: 835

opendilab/LightZero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Language: Python - Size: 115 MB - Last synced at: about 9 hours ago - Pushed at: 5 days ago - Stars: 1,370 - Forks: 154

rl-tools/rl-tools

The Fastest Deep Reinforcement Learning Library

Language: C++ - Size: 7.25 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 794 - Forks: 30

ikostrikov/jaxrl

JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.

Language: Jupyter Notebook - Size: 4.17 MB - Last synced at: 11 months ago - Pushed at: over 2 years ago - Stars: 597 - Forks: 62

Omegastick/pytorch-cpp-rl 📦

PyTorch C++ Reinforcement Learning

Language: C++ - Size: 540 KB - Last synced at: 6 days ago - Pushed at: about 5 years ago - Stars: 523 - Forks: 88

ikostrikov/pytorch-trpo

PyTorch implementation of Trust Region Policy Optimization

Language: Python - Size: 9.77 KB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 440 - Forks: 90

denisyarats/pytorch_sac

PyTorch implementation of Soft Actor-Critic (SAC)

Language: Jupyter Notebook - Size: 14.5 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 426 - Forks: 90

chingyaoc/pytorch-REINFORCE

PyTorch Implementation of REINFORCE for both discrete & continuous control

Language: Python - Size: 330 KB - Last synced at: 6 days ago - Pushed at: about 8 years ago - Stars: 266 - Forks: 50

openai/EPG

Code for the paper "Evolved Policy Gradients"

Language: Python - Size: 457 KB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 250 - Forks: 55

andrewliao11/gail-tf

Tensorflow implementation of generative adversarial imitation learning

Language: Python - Size: 2.42 MB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 200 - Forks: 46

m5823779/motion-planner-reinforcement-learning

End to end motion planner using Deep Deterministic Policy Gradient (DDPG) in gazebo

Language: Python - Size: 28.5 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 142 - Forks: 38

andrewliao11/pytorch-a3c-mujoco

Implement A3C for Mujoco gym envs

Language: Python - Size: 230 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 73 - Forks: 20

zhihanyang2022/off-policy-continuous-control

Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)

Language: Python - Size: 137 MB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 72 - Forks: 10

fshamshirdar/pytorch-rdpg

PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)

Language: Python - Size: 23.4 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 44 - Forks: 21

Scitator/catalyst-rl-framework

Catalyst.RL: A Distributed Framework for Reproducible RL Research

Language: Python - Size: 9.77 KB - Last synced at: 4 days ago - Pushed at: about 6 years ago - Stars: 39 - Forks: 3

simionsoft/SimionZoo

A workbench for online model-free Reinforcement Learning on continuous control problems

Language: C++ - Size: 248 MB - Last synced at: 4 days ago - Pushed at: almost 2 years ago - Stars: 37 - Forks: 25

LQNew/Continuous_Control_Benchmark

Benchmark data (i.e., DeepMind Control Suite and MuJoCo) for RL.

Language: Python - Size: 73.6 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 24 - Forks: 0

BY571/Normalized-Advantage-Function-NAF-

PyTorch implementation of the Q-Learning Algorithm Normalized Advantage Function for continuous control problems + PER and N-step Method

Language: Jupyter Notebook - Size: 3.86 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 23 - Forks: 13

onlytailei/pytorch-rl Fork of jingweiz/pytorch-rl

Deep Reinforcement Learning with pytorch & visdom (the branch for A3C continuous control)

Language: Python - Size: 12.1 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 23 - Forks: 5

hcnoh/rl-collection-pytorch

A collection of Reinforcement Learning implementations with PyTorch

Language: Python - Size: 5.84 MB - Last synced at: 13 days ago - Pushed at: about 3 years ago - Stars: 20 - Forks: 1

mknbv/neuralode-rl

Neural Ordinary Differential Equations for Reinforcement Learning

Language: Python - Size: 365 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 19 - Forks: 3

alirezakazemipour/Continuous-PPO

Proximal Policy Optimization (Continuous Version) in PyTorch.

Language: Python - Size: 14.3 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 16 - Forks: 2

LQNew/LWDRLC

Lightweight deep RL Libraray for continuous control.

Language: Python - Size: 46 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 14 - Forks: 0

kinwo/deeprl-continuous-control

Learning Continuous Control in Deep Reinforcement Learning

Language: HTML - Size: 1.58 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 14 - Forks: 8

BackpropTools/BackpropTools

A Fast, Portable Deep Reinforcement Learning Library for Continuous Control

Language: C++ - Size: 1.94 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 13 - Forks: 0

Akella17/Deep-Bayesian-Quadrature-Policy-Optimization

Official implementation of the AAAI 2021 paper Deep Bayesian Quadrature Policy Optimization.

Language: Python - Size: 3.44 MB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 13 - Forks: 7

MahanFathi/HJxB

Continuous-Time/State/Action Fitted Value Iteration via Hamilton-Jacobi-Bellman (HJB)

Language: Python - Size: 142 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 12 - Forks: 0

sparisi/tensorl

Simple and self-contained TensorFlow implementation of reinforcement learning algorithms for continuous control, integrated with OpenAI Gym and other physics engines.

Language: Python - Size: 135 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 11 - Forks: 3

angel-ayala/webots-fire-scene 📦

A first approach for fire and smoke simulation Webots world to control a drone, manipulating its angles and thrust.

Language: Python - Size: 13.1 MB - Last synced at: 7 months ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 0

rupertbg/YES3

Whitelist intentionally-public buckets, block everything else

Language: Python - Size: 6.84 KB - Last synced at: 28 days ago - Pushed at: about 4 years ago - Stars: 9 - Forks: 1

zhengfeiwang/NeurIPS2018-AIforProsthetics

Reinforcement learning with musculoskeletal models

Language: Python - Size: 4.93 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 9 - Forks: 1

lgvaz/rlbox

RLbox: Solving OpenAI Gym with TensorFlow

Language: Python - Size: 2.62 MB - Last synced at: about 9 hours ago - Pushed at: about 7 years ago - Stars: 7 - Forks: 6

nalbert9/Deep_Reinforcement_Learning

Train agents to walk, drive by themself using Unity ML Agent, OpenAI Gym and PyTorch

Language: Python - Size: 11.8 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 0

SSubhnil/BAC-DAC-gym

Bayesian Actor-Critic with Neural Networks. Developing an OpenAI Gym toolkit for Bayesian AC reinforcement learning.

Language: Python - Size: 1.8 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 6 - Forks: 1

LinghengMeng/4_Room_World_Environment

This repository provides a simulation of 4-Room-World environment.

Language: Python - Size: 4.89 MB - Last synced at: 7 months ago - Pushed at: over 6 years ago - Stars: 6 - Forks: 3

YufengYuan/DRL

Implementation of Deep RL algorithms for continuous-control tasks

Language: Python - Size: 92.8 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 5 - Forks: 1

RoboticsDesignLab/jitterbug

A Jitterbug dm_control Reinforcement Learning domain

Language: Jupyter Notebook - Size: 44.4 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 1

MahanFathi/DeepDPG-TensorFlow

TensorFlow Implementation of Deep Deterministic Policy Gradients for Continuous OpenAI Gym Environments

Language: Python - Size: 10.7 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 0

telmo-correa/DRLND-project-2

Implementation of project 2 for Udacity's Deep Reinforcement Learning Nanodegree

Language: Jupyter Notebook - Size: 22.9 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 2 - Forks: 1

Sardhendu/DeepRL

Deep Reinforcement Learning Projects

Language: Jupyter Notebook - Size: 78 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 2

jacobyxu/Crawler_using_PPO

Implement PPO to solve Crawler problem in Unity

Language: Python - Size: 61.3 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

nslyubaykin/mbrl_multitasking

Model-Based RL Multi-Tasking with ReLAx

Language: Jupyter Notebook - Size: 11.1 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

dstuemk/simple-ppo-continuous

Implementation (TF2) of proximal policy optimization (PPO) for a continuous control task

Language: Python - Size: 22.5 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0

koulanurag/variable-td3

Learning n-step actions for control tasks

Language: Python - Size: 1.16 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

santhisenan/DDPG_Gym

An implementation of DDPG algorithm on OpenAI Gym Pendulum-v0 environment.

Language: Python - Size: 153 KB - Last synced at: 28 days ago - Pushed at: about 6 years ago - Stars: 2 - Forks: 0

fdasilva59/Udacity-DRL-ContinuousControl

My solution to Udacity Deep Reinforcement Learning Nanodegree 's Project 2 - Continuous Control

Language: Jupyter Notebook - Size: 5.1 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 3

marcelloaborges/Tennis-Collaboration-Continuous-Control

Udacity Deep Reinforcement Learning Nanodegree Program - Collaboration Continuous Control

Language: ASP - Size: 19.2 MB - Last synced at: 10 months ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0

partha746/DRLND_P2_Reacher_EnV

Continuous Control using DDPG Algorithm

Language: HTML - Size: 1.13 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 2

angel-ayala/gym-webots-fire 📦

Gym environment fro Webots Fire Scene

Language: Python - Size: 11.7 KB - Last synced at: 7 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

nslyubaykin/relax_mbpo_example

Example MBPO implementation with ReLAx

Language: Jupyter Notebook - Size: 86 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

nslyubaykin/parallel_ppo

Speeding Up PPO with Parallel Sampling

Language: Jupyter Notebook - Size: 4.88 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

tyranitar/continuous-control

A2C and D4PG implementations for the continuous control challenge. Part of the coursework for Udacity's Deep RL Nanodegree.

Language: Python - Size: 24.3 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

exajobs/ci-cd-collection

An ongoing curated list of awesome frameworks, important books, articles, talks, libraries, learning tutorials, best practices and technical resources about Continuous Integration & Continuous Delivery.

Size: 461 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

LQNew/ValueEstimationRL

Compute Q-Value Estimation for RL in MuJoCo environment.

Language: Python - Size: 46.1 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 1

BY571/TD3-and-Extensions

PyTorch implementation of Twin Delayed Deep Deterministic Policy Gradient (TD3) - including additional Extension to improve the algorithm's performance.

Language: Python - Size: 1.73 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

tim-vorona/Wolf-Rabbit

The tracking problem: competition between two recurrent neural networks at the sphere.

Language: Python - Size: 1000 KB - Last synced at: 10 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

surajitsaikia27/DRL_Continiuos_Control

Deep Deterministic Policy Gradients (DDPG) for controlling Robotic hand to grasp ball

Language: Jupyter Notebook - Size: 83 KB - Last synced at: 6 months ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

ZSoumia/Continous-control-Agent

This project was the second project of My deep reinforcement learning nanodegree

Language: HTML - Size: 6.55 MB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

silviomori/udacity-deep-reinforcement-learning-p2-continuous-control

Create and train a double-jointed arm agent that is able to maintain its hand in contact with a moving target

Language: Jupyter Notebook - Size: 15.9 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 2

MasterYexl/Mick

连续事件处理Dealing with the Continuous click Event

Language: JavaScript - Size: 10.7 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

marcelloaborges/Reacher-Continuous-Control

Udacity Deep Reinforcement Learning Nanodegree Program - Continuous Control

Language: ASP - Size: 18 MB - Last synced at: 10 months ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 5

legalaspro/rl-odyssey

RL-Odyssey is a research framework for continuous control that implements state-of-the-art RL algorithms (SAC, TD3, PPO, etc.) with clean experiment scripts and interactive notebooks.

Language: Jupyter Notebook - Size: 66 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

aidanscannell/dcmpc

Official PyTorch implementation of "DC-MPC: Discrete Codebook Model Predictive Control"

Language: Jupyter Notebook - Size: 30.4 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

KitStandart/rl_lib

Исследовательская библиотека обучения с подкреплением.

Language: Python - Size: 502 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Payam-Mousavi/RL-Continuous-Control

Using RL to control a double-jointed robot to reach target locations

Language: ASP - Size: 87.7 MB - Last synced at: about 11 hours ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/relax_frwr_example

Example FRWR (PDDM) implementation with ReLAx

Language: Jupyter Notebook - Size: 10 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/relax_cem_example

Example CEM implementation with ReLAx

Language: Jupyter Notebook - Size: 10.4 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/relax_random_shooting_example

Example Random Shooting implementation with ReLAx

Language: Jupyter Notebook - Size: 12.6 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/relax_ppo_example

Example PPO implementation with ReLAx

Language: Jupyter Notebook - Size: 3.7 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/relax_trpo_example

Example TRPO implementation with ReLAx

Language: Jupyter Notebook - Size: 2.35 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/relax_a2c_example

Example A2C implementation with ReLAx

Language: Jupyter Notebook - Size: 331 KB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/relax_vpg_example

Example VPG implementation with ReLAx

Language: Jupyter Notebook - Size: 384 KB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/relax_dyna_q_example

Example DYNA-Q implementation with ReLAx

Language: Jupyter Notebook - Size: 93.5 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/nstep_td3

Multistep TD3 for locomotion

Language: Jupyter Notebook - Size: 42.3 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/trpo_schedule_kl

Scheduling TRPO's KL Divergence Constraint

Language: Jupyter Notebook - Size: 212 KB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

harwiltz/sac

Simple implementation of SAC with PyTorch.

Language: Python - Size: 34.2 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

kHarshit/udacity-drlnd-projects 📦

Udacity's DRLND projects

Language: Jupyter Notebook - Size: 7.44 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

aaronsnoswell/line-world

A simple multi-modal continuous control RL environment

Language: Python - Size: 82 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

being-aerys/Reinforcement-Learning-Self-Projects

Fun with Reinforcement Learning in my spare time

Language: Jupyter Notebook - Size: 2.77 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

phate09/drl_continuous_control

Udacity deep reinforcement learning continuous control project

Language: Python - Size: 3.27 MB - Last synced at: 5 months ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

SIakovlev/Continuous-Control

Language: Jupyter Notebook - Size: 5.76 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 1

nithindd/ContinuousControl

Continuous Control using DDPG Algorithm

Language: Jupyter Notebook - Size: 43.7 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

marioyc/learning-to-run

Learning to Run NIPS 2017 Competition

Language: Python - Size: 65.4 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

TomorrowIsAnOtherDay/Deep-RL-Collection

Size: 0 Bytes - Last synced at: about 2 months ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

Related Topics
reinforcement-learning 56 deep-reinforcement-learning 32 pytorch 24 ddpg 18 reinforcement-learning-algorithms 18 deep-learning 17 mujoco 15 policy-gradient 14 ppo 13 actor-critic 10 proximal-policy-optimization 10 machine-learning 9 model-based-reinforcement-learning 8 openai-gym 8 tensorflow 8 trpo 8 td3 8 gym 7 sac 6 udacity-nanodegree 6 ddpg-algorithm 6 trust-region-policy-optimization 5 advantage-actor-critic 4 atari 4 discrete-control 4 soft-actor-critic 4 dqn 4 navigation 4 model-predictive-control 4 gae 4 unity-environment 4 roboschool 3 generalized-advantage-estimation 3 deep-deterministic-policy-gradient 3 model-free-rl 3 cpp 3 python3 3 udacity 3 a3c 3 a2c 3 fire 2 drone 2 rdpg 2 simulation 2 reinforce 2 continuous-deployment 2 webots 2 python 2 actor-critic-methods 2 model-based-rl 2 model-based-acceleration 2 cross-entropy-method 2 reinforcement-learning-agent 2 benchmark 2 q-learning 2 robotics 2 tinyrl 2 prioritized-experience-replay 2 n-step-bootstrapping 2 d4pg 2 dm-control 2 flax 2 jax 2 ddpg-agent 2 framework 1 hessian 1 kfac 1 kronecker-factored-approximation 1 natural-gradients 1 second-order 1 tictactoe 1 vanilla-policy-gradient 1 cem 1 stochastic-muzero 1 multi-task-reinforcement-learning 1 multi-task-rl 1 dyna-q 1 self-play 1 sampled-muzero 1 humanoid-robot 1 parallel-computing 1 muzero 1 libtorch 1 dqn-pytorch 1 naf 1 normalized-advantage-functions 1 windows 1 linux 1 distributed-systems 1 double-dqn 1 cntk 1 q-learning-algorithm 1 d2rl 1 munchausen-reinforcement-learning 1 nstep-bootstrapping 1 batch-reinforcement-learning 1 behavioral-cloning 1 offline-reinforcement-learning 1 acktr 1 reacher-environment 1