An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: policy-gradient

qlan3/Jaxplorer

Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.

Language: Python - Size: 91.8 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 9 - Forks: 0

nslyubaykin/relax_trpo_example

Example TRPO implementation with ReLAx

Language: Jupyter Notebook - Size: 2.35 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/relax_vpg_example

Example VPG implementation with ReLAx

Language: Jupyter Notebook - Size: 384 KB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/relax_td3_example

Example TD3 implementation with ReLAx

Language: Jupyter Notebook - Size: 7.3 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/relax_ppo_example

Example PPO implementation with ReLAx

Language: Jupyter Notebook - Size: 3.7 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/ppo_with_dqn_critic

Training PPO with DQN as a critic

Language: Jupyter Notebook - Size: 16.9 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/relax_a2c_example

Example A2C implementation with ReLAx

Language: Jupyter Notebook - Size: 331 KB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/relax_sac_example

Example SAC implementation with ReLAx

Language: Jupyter Notebook - Size: 7.23 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/trpo_schedule_kl

Scheduling TRPO's KL Divergence Constraint

Language: Jupyter Notebook - Size: 212 KB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/rnns_for_pomdp

Recurrent Policies for Handling Partially Observable Environments

Language: Jupyter Notebook - Size: 3.46 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

nslyubaykin/parallel_ppo

Speeding Up PPO with Parallel Sampling

Language: Jupyter Notebook - Size: 4.88 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

nslyubaykin/relax_ddpg_example

Example DDPG implementation with ReLAx

Language: Jupyter Notebook - Size: 3.19 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/nstep_td3

Multistep TD3 for locomotion

Language: Jupyter Notebook - Size: 42.3 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

KJLdefeated/Trajectory-Transformer-for-Quatitative-Trading

NYCU Intro2AI Final Project

Language: Python - Size: 378 MB - Last synced at: 10 months ago - Pushed at: almost 2 years ago - Stars: 10 - Forks: 3

notjedi/tuxkart-ai

[WIP] RL agent for the SuperTuxKart game.

Language: Python - Size: 82.7 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 5 - Forks: 0

bit-baker/Actor-Critic

Language: Jupyter Notebook - Size: 1.71 MB - Last synced at: 11 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

jizekki/reinforcement-learning

My work during the TP sessions on reinforcement learning

Language: Python - Size: 11.7 KB - Last synced at: 11 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

chengxi600/RLStuff

A collection of reinforcement learning algorithm implementations

Language: Jupyter Notebook - Size: 926 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 54 - Forks: 35

oliverc1623/DRIVE-Sim

A PyTorch-based framework to conduct deep reinforcement learning research in multiple autonomous vehicle simulators

Language: Jupyter Notebook - Size: 15.5 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 2 - Forks: 1

Allenpandas/Reinforcement-Learning-Papers

📚 List of Top-tier Conference Papers on Reinforcement Learning (RL),including: NeurIPS, ICML, AAAI, IJCAI, AAMAS, ICLR, ICRA, etc.

Size: 608 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 246 - Forks: 30

markhliu/AlphaGoSimplified

Book repository for AlphaGo Simplified (CRC Press 2024). Implement ideas behind Deep Blue (rule-based AI) and AlphaGo (rule-based AI + Deep Learning) in three simple games: Last Coin Standing, Tic Tac Toe, and Connect Four.

Language: Jupyter Notebook - Size: 27.6 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

Scitator/rl-course-experiments

Language: Jupyter Notebook - Size: 2 MB - Last synced at: 1 day ago - Pushed at: almost 8 years ago - Stars: 77 - Forks: 23

Prakhar-FF13/Reinforcement-Learning-With-Python

Reinforcement Learning Notebooks

Language: Python - Size: 115 KB - Last synced at: about 2 months ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 1

destagia/pgfd

Policy Gradient reinforcement learning with Finite-Difference

Language: Python - Size: 15.6 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

victor-iyi/policy-gradient

A policy gradient approach to a multi-armed bandit problem

Language: Jupyter Notebook - Size: 5.86 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

victor-iyi/multi-armed-bandit-with-policy-gradient

A multi armed bandit Reinforcement learning problem using Policy Gradient.

Language: Jupyter Notebook - Size: 8.79 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 8 - Forks: 0

victor-iyi/deep-RL

Exploration of deep reinforcement learning and various state-of-the-art techniques to create a turely autonomous agent.

Language: Python - Size: 61.5 KB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 1

yaserkl/RLSeq2Seq

Deep Reinforcement Learning For Sequence to Sequence Models

Language: Python - Size: 4.17 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 761 - Forks: 161

waynemystir/deep-RL-bootcamp

My solutions to the labs from this bootcamp:

Language: Jupyter Notebook - Size: 110 MB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 0

falcondai/mnist

deep models for small image classification datasets

Language: Python - Size: 13.7 KB - Last synced at: about 1 year ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

paulorocosta/learning-2opt-drl

Learning 2-opt Heuristics for the TSP via Deep Reinforcement Learning

Language: Python - Size: 82.2 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 41 - Forks: 18

koulanurag/pfa

Policy Fusion Architecture (PFA): We investigate policy gradient approaches for reward decomposition in reinforcement Learning

Language: Python - Size: 482 KB - Last synced at: about 1 year ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 0

abaheti95/LoL-RL

Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients

Language: Python - Size: 3.98 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 20 - Forks: 5

MehdiShahbazi/REINFORCE-Cart-Pole-Gymnasium

This repo implements the REINFORCE algorithm for solving the Cart Pole V1 environment of the Gymnasium library using Python 3.8 and PyTorch 2.0.1.

Language: Python - Size: 636 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 2

choo8/openai-cartpole

Experimentation with the Cartpole environment in OpenAI's gym

Language: Jupyter Notebook - Size: 37.1 KB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 0

brandhaug/board-games-reinforcement-learning

Board games with Reinforcement Learning. Peg Solitaire with an Actor Critic agent. NIM, Ledge (aka Gold Rush), and Hex with a Monte Carlo Tree Search agent.

Language: Scala - Size: 153 KB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

BY571/Deep-Reinforcement-Learning-Algorithm-Collection

Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.

Language: Jupyter Notebook - Size: 25.2 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 68 - Forks: 14

georgesung/deep_rl_acrobot

TensorFlow A2C to solve Acrobot, with synchronized parallel environments

Language: Python - Size: 368 KB - Last synced at: about 1 year ago - Pushed at: about 7 years ago - Stars: 35 - Forks: 10

CS486-RL-Poker-Agent/bismuth

An RL agent using policy gradient to learn no-limit Texas hold'em.

Language: Python - Size: 1.07 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Shubham1965/Reinforcement-learning

Implementation of state-of-the-art reinforcement learning algorithms

Language: Jupyter Notebook - Size: 570 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

sahandkhoshdel99/Reinforcement-Learning-

Language: Jupyter Notebook - Size: 209 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

enfff/robot-learning-labs

Labs for the Robot Learning class, focusing on robotics and Reinforcement Learning. Each lab focuses on a different topic, had mandatory tasks and eventually extensions. All the results have been discussed in the reports.

Language: Python - Size: 0 Bytes - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

avoroshilov/rl-selfplay

Simple reinforcement learning framework for selfplay experiments

Language: Python - Size: 22.5 KB - Last synced at: about 1 year ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

FumaNet/PolicyGradientWithNN

A commented, Colab-compatible version of an implementation of a Neural Network Policy to solve OpenAI gym's CartPole-v1 using Policy Gradient.

Language: Jupyter Notebook - Size: 7.81 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

jviquerat/pbo

Policy-based optimization : single-step policy gradient seen as an evolution strategy

Language: Python - Size: 44.6 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 5

ShaharShc/DeepReinforcementLearningCourse

Ben Gurion University "Deep Reinforcement Learning (372.2.5910)" course assignments & solutions

Language: Python - Size: 3.65 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

spirosbax/HittingTheGym

Implementing reinforcement learning algorithms using TensorFlow and Keras in OpenAI Gym

Language: Python - Size: 463 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 0

MG2033/A2C

A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow

Language: Python - Size: 886 KB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 181 - Forks: 37

Kytabyte/rl-playground

Implementation and experiments of reinforcement learning algorithms in CS885 @ UW

Language: Python - Size: 93.8 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 1

jcwleo/Reinforcement_Learning

강화학습에 대한 기본적인 알고리즘 구현

Language: Python - Size: 5.56 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 115 - Forks: 32

philip-jordan/decentralized-byzantine-RL

Experiments code for AAMAS'24 paper on "Decentralized Federated Policy Gradient with Byzantine Fault-Tolerance and Provably Fast Convergence"

Language: Python - Size: 1.95 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

maik97/wacky-rl

Custom Reinforcement Learning Agents

Language: Python - Size: 232 KB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

navuboy/rl_gym

Solving several OpenAI Gym and custom gazebo environments using reinforcement learning techniques.

Language: Python - Size: 4.38 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 19 - Forks: 15

mtrazzi/spinning-up-a-Pong-AI-with-deep-RL

Code for "Spinning Up a Pong AI With Deep RL" on FloydHub.

Language: Jupyter Notebook - Size: 4.41 MB - Last synced at: 28 days ago - Pushed at: over 6 years ago - Stars: 54 - Forks: 14

Panjete/mujocoagents

Exploring Imitation Learning (DAGGER), RL (Policy Gradients and Soft Actor-Critic) and Imitation-Seeded RL for training MuJoCo Environments in OpenAI's Gym

Language: Python - Size: 22.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

rajak7/RL_CVD

Reinforcement Learning Agent for Autonomous Synthesis of pure phase MoS2 with minimum defect conc. by CVD

Language: Jupyter Notebook - Size: 3.02 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 3

RedLeader962/LectureDirigeDRLimplementation

Directed reading on Deep Reinforcement Learning

Language: Python - Size: 62.4 MB - Last synced at: 27 days ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

johnlime/UnitNeurons

C++ neuron-based neural network library

Language: C++ - Size: 2.57 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 1

bay3s/ppo-parallel

Parallelized implementation of Proximal Policy Optimization (PPO).

Language: Python - Size: 105 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

lorenzomancini1/DeepRL

Implementations of some of the most well known Deep Reinforcement Learning algorithms

Language: Python - Size: 17.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

dksifoua/Reinforcement-Learning

Language: Jupyter Notebook - Size: 1.14 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 1

mynkpl1998/upgraded-octo-lamp

An autonomous driving simulator for modelling Vehicle to Infrastructure (V2I) conditions.

Language: Jupyter Notebook - Size: 26.4 MB - Last synced at: 6 months ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 0

aryamaansaha/cvi-rl-games

This repository contains code for reinforcement learning algorithms in Pytorch.

Language: Jupyter Notebook - Size: 15 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

luke-davidson/ReinforcementLearning

Programming assignments completed for my Reinforcement Learning course: Topics include Bandit Algorithms, Dynamic Programming, policy iteration, Monte-Carlo methods, SARSA, Q-Learning, Dyna-Q/Dyna-Q+, gradient control methods, state aggregation methods, and Deep Q-Learning Networks (DQNs).

Language: Jupyter Notebook - Size: 26.6 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

BardOfCodes/DRL_in_CV

A course on Deep Reinforcement Learning in Computer Vision. Visit Website:

Language: HTML - Size: 26.4 MB - Last synced at: about 1 year ago - Pushed at: about 7 years ago - Stars: 62 - Forks: 12

mew-two-github/CS6700-Project

Implementation of REINFORCE for open ai env acrobot, epsilon greedy Q-Learning for open ai env taxi & TD(0) for custom gameshow env KBC.

Language: Python - Size: 56.6 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

hvishal512/CS6700-Reinforcement-Learning

Artificial Intelligence series

Language: Jupyter Notebook - Size: 5.04 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 16 - Forks: 4

rafonsor/unRL

unRL (AKA "unreal") is a set of libraries providing Reinforcement Learning algorithms implemented in PyTorch or Jax.

Language: Python - Size: 354 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

aayushg55/cs285_hw1_dagger

Deep RL Assignments

Language: Jupyter Notebook - Size: 81.7 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

germain-hug/Deep-RL-Keras

Keras Implementation of popular Deep RL Algorithms (A3C, DDQN, DDPG, Dueling DDQN)

Language: Python - Size: 2.84 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 496 - Forks: 148

Rintarooo/VRP_DRL_MHA

"Attention, Learn to Solve Routing Problems!"[Kool+, 2019], Capacitated Vehicle Routing Problem solver

Language: Python - Size: 34.3 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 138 - Forks: 32

tsenghungchen/show-adapt-and-tell

Code for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017

Language: Python - Size: 2.51 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 148 - Forks: 40

SlipknotTN/pytorch_carracing_rl

QLearning and Policy Gradient PyTorch implementation for OpenAI gym CarRacing

Language: Python - Size: 3.84 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

KitStandart/rl_lib

Исследовательская библиотека обучения с подкреплением.

Language: Python - Size: 502 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

RITCHIEHuang/DeepRL_Algorithms

DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)

Language: Python - Size: 8.22 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 276 - Forks: 71

Phoenix-Shen/ReinforcementLearning

强化学习算法库,包含了目前主流的强化学习算法(Value based and Policy basd)的代码,代码都经过调试并可以运行

Language: Python - Size: 20.1 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 32 - Forks: 8

omerbsezer/PolicyGradient_PongGame 📦

Pong Game problem solving using RL - Policy Gradient with OpenAI Gym Framework and Tensorflow

Language: Python - Size: 12.9 MB - Last synced at: 12 days ago - Pushed at: about 7 years ago - Stars: 14 - Forks: 1

cedrickchee/baselines Fork of openai/baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language: Python - Size: 4.59 MB - Last synced at: about 1 year ago - Pushed at: almost 7 years ago - Stars: 3 - Forks: 1

RayYoh/BasicRL

BasicRL: easy and fundamental codes for deep reinforcement learning。It is an improvement on rainbow-is-all-you-need and OpenAI Spinning Up.

Language: Python - Size: 3.28 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 13 - Forks: 0

Kyziridis/BipedalWalker-v2

Solving openAI's game 'BipedalWalker-v2' with Deep Reinforcement Learning

Language: Python - Size: 2.43 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 26 - Forks: 7

hzm2016/RL_project

Compare different policy gradients algorithms in 7_Policy_gradient_softmax.

Language: Python - Size: 4.94 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 1

John-CYHui/Reinforcement-Learning-REINFORCE

This repo constains the implementation of REINFORCE and REINFORCE-Baseline algorithm on Mountain car problem.

Language: Python - Size: 2.77 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

ITE-5th/image-captioning-gan

Language: Python - Size: 24.4 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 23 - Forks: 6

LQNew/LWDRLC

Lightweight deep RL Libraray for continuous control.

Language: Python - Size: 46 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 14 - Forks: 0

CynthiaKoopman/Reinforcement-Learning

Implementation of selected Reinforcement Learning Algorithms

Language: Jupyter Notebook - Size: 512 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 1

withai/Policy-Gradients-Contextual-Bandit-Problem

The contextual Bandit problem is the intermediate between Simple Bandit problem and the full RL problem. In this experiment we are going to find optimal policy to obtain maximum rewards.

Language: Jupyter Notebook - Size: 3.91 KB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 7 - Forks: 0

withai/Policy-Gradients-Full-RL-CartPole

This experiment learns the optimal policies by the method of Policy-Gradients in the Full Reinforcement Learning problem in the environment "CartPole" from OpenAI Gym.

Language: Jupyter Notebook - Size: 5.86 KB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 1

withai/Policy-Gradients-Mulit-armed-Bandit-Problem

With the concept of Policy Gradients in Reinforcement Learning we are going find optimal policy for obtaining maximum reward in Multi-armed Bandit Problem

Language: Jupyter Notebook - Size: 4.88 KB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 1

CherryPieSexy/imitation_learning

PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.

Language: Python - Size: 34.5 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 118 - Forks: 14

erfanMhi/Deep-Reinforcement-Learning-CS285-Pytorch

Solutions of assignments of Deep Reinforcement Learning course presented by the University of California, Berkeley (CS285) in Pytorch framework

Language: Python - Size: 34.7 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 124 - Forks: 11

pat-coady/trpo

Trust Region Policy Optimization with TensorFlow and OpenAI Gym

Language: Jupyter Notebook - Size: 11.8 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 355 - Forks: 105

carlosgoe/snake-rl

A reinforcement learning implementation of the video game "Snake".

Language: Python - Size: 30.3 KB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

PieroMacaluso/Deep-RL-Autonomous-Systems

Designing a control system to exploit model-free deep reinforcement learning algorithms to solve a real-world autonomous driving task of a small robot.

Language: Python - Size: 319 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 14 - Forks: 5

sparisi/mips

Minimal Policy Search Toolbox

Language: MATLAB - Size: 5.95 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 22 - Forks: 9

JasonYao81000/MLDS2018SPRING

Machine Learning and having it Deep and Structured (MLDS) in 2018 spring

Language: Python - Size: 653 MB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 136 - Forks: 46

holmen1/robots

Exploring reinforcement learning

Language: Jupyter Notebook - Size: 20.1 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

jw1401/PPO-Tensorflow-2.0

Proximal Policy Optimization with Tensorflow 2.0

Language: Python - Size: 1.3 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 27 - Forks: 8

sukiboo/policy_entropy

Analyzing policy entropy of reinforcement learning agents

Language: Python - Size: 21.5 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

yueying-teng/pong_with_policy_gradients

Language: Python - Size: 7.13 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

galinator9000/tf_A3C_BipedalWalker

BipedalWalker environment from gym, solved with Asynchronous Advantage Actor Critic algorithm using Tensorflow.

Language: Python - Size: 1.07 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 2

Related Keywords
policy-gradient 399 reinforcement-learning 302 deep-reinforcement-learning 110 pytorch 94 actor-critic 85 ppo 65 deep-learning 62 dqn 60 q-learning 55 reinforcement-learning-algorithms 54 tensorflow 52 openai-gym 49 ddpg 41 machine-learning 40 proximal-policy-optimization 35 reinforce 33 python 27 a2c 26 sarsa 22 a3c 22 rl 21 trpo 21 gym 19 monte-carlo 19 keras 19 td3 19 sac 17 artificial-intelligence 17 deep-q-network 16 neural-network 16 policy-iteration 15 python3 14 deep-q-learning 14 continuous-control 14 pong 13 mujoco 12 markov-decision-processes 12 cartpole 11 soft-actor-critic 11 actor-critic-algorithm 10 value-iteration 10 qlearning 10 tensorflow2 10 dynamic-programming 9 deep-deterministic-policy-gradient 9 temporal-differencing-learning 9 deep-neural-networks 9 double-dqn 9 sarsa-learning 8 cartpole-v0 8 ai 8 imitation-learning 8 pytorch-implementation 8 atari 8 advantage-actor-critic 8 gae 8 dueling-dqn 7 lstm 7 monte-carlo-tree-search 7 trust-region-policy-optimization 7 reinforcement-learning-environments 6 pytorch-rl 6 actor-critic-methods 6 drl 6 ppo-pytorch 6 multi-agent-reinforcement-learning 6 generalized-advantage-estimation 6 neural-networks 6 policy 6 gym-environment 6 openai 6 ddqn 6 robotics 6 reinforcement-learning-agent 5 policy-optimization 5 generative-adversarial-network 5 tutorial 5 discrete-control 5 gan 5 seq2seq 5 recurrent-neural-networks 5 temporal-difference 5 algorithms 5 hierarchical-reinforcement-learning 5 evolution-strategies 5 td-learning 5 policy-evaluation 5 model-based-rl 5 nlp 5 openai-gym-environments 5 jax 5 multiagent-reinforcement-learning 4 natural-language-processing 4 atari2600 4 deep-rl 4 rl-algorithms 4 grid-world 4 meta-learning 4 policy-based-method 4 reinforcement 4