GitHub topics: policy-gradient

Repositories

qlan3/Jaxplorer

Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.

Language: Python - Size: 91.8 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 9 - Forks: 0

nslyubaykin/relax_trpo_example

Example TRPO implementation with ReLAx

Language: Jupyter Notebook - Size: 2.35 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/relax_vpg_example

Example VPG implementation with ReLAx

Language: Jupyter Notebook - Size: 384 KB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/relax_td3_example

Example TD3 implementation with ReLAx

Language: Jupyter Notebook - Size: 7.3 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/relax_ppo_example

Example PPO implementation with ReLAx

Language: Jupyter Notebook - Size: 3.7 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/ppo_with_dqn_critic

Training PPO with DQN as a critic

Language: Jupyter Notebook - Size: 16.9 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/relax_a2c_example

Example A2C implementation with ReLAx

Language: Jupyter Notebook - Size: 331 KB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/relax_sac_example

Example SAC implementation with ReLAx

Language: Jupyter Notebook - Size: 7.23 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/trpo_schedule_kl

Scheduling TRPO's KL Divergence Constraint

Language: Jupyter Notebook - Size: 212 KB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/rnns_for_pomdp

Recurrent Policies for Handling Partially Observable Environments

Language: Jupyter Notebook - Size: 3.46 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

nslyubaykin/parallel_ppo

Speeding Up PPO with Parallel Sampling

Language: Jupyter Notebook - Size: 4.88 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

nslyubaykin/relax_ddpg_example

Example DDPG implementation with ReLAx

Language: Jupyter Notebook - Size: 3.19 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/nstep_td3

Multistep TD3 for locomotion

Language: Jupyter Notebook - Size: 42.3 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

KJLdefeated/Trajectory-Transformer-for-Quatitative-Trading

NYCU Intro2AI Final Project

Language: Python - Size: 378 MB - Last synced at: 10 months ago - Pushed at: almost 2 years ago - Stars: 10 - Forks: 3

notjedi/tuxkart-ai

[WIP] RL agent for the SuperTuxKart game.

Language: Python - Size: 82.7 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 5 - Forks: 0

bit-baker/Actor-Critic

Language: Jupyter Notebook - Size: 1.71 MB - Last synced at: 11 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

jizekki/reinforcement-learning

My work during the TP sessions on reinforcement learning

Language: Python - Size: 11.7 KB - Last synced at: 11 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

chengxi600/RLStuff

A collection of reinforcement learning algorithm implementations

Language: Jupyter Notebook - Size: 926 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 54 - Forks: 35

oliverc1623/DRIVE-Sim

A PyTorch-based framework to conduct deep reinforcement learning research in multiple autonomous vehicle simulators

Language: Jupyter Notebook - Size: 15.5 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 2 - Forks: 1

Allenpandas/Reinforcement-Learning-Papers

📚 List of Top-tier Conference Papers on Reinforcement Learning (RL)，including: NeurIPS, ICML, AAAI, IJCAI, AAMAS, ICLR, ICRA, etc.

Size: 608 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 246 - Forks: 30

Book repository for AlphaGo Simplified (CRC Press 2024). Implement ideas behind Deep Blue (rule-based AI) and AlphaGo (rule-based AI + Deep Learning) in three simple games: Last Coin Standing, Tic Tac Toe, and Connect Four.

Language: Jupyter Notebook - Size: 27.6 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

Scitator/rl-course-experiments

Language: Jupyter Notebook - Size: 2 MB - Last synced at: 1 day ago - Pushed at: almost 8 years ago - Stars: 77 - Forks: 23

Prakhar-FF13/Reinforcement-Learning-With-Python

Reinforcement Learning Notebooks

Language: Python - Size: 115 KB - Last synced at: about 2 months ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 1

destagia/pgfd

Policy Gradient reinforcement learning with Finite-Difference

Language: Python - Size: 15.6 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

victor-iyi/policy-gradient

A policy gradient approach to a multi-armed bandit problem

Language: Jupyter Notebook - Size: 5.86 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

victor-iyi/multi-armed-bandit-with-policy-gradient

A multi armed bandit Reinforcement learning problem using Policy Gradient.

Language: Jupyter Notebook - Size: 8.79 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 8 - Forks: 0

victor-iyi/deep-RL

Exploration of deep reinforcement learning and various state-of-the-art techniques to create a turely autonomous agent.

Language: Python - Size: 61.5 KB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 1

yaserkl/RLSeq2Seq

Deep Reinforcement Learning For Sequence to Sequence Models

Language: Python - Size: 4.17 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 761 - Forks: 161

waynemystir/deep-RL-bootcamp

My solutions to the labs from this bootcamp:

Language: Jupyter Notebook - Size: 110 MB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 0

falcondai/mnist

deep models for small image classification datasets

Language: Python - Size: 13.7 KB - Last synced at: about 1 year ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

paulorocosta/learning-2opt-drl

Learning 2-opt Heuristics for the TSP via Deep Reinforcement Learning

Language: Python - Size: 82.2 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 41 - Forks: 18

koulanurag/pfa

Policy Fusion Architecture (PFA): We investigate policy gradient approaches for reward decomposition in reinforcement Learning

Language: Python - Size: 482 KB - Last synced at: about 1 year ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 0

abaheti95/LoL-RL

Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients

Language: Python - Size: 3.98 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 20 - Forks: 5

MehdiShahbazi/REINFORCE-Cart-Pole-Gymnasium

This repo implements the REINFORCE algorithm for solving the Cart Pole V1 environment of the Gymnasium library using Python 3.8 and PyTorch 2.0.1.

Language: Python - Size: 636 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 2

choo8/openai-cartpole

Experimentation with the Cartpole environment in OpenAI's gym

Language: Jupyter Notebook - Size: 37.1 KB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 0

brandhaug/board-games-reinforcement-learning

Board games with Reinforcement Learning. Peg Solitaire with an Actor Critic agent. NIM, Ledge (aka Gold Rush), and Hex with a Monte Carlo Tree Search agent.

Language: Scala - Size: 153 KB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

BY571/Deep-Reinforcement-Learning-Algorithm-Collection

Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.

Language: Jupyter Notebook - Size: 25.2 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 68 - Forks: 14

georgesung/deep_rl_acrobot

TensorFlow A2C to solve Acrobot, with synchronized parallel environments

Language: Python - Size: 368 KB - Last synced at: about 1 year ago - Pushed at: about 7 years ago - Stars: 35 - Forks: 10

CS486-RL-Poker-Agent/bismuth

An RL agent using policy gradient to learn no-limit Texas hold'em.

Language: Python - Size: 1.07 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Shubham1965/Reinforcement-learning

Implementation of state-of-the-art reinforcement learning algorithms

Language: Jupyter Notebook - Size: 570 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

sahandkhoshdel99/Reinforcement-Learning-

Language: Jupyter Notebook - Size: 209 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

enfff/robot-learning-labs

Labs for the Robot Learning class, focusing on robotics and Reinforcement Learning. Each lab focuses on a different topic, had mandatory tasks and eventually extensions. All the results have been discussed in the reports.

Language: Python - Size: 0 Bytes - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

avoroshilov/rl-selfplay

Simple reinforcement learning framework for selfplay experiments

Language: Python - Size: 22.5 KB - Last synced at: about 1 year ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

FumaNet/PolicyGradientWithNN

A commented, Colab-compatible version of an implementation of a Neural Network Policy to solve OpenAI gym's CartPole-v1 using Policy Gradient.

Language: Jupyter Notebook - Size: 7.81 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

jviquerat/pbo

Policy-based optimization : single-step policy gradient seen as an evolution strategy

Language: Python - Size: 44.6 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 5

ShaharShc/DeepReinforcementLearningCourse

Ben Gurion University "Deep Reinforcement Learning (372.2.5910)" course assignments & solutions

Language: Python - Size: 3.65 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

spirosbax/HittingTheGym

Implementing reinforcement learning algorithms using TensorFlow and Keras in OpenAI Gym

Language: Python - Size: 463 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 0

MG2033/A2C

A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow

Language: Python - Size: 886 KB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 181 - Forks: 37

Kytabyte/rl-playground

Implementation and experiments of reinforcement learning algorithms in CS885 @ UW

Language: Python - Size: 93.8 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 1

jcwleo/Reinforcement_Learning

강화학습에 대한 기본적인 알고리즘 구현

Language: Python - Size: 5.56 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 115 - Forks: 32

philip-jordan/decentralized-byzantine-RL

Experiments code for AAMAS'24 paper on "Decentralized Federated Policy Gradient with Byzantine Fault-Tolerance and Provably Fast Convergence"

Language: Python - Size: 1.95 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

maik97/wacky-rl

Custom Reinforcement Learning Agents

Language: Python - Size: 232 KB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

navuboy/rl_gym

Solving several OpenAI Gym and custom gazebo environments using reinforcement learning techniques.

Language: Python - Size: 4.38 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 19 - Forks: 15

mtrazzi/spinning-up-a-Pong-AI-with-deep-RL

Code for "Spinning Up a Pong AI With Deep RL" on FloydHub.

Language: Jupyter Notebook - Size: 4.41 MB - Last synced at: 28 days ago - Pushed at: over 6 years ago - Stars: 54 - Forks: 14

Panjete/mujocoagents

Exploring Imitation Learning (DAGGER), RL (Policy Gradients and Soft Actor-Critic) and Imitation-Seeded RL for training MuJoCo Environments in OpenAI's Gym

Language: Python - Size: 22.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

rajak7/RL_CVD

Reinforcement Learning Agent for Autonomous Synthesis of pure phase MoS2 with minimum defect conc. by CVD

Language: Jupyter Notebook - Size: 3.02 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 3

RedLeader962/LectureDirigeDRLimplementation

Directed reading on Deep Reinforcement Learning

Language: Python - Size: 62.4 MB - Last synced at: 27 days ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

johnlime/UnitNeurons

C++ neuron-based neural network library

Language: C++ - Size: 2.57 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 1

bay3s/ppo-parallel

Parallelized implementation of Proximal Policy Optimization (PPO).

Language: Python - Size: 105 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

lorenzomancini1/DeepRL

Implementations of some of the most well known Deep Reinforcement Learning algorithms

Language: Python - Size: 17.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

dksifoua/Reinforcement-Learning

Language: Jupyter Notebook - Size: 1.14 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 1

mynkpl1998/upgraded-octo-lamp

An autonomous driving simulator for modelling Vehicle to Infrastructure (V2I) conditions.

Language: Jupyter Notebook - Size: 26.4 MB - Last synced at: 6 months ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 0

aryamaansaha/cvi-rl-games

This repository contains code for reinforcement learning algorithms in Pytorch.

Language: Jupyter Notebook - Size: 15 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

luke-davidson/ReinforcementLearning

Programming assignments completed for my Reinforcement Learning course: Topics include Bandit Algorithms, Dynamic Programming, policy iteration, Monte-Carlo methods, SARSA, Q-Learning, Dyna-Q/Dyna-Q+, gradient control methods, state aggregation methods, and Deep Q-Learning Networks (DQNs).

Language: Jupyter Notebook - Size: 26.6 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

BardOfCodes/DRL_in_CV

A course on Deep Reinforcement Learning in Computer Vision. Visit Website:

Language: HTML - Size: 26.4 MB - Last synced at: about 1 year ago - Pushed at: about 7 years ago - Stars: 62 - Forks: 12

mew-two-github/CS6700-Project

Implementation of REINFORCE for open ai env acrobot, epsilon greedy Q-Learning for open ai env taxi & TD(0) for custom gameshow env KBC.

Language: Python - Size: 56.6 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

hvishal512/CS6700-Reinforcement-Learning

Artificial Intelligence series

Language: Jupyter Notebook - Size: 5.04 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 16 - Forks: 4

rafonsor/unRL

unRL (AKA "unreal") is a set of libraries providing Reinforcement Learning algorithms implemented in PyTorch or Jax.

Language: Python - Size: 354 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

aayushg55/cs285_hw1_dagger

Deep RL Assignments

Language: Jupyter Notebook - Size: 81.7 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

germain-hug/Deep-RL-Keras

Keras Implementation of popular Deep RL Algorithms (A3C, DDQN, DDPG, Dueling DDQN)

Language: Python - Size: 2.84 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 496 - Forks: 148

Rintarooo/VRP_DRL_MHA

"Attention, Learn to Solve Routing Problems!"[Kool+, 2019], Capacitated Vehicle Routing Problem solver

Language: Python - Size: 34.3 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 138 - Forks: 32

tsenghungchen/show-adapt-and-tell

Code for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017

Language: Python - Size: 2.51 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 148 - Forks: 40

SlipknotTN/pytorch_carracing_rl

QLearning and Policy Gradient PyTorch implementation for OpenAI gym CarRacing

Language: Python - Size: 3.84 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

KitStandart/rl_lib

Исследовательская библиотека обучения с подкреплением.

Language: Python - Size: 502 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

RITCHIEHuang/DeepRL_Algorithms

DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)

Language: Python - Size: 8.22 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 276 - Forks: 71