GitHub topics: policy-gradient

Repositories

datawhalechina/easy-rl

强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/

Language: Jupyter Notebook - Size: 516 MB - Last synced at: about 16 hours ago - Pushed at: 8 days ago - Stars: 11,201 - Forks: 2,026

WorldEditor50/snakeAI

testing MLP, DQN, PPO, SAC, policy-gradient by snakeAI

Language: C++ - Size: 3.34 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 11 - Forks: 4

Automated bot that utilizes gradient networks to optimize and enhance machine learning algorithms. Designed to streamline the model training process and improve accuracy of predictive analytics tasks.

Size: 5.86 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 1

NegarMov/DRL_Algorithms

A collection of DRL algorithms on Gymnasium environments for hands-on learning and experimentation.

Language: Python - Size: 34.3 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

thu-ml/tianshou

An elegant PyTorch deep reinforcement learning library.

Language: Python - Size: 46.6 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 8,449 - Forks: 1,150

Cybernetic1/reinforcement-learning-experiments

Reinforcement learning experiments with Tic Tac Toe, especially with "logical representations"

Language: Python - Size: 15.8 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 7 - Forks: 0

niupuhua1234/GFN-PG

Code for the ICML 2024 paper 'GFlowNet Training by Policy Gradients'

Language: Python - Size: 5.41 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

VachanVY/Reinforcement-Learning

PyTorch implementations of algorithms from "Reinforcement Learning: An Introduction by Sutton and Barto", along with various RL research papers.

Language: Python - Size: 38.5 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 84 - Forks: 3

RainyEarth/Trade_Predictor_Project

An AI-powered trade prediction system using machine learning, technical analysis, and time series models. Built with FastAPI, React, and Tailwind CSS.

Language: Python - Size: 846 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 3 - Forks: 0

EMI-Group/evorl

EvoRL is a fully GPU-accelerated framework for Evolutionary Reinforcement Learning, implemented with JAX. It supports Reinforcement Learning (RL), Evolutionary Computation (EC), Evolution-guided Reinforcement Learning (ERL), AutoRL, and seamless integration with GPU-optimized simulation environments.

Language: Python - Size: 2.55 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 75 - Forks: 8

sweetice/Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Language: Python - Size: 42.1 MB - Last synced at: 8 days ago - Pushed at: about 2 years ago - Stars: 4,267 - Forks: 876

nikhilbarhate99/PPO-PyTorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Language: Python - Size: 12.1 MB - Last synced at: 10 days ago - Pushed at: 10 months ago - Stars: 2,005 - Forks: 378

MorvanZhou/Reinforcement-learning-with-tensorflow

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

Language: Python - Size: 428 KB - Last synced at: 10 days ago - Pushed at: about 1 year ago - Stars: 9,166 - Forks: 5,024

bmarroc/reinforcement-learning

Jupyter notebooks implementing Reinforcement Learning algorithms in Numpy and Tensorflow

Language: Jupyter Notebook - Size: 2.84 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 1 - Forks: 1

pyrddlgym-project/pyRDDLGym-jax

JAX compilation of RDDL description files, and a differentiable planner in JAX.

Language: Python - Size: 12.1 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 5 - Forks: 1

Khrylx/PyTorch-RL

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Language: Python - Size: 30.5 MB - Last synced at: 10 days ago - Pushed at: about 4 years ago - Stars: 1,197 - Forks: 190

medipixel/rl_algorithms

Structural implementation of RL key algorithms

Language: Python - Size: 2.6 MB - Last synced at: 10 days ago - Pushed at: about 2 years ago - Stars: 511 - Forks: 64

sshkhr/Practical_RL

My solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow

Language: Jupyter Notebook - Size: 9.91 MB - Last synced at: 5 days ago - Pushed at: over 3 years ago - Stars: 53 - Forks: 25

qlan3/Explorer

Explorer is a PyTorch reinforcement learning framework for exploring new ideas.

Language: Python - Size: 914 KB - Last synced at: 20 days ago - Pushed at: 21 days ago - Stars: 92 - Forks: 14

flint-xf-fan/Byzantine-Federated-RL

[NeurIPS2021] Federated Reinforcement Learning with Theoretical Guarantees. The repo contains code and experiments for our Federated Policy Gradient with Byzantine Resilience framework for improving sample efficiency of RL agents.

Language: Python - Size: 446 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 92 - Forks: 12

benedekrozemberczki/awesome-monte-carlo-tree-search-papers

A curated list of Monte Carlo tree search papers with implementations.

Language: Python - Size: 238 KB - Last synced at: 2 days ago - Pushed at: about 1 year ago - Stars: 674 - Forks: 74

navneet-nmk/pytorch-rl

This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch

Language: Python - Size: 97.1 MB - Last synced at: 6 days ago - Pushed at: almost 6 years ago - Stars: 444 - Forks: 55

rlcode/reinforcement-learning

Minimal and Clean Reinforcement Learning Examples

Language: Python - Size: 60.2 MB - Last synced at: 26 days ago - Pushed at: about 2 years ago - Stars: 3,506 - Forks: 738

gordicaleksa/pytorch-learn-reinforcement-learning

A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.

Language: Python - Size: 13.1 MB - Last synced at: 5 days ago - Pushed at: about 4 years ago - Stars: 154 - Forks: 33

nikhilbarhate99/Actor-Critic-PyTorch

Policy Gradient Actor-Critic PyTorch | Lunar Lander v2

Language: Python - Size: 2.1 MB - Last synced at: 5 days ago - Pushed at: about 6 years ago - Stars: 73 - Forks: 26

CodeName-Detective/Prompt-to-Song-Generation-using-Large-Language-Models

This project uses LLMs to generate music from text by understanding prompts, creating lyrics, determining genre, and composing melodies. It harnesses LLM capabilities to create songs based on text inputs through a multi-step approach.

Language: Jupyter Notebook - Size: 57.6 MB - Last synced at: 10 days ago - Pushed at: 12 months ago - Stars: 13 - Forks: 0

MarcoMeter/episodic-transformer-memory-ppo

Clean baseline implementation of PPO using an episodic TransformerXL memory

Language: Python - Size: 23.9 MB - Last synced at: 29 days ago - Pushed at: 11 months ago - Stars: 172 - Forks: 22

keon/policy-gradient

Minimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras

Language: Python - Size: 3.54 MB - Last synced at: 29 days ago - Pushed at: over 5 years ago - Stars: 160 - Forks: 43

WilliamZhang20/AI-Moonlander

Using reinforcement learning to learn to land on the moon!

Language: Jupyter Notebook - Size: 5.72 MB - Last synced at: 30 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

zuoxingdong/lagom

lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.

Language: Jupyter Notebook - Size: 95.9 MB - Last synced at: 2 days ago - Pushed at: over 2 years ago - Stars: 375 - Forks: 30

kengz/openai_lab

An experimentation framework for Reinforcement Learning using OpenAI Gym, Tensorflow, and Keras.

Language: Python - Size: 8.47 MB - Last synced at: about 1 month ago - Pushed at: about 7 years ago - Stars: 326 - Forks: 68

sudharsan13296/Deep-Reinforcement-Learning-With-Python

Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math

Language: Jupyter Notebook - Size: 23.9 MB - Last synced at: about 1 month ago - Pushed at: about 4 years ago - Stars: 409 - Forks: 136

kengz/SLM-Lab

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

Language: Python - Size: 4.08 MB - Last synced at: 25 days ago - Pushed at: 3 months ago - Stars: 1,277 - Forks: 274

salesforce/MultiHopKG

Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout

Language: Jupyter Notebook - Size: 24.2 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 309 - Forks: 80

sudharsan13296/Hands-On-Reinforcement-Learning-With-Python

Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow

Language: Jupyter Notebook - Size: 41.9 MB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 847 - Forks: 325

pythonlessons/Reinforcement_Learning

Reinforcement learning tutorials

Language: Python - Size: 87 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 374 - Forks: 155

ramanakshay/eligibility-traces

Continual credit-assignment for deep networks using eligibility traces

Language: Python - Size: 664 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

rlcode/reinforcement-learning-kr

[파이썬과 케라스로 배우는 강화학습] 예제

Language: Python - Size: 140 MB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 380 - Forks: 232

NeymarL/Pacman-RL

Implement some reinforcement learning algorithms, test and visualize on Pacman.

Language: Python - Size: 7.26 MB - Last synced at: 16 days ago - Pushed at: over 6 years ago - Stars: 27 - Forks: 2

omerbsezer/Reinforcement_learning_tutorial_with_demo 📦

Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..

Language: Jupyter Notebook - Size: 151 KB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 751 - Forks: 174

kkm24132/ReinforcementLearning

Focuses on Reinforcement Learning related concepts, use cases, and learning approaches

Language: Jupyter Notebook - Size: 7.55 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 7 - Forks: 3

bentrevett/pytorch-rl 📦

Tutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]

Language: Jupyter Notebook - Size: 55.7 MB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 277 - Forks: 78

MarcoMeter/recurrent-ppo-truncated-bptt

Baseline implementation of recurrent PPO using truncated BPTT

Language: Jupyter Notebook - Size: 17.7 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 138 - Forks: 18

Urinx/ReinforcementLearning

Reinforcing Your Learning of Reinforcement Learning

Language: Python - Size: 118 MB - Last synced at: 30 days ago - Pushed at: almost 6 years ago - Stars: 94 - Forks: 22

activatedgeek/torchrl

Highly Modular and Scalable Reinforcement Learning

Language: Python - Size: 324 KB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 113 - Forks: 9

yukezhu/tensorflow-reinforce 📦

Implementations of Reinforcement Learning Models in Tensorflow

Language: Python - Size: 39.1 KB - Last synced at: about 1 month ago - Pushed at: over 7 years ago - Stars: 483 - Forks: 135

VinF/deer

DEEp Reinforcement learning framework

Language: Python - Size: 12.6 MB - Last synced at: 11 days ago - Pushed at: about 1 year ago - Stars: 484 - Forks: 123

legalaspro/rl-odyssey

RL-Odyssey is a research framework for continuous control that implements state-of-the-art RL algorithms (SAC, TD3, PPO, etc.) with clean experiment scripts and interactive notebooks.

Language: Jupyter Notebook - Size: 66 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

sukhitashvili/pong

A reimplementation of Andrej Karpathy's repository for an RL self-learning AI agent that learns to play Pong through trial and error, using PyTorch

Language: Python - Size: 9.03 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

mKabouri/RL-algorithms

Language: Jupyter Notebook - Size: 1.93 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

LiamConnell/deep-algotrading

A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading

Language: Jupyter Notebook - Size: 1.37 MB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 236 - Forks: 75

Allenpandas/Tutorial4RL

Tutorial4RL: Tutorial for Reinforcement Learning. 强化学习入门教程.

Size: 4.17 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 140 - Forks: 12

suragnair/seqGAN

A simplified PyTorch implementation of "SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient." (Yu, Lantao, et al.)

Language: Python - Size: 4.14 MB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 645 - Forks: 149

Wadaboa/cpr-appropriation

Solutions to the Harvest CPR appropriation problem with policy gradient methods and social learning, for Autonomous and Adaptive Systems class at UNIBO

Language: Python - Size: 24.5 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 9 - Forks: 4

keon/CodeGAN

[Deprecated] Source Code Generation using Sequence Generative Adversarial Networks :octocat:

Language: Python - Size: 5.62 MB - Last synced at: 29 days ago - Pushed at: over 8 years ago - Stars: 74 - Forks: 26

epignatelli/discovering-reinforcement-learning-algorithms

A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. and Silver, D., 2020. Discovering reinforcement learning algorithms. Advances in Neural Information Processing Systems, 33.

Language: Python - Size: 80.1 KB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 21 - Forks: 5

Kismuz/btgym

Scalable, event-driven, deep-learning-friendly backtesting library

Language: Python - Size: 124 MB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 990 - Forks: 260

shaheennabi/Reinforcement-or-Deep-Reinforcement-Learning-Practices-and-Mini-Projects

Reinforcement Learning (RL) 🤖! This repository is your hands-on guide to implementing RL algorithms, from Markov Decision Processes (MDPs) to advanced methods like PPO and DDPG. 🚀 Build smart agents, learn the math behind policies, and experiment with real-world applications! 🔥💡

Size: 24.4 KB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 3 - Forks: 0

DeNA/HandyRL

HandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own environments.

Language: Python - Size: 612 KB - Last synced at: 27 days ago - Pushed at: 2 months ago - Stars: 287 - Forks: 43

QasimWani/policy-value-methods

Deep Reinforcement Learning algorithms for Policy Value methods written from scratch.

Language: Python - Size: 389 MB - Last synced at: 28 days ago - Pushed at: over 4 years ago - Stars: 23 - Forks: 5

jihoonerd/rl-maze

Simple maze solver by reinforcement learning

Language: Python - Size: 1.54 MB - Last synced at: 27 days ago - Pushed at: about 1 year ago - Stars: 7 - Forks: 1

HuichuanLI/play_with_deep_reinforcement_learning

玩转深度强化学习

Language: Jupyter Notebook - Size: 2.81 MB - Last synced at: 19 days ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

deepbiolab/drl

Implementation of deep reinforcement learning

Language: Jupyter Notebook - Size: 30.7 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

MarioFiorino/Tutorial-Reinforcement-Learning-ITA-Python

In questa repository una collezione di tutorial sulle basi del Reinforcement Learning, sviluppati in Python, interamente in italiano.

Language: Jupyter Notebook - Size: 5.46 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 10 - Forks: 2

AgentMaker/Paddle-RLBooks

Paddle-RLBooks is a reinforcement learning code study guide based on pure PaddlePaddle.

Language: Python - Size: 14.1 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 110 - Forks: 13

simoninithomas/Policy_gradients_CartPole

A Policy Gradient Learning with CartPole-v0 for Siraj Raval's challenge

Language: Jupyter Notebook - Size: 518 KB - Last synced at: about 2 months ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 2

nikhil-kotecha/Emotional_Dialogue

A Deep Reinforcement Learning Approach (LSTM + policy gradient) to create a chatbot that produces coherent, emotional dialogue.

Language: Python - Size: 160 KB - Last synced at: about 1 month ago - Pushed at: about 7 years ago - Stars: 27 - Forks: 11

SwamiKannan/Reinforcement-Learning-Specialization

Programming Assignments for Reinforcement Learning Specialization

Language: Jupyter Notebook - Size: 2 MB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

prakHr/Reinforcement-Learning-Book

[Book] :- Andrea Lonza - Reinforcement Learning Algorithms with Python_ Learn, understand, and develop smart algorithms for addressing AI challenges-Packt Publishing (2019)

Language: Python - Size: 20.9 MB - Last synced at: about 1 month ago - Pushed at: almost 5 years ago - Stars: 5 - Forks: 4

fukuta0614/chainer-SeqGAN

implementation of SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient

Language: Jupyter Notebook - Size: 2.09 MB - Last synced at: about 1 month ago - Pushed at: over 8 years ago - Stars: 33 - Forks: 14

mklblm/VU-Multi-agent-Systems

From-scratch implementations of Monte Carlo Tree Search and the Actor-Critic Advantage Framework for the Multi-Agent Systems course at Vrije Universiteit Amsterdam in 2024-2025

Language: Jupyter Notebook - Size: 9.09 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

fzhu0628/Fast-FedPG---Towards-Fast-Rates-for-Federated-and-Multi-Task-Reinforcement-Learning

This work is a conference paper published at IEEE CDC 2024. The paper is dedicated to finding a policy that maximizes the average of long-term cumulative rewards across environments. Included in the repository are a brief introduction of our work, the poster for the AI Symposium at NCSU, and the slides for the CDC talk.

Size: 2.19 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

alok/rl_implementations

Language: Jupyter Notebook - Size: 792 KB - Last synced at: 4 days ago - Pushed at: almost 6 years ago - Stars: 43 - Forks: 5

hcnoh/rl-collection-pytorch

A collection of Reinforcement Learning implementations with PyTorch

Language: Python - Size: 5.84 MB - Last synced at: 9 days ago - Pushed at: about 3 years ago - Stars: 20 - Forks: 1

goktug97/PEPG-ES

Python Implementation of Parameter-exploring Policy Gradients Evolution Strategy

Language: Python - Size: 11.2 MB - Last synced at: 19 days ago - Pushed at: about 5 years ago - Stars: 16 - Forks: 0

vrsreeganesh/StyleTransferApproachToAppearanceAgnosticAgents

A domain randomization approach to robust Sim2Real autonomous vehicles

Language: Python - Size: 32.6 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

keishihara/policy-gradients-pytorch

Simple Policy Gradient implementations in PyTorch for Reinforcement Learning.

Language: Python - Size: 148 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

Fer14/raice

Car racing RL agents in actual F1 tracks

Language: Jupyter Notebook - Size: 145 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 12 - Forks: 0

erickTornero/rl-baselines

Implementations of Reinforcement Learning algorithms as baselines in Pytorch

Language: Python - Size: 380 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

aminkhani/Deep-RL

You can see a reference for Books, Articles, Courses and Educational Materials in this field. Implementation of Reinforcement Learning Algorithms and Environments. Python, OpenAI Gym, Tensorflow.

Language: Jupyter Notebook - Size: 21.5 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 20 - Forks: 0

SapanaChaudhary/PyTorch-CPO

PyTorch implementation of Constrained Policy Optimization

Language: Python - Size: 8.91 MB - Last synced at: 5 months ago - Pushed at: over 3 years ago - Stars: 49 - Forks: 10

IVproger/RL_ShakeGame_project

This project, developed as part of the Innopolis University's Reinforcement Learning course (2024), emulates the classic Snake game and applies 3-5 different RL algorithms to optimize the agent's performance.

Language: Jupyter Notebook - Size: 20.5 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

theamrzaki/text_summurization_abstractive_methods

Multiple implementations for abstractive text summurization , using google colab

Language: Jupyter Notebook - Size: 4.3 MB - Last synced at: 6 months ago - Pushed at: over 4 years ago - Stars: 526 - Forks: 219

liziniu/ReMax

Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)

Language: Python - Size: 1.76 MB - Last synced at: 6 months ago - Pushed at: over 1 year ago - Stars: 151 - Forks: 13

ethanmclark1/rl_toolkit

Implementation of core reinforcement learning algorithms with PyTorch

Language: Python - Size: 233 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

LucasWaelti/RL_Webots

Webots project to show how to use Deep Reinforcement Learning with Webots in C++.

Language: C++ - Size: 35.2 KB - Last synced at: 6 months ago - Pushed at: about 5 years ago - Stars: 39 - Forks: 7

hartikainen/easy21 📦

Reinforcement learning agents and environment for Easy21, a modified version of Blackjack

Language: Python - Size: 2.77 MB - Last synced at: about 2 months ago - Pushed at: about 8 years ago - Stars: 14 - Forks: 3

hsvgbkhgbv/SQDDPG

This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled by ''Shapley Q-value: A Local Reward Approach to Solve Global Reward Games''.

Language: Python - Size: 142 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 113 - Forks: 42

callmespring/RL-short-course

Reinforcement Learning Short Course

Language: Jupyter Notebook - Size: 95.6 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 53 - Forks: 18

goktug97/nes-torch

Minimal PyTorch Library for Natural Evolution Strategies

Language: Python - Size: 755 KB - Last synced at: 13 days ago - Pushed at: over 3 years ago - Stars: 17 - Forks: 3

saqib1707/RL-PPO-PyTorch

Simple and Modular implementation of Proximal Policy Optimization (PPO) in PyTorch

Language: Python - Size: 22.5 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

nabla0001/ram

Implementation of the location-guided deep recurrent attention model (LG-DRAM) I developed for my MSc thesis at UCL (2017)

Language: Python - Size: 17.5 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

tsangwpx/ml2048

Yet another 2048 in reinforcement learning

Language: Jupyter Notebook - Size: 5.52 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

bhanuvikasr/Deep-RL-TORCS

Autonomous Navigation using Deep Reinforcement Learning

Language: Python - Size: 14.4 MB - Last synced at: 26 days ago - Pushed at: almost 8 years ago - Stars: 24 - Forks: 12

fredrikmagnus/RL-for-Inverted-Pendulum

Reinforcement Learning for the inverted pendulum problem using a custom simulation. Implements and evaluates DQN, REINFORCE, and DDPG algorithms to learn control strategies for balancing a pendulum on a moving cart.

Language: Python - Size: 99.8 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

arnomoonens/yarll

Combining deep learning and reinforcement learning.

Language: Python - Size: 2.83 MB - Last synced at: 6 months ago - Pushed at: over 3 years ago - Stars: 81 - Forks: 28

HridayM25/ReinforcementLearning

Some algorithms of Reinforcement Learning implemented by me, in accordance to "Introduction to Reinforcement Learning" by Richard Sutton and Andrew Barto.

Language: Jupyter Notebook - Size: 538 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

trunghng/deep_rl_zoo

Language: Python - Size: 55.1 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

hishamcse/DRL-Renegades-Game-Bots

A collection of my implemented RL agents for games like Pacman, Pong, SpaceInvaders, Frozenlake, Taxi, Pixelcopter, Pyramids and a lot more by implementing various DRL algorithms using gym, unity-ml, pygame, sb3, rl-zoo and pandagym libraries. To see more advanced & interesting agents, please visit below link:

Language: Jupyter Notebook - Size: 120 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

sheeerio/rl-implementation

PyTorch implementations for deep and classical rl algorithms

Language: Python - Size: 99 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

Related Keywords

policy-gradient 399 reinforcement-learning 302 deep-reinforcement-learning 110 pytorch 94 actor-critic 85 ppo 65 deep-learning 62 dqn 60 q-learning 55 reinforcement-learning-algorithms 54 tensorflow 52 openai-gym 49 ddpg 41 machine-learning 40 proximal-policy-optimization 35 reinforce 33 python 27 a2c 26 a3c 22 sarsa 22 trpo 21 rl 21 keras 19 td3 19 gym 19 monte-carlo 19 sac 17 artificial-intelligence 17 neural-network 16 deep-q-network 16 policy-iteration 15 python3 14 continuous-control 14 deep-q-learning 14 pong 13 mujoco 12 markov-decision-processes 12 soft-actor-critic 11 cartpole 11 value-iteration 10 qlearning 10 tensorflow2 10 actor-critic-algorithm 10 dynamic-programming 9 double-dqn 9 deep-deterministic-policy-gradient 9 deep-neural-networks 9 temporal-differencing-learning 9 pytorch-implementation 8 imitation-learning 8 advantage-actor-critic 8 atari 8 ai 8 gae 8 cartpole-v0 8 sarsa-learning 8 trust-region-policy-optimization 7 dueling-dqn 7 monte-carlo-tree-search 7 lstm 7 reinforcement-learning-environments 6 ppo-pytorch 6 neural-networks 6 actor-critic-methods 6 pytorch-rl 6 gym-environment 6 multi-agent-reinforcement-learning 6 generalized-advantage-estimation 6 openai 6 policy 6 ddqn 6 robotics 6 drl 6 evolution-strategies 5 tutorial 5 openai-gym-environments 5 recurrent-neural-networks 5 model-based-rl 5 gan 5 nlp 5 discrete-control 5 policy-optimization 5 algorithms 5 td-learning 5 reinforcement-learning-agent 5 temporal-difference 5 jax 5 policy-evaluation 5 seq2seq 5 hierarchical-reinforcement-learning 5 generative-adversarial-network 5 ddpg-algorithm 4 rl-algorithms 4 natural-language-processing 4 bandit-algorithms 4 monte-carlo-simulation 4 grid-world 4 convolutional-neural-networks 4 natural-policy-gradient 4 meta-learning 4