GitHub topics: policy-gradient
datawhalechina/easy-rl
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
Language: Jupyter Notebook - Size: 516 MB - Last synced at: about 16 hours ago - Pushed at: 8 days ago - Stars: 11,201 - Forks: 2,026

WorldEditor50/snakeAI
testing MLP, DQN, PPO, SAC, policy-gradient by snakeAI
Language: C++ - Size: 3.34 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 11 - Forks: 4

23091v/Gradient-Network-Bot
Automated bot that utilizes gradient networks to optimize and enhance machine learning algorithms. Designed to streamline the model training process and improve accuracy of predictive analytics tasks.
Size: 5.86 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 1

NegarMov/DRL_Algorithms
A collection of DRL algorithms on Gymnasium environments for hands-on learning and experimentation.
Language: Python - Size: 34.3 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

thu-ml/tianshou
An elegant PyTorch deep reinforcement learning library.
Language: Python - Size: 46.6 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 8,449 - Forks: 1,150

Cybernetic1/reinforcement-learning-experiments
Reinforcement learning experiments with Tic Tac Toe, especially with "logical representations"
Language: Python - Size: 15.8 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 7 - Forks: 0

niupuhua1234/GFN-PG
Code for the ICML 2024 paper 'GFlowNet Training by Policy Gradients'
Language: Python - Size: 5.41 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

VachanVY/Reinforcement-Learning
PyTorch implementations of algorithms from "Reinforcement Learning: An Introduction by Sutton and Barto", along with various RL research papers.
Language: Python - Size: 38.5 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 84 - Forks: 3

RainyEarth/Trade_Predictor_Project
An AI-powered trade prediction system using machine learning, technical analysis, and time series models. Built with FastAPI, React, and Tailwind CSS.
Language: Python - Size: 846 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 3 - Forks: 0

EMI-Group/evorl
EvoRL is a fully GPU-accelerated framework for Evolutionary Reinforcement Learning, implemented with JAX. It supports Reinforcement Learning (RL), Evolutionary Computation (EC), Evolution-guided Reinforcement Learning (ERL), AutoRL, and seamless integration with GPU-optimized simulation environments.
Language: Python - Size: 2.55 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 75 - Forks: 8

sweetice/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Language: Python - Size: 42.1 MB - Last synced at: 8 days ago - Pushed at: about 2 years ago - Stars: 4,267 - Forks: 876

nikhilbarhate99/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Language: Python - Size: 12.1 MB - Last synced at: 10 days ago - Pushed at: 10 months ago - Stars: 2,005 - Forks: 378

MorvanZhou/Reinforcement-learning-with-tensorflow
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
Language: Python - Size: 428 KB - Last synced at: 10 days ago - Pushed at: about 1 year ago - Stars: 9,166 - Forks: 5,024

bmarroc/reinforcement-learning
Jupyter notebooks implementing Reinforcement Learning algorithms in Numpy and Tensorflow
Language: Jupyter Notebook - Size: 2.84 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 1 - Forks: 1

pyrddlgym-project/pyRDDLGym-jax
JAX compilation of RDDL description files, and a differentiable planner in JAX.
Language: Python - Size: 12.1 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 5 - Forks: 1

Khrylx/PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Language: Python - Size: 30.5 MB - Last synced at: 10 days ago - Pushed at: about 4 years ago - Stars: 1,197 - Forks: 190

medipixel/rl_algorithms
Structural implementation of RL key algorithms
Language: Python - Size: 2.6 MB - Last synced at: 10 days ago - Pushed at: about 2 years ago - Stars: 511 - Forks: 64

sshkhr/Practical_RL
My solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow
Language: Jupyter Notebook - Size: 9.91 MB - Last synced at: 5 days ago - Pushed at: over 3 years ago - Stars: 53 - Forks: 25

qlan3/Explorer
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
Language: Python - Size: 914 KB - Last synced at: 20 days ago - Pushed at: 21 days ago - Stars: 92 - Forks: 14

flint-xf-fan/Byzantine-Federated-RL
[NeurIPS2021] Federated Reinforcement Learning with Theoretical Guarantees. The repo contains code and experiments for our Federated Policy Gradient with Byzantine Resilience framework for improving sample efficiency of RL agents.
Language: Python - Size: 446 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 92 - Forks: 12

benedekrozemberczki/awesome-monte-carlo-tree-search-papers
A curated list of Monte Carlo tree search papers with implementations.
Language: Python - Size: 238 KB - Last synced at: 2 days ago - Pushed at: about 1 year ago - Stars: 674 - Forks: 74

navneet-nmk/pytorch-rl
This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch
Language: Python - Size: 97.1 MB - Last synced at: 6 days ago - Pushed at: almost 6 years ago - Stars: 444 - Forks: 55

rlcode/reinforcement-learning
Minimal and Clean Reinforcement Learning Examples
Language: Python - Size: 60.2 MB - Last synced at: 26 days ago - Pushed at: about 2 years ago - Stars: 3,506 - Forks: 738

gordicaleksa/pytorch-learn-reinforcement-learning
A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.
Language: Python - Size: 13.1 MB - Last synced at: 5 days ago - Pushed at: about 4 years ago - Stars: 154 - Forks: 33

nikhilbarhate99/Actor-Critic-PyTorch
Policy Gradient Actor-Critic PyTorch | Lunar Lander v2
Language: Python - Size: 2.1 MB - Last synced at: 5 days ago - Pushed at: about 6 years ago - Stars: 73 - Forks: 26

CodeName-Detective/Prompt-to-Song-Generation-using-Large-Language-Models
This project uses LLMs to generate music from text by understanding prompts, creating lyrics, determining genre, and composing melodies. It harnesses LLM capabilities to create songs based on text inputs through a multi-step approach.
Language: Jupyter Notebook - Size: 57.6 MB - Last synced at: 10 days ago - Pushed at: 12 months ago - Stars: 13 - Forks: 0

MarcoMeter/episodic-transformer-memory-ppo
Clean baseline implementation of PPO using an episodic TransformerXL memory
Language: Python - Size: 23.9 MB - Last synced at: 29 days ago - Pushed at: 11 months ago - Stars: 172 - Forks: 22

keon/policy-gradient
Minimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras
Language: Python - Size: 3.54 MB - Last synced at: 29 days ago - Pushed at: over 5 years ago - Stars: 160 - Forks: 43

WilliamZhang20/AI-Moonlander
Using reinforcement learning to learn to land on the moon!
Language: Jupyter Notebook - Size: 5.72 MB - Last synced at: 30 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

zuoxingdong/lagom
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Language: Jupyter Notebook - Size: 95.9 MB - Last synced at: 2 days ago - Pushed at: over 2 years ago - Stars: 375 - Forks: 30

kengz/openai_lab
An experimentation framework for Reinforcement Learning using OpenAI Gym, Tensorflow, and Keras.
Language: Python - Size: 8.47 MB - Last synced at: about 1 month ago - Pushed at: about 7 years ago - Stars: 326 - Forks: 68

sudharsan13296/Deep-Reinforcement-Learning-With-Python
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
Language: Jupyter Notebook - Size: 23.9 MB - Last synced at: about 1 month ago - Pushed at: about 4 years ago - Stars: 409 - Forks: 136

kengz/SLM-Lab
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
Language: Python - Size: 4.08 MB - Last synced at: 25 days ago - Pushed at: 3 months ago - Stars: 1,277 - Forks: 274

salesforce/MultiHopKG
Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
Language: Jupyter Notebook - Size: 24.2 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 309 - Forks: 80

sudharsan13296/Hands-On-Reinforcement-Learning-With-Python
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Language: Jupyter Notebook - Size: 41.9 MB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 847 - Forks: 325

pythonlessons/Reinforcement_Learning
Reinforcement learning tutorials
Language: Python - Size: 87 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 374 - Forks: 155

ramanakshay/eligibility-traces
Continual credit-assignment for deep networks using eligibility traces
Language: Python - Size: 664 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

rlcode/reinforcement-learning-kr
[파이썬과 케라스로 배우는 강화학습] 예제
Language: Python - Size: 140 MB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 380 - Forks: 232

NeymarL/Pacman-RL
Implement some reinforcement learning algorithms, test and visualize on Pacman.
Language: Python - Size: 7.26 MB - Last synced at: 16 days ago - Pushed at: over 6 years ago - Stars: 27 - Forks: 2

omerbsezer/Reinforcement_learning_tutorial_with_demo 📦
Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Language: Jupyter Notebook - Size: 151 KB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 751 - Forks: 174

kkm24132/ReinforcementLearning
Focuses on Reinforcement Learning related concepts, use cases, and learning approaches
Language: Jupyter Notebook - Size: 7.55 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 7 - Forks: 3

bentrevett/pytorch-rl 📦
Tutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]
Language: Jupyter Notebook - Size: 55.7 MB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 277 - Forks: 78

MarcoMeter/recurrent-ppo-truncated-bptt
Baseline implementation of recurrent PPO using truncated BPTT
Language: Jupyter Notebook - Size: 17.7 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 138 - Forks: 18

Urinx/ReinforcementLearning
Reinforcing Your Learning of Reinforcement Learning
Language: Python - Size: 118 MB - Last synced at: 30 days ago - Pushed at: almost 6 years ago - Stars: 94 - Forks: 22

activatedgeek/torchrl
Highly Modular and Scalable Reinforcement Learning
Language: Python - Size: 324 KB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 113 - Forks: 9

yukezhu/tensorflow-reinforce 📦
Implementations of Reinforcement Learning Models in Tensorflow
Language: Python - Size: 39.1 KB - Last synced at: about 1 month ago - Pushed at: over 7 years ago - Stars: 483 - Forks: 135

VinF/deer
DEEp Reinforcement learning framework
Language: Python - Size: 12.6 MB - Last synced at: 11 days ago - Pushed at: about 1 year ago - Stars: 484 - Forks: 123

legalaspro/rl-odyssey
RL-Odyssey is a research framework for continuous control that implements state-of-the-art RL algorithms (SAC, TD3, PPO, etc.) with clean experiment scripts and interactive notebooks.
Language: Jupyter Notebook - Size: 66 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

sukhitashvili/pong
A reimplementation of Andrej Karpathy's repository for an RL self-learning AI agent that learns to play Pong through trial and error, using PyTorch
Language: Python - Size: 9.03 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

mKabouri/RL-algorithms
Language: Jupyter Notebook - Size: 1.93 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

LiamConnell/deep-algotrading
A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading
Language: Jupyter Notebook - Size: 1.37 MB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 236 - Forks: 75

Allenpandas/Tutorial4RL
Tutorial4RL: Tutorial for Reinforcement Learning. 强化学习入门教程.
Size: 4.17 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 140 - Forks: 12

suragnair/seqGAN
A simplified PyTorch implementation of "SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient." (Yu, Lantao, et al.)
Language: Python - Size: 4.14 MB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 645 - Forks: 149

Wadaboa/cpr-appropriation
Solutions to the Harvest CPR appropriation problem with policy gradient methods and social learning, for Autonomous and Adaptive Systems class at UNIBO
Language: Python - Size: 24.5 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 9 - Forks: 4

keon/CodeGAN
[Deprecated] Source Code Generation using Sequence Generative Adversarial Networks :octocat:
Language: Python - Size: 5.62 MB - Last synced at: 29 days ago - Pushed at: over 8 years ago - Stars: 74 - Forks: 26

epignatelli/discovering-reinforcement-learning-algorithms
A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. and Silver, D., 2020. Discovering reinforcement learning algorithms. Advances in Neural Information Processing Systems, 33.
Language: Python - Size: 80.1 KB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 21 - Forks: 5

Kismuz/btgym
Scalable, event-driven, deep-learning-friendly backtesting library
Language: Python - Size: 124 MB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 990 - Forks: 260

shaheennabi/Reinforcement-or-Deep-Reinforcement-Learning-Practices-and-Mini-Projects
Reinforcement Learning (RL) 🤖! This repository is your hands-on guide to implementing RL algorithms, from Markov Decision Processes (MDPs) to advanced methods like PPO and DDPG. 🚀 Build smart agents, learn the math behind policies, and experiment with real-world applications! 🔥💡
Size: 24.4 KB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 3 - Forks: 0

DeNA/HandyRL
HandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own environments.
Language: Python - Size: 612 KB - Last synced at: 27 days ago - Pushed at: 2 months ago - Stars: 287 - Forks: 43

QasimWani/policy-value-methods
Deep Reinforcement Learning algorithms for Policy Value methods written from scratch.
Language: Python - Size: 389 MB - Last synced at: 28 days ago - Pushed at: over 4 years ago - Stars: 23 - Forks: 5

jihoonerd/rl-maze
Simple maze solver by reinforcement learning
Language: Python - Size: 1.54 MB - Last synced at: 27 days ago - Pushed at: about 1 year ago - Stars: 7 - Forks: 1

HuichuanLI/play_with_deep_reinforcement_learning
玩转深度强化学习
Language: Jupyter Notebook - Size: 2.81 MB - Last synced at: 19 days ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

deepbiolab/drl
Implementation of deep reinforcement learning
Language: Jupyter Notebook - Size: 30.7 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

MarioFiorino/Tutorial-Reinforcement-Learning-ITA-Python
In questa repository una collezione di tutorial sulle basi del Reinforcement Learning, sviluppati in Python, interamente in italiano.
Language: Jupyter Notebook - Size: 5.46 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 10 - Forks: 2

AgentMaker/Paddle-RLBooks
Paddle-RLBooks is a reinforcement learning code study guide based on pure PaddlePaddle.
Language: Python - Size: 14.1 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 110 - Forks: 13

simoninithomas/Policy_gradients_CartPole
A Policy Gradient Learning with CartPole-v0 for Siraj Raval's challenge
Language: Jupyter Notebook - Size: 518 KB - Last synced at: about 2 months ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 2

nikhil-kotecha/Emotional_Dialogue
A Deep Reinforcement Learning Approach (LSTM + policy gradient) to create a chatbot that produces coherent, emotional dialogue.
Language: Python - Size: 160 KB - Last synced at: about 1 month ago - Pushed at: about 7 years ago - Stars: 27 - Forks: 11

SwamiKannan/Reinforcement-Learning-Specialization
Programming Assignments for Reinforcement Learning Specialization
Language: Jupyter Notebook - Size: 2 MB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

prakHr/Reinforcement-Learning-Book
[Book] :- Andrea Lonza - Reinforcement Learning Algorithms with Python_ Learn, understand, and develop smart algorithms for addressing AI challenges-Packt Publishing (2019)
Language: Python - Size: 20.9 MB - Last synced at: about 1 month ago - Pushed at: almost 5 years ago - Stars: 5 - Forks: 4

fukuta0614/chainer-SeqGAN
implementation of SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient
Language: Jupyter Notebook - Size: 2.09 MB - Last synced at: about 1 month ago - Pushed at: over 8 years ago - Stars: 33 - Forks: 14

mklblm/VU-Multi-agent-Systems
From-scratch implementations of Monte Carlo Tree Search and the Actor-Critic Advantage Framework for the Multi-Agent Systems course at Vrije Universiteit Amsterdam in 2024-2025
Language: Jupyter Notebook - Size: 9.09 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

fzhu0628/Fast-FedPG---Towards-Fast-Rates-for-Federated-and-Multi-Task-Reinforcement-Learning
This work is a conference paper published at IEEE CDC 2024. The paper is dedicated to finding a policy that maximizes the average of long-term cumulative rewards across environments. Included in the repository are a brief introduction of our work, the poster for the AI Symposium at NCSU, and the slides for the CDC talk.
Size: 2.19 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

alok/rl_implementations
Language: Jupyter Notebook - Size: 792 KB - Last synced at: 4 days ago - Pushed at: almost 6 years ago - Stars: 43 - Forks: 5

hcnoh/rl-collection-pytorch
A collection of Reinforcement Learning implementations with PyTorch
Language: Python - Size: 5.84 MB - Last synced at: 9 days ago - Pushed at: about 3 years ago - Stars: 20 - Forks: 1

goktug97/PEPG-ES
Python Implementation of Parameter-exploring Policy Gradients Evolution Strategy
Language: Python - Size: 11.2 MB - Last synced at: 19 days ago - Pushed at: about 5 years ago - Stars: 16 - Forks: 0

vrsreeganesh/StyleTransferApproachToAppearanceAgnosticAgents
A domain randomization approach to robust Sim2Real autonomous vehicles
Language: Python - Size: 32.6 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

keishihara/policy-gradients-pytorch
Simple Policy Gradient implementations in PyTorch for Reinforcement Learning.
Language: Python - Size: 148 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

Fer14/raice
Car racing RL agents in actual F1 tracks
Language: Jupyter Notebook - Size: 145 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 12 - Forks: 0

erickTornero/rl-baselines
Implementations of Reinforcement Learning algorithms as baselines in Pytorch
Language: Python - Size: 380 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

aminkhani/Deep-RL
You can see a reference for Books, Articles, Courses and Educational Materials in this field. Implementation of Reinforcement Learning Algorithms and Environments. Python, OpenAI Gym, Tensorflow.
Language: Jupyter Notebook - Size: 21.5 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 20 - Forks: 0

SapanaChaudhary/PyTorch-CPO
PyTorch implementation of Constrained Policy Optimization
Language: Python - Size: 8.91 MB - Last synced at: 5 months ago - Pushed at: over 3 years ago - Stars: 49 - Forks: 10

IVproger/RL_ShakeGame_project
This project, developed as part of the Innopolis University's Reinforcement Learning course (2024), emulates the classic Snake game and applies 3-5 different RL algorithms to optimize the agent's performance.
Language: Jupyter Notebook - Size: 20.5 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

theamrzaki/text_summurization_abstractive_methods
Multiple implementations for abstractive text summurization , using google colab
Language: Jupyter Notebook - Size: 4.3 MB - Last synced at: 6 months ago - Pushed at: over 4 years ago - Stars: 526 - Forks: 219

liziniu/ReMax
Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)
Language: Python - Size: 1.76 MB - Last synced at: 6 months ago - Pushed at: over 1 year ago - Stars: 151 - Forks: 13

ethanmclark1/rl_toolkit
Implementation of core reinforcement learning algorithms with PyTorch
Language: Python - Size: 233 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

LucasWaelti/RL_Webots
Webots project to show how to use Deep Reinforcement Learning with Webots in C++.
Language: C++ - Size: 35.2 KB - Last synced at: 6 months ago - Pushed at: about 5 years ago - Stars: 39 - Forks: 7

hartikainen/easy21 📦
Reinforcement learning agents and environment for Easy21, a modified version of Blackjack
Language: Python - Size: 2.77 MB - Last synced at: about 2 months ago - Pushed at: about 8 years ago - Stars: 14 - Forks: 3

hsvgbkhgbv/SQDDPG
This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled by ''Shapley Q-value: A Local Reward Approach to Solve Global Reward Games''.
Language: Python - Size: 142 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 113 - Forks: 42

callmespring/RL-short-course
Reinforcement Learning Short Course
Language: Jupyter Notebook - Size: 95.6 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 53 - Forks: 18

goktug97/nes-torch
Minimal PyTorch Library for Natural Evolution Strategies
Language: Python - Size: 755 KB - Last synced at: 13 days ago - Pushed at: over 3 years ago - Stars: 17 - Forks: 3

saqib1707/RL-PPO-PyTorch
Simple and Modular implementation of Proximal Policy Optimization (PPO) in PyTorch
Language: Python - Size: 22.5 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

nabla0001/ram
Implementation of the location-guided deep recurrent attention model (LG-DRAM) I developed for my MSc thesis at UCL (2017)
Language: Python - Size: 17.5 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

tsangwpx/ml2048
Yet another 2048 in reinforcement learning
Language: Jupyter Notebook - Size: 5.52 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

bhanuvikasr/Deep-RL-TORCS
Autonomous Navigation using Deep Reinforcement Learning
Language: Python - Size: 14.4 MB - Last synced at: 26 days ago - Pushed at: almost 8 years ago - Stars: 24 - Forks: 12

fredrikmagnus/RL-for-Inverted-Pendulum
Reinforcement Learning for the inverted pendulum problem using a custom simulation. Implements and evaluates DQN, REINFORCE, and DDPG algorithms to learn control strategies for balancing a pendulum on a moving cart.
Language: Python - Size: 99.8 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

arnomoonens/yarll
Combining deep learning and reinforcement learning.
Language: Python - Size: 2.83 MB - Last synced at: 6 months ago - Pushed at: over 3 years ago - Stars: 81 - Forks: 28

HridayM25/ReinforcementLearning
Some algorithms of Reinforcement Learning implemented by me, in accordance to "Introduction to Reinforcement Learning" by Richard Sutton and Andrew Barto.
Language: Jupyter Notebook - Size: 538 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

trunghng/deep_rl_zoo
Language: Python - Size: 55.1 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

hishamcse/DRL-Renegades-Game-Bots
A collection of my implemented RL agents for games like Pacman, Pong, SpaceInvaders, Frozenlake, Taxi, Pixelcopter, Pyramids and a lot more by implementing various DRL algorithms using gym, unity-ml, pygame, sb3, rl-zoo and pandagym libraries. To see more advanced & interesting agents, please visit below link:
Language: Jupyter Notebook - Size: 120 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

sheeerio/rl-implementation
PyTorch implementations for deep and classical rl algorithms
Language: Python - Size: 99 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0
