GitHub topics: stable-baselines
Bussler/RL_With_Delayed_Reward
Reinforcement Learning on a rust simulation that only gives delayed rewards.
Language: Python - Size: 3.54 MB - Last synced at: about 11 hours ago - Pushed at: about 17 hours ago - Stars: 4 - Forks: 0

DLR-RM/rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
Language: Python - Size: 3.82 MB - Last synced at: about 17 hours ago - Pushed at: 1 day ago - Stars: 2,497 - Forks: 559

DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Language: Python - Size: 4.7 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 11,188 - Forks: 1,891

Stable-Baselines-Team/stable-baselines3-contrib
Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code
Language: Python - Size: 1.44 MB - Last synced at: 8 days ago - Pushed at: about 1 month ago - Stars: 622 - Forks: 207

Stable-Baselines-Team/stable-baselines
Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms
Language: Python - Size: 9.05 MB - Last synced at: 2 days ago - Pushed at: about 2 years ago - Stars: 302 - Forks: 62

alpine-chamois/actor-critic
Deep Reinforcement Learning
Language: Python - Size: 1.33 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

Stable-Baselines-Team/rl-colab-notebooks
Colab notebooks part of the documentation of Stable Baselines reinforcement learning library
Language: Jupyter Notebook - Size: 85.9 KB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 228 - Forks: 41

DLR-RM/rl-trained-agents
A collection of pre-trained RL agents using Stable Baselines3
Language: Python - Size: 1.35 GB - Last synced at: about 2 months ago - Pushed at: 9 months ago - Stars: 128 - Forks: 27

araffin/rl-tutorial-jnrr19
Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019
Language: Jupyter Notebook - Size: 500 KB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 673 - Forks: 126

stefanbschneider/mobile-env
An open, minimalist Gymnasium environment for autonomous coordination in wireless mobile networks.
Language: Python - Size: 3.33 MB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 129 - Forks: 29

araffin/rl-baselines-zoo 📦
A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.
Language: Python - Size: 367 MB - Last synced at: 2 months ago - Pushed at: almost 3 years ago - Stars: 1,175 - Forks: 209

araffin/learning-to-drive-in-5-minutes Fork of r7vme/learning-to-drive-in-a-day
Implementation of reinforcement learning approach to make a car learn to drive smoothly in minutes
Language: Python - Size: 14.2 MB - Last synced at: 7 days ago - Pushed at: over 3 years ago - Stars: 287 - Forks: 83

rpatrik96/AttA2C
Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning
Language: Jupyter Notebook - Size: 9.37 MB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 27 - Forks: 7

HugoTini/GymGodot
Reinforcement learning with Godot
Language: GDScript - Size: 10.1 MB - Last synced at: 4 months ago - Pushed at: about 4 years ago - Stars: 22 - Forks: 5

rpatrik96/pytorch-a2c
A well-documented A2C written in PyTorch
Language: Python - Size: 8.79 KB - Last synced at: about 1 month ago - Pushed at: about 6 years ago - Stars: 50 - Forks: 10

RealVNF/distributed-drl-coordination
Distributed Online Service Coordination Using Deep Reinforcement Learning
Language: Python - Size: 363 KB - Last synced at: 4 months ago - Pushed at: almost 2 years ago - Stars: 19 - Forks: 6

RsGoksel/Snake-Game_PPO-Solution
Snake game environment integrated with OpenAI Gym. Proximal Policy Optimization (PPO) implementation for training. Visualization of training progress and agent performance. Easy to understand code.
Language: Jupyter Notebook - Size: 22.5 KB - Last synced at: 24 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 1

GioStamoulos/BTC_RL_Trading_Bot
A trading bitcoin agent was created with deep reinforcement learning implementations.
Language: Jupyter Notebook - Size: 53.9 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 27 - Forks: 6

Tqualizer/Retro-Street-Fighter-reinforcement-learning
Experiments with multiple reinforcement ML algorithms to learn how to beat Street Fighter II
Language: Python - Size: 2.65 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 0

Derek-TH-Wang/hybrid_gait
RL training for quadruped robot(mit minicheetah) various gaits in different velocity based on MPC controller.
Language: C++ - Size: 16.6 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 10 - Forks: 2

andrewjc/wolf3d-golang
This project combines the power of reinforcement learning with a custom raycasting game engine written in Golang, reminiscent of the classic Wolf3D game.
Language: Go - Size: 1.25 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 0

HumanCompatibleAI/better-adversarial-defenses
Training in bursts for defending against adversarial policies
Language: Python - Size: 47.9 MB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 11 - Forks: 3

HannesStark/gnn-reinforcement-learning
Representing robots as graphs for reinforcement-learning in PyBullet locomotion environments.
Language: Jupyter Notebook - Size: 20.4 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 17 - Forks: 5

sergeLabo/balance
Reinforcement learning
Language: Python - Size: 7.07 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

KeremCikikci/Apple_Basket
-Reinforcement Learning- (stablebaselines & gym)
Language: Python - Size: 386 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 1

astrologos/tradinggym
A highly-customizable OpenAI gym environment to train & evaluate RL agents trading stocks and crypto.
Language: Python - Size: 14.1 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 1

sadighian/recommendation-gym
MovieLens recommendation system using reinforcement learning (GYM + PPO)
Language: Python - Size: 53.7 KB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 39 - Forks: 9

hululuzhu/mario-rl
Super Mario Reinforcement Learning (RL) Training Colab notebook with Stable Baseline3 Library
Language: Jupyter Notebook - Size: 22.9 MB - Last synced at: 8 days ago - Pushed at: almost 2 years ago - Stars: 5 - Forks: 4

iKeepo/stable-baselines-zh
Stable Baselines官方文档中文版
Language: Python - Size: 4.96 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 77 - Forks: 10

7enTropy7/Racer_AI
Developed an highly customizable OpenAI gym environment and trained a stable_baselines3 PPO agent. Used the expert agent for Imitation Learning with DAgger
Language: Python - Size: 1.7 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 6 - Forks: 1

andriusbern/stadium
A graphical interface for reinforcement learning and gym-based environments.
Language: Python - Size: 17.8 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 31 - Forks: 3

CN-UPB/NFVdeep
NFVdeep: Deep Reinforcement Learning for Online Orchestration of Service Function Chains
Language: Python - Size: 420 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 31 - Forks: 13

prabhasak/masters-thesis
A basic implementation of imitation learning using Stable Baselines 2.10
Language: Python - Size: 58.7 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 0

manantomar/Mirror-Descent-Policy-Optimization
Mirror Descent Policy Optimization
Language: Python - Size: 44.9 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 25 - Forks: 3

CUN-bjy/policy-distillation-baselines
Pytorch Implementation of Policy Distillation for control, which has well-trained teachers via stable_baselines3.
Language: Python - Size: 959 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 28 - Forks: 3

francofgp/Tic-Tac-Toe-Gym
This is the Tic-Tac-Toe game made with Python using the PyGame library and the Gym library to implement the AI with Reinforcement Learning
Language: Python - Size: 1.86 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 2

Egiob/DiversityIsAllYouNeed-SB3 Fork of DLR-RM/stable-baselines3
Implementation of Diversity Is All You Need (DIAYN) on top of Stable Baselines 3.
Language: Python - Size: 162 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 9 - Forks: 1

wangcongrobot/gym-ur5_gripper
Language: Python - Size: 15.2 MB - Last synced at: over 2 years ago - Pushed at: almost 6 years ago - Stars: 7 - Forks: 3

landonbutler/Learning-Connectivity
Novel learned communication protocol for coordinating data distribution within collaborative robot teams
Language: Python - Size: 174 MB - Last synced at: 9 months ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 2

PierreExeter/rl_reach
RL Reach is a platform for running reproducible reinforcement learning experiments.
Language: Python - Size: 83.1 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 34 - Forks: 8

SwamiKannan/Breakout-v0-using-Stable-Baselines
Solving the Atari Breakout environment using Stable Baselines
Language: Jupyter Notebook - Size: 161 MB - Last synced at: 5 months ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 1

NTUT-SELab/ConsoleGame-ActionMask.Gym
一些簡單的遊戲,用來驗證強化學習中使用動作遮罩的效果及影響
Language: Python - Size: 350 KB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 7 - Forks: 1

USC-CSCI527-Spring2021/VizDoom-Bot
Language: Python - Size: 32.3 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 3 - Forks: 0

kollabpr/Reinforcement-Learning-in-Super-Mario-Bros
This repository contains the Project for Machine Learning (CSC-736), done under the guidance of Dr. Siming Liu, at Missouri State University.
Language: Python - Size: 10.6 MB - Last synced at: about 2 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 1

waqasisme/snake-game-rl
A snake game Box environment for Reinforcement Learning, both CNN and Grid based environments included.
Language: Python - Size: 21.5 KB - Last synced at: 6 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

Derek-TH-Wang/OpenRoboRL
An open source robot reinforcement learing plantform using stable-baselines and OpenAI Gym
Language: Python - Size: 9.96 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 2

rd-tobias-sunderdiek/hyperparameter-tuning-demo
Demo for hyperparameter tuning with tune using an DQN example from Udacity for OpenAI gym
Language: Python - Size: 1.48 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 0

lutakrystal305/Reinforcement_Learning
Language: Jupyter Notebook - Size: 185 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

unknown-yuser/ppo_super-mario-bros
Playing Super Mario Bros using PPO
Language: Jupyter Notebook - Size: 15.3 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

Devzard/TicTacToe-AI-bot
A bot to play Tic-tac-toe using deep reinforcement learning. Additional RL agents and n-step lookahead agent.
Language: Jupyter Notebook - Size: 200 KB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

prabhasak/reproducibility
An implementation of reinforcement learning using Stable Baselines 2.10, with several features and focus on reproducing results
Language: Python - Size: 7.87 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

yashk2000/BolleRenverse
A few things I learned while learning about machine learning.
Language: Jupyter Notebook - Size: 469 KB - Last synced at: 4 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

manantomar/Multi-step-Greedy-Reinforcement-Learning-Algorithms
Multi-step Greedy Reinforcement Learning Algorithms
Language: Python - Size: 94.7 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 1

WPI-MMR/learning_experiments
Experiments to try and get the solo 8 up and running
Language: Jupyter Notebook - Size: 213 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

TheTrash/Thesis-Work
My final work for my degree in computer science
Language: Jupyter Notebook - Size: 12.1 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

felix0901/ReinforceLearning_SB3
Training Open AI Gym with Reinforcement Learning
Language: Python - Size: 31.3 KB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

nicholasRenninger/GAIL-Formal_Methods
A docker environment and notebooks to experiment with Generative Adversarial Imitation Learning and Formal Methods
Language: Python - Size: 214 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 1

AvisekNaug/CDC2020
Work for CDC2020
Language: Jupyter Notebook - Size: 63.3 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

AnthonyDickson/learning2write
Teaching a neural network how to write letters and digits with reinforcement learning.
Language: Python - Size: 1.25 MB - Last synced at: over 2 years ago - Pushed at: almost 6 years ago - Stars: 4 - Forks: 0

nicholasRenninger/NeuralMooreMachine_Experiments
A docker environment and notebooks to experiment with the extraction of moore machines from RNN RL policies
Language: Jupyter Notebook - Size: 427 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

lukaskiss222/agarDQNbot
Trying to implement Agar bot with DQN
Language: Python - Size: 114 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0
