GitHub topics: stable-baselines3
bcorfman/sb3-trial
Stable Baselines 3 Cartpole example configured with Rye as dependency manager.
Language: Makefile - Size: 28.3 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

KohlerHECTOR/primenet
Predicting prime numbers as list of bits.
Language: Jupyter Notebook - Size: 120 KB - Last synced at: 25 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

donymorph/Dongfeng_competition
Building autonomous driving system by using CARLA simulator
Language: Python - Size: 294 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1

The-FinAI/trials
Our codebase trials provide an implementation of the Select and Trade paper, which proposes a new paradigm for pair trading using hierarchical reinforcement learning. It includes the code for the proposed method and experimental results on real-world stock data to demonstrate its effectiveness.
Language: Python - Size: 5.43 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 90 - Forks: 22

ScheiklP/sofa_zoo
Reinforcement learning scripts for sofa_env environments.
Language: Python - Size: 42 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 5

ivor4/multi_sb3 Fork of DLR-RM/stable-baselines3
Experimental version of Stable Baslines3 which expands SB3 2.2.1 to be able to define a multi algorithm training. Usage will be based on defer actions, observation space and rewards between its inner algorithms (PPO, DQN, SAC...). It is thought for projects which may rely on different strategies for different actions with a focused training
Size: 3.91 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

wayne-turner/load_balancer
Reinforcement learning for optimizing energy distribution, effective resource management through simulation.
Language: Python - Size: 4.28 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

lie-flat/smart-car-auto-parking 📦
Monorepo for our CV&RL course project: self parking
Language: C - Size: 306 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

rChimisso/2WSI-RL
Study on the application of reinforcement learning to the management of a traffic light intersection.
Language: Python - Size: 81.7 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 0

KohlerHECTOR/TREX-Tree-Reward-EXploration
Using Tree estimators of the MDP models to then count leaves grouping similar transitions and do count-based exploration.
Language: Python - Size: 436 KB - Last synced at: 25 days ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

haldersourav/grid_navigation_RL
A set of simple reinforcement learning examples of navigating a 1D, 2D and 3D grid using Stable-Baselines3
Language: Jupyter Notebook - Size: 5.29 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

inakivazquez/docker_rl
Docker containers for Linux/Windows to experiment with Gymnasium and Stable-Baselines3
Language: Dockerfile - Size: 75.2 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Enkhai/lucy-utils
A set of utilities for the Rocket League bot Lucy, useful for building effective Reinforcement Learning agents for Rocket League, using RLGym
Language: Python - Size: 21.2 MB - Last synced at: 3 months ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

datennerd/super-mario-bros
🍄 Reinforcement Learning agent for Super Mario Bros
Language: Python - Size: 98.6 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 1

Nish-19/SB3-tutorial
Tutorial for using Stable Baselines 3 for creating custom policies
Language: Python - Size: 3.04 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

koshachya-myata/Data_Center_Simulation
Data Center Environment and Reinforcement Learning (RL) Control
Language: HTML - Size: 1.81 GB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 1

sukiboo/personalization_wain21
Source code for the numerical experiments presented in the paper "On the Unreasonable Efficiency of State Space Clustering in Personalization Tasks".
Language: Python - Size: 27.3 KB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

Bonifatius94/robot-sf
training environment for the simulation of a robot moving in a pedestrian-filled space
Language: Python - Size: 4.4 MB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1

ChernyakKonstantin/godot_gym_api
Godot Gym API is an Open Source framework for using Godot3 game engine as 3d-environment for training reinforcement learning agents implemented in Python on any data, including images and point clouds.
Language: GDScript - Size: 6.06 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 27 - Forks: 1

RsGoksel/Genetic-Algorithm-Solutions
A Guide to Problems and Solutions on Genetic Algorithms
Language: Jupyter Notebook - Size: 537 KB - Last synced at: 20 days ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1

artemchege/Evolution
Prey and predator, cyber evolutionary
Language: Python - Size: 1.57 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

RsGoksel/SnakeGame-with-GeneticAlgorithm
Using Neural Network and Genetic Algorithm to play SnakeGame
Language: Jupyter Notebook - Size: 41 KB - Last synced at: 4 months ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

fareskhlifi/Intelligent-Scheduling-using-Reinforcement-learning-and-Deep-Q-Networks
Implementing a new environment in Gymnasium for intelligent schduling
Language: Jupyter Notebook - Size: 7.19 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

cisimon7/Behavioral-And-Cognitive-Robotics
My Introduction to Evolutionary and Reinforcement Learning
Language: Python - Size: 2.58 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

sukiboo/policy_entropy
Analyzing policy entropy of reinforcement learning agents
Language: Python - Size: 21.5 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

TommasoVandermeer/Hospitalbot-Path-Planning
This repository contains an application using ROS2 Humble, Gazebo, OpenAI Gym and Stable Baselines3 to train reinforcement learning agents for a path planning problem.
Language: Python - Size: 123 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 10 - Forks: 2

PrithviAbhishetty/rp3
Language: Python - Size: 8.07 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

sergeLabo/furuta
Pilotage d'un pendule de Furuta avec un Raspberry PI
Language: Python - Size: 84.5 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

Jondiii/vrp-rl
Reinforcement Learning for VRP
Language: Python - Size: 4.37 GB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1

USNavalResearchLaboratory/task-scheduling
Python package implementing task generators, traditional and ML-based scheduling algorithms, and assessment tools.
Language: Python - Size: 190 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 3

tehelka-gamedev/hide_and_seek
Hide and Seek AI test with Deep Q-Learning (stable baselines 3)
Language: Python - Size: 369 KB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

maximotus/asp23-karting-challenge-rl
This project aims to implement a reinfrcement learning agent using Proximal Policy Optimization (PPO). And given the Unity environment of the "Karting Microgame", it can be used to train a robust agent on multiple tracks which can compete against other implementations.
Language: ASP.NET - Size: 30.3 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

WilliamSMendes/EnergyResourceAgent
In this project I pass through the principles and concepts of Reinforced Learning and I trained an agent to manage the energy resources
Language: Jupyter Notebook - Size: 18 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

manish-nayak/pybullet-stable-baselines
PyBullet environments to use Reinforcement learning with Stable Baselines 3
Language: Python - Size: 32.1 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

rey-allan/chimp-ai-nzee
:monkey_face: Recreation of Theory of Mind experiment done on chimpanzees as an RL environment
Language: Python - Size: 330 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

astrologos/tradinggym
A highly-customizable OpenAI gym environment to train & evaluate RL agents trading stocks and crypto.
Language: Python - Size: 14.1 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 1

GrahlmanMatthew/CartPole-Reinforcement-Learning
Uses the Stable Baselines 3 and OpenAI Python libraries to train models that attempt to solve the CartPole problem using 3 reinforcement learning algorithms; PPO (Proximal Policy Optimization), A2C (Advantage Actor Critic) and DQN (Deep Q Learning).
Language: Python - Size: 334 KB - Last synced at: 2 months ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

shashvatshah9/RL-GFootball Fork of aaravpandya/RL-GFootball
Distributed training for RL algo on pytorch
Language: Python - Size: 41 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

SwamiKannan/CartPole-using-Stable-Baselines
CartPole environment using Stable BaseLines library
Language: Jupyter Notebook - Size: 305 KB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

ZikangXiong/MFNLC
[IROS 22'] Model-free Neural Lyapunov Control
Language: Python - Size: 121 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 1

NickKaparinos/OpenAI-Gym-Projects
OpenAI Gym environment solutions using Deep Reinforcement Learning.
Language: Python - Size: 47.3 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 28 - Forks: 7

glinerosuarez/regen
Regen is an end-to-end application that showcases how to train and deploy reinforcement learning trading agents
Language: Python - Size: 267 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 5

resuldagdanov/pettingzoo-environments
Simple Training and Evaluation of Multi-Agent Environments with Deep Reinforcement Algorithms
Language: Jupyter Notebook - Size: 22 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

koulakis/reacher-deep-reinforcement-learning
This is a solution for the second project of the Udacity deep reinforcement learning course. It includes code for training an agent and for using it in a simulation environment.
Language: Jupyter Notebook - Size: 226 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 4

Marc-Velay/hydra_optuna_tutorial
Language: Python - Size: 8.79 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

AhmadrezaHadi/deep-css
Resource Management in Cloud Servers Using end-to-end Deep Reinforcement Learning
Language: Python - Size: 58.5 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

marco-milanesi/lqr-ppo
Git repository for LQR and Reinforcement Learning labs. Code for modeling human movement and solving optimization using PPO algorithm.
Language: Jupyter Notebook - Size: 4.74 MB - Last synced at: 8 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

EwenCollin/BotMobile
Autonomous 1:10 race car with a reinforcement learning based approach
Language: C - Size: 220 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

MichaelFish199/SonicTheHedgehog2-ReinforcmentLearning
This project implements an agent for playing the SonicTheHedgehog2 game from a ROM file using the Proximal Policy Optimization (PPO) algorithm from the stablebaselines3 library. The agent is trained to learn the optimal actions to take at each step in the game in order to complete the level and maximize the score.
Language: Jupyter Notebook - Size: 60.5 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

MichaelFish199/SuperMarioBros-ReinforcementLearning
This project implements an agent for playing the SuperMarioBros game using the Proximal Policy Optimization (PPO) algorithm from the stablebaselines3 library. The agent is trained to learn the optimal actions to take at each step in the game in order to complete the level and maximize the score.
Language: Jupyter Notebook - Size: 1.41 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

MichaelFish199/LunarLander-ReinforcementLearning
In this project, I created an agent using the PPO algorithm from stable baselines3 to complete a task in the LunarLander environment. The agent was trained using reinforcement learning techniques to maximize its performance in the task. The resulting model was able to achieve a high level of success in the LunarLander environment.
Language: Jupyter Notebook - Size: 199 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

MichaelFish199/CartPole-ReinforcementLearning
In this project, I created an agent to solve the CartPole task using the stablebaselines3 library. CartPole is a problem from the OpenAI Gym catalog, in which the goal is to maintain balance of a wooden pole using motors attached to its ends. The agent must decide whether to move the pole left or right to maintain balance.
Language: Jupyter Notebook - Size: 117 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

MichaelFish199/VizDoom-ReinforcementLearning
This project implements an agent for playing the VizDoom game on various levels using the Proximal Policy Optimization (PPO) algorithm from the stablebaselines3 library. The agent is trained to learn the optimal actions to take at each step in the game in order to complete the level and maximize the score.
Language: Jupyter Notebook - Size: 665 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

MichaelFish199/BipedalWalker-ReinforcementLearning
In this project I create agent for the BipedalWalker environment using the Proximal Policy Optimization (PPO) algorithm from the stablebaselines3 library. The agent is trained to navigate the BipedalWalker environment, which is a simulated robot with two legs.
Language: Jupyter Notebook - Size: 144 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

MichaelFish199/FlappyBird-ReinforcmentLearning
This project implements an agent for playing the FlappyBird game in a browser using the Deep Q-Network (DQN) algorithm from the stablebaselines3 library. The agent is trained to learn the optimal actions to take at each step in the game in order to maximize the score.
Language: Jupyter Notebook - Size: 12.7 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nhieubrian/AirSim-Demo
Up to date progress of Microsoft AirSim from my academic internship. This is a guide on how to get Stable Baseline's reinforcement learning to work alongside the AirSim package with Unreal Engine 4.
Size: 780 KB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

LukasDrews97/flappy-bird-reinforcement-learning Fork of doravalerie/flappy-bird-gym
Training a Reinforcement Learning Agent to Play Flappy Bird
Language: Python - Size: 64.9 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

TepidJesus/SC2-ProtossBot
[WIP] - My first attempt at a Star Craft 2 bot using Stable baselines 3.
Language: Python - Size: 11.7 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

Credioni/ML200.ristinolla
Custom TicTacToe openai-gym enviroment for PPO RL-agent
Language: Jupyter Notebook - Size: 548 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 1

lukefisher03/tic-tac-toe-gym-env
A custom OpenAI gym environment for training Tic-Tac-Toe agents with Stable-Baselines3
Language: Python - Size: 1.46 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

AAnirudh07/stable-baselines
Implementation of RL algorithms using the stable baselines library
Language: Python - Size: 8.17 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

Enkhai/rlgym-examples
Training examples for the Rocket League Gym using Stable-Baselines3
Language: Python - Size: 27.3 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

nerdinand/shooty-game
Game/Simulation which is a 2D-analog of Counter Strike as a target for Deep Reinforcement Learning.
Language: Python - Size: 614 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

arora-aditya/wordle-rl
reinforcement learning with wordle and open-ai gym
Language: Python - Size: 24.1 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 1

unknown-yuser/SAC_KukaDiverseObject
Train Soft Actor-Critic model for KukaDiverseObject environment in pybullet
Language: Python - Size: 3.23 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

mohith-sakthivel/sufficient-ppo
Clean and flexible implementation of PPO (built on top of stable-baselines3)
Language: Python - Size: 25.4 KB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0
