GitHub topics: stable-baselines3

Repositories

bcorfman/sb3-trial

Stable Baselines 3 Cartpole example configured with Rye as dependency manager.

Language: Makefile - Size: 28.3 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

KohlerHECTOR/primenet

Predicting prime numbers as list of bits.

Language: Jupyter Notebook - Size: 120 KB - Last synced at: 25 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

donymorph/Dongfeng_competition

Building autonomous driving system by using CARLA simulator

Language: Python - Size: 294 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1

Our codebase trials provide an implementation of the Select and Trade paper, which proposes a new paradigm for pair trading using hierarchical reinforcement learning. It includes the code for the proposed method and experimental results on real-world stock data to demonstrate its effectiveness.

Language: Python - Size: 5.43 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 90 - Forks: 22

ScheiklP/sofa_zoo

Reinforcement learning scripts for sofa_env environments.

Language: Python - Size: 42 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 5

ivor4/multi_sb3 Fork of DLR-RM/stable-baselines3

Experimental version of Stable Baslines3 which expands SB3 2.2.1 to be able to define a multi algorithm training. Usage will be based on defer actions, observation space and rewards between its inner algorithms (PPO, DQN, SAC...). It is thought for projects which may rely on different strategies for different actions with a focused training

Size: 3.91 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

wayne-turner/load_balancer

Reinforcement learning for optimizing energy distribution, effective resource management through simulation.

Language: Python - Size: 4.28 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

lie-flat/smart-car-auto-parking 📦

Monorepo for our CV&RL course project: self parking

Language: C - Size: 306 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

rChimisso/2WSI-RL

Study on the application of reinforcement learning to the management of a traffic light intersection.

Language: Python - Size: 81.7 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 0

KohlerHECTOR/TREX-Tree-Reward-EXploration

Using Tree estimators of the MDP models to then count leaves grouping similar transitions and do count-based exploration.

Language: Python - Size: 436 KB - Last synced at: 25 days ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

haldersourav/grid_navigation_RL

A set of simple reinforcement learning examples of navigating a 1D, 2D and 3D grid using Stable-Baselines3

Language: Jupyter Notebook - Size: 5.29 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

inakivazquez/docker_rl

Docker containers for Linux/Windows to experiment with Gymnasium and Stable-Baselines3

Language: Dockerfile - Size: 75.2 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Enkhai/lucy-utils

A set of utilities for the Rocket League bot Lucy, useful for building effective Reinforcement Learning agents for Rocket League, using RLGym

Language: Python - Size: 21.2 MB - Last synced at: 3 months ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

datennerd/super-mario-bros

🍄 Reinforcement Learning agent for Super Mario Bros

Language: Python - Size: 98.6 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 1

Nish-19/SB3-tutorial

Tutorial for using Stable Baselines 3 for creating custom policies

Language: Python - Size: 3.04 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

koshachya-myata/Data_Center_Simulation

Data Center Environment and Reinforcement Learning (RL) Control

Language: HTML - Size: 1.81 GB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 1

sukiboo/personalization_wain21

Source code for the numerical experiments presented in the paper "On the Unreasonable Efficiency of State Space Clustering in Personalization Tasks".

Language: Python - Size: 27.3 KB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

Bonifatius94/robot-sf

training environment for the simulation of a robot moving in a pedestrian-filled space

Language: Python - Size: 4.4 MB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1

ChernyakKonstantin/godot_gym_api

Godot Gym API is an Open Source framework for using Godot3 game engine as 3d-environment for training reinforcement learning agents implemented in Python on any data, including images and point clouds.

Language: GDScript - Size: 6.06 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 27 - Forks: 1

RsGoksel/Genetic-Algorithm-Solutions

A Guide to Problems and Solutions on Genetic Algorithms

Language: Jupyter Notebook - Size: 537 KB - Last synced at: 20 days ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1

artemchege/Evolution

Prey and predator, cyber evolutionary

Language: Python - Size: 1.57 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

RsGoksel/SnakeGame-with-GeneticAlgorithm

Using Neural Network and Genetic Algorithm to play SnakeGame

Language: Jupyter Notebook - Size: 41 KB - Last synced at: 4 months ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

fareskhlifi/Intelligent-Scheduling-using-Reinforcement-learning-and-Deep-Q-Networks

Implementing a new environment in Gymnasium for intelligent schduling

Language: Jupyter Notebook - Size: 7.19 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

cisimon7/Behavioral-And-Cognitive-Robotics

My Introduction to Evolutionary and Reinforcement Learning

Language: Python - Size: 2.58 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

sukiboo/policy_entropy

Analyzing policy entropy of reinforcement learning agents

Language: Python - Size: 21.5 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

TommasoVandermeer/Hospitalbot-Path-Planning

This repository contains an application using ROS2 Humble, Gazebo, OpenAI Gym and Stable Baselines3 to train reinforcement learning agents for a path planning problem.

Language: Python - Size: 123 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 10 - Forks: 2

PrithviAbhishetty/rp3

Language: Python - Size: 8.07 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

sergeLabo/furuta

Pilotage d'un pendule de Furuta avec un Raspberry PI

Language: Python - Size: 84.5 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

Jondiii/vrp-rl

Reinforcement Learning for VRP

Language: Python - Size: 4.37 GB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1

USNavalResearchLaboratory/task-scheduling

Python package implementing task generators, traditional and ML-based scheduling algorithms, and assessment tools.

Language: Python - Size: 190 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 3

tehelka-gamedev/hide_and_seek

Hide and Seek AI test with Deep Q-Learning (stable baselines 3)

Language: Python - Size: 369 KB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

maximotus/asp23-karting-challenge-rl

This project aims to implement a reinfrcement learning agent using Proximal Policy Optimization (PPO). And given the Unity environment of the "Karting Microgame", it can be used to train a robust agent on multiple tracks which can compete against other implementations.

Language: ASP.NET - Size: 30.3 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

WilliamSMendes/EnergyResourceAgent

In this project I pass through the principles and concepts of Reinforced Learning and I trained an agent to manage the energy resources

Language: Jupyter Notebook - Size: 18 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

manish-nayak/pybullet-stable-baselines

PyBullet environments to use Reinforcement learning with Stable Baselines 3

Language: Python - Size: 32.1 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

rey-allan/chimp-ai-nzee

:monkey_face: Recreation of Theory of Mind experiment done on chimpanzees as an RL environment

Language: Python - Size: 330 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

astrologos/tradinggym

A highly-customizable OpenAI gym environment to train & evaluate RL agents trading stocks and crypto.

Language: Python - Size: 14.1 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 1

GrahlmanMatthew/CartPole-Reinforcement-Learning

Uses the Stable Baselines 3 and OpenAI Python libraries to train models that attempt to solve the CartPole problem using 3 reinforcement learning algorithms; PPO (Proximal Policy Optimization), A2C (Advantage Actor Critic) and DQN (Deep Q Learning).

Language: Python - Size: 334 KB - Last synced at: 2 months ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

shashvatshah9/RL-GFootball Fork of aaravpandya/RL-GFootball

Distributed training for RL algo on pytorch

Language: Python - Size: 41 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

SwamiKannan/CartPole-using-Stable-Baselines

CartPole environment using Stable BaseLines library

Language: Jupyter Notebook - Size: 305 KB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

ZikangXiong/MFNLC

[IROS 22'] Model-free Neural Lyapunov Control

Language: Python - Size: 121 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 1

NickKaparinos/OpenAI-Gym-Projects

OpenAI Gym environment solutions using Deep Reinforcement Learning.

Language: Python - Size: 47.3 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 28 - Forks: 7

glinerosuarez/regen

Regen is an end-to-end application that showcases how to train and deploy reinforcement learning trading agents

Language: Python - Size: 267 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 5

resuldagdanov/pettingzoo-environments

Simple Training and Evaluation of Multi-Agent Environments with Deep Reinforcement Algorithms

Language: Jupyter Notebook - Size: 22 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

koulakis/reacher-deep-reinforcement-learning

This is a solution for the second project of the Udacity deep reinforcement learning course. It includes code for training an agent and for using it in a simulation environment.

Language: Jupyter Notebook - Size: 226 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 4

Marc-Velay/hydra_optuna_tutorial

Language: Python - Size: 8.79 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

AhmadrezaHadi/deep-css

Resource Management in Cloud Servers Using end-to-end Deep Reinforcement Learning

Language: Python - Size: 58.5 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

marco-milanesi/lqr-ppo

Git repository for LQR and Reinforcement Learning labs. Code for modeling human movement and solving optimization using PPO algorithm.

Language: Jupyter Notebook - Size: 4.74 MB - Last synced at: 8 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

EwenCollin/BotMobile

Autonomous 1:10 race car with a reinforcement learning based approach

Language: C - Size: 220 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

MichaelFish199/SonicTheHedgehog2-ReinforcmentLearning

This project implements an agent for playing the SonicTheHedgehog2 game from a ROM file using the Proximal Policy Optimization (PPO) algorithm from the stablebaselines3 library. The agent is trained to learn the optimal actions to take at each step in the game in order to complete the level and maximize the score.

Language: Jupyter Notebook - Size: 60.5 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

MichaelFish199/SuperMarioBros-ReinforcementLearning

This project implements an agent for playing the SuperMarioBros game using the Proximal Policy Optimization (PPO) algorithm from the stablebaselines3 library. The agent is trained to learn the optimal actions to take at each step in the game in order to complete the level and maximize the score.

Language: Jupyter Notebook - Size: 1.41 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

MichaelFish199/LunarLander-ReinforcementLearning

In this project, I created an agent using the PPO algorithm from stable baselines3 to complete a task in the LunarLander environment. The agent was trained using reinforcement learning techniques to maximize its performance in the task. The resulting model was able to achieve a high level of success in the LunarLander environment.

Language: Jupyter Notebook - Size: 199 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

MichaelFish199/CartPole-ReinforcementLearning

In this project, I created an agent to solve the CartPole task using the stablebaselines3 library. CartPole is a problem from the OpenAI Gym catalog, in which the goal is to maintain balance of a wooden pole using motors attached to its ends. The agent must decide whether to move the pole left or right to maintain balance.

Language: Jupyter Notebook - Size: 117 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

MichaelFish199/VizDoom-ReinforcementLearning

This project implements an agent for playing the VizDoom game on various levels using the Proximal Policy Optimization (PPO) algorithm from the stablebaselines3 library. The agent is trained to learn the optimal actions to take at each step in the game in order to complete the level and maximize the score.

Language: Jupyter Notebook - Size: 665 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

MichaelFish199/BipedalWalker-ReinforcementLearning

In this project I create agent for the BipedalWalker environment using the Proximal Policy Optimization (PPO) algorithm from the stablebaselines3 library. The agent is trained to navigate the BipedalWalker environment, which is a simulated robot with two legs.

Language: Jupyter Notebook - Size: 144 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

MichaelFish199/FlappyBird-ReinforcmentLearning

This project implements an agent for playing the FlappyBird game in a browser using the Deep Q-Network (DQN) algorithm from the stablebaselines3 library. The agent is trained to learn the optimal actions to take at each step in the game in order to maximize the score.

Language: Jupyter Notebook - Size: 12.7 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0