Topic: "ppo-pytorch"
nikhilbarhate99/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Language: Python - Size: 12.1 MB - Last synced at: 25 days ago - Pushed at: 12 months ago - Stars: 2,047 - Forks: 378

Lizhi-sjtu/DRL-code-pytorch
Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.
Language: Python - Size: 3.37 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 656 - Forks: 124

taherfattahi/ppo-rocket-landing
Proximal Policy Optimization (PPO) algorithm using PyTorch to train an agent for a rocket landing task in a custom environment
Language: Python - Size: 675 KB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 226 - Forks: 50

reiniscimurs/DRL-robot-navigation-IR-SIM
Deep Reinforcement Learning for mobile robot navigation in IR-SIM simulation. Using DRL (SAC, TD3, PPO, DDPG) neural networks, a robot learns to navigate to a random goal point in a simulated environment while avoiding obstacles.
Language: Python - Size: 12.8 MB - Last synced at: about 20 hours ago - Pushed at: about 21 hours ago - Stars: 174 - Forks: 20

CherryPieSexy/imitation_learning
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Language: Python - Size: 34.5 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 118 - Forks: 14

philtabor/ProtoRL
A Torch Based RL Framework for Rapid Prototyping of Research Papers
Language: Python - Size: 200 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 67 - Forks: 5

dvalenciar/ReinforceUI-Studio
ReinforceUI-Studio. A Python-based application designed to simplify the configuration and monitoring of RL training processes. Supporting MuJoCo, OpenAI Gymnasium, and DeepMind Control Suite. Algorithms included: CTD4, DDPG, DQN, PPO, SAC, TD3, TQC
Language: Python - Size: 21.3 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 65 - Forks: 3

paulchen2713/RIS-MISO-HWI-DRL
Implementation of the IEEE WCNC 2025 'Worst-Case MSE Minimization for RIS-Assisted mmWave MU-MISO Systems With Hardware Impairments and Imperfect CSI' paper
Language: Python - Size: 289 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 18 - Forks: 5

LittleWebCat/DRL-Base-EMS
DRL-Base-EMS for HEVs
Language: HTML - Size: 4.92 MB - Last synced at: 7 months ago - Pushed at: over 2 years ago - Stars: 16 - Forks: 1

Solrikk/CriptoWhisper
TradeWhisperer is a sophisticated cryptocurrency trading bot that leverages advanced Reinforcement Learning techniques, specifically the Proximal Policy Optimization (PPO) algorithm, to navigate the complex world of crypto markets. Built with a focus on adaptability and risk management, this bot combines technical analysis with machine learning.
Language: Python - Size: 4 MB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 14 - Forks: 4

faildeny/Multi_Agent_PPO
Multi agent PPO implementation in Pytorch for Unity ML Agents environments.
Language: Python - Size: 3.48 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 14 - Forks: 2

davide97l/PPO-GAIL-cartpole
GAIL learning to imitate PPO playing CartPole.
Language: Jupyter Notebook - Size: 1.34 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 11 - Forks: 4

rvdweerd/simmodel
Solving pursuit-evasion problems on graphs using Reinfocement Learning and GNNs
Language: Python - Size: 364 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 1

Git-123-Hub/reinforcement-learning-algorithm
implementation of reinforcement learning algorithm that is easy to read and understand
Language: Python - Size: 7.13 MB - Last synced at: 13 days ago - Pushed at: over 3 years ago - Stars: 8 - Forks: 0

francofgp/Tic-Tac-Toe-Gym
This is the Tic-Tac-Toe game made with Python using the PyGame library and the Gym library to implement the AI with Reinforcement Learning
Language: Python - Size: 1.86 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 2

jatinarora2702/gail-pytorch
PyTorch implementation of GAIL and PPO reinforcement learning algorithms
Language: Python - Size: 1.48 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 7 - Forks: 4

SchweizerischeBundesbahnen/flatland-torchrl Fork of RoboEden/flatland-marl
An adaption of the Flatland environment for TorchRL.
Language: Python - Size: 148 MB - Last synced at: 7 months ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

houssameehsain/CutnFill_DeepRL
Positioning a building mass on topography while minimizing the required cut and fill excavation volume using actor critic methods.
Language: Python - Size: 44.7 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 0

wegfawefgawefg/wegs-drl-baselines
Minimum viable reinforcement learning algorithms for your educational convenience.
Language: Python - Size: 1.45 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 0

imoneoi/xrl-ppo
Automated & super fast PyTorch deep reinforcement learning platform for autonomous driving
Language: Python - Size: 32.2 KB - Last synced at: 3 months ago - Pushed at: almost 5 years ago - Stars: 5 - Forks: 1

Nikunj-Gupta/HAMMER
HAMMER: Multi-Level Coordination of Reinforcement Learning Agents via Learned Messaging (Paper: https://ala2021.vub.ac.be/papers/ALA2021_paper_35.pdf)
Language: Python - Size: 4.02 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 4 - Forks: 0

alirezakazemipour/Mario-PPO
Language: Python - Size: 696 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 0

c2d08y/LearningBot
A deep reinforcement learning Bot for https://kana.byha.top:444/
Language: Python - Size: 126 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 0

Rashadows/DRL-for-ORAN-Resource-Allocation
Performance evaluation of several DRL algorithms in a discrete action-space for resource allocation in Open RAN
Language: Python - Size: 69.6 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

saqib1707/RL-PPO-PyTorch
Simple and Modular implementation of Proximal Policy Optimization (PPO) in PyTorch
Language: Python - Size: 22.5 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

hhxc-0/RL_DaVinciCode
A reinforcement learning model for the Da Vinci code game
Language: Python - Size: 138 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 2 - Forks: 0

steph-koopmanschap/PyLife2
The Improved version of PyLife (now with AI)
Language: Python - Size: 115 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

alex-nooj/champion_league
Language: Python - Size: 406 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

tomasspangelo/proximal-policy-optimization
An implementation from the state-of-the-art family of reinforcement learning algorithms Proximal Policy Optimization using normalized Generalized Advantage Estimation and optional batch mode training. The loss function incorporates an entropy bonus.
Language: Python - Size: 23.4 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

leonjovanovic/drl-ppo-bipedal-walker
PyTorch application of reinforcement learning Advanced Policy Gradient algorithms in OpenAI BipedalWalker- PPO
Language: Python - Size: 347 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

faildeny/PPO_pytorch_implementation
Proximal Policy Optimization method in Pytorch
Language: Python - Size: 2.9 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 1

anshdavid/pytorch-driving-torcs
self driving car using Torcs-1.3.7 simulator with server-patch
Language: Python - Size: 70.3 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

rshnn/battleship
Agent trained to play battleship using reinforcement learning (PPO) and openAI gym
Language: Jupyter Notebook - Size: 29.9 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

akashe/DeepReinforcementLearning
Deep RL implementations. DQN, SAC, DDPG, TD3, PPO and VPG implemented in pytorch. Tested Env: LunarLander-v2 and Pendulum-v0.
Language: Python - Size: 650 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 1

roeey777/Splendor-AI
AI agents for the boardgame Splendor
Language: Python - Size: 103 MB - Last synced at: about 18 hours ago - Pushed at: about 19 hours ago - Stars: 1 - Forks: 0

JRaposo151/Evolutionary-Robotics-with-GE
Evolutionary Robotics with Gramatical Evolution for Dissertation
Language: Python - Size: 10.2 MB - Last synced at: 18 days ago - Pushed at: 19 days ago - Stars: 1 - Forks: 0

naidezhujimo/Proximal-Policy-Optimization-PPO-for-BipedalWalker-v3
his repository contains an implementation of the Proximal Policy Optimization (PPO) algorithm to solve the BipedalWalker-v3 environment from the Gymnasium library. This project uses a combination of policy and value networks to learn a policy for controlling a bipedal walker.
Language: Python - Size: 8.29 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

CAI23sbP/Hybrid-Action-PPO
Hybrid Action PPO in stable-baselines3
Language: Python - Size: 20.5 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

mmbajo/AgentGrad
Collection of Reinforcement Learning Algorithm implementations.
Language: Python - Size: 294 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

Achronus/rl_atari_games
An exploration of the effects of Intrinsic Motivation methods on RL algorithms using Atari games.
Language: Python - Size: 40.8 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

ialexmp/DRL-Generalization
Exploring Generalization in Reinforcement Learning algorithms for different tasks using PPO, Gymnasium-Robotics and MuJoCo
Language: Python - Size: 78.9 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

bantu-4879/Atari_Games-Deep_Reinforcement_Learning
This repository hosts Jupyter notebooks showcasing the training of Atari games using a variety of Deep Reinforcement Learning (RL) algorithms such as Proximal Policy Optimization (PPO), Deep Deterministic Policy Gradient (DDPG), Deep Q-Networks (DQN), Advantage Actor-Critic (A2C), and more.
Language: Jupyter Notebook - Size: 364 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Icyfiremario/PPO-Jumpstart
Basic PPO based AI template
Language: Python - Size: 16.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

DataRohit/AI-Mario-Game
This is a Deep-Q Learning [Stable Baseline] based AI Mario Game where the Model Incrementally Learns and Improves to Play the Game.
Language: Jupyter Notebook - Size: 261 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

nkoorty/rl_parking
Repository with all source files relating to the 6CCE3EEP Final Year Project titled "Self Parking with Reinforcement Learning." The project was implemented using Python, and used PyGame, OpenAI Gym, and the Stable Baselines-3 libraries in order to implement a Proximal Policy Optimisation (PPO) algorithm.
Language: Python - Size: 2.67 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

tedhuang96/ppo-pytorch
PPO in pytorch version.
Language: Python - Size: 9.77 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

dodoseung/ppo-proximal-policy-optimization-pytorch
The pytorch implementation of ppo
Language: Python - Size: 2.93 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

leonjovanovic/drl-ml-agents-3dball
PyTorch application of reinforcement learning DDPG and PPO algorithms in Unity 3D-Ball
Language: Python - Size: 836 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

Nikunj-Gupta/IMRL Fork of lcswillems/rl-starter-files
Informationally Mosaic Reinforcement Learning (Preprint: https://scholar.google.com/citations?view_op=view_citation&hl=en&user=nargncAAAAAJ&citation_for_view=nargncAAAAAJ:W7OEmFMy1HYC)
Language: Python - Size: 27.5 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

nkarasovd/HSE_Production_Stories
:ram: Materials and homework assignments for HSE production stories course
Language: Jupyter Notebook - Size: 47.7 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

jiseongHAN/reinforcement
My Little Reinforcement Learning
Language: Python - Size: 3.32 MB - Last synced at: 3 months ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

emanuelegreco29/Toy_Model_RL
Toy model implementing various architectures to teach a generic point of mass to reach a static and/or dynamic target in a 3D space.
Language: Python - Size: 438 KB - Last synced at: about 19 hours ago - Pushed at: about 20 hours ago - Stars: 0 - Forks: 0

nonkloq/mazeharvest
A Grid Based RL Environment & Implementaions of few Deep-RL Algorithms.
Language: Python - Size: 33.2 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

RuvenGuna94/Dialogue-Summary-remove-toxic-text-PPO
Fine-tuning FLAN-T5 with PPO and PEFT to generate less toxic text summaries. This notebook leverages Meta AI's hate speech reward model and utilizes RLHF techniques for improved safety.
Language: Jupyter Notebook - Size: 12.9 MB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

mrunalmania/Transformer-based-Stock-Prediction
In this project I am focuses on determining the direction of trend for the specific stocks (e.g. MAMAA META, APPL, MSFT, AMZN, ALPHABET)
Language: Jupyter Notebook - Size: 18.3 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

Ladun/PPO
Minimal implementation of Proximal Policy Optimization (PPO) in PyTorch
Language: Python - Size: 19.2 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

msmmb/noob_ppo
PPO for Bipedal Walker
Language: Python - Size: 1.2 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

mominalix/Humanoid-Robot-Reinforcement-Learning-PPO
This repository contains a project that leverages reinforcement learning to make a humanoid robot walk in a PyBullet simulation. It uses a custom Gym environment, a Proximal Policy Optimization (PPO) agent, and a provided URDF file for the robot model. The training process prints rewards per generation and visualizes the robot's behavior.
Language: Python - Size: 7.74 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

asdfGuest/Simple-PPO
Simple implementation of PPO algorithm
Language: Jupyter Notebook - Size: 29.5 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

jakemaz66/Reinforcement_Learning_Algorithms
Custom implementations of RL algorithms that can solve complex tasks like Atari games
Language: Python - Size: 215 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

dfoshidero/RLModels-Doom_CartPole
Collection of Reinforcement Learning models for training in the VizDoom and CartPole environments. Developed by Group 2 for the University of Bath's CM50270 module, exploring advanced strategies for models navigating complex situations.
Language: Jupyter Notebook - Size: 504 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

EnriqManComp/smart-disks-PPO
This project aims to find a possible solution to a search problem in a given environment with two players using Proximal Policy Optimization as AI algorithm.
Language: Python - Size: 5.55 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

arya-ebrahimi/rl-playground
tabular and deep rl algorithms
Language: Jupyter Notebook - Size: 64.9 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

GerTheMessiah/Snake-AI
Short own implementation of the game snake. In this project I'am using the ray library together with ray tune and a custom PPO model.
Language: Python - Size: 11.3 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

muno-video-conferencing/muno
Muno server for bandwidth estimation in video conferencing
Size: 1000 Bytes - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

bay3s/ppo-parallel
Parallelized implementation of Proximal Policy Optimization (PPO).
Language: Python - Size: 105 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

harikris001/Super-Mario-Reinforcement_Learning
Reinforcement Learning in Super Mario using Pytorch and PPO
Language: Jupyter Notebook - Size: 11.8 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

escribano89/unir_tfm_reinforcement_learning
Repositorio para el contenido relativo al trabajo de fin de máster desarrollado en el Máster de Inteligencia Artificial de la Universidad Internacional de La Rioja (UNIR).
Language: Jupyter Notebook - Size: 1.09 GB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

tohsin/Reinforcement_learning_projects
Here i write basic optimisation algorithm using rl algroithms on AI gyms
Language: Jupyter Notebook - Size: 1.57 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

iamvigneshwars/ai-walkers-ppo-pytorch
AI agent learns to walk, run, hop and crawl with out any given data using proximal policy optimisation.
Language: Python - Size: 152 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

chpi7/ppo
An easily understandable implementation of Proximal Policy Optimization with PyTorch
Language: Python - Size: 68.4 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

KiUngSong/RL
Repository of Various Test & Implementation of RL
Language: Jupyter Notebook - Size: 8.73 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 1

nkarasovd/HSE_Reinforcement_Learning
:robot: Materials and homework assignments for HSE reinforcement learning course
Language: Python - Size: 915 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

sprakashdash/RL.Fun.Do
A repository for easy understanding of codes in Deep Reinforcement Learning
Language: Python - Size: 217 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0
