ppo-pytorch | Topic | Ecosyste.ms: Repos

Topic: "ppo-pytorch"

nikhilbarhate99/PPO-PyTorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Language: Python - Size: 12.1 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 2,047 - Forks: 378

Lizhi-sjtu/DRL-code-pytorch

Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.

Language: Python - Size: 3.37 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 656 - Forks: 124

taherfattahi/ppo-rocket-landing

Proximal Policy Optimization (PPO) algorithm using PyTorch to train an agent for a rocket landing task in a custom environment

Language: Python - Size: 675 KB - Last synced at: 2 months ago - Pushed at: 9 months ago - Stars: 226 - Forks: 50

reiniscimurs/DRL-robot-navigation-IR-SIM

Deep Reinforcement Learning for mobile robot navigation in IR-SIM simulation. Using DRL (SAC, TD3, PPO, DDPG) neural networks, a robot learns to navigate to a random goal point in a simulated environment while avoiding obstacles.

Language: Python - Size: 12.8 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 176 - Forks: 20

CherryPieSexy/imitation_learning

PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.

Language: Python - Size: 34.5 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 118 - Forks: 14

philtabor/ProtoRL

A Torch Based RL Framework for Rapid Prototyping of Research Papers

Language: Python - Size: 208 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 68 - Forks: 5

dvalenciar/ReinforceUI-Studio

ReinforceUI-Studio. A Python-based application designed to simplify the configuration and monitoring of RL training processes. Supporting MuJoCo, OpenAI Gymnasium, and DeepMind Control Suite. Algorithms included: CTD4, DDPG, DQN, PPO, SAC, TD3, TQC

Language: Python - Size: 21.2 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 65 - Forks: 3

paulchen2713/RIS-MISO-HWI-DRL

Implementation of the IEEE WCNC 2025 'Worst-Case MSE Minimization for RIS-Assisted mmWave MU-MISO Systems With Hardware Impairments and Imperfect CSI' paper

Language: Python - Size: 939 KB - Last synced at: 12 days ago - Pushed at: about 2 months ago - Stars: 18 - Forks: 5

LittleWebCat/DRL-Base-EMS

DRL-Base-EMS for HEVs

Language: HTML - Size: 4.92 MB - Last synced at: 8 months ago - Pushed at: over 2 years ago - Stars: 16 - Forks: 1

Solrikk/CriptoWhisper

TradeWhisperer is a sophisticated cryptocurrency trading bot that leverages advanced Reinforcement Learning techniques, specifically the Proximal Policy Optimization (PPO) algorithm, to navigate the complex world of crypto markets. Built with a focus on adaptability and risk management, this bot combines technical analysis with machine learning.

Language: Python - Size: 4 MB - Last synced at: 4 months ago - Pushed at: 8 months ago - Stars: 14 - Forks: 4

faildeny/Multi_Agent_PPO

Multi agent PPO implementation in Pytorch for Unity ML Agents environments.

Language: Python - Size: 3.48 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 14 - Forks: 2

davide97l/PPO-GAIL-cartpole

GAIL learning to imitate PPO playing CartPole.

Language: Jupyter Notebook - Size: 1.34 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 11 - Forks: 4

rvdweerd/simmodel

Solving pursuit-evasion problems on graphs using Reinfocement Learning and GNNs

Language: Python - Size: 364 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 9 - Forks: 1

Git-123-Hub/reinforcement-learning-algorithm

implementation of reinforcement learning algorithm that is easy to read and understand

Language: Python - Size: 7.13 MB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 8 - Forks: 0

francofgp/Tic-Tac-Toe-Gym

This is the Tic-Tac-Toe game made with Python using the PyGame library and the Gym library to implement the AI with Reinforcement Learning

Language: Python - Size: 1.86 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 2

jatinarora2702/gail-pytorch

PyTorch implementation of GAIL and PPO reinforcement learning algorithms

Language: Python - Size: 1.48 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 7 - Forks: 4

SchweizerischeBundesbahnen/flatland-torchrl Fork of RoboEden/flatland-marl

An adaption of the Flatland environment for TorchRL.

Language: Python - Size: 148 MB - Last synced at: 8 months ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

houssameehsain/CutnFill_DeepRL

Positioning a building mass on topography while minimizing the required cut and fill excavation volume using actor critic methods.

Language: Python - Size: 44.7 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 0

wegfawefgawefg/wegs-drl-baselines

Minimum viable reinforcement learning algorithms for your educational convenience.

Language: Python - Size: 1.45 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 0

imoneoi/xrl-ppo

Automated & super fast PyTorch deep reinforcement learning platform for autonomous driving

Language: Python - Size: 32.2 KB - Last synced at: 12 days ago - Pushed at: about 5 years ago - Stars: 5 - Forks: 1

Nikunj-Gupta/HAMMER

HAMMER: Multi-Level Coordination of Reinforcement Learning Agents via Learned Messaging (Paper: https://ala2021.vub.ac.be/papers/ALA2021_paper_35.pdf)

Language: Python - Size: 4.02 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

alirezakazemipour/Mario-PPO

Language: Python - Size: 696 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 0

c2d08y/LearningBot

A deep reinforcement learning Bot for https://kana.byha.top:444/

Language: Python - Size: 126 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 0

JRaposo151/Evolutionary-Robotics-with-GE

Evolutionary Robotics with Gramatical Evolution for Dissertation

Language: Python - Size: 11.2 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 2 - Forks: 0

Rashadows/DRL-for-ORAN-Resource-Allocation

Performance evaluation of several DRL algorithms in a discrete action-space for resource allocation in Open RAN

Language: Python - Size: 69.6 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

saqib1707/RL-PPO-PyTorch

Simple and Modular implementation of Proximal Policy Optimization (PPO) in PyTorch

Language: Python - Size: 22.5 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 2 - Forks: 0

hhxc-0/RL_DaVinciCode

A reinforcement learning model for the Da Vinci code game

Language: Python - Size: 138 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

steph-koopmanschap/PyLife2

The Improved version of PyLife (now with AI)

Language: Python - Size: 115 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

alex-nooj/champion_league

Language: Python - Size: 406 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

tomasspangelo/proximal-policy-optimization

An implementation from the state-of-the-art family of reinforcement learning algorithms Proximal Policy Optimization using normalized Generalized Advantage Estimation and optional batch mode training. The loss function incorporates an entropy bonus.

Language: Python - Size: 23.4 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

leonjovanovic/drl-ppo-bipedal-walker

PyTorch application of reinforcement learning Advanced Policy Gradient algorithms in OpenAI BipedalWalker- PPO

Language: Python - Size: 347 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

faildeny/PPO_pytorch_implementation

Proximal Policy Optimization method in Pytorch

Language: Python - Size: 2.9 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 1

anshdavid/pytorch-driving-torcs

self driving car using Torcs-1.3.7 simulator with server-patch

Language: Python - Size: 70.3 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

rshnn/battleship

Agent trained to play battleship using reinforcement learning (PPO) and openAI gym

Language: Jupyter Notebook - Size: 29.9 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

akashe/DeepReinforcementLearning

Deep RL implementations. DQN, SAC, DDPG, TD3, PPO and VPG implemented in pytorch. Tested Env: LunarLander-v2 and Pendulum-v0.

Language: Python - Size: 650 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 1

mmbajo/AgentGrad

Collection of Reinforcement Learning Algorithm implementations.

Language: Python - Size: 295 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1 - Forks: 0

EricChen0104/ppo-icm-maze-exploration

A curiosity-driven PPO + ICM reinforcement learning agent for autonomous maze exploration and victim rescue — built to evolve into a full SLAM-based search and rescue system.

Language: Python - Size: 26.6 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1 - Forks: 0

roeey777/Splendor-AI

AI agents for the boardgame Splendor

Language: Python - Size: 103 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

naidezhujimo/Proximal-Policy-Optimization-PPO-for-BipedalWalker-v3

his repository contains an implementation of the Proximal Policy Optimization (PPO) algorithm to solve the BipedalWalker-v3 environment from the Gymnasium library. This project uses a combination of policy and value networks to learn a policy for controlling a bipedal walker.

Language: Python - Size: 8.29 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

CAI23sbP/Hybrid-Action-PPO

Hybrid Action PPO in stable-baselines3

Language: Python - Size: 20.5 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

Achronus/rl_atari_games

An exploration of the effects of Intrinsic Motivation methods on RL algorithms using Atari games.

Language: Python - Size: 40.8 MB - Last synced at: 4 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

ialexmp/DRL-Generalization

Exploring Generalization in Reinforcement Learning algorithms for different tasks using PPO, Gymnasium-Robotics and MuJoCo

Language: Python - Size: 78.9 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

bantu-4879/Atari_Games-Deep_Reinforcement_Learning

This repository hosts Jupyter notebooks showcasing the training of Atari games using a variety of Deep Reinforcement Learning (RL) algorithms such as Proximal Policy Optimization (PPO), Deep Deterministic Policy Gradient (DDPG), Deep Q-Networks (DQN), Advantage Actor-Critic (A2C), and more.

Language: Jupyter Notebook - Size: 364 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Icyfiremario/PPO-Jumpstart

Basic PPO based AI template

Language: Python - Size: 16.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

DataRohit/AI-Mario-Game

This is a Deep-Q Learning [Stable Baseline] based AI Mario Game where the Model Incrementally Learns and Improves to Play the Game.

Language: Jupyter Notebook - Size: 261 KB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

nkoorty/rl_parking

Repository with all source files relating to the 6CCE3EEP Final Year Project titled "Self Parking with Reinforcement Learning." The project was implemented using Python, and used PyGame, OpenAI Gym, and the Stable Baselines-3 libraries in order to implement a Proximal Policy Optimisation (PPO) algorithm.

Language: Python - Size: 2.67 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

tedhuang96/ppo-pytorch

PPO in pytorch version.

Language: Python - Size: 9.77 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

dodoseung/ppo-proximal-policy-optimization-pytorch

The pytorch implementation of ppo

Language: Python - Size: 2.93 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

leonjovanovic/drl-ml-agents-3dball

PyTorch application of reinforcement learning DDPG and PPO algorithms in Unity 3D-Ball

Language: Python - Size: 836 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

Nikunj-Gupta/IMRL Fork of lcswillems/rl-starter-files

Informationally Mosaic Reinforcement Learning (Preprint: https://scholar.google.com/citations?view_op=view_citation&hl=en&user=nargncAAAAAJ&citation_for_view=nargncAAAAAJ:W7OEmFMy1HYC)

Language: Python - Size: 27.5 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

nkarasovd/HSE_Production_Stories

:ram: Materials and homework assignments for HSE production stories course

Language: Jupyter Notebook - Size: 47.7 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

jiseongHAN/reinforcement

My Little Reinforcement Learning

Language: Python - Size: 3.32 MB - Last synced at: 4 months ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

fenixxdev/ppo-icm-maze-exploration

Train a PPO agent with an Intrinsic Curiosity Module to explore mazes and rescue victims. Discover efficient navigation strategies. 🐱👤🚀

Language: Python - Size: 26.6 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

emanuelegreco29/Toy_Model_RL

Toy model implementing various architectures to teach a generic point of mass to reach a static and/or dynamic target in a 3D space.

Language: Python - Size: 4.2 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

nonkloq/mazeharvest

A Grid Based RL Environment & Implementaions of few Deep-RL Algorithms.

Language: Python - Size: 33.2 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

RuvenGuna94/Dialogue-Summary-remove-toxic-text-PPO

Fine-tuning FLAN-T5 with PPO and PEFT to generate less toxic text summaries. This notebook leverages Meta AI's hate speech reward model and utilizes RLHF techniques for improved safety.

Language: Jupyter Notebook - Size: 12.9 MB - Last synced at: 4 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

mrunalmania/Transformer-based-Stock-Prediction

In this project I am focuses on determining the direction of trend for the specific stocks (e.g. MAMAA META, APPL, MSFT, AMZN, ALPHABET)

Language: Jupyter Notebook - Size: 18.3 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

Ladun/PPO

Minimal implementation of Proximal Policy Optimization (PPO) in PyTorch

Language: Python - Size: 19.2 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

msmmb/noob_ppo

PPO for Bipedal Walker

Language: Python - Size: 1.2 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

mominalix/Humanoid-Robot-Reinforcement-Learning-PPO

This repository contains a project that leverages reinforcement learning to make a humanoid robot walk in a PyBullet simulation. It uses a custom Gym environment, a Proximal Policy Optimization (PPO) agent, and a provided URDF file for the robot model. The training process prints rewards per generation and visualizes the robot's behavior.

Language: Python - Size: 7.74 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0