An open API service providing repository metadata for many open source software ecosystems.

Topic: "ppo-pytorch"

nikhilbarhate99/PPO-PyTorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Language: Python - Size: 12.1 MB - Last synced at: 25 days ago - Pushed at: 12 months ago - Stars: 2,047 - Forks: 378

Lizhi-sjtu/DRL-code-pytorch

Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.

Language: Python - Size: 3.37 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 656 - Forks: 124

taherfattahi/ppo-rocket-landing

Proximal Policy Optimization (PPO) algorithm using PyTorch to train an agent for a rocket landing task in a custom environment

Language: Python - Size: 675 KB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 226 - Forks: 50

reiniscimurs/DRL-robot-navigation-IR-SIM

Deep Reinforcement Learning for mobile robot navigation in IR-SIM simulation. Using DRL (SAC, TD3, PPO, DDPG) neural networks, a robot learns to navigate to a random goal point in a simulated environment while avoiding obstacles.

Language: Python - Size: 12.8 MB - Last synced at: about 20 hours ago - Pushed at: about 21 hours ago - Stars: 174 - Forks: 20

CherryPieSexy/imitation_learning

PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.

Language: Python - Size: 34.5 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 118 - Forks: 14

philtabor/ProtoRL

A Torch Based RL Framework for Rapid Prototyping of Research Papers

Language: Python - Size: 200 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 67 - Forks: 5

dvalenciar/ReinforceUI-Studio

ReinforceUI-Studio. A Python-based application designed to simplify the configuration and monitoring of RL training processes. Supporting MuJoCo, OpenAI Gymnasium, and DeepMind Control Suite. Algorithms included: CTD4, DDPG, DQN, PPO, SAC, TD3, TQC

Language: Python - Size: 21.3 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 65 - Forks: 3

paulchen2713/RIS-MISO-HWI-DRL

Implementation of the IEEE WCNC 2025 'Worst-Case MSE Minimization for RIS-Assisted mmWave MU-MISO Systems With Hardware Impairments and Imperfect CSI' paper

Language: Python - Size: 289 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 18 - Forks: 5

LittleWebCat/DRL-Base-EMS

DRL-Base-EMS for HEVs

Language: HTML - Size: 4.92 MB - Last synced at: 7 months ago - Pushed at: over 2 years ago - Stars: 16 - Forks: 1

Solrikk/CriptoWhisper

TradeWhisperer is a sophisticated cryptocurrency trading bot that leverages advanced Reinforcement Learning techniques, specifically the Proximal Policy Optimization (PPO) algorithm, to navigate the complex world of crypto markets. Built with a focus on adaptability and risk management, this bot combines technical analysis with machine learning.

Language: Python - Size: 4 MB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 14 - Forks: 4

faildeny/Multi_Agent_PPO

Multi agent PPO implementation in Pytorch for Unity ML Agents environments.

Language: Python - Size: 3.48 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 14 - Forks: 2

davide97l/PPO-GAIL-cartpole

GAIL learning to imitate PPO playing CartPole.

Language: Jupyter Notebook - Size: 1.34 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 11 - Forks: 4

rvdweerd/simmodel

Solving pursuit-evasion problems on graphs using Reinfocement Learning and GNNs

Language: Python - Size: 364 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 1

Git-123-Hub/reinforcement-learning-algorithm

implementation of reinforcement learning algorithm that is easy to read and understand

Language: Python - Size: 7.13 MB - Last synced at: 13 days ago - Pushed at: over 3 years ago - Stars: 8 - Forks: 0

francofgp/Tic-Tac-Toe-Gym

This is the Tic-Tac-Toe game made with Python using the PyGame library and the Gym library to implement the AI with Reinforcement Learning

Language: Python - Size: 1.86 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 2

jatinarora2702/gail-pytorch

PyTorch implementation of GAIL and PPO reinforcement learning algorithms

Language: Python - Size: 1.48 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 7 - Forks: 4

SchweizerischeBundesbahnen/flatland-torchrl Fork of RoboEden/flatland-marl

An adaption of the Flatland environment for TorchRL.

Language: Python - Size: 148 MB - Last synced at: 7 months ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

houssameehsain/CutnFill_DeepRL

Positioning a building mass on topography while minimizing the required cut and fill excavation volume using actor critic methods.

Language: Python - Size: 44.7 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 0

wegfawefgawefg/wegs-drl-baselines

Minimum viable reinforcement learning algorithms for your educational convenience.

Language: Python - Size: 1.45 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 0

imoneoi/xrl-ppo

Automated & super fast PyTorch deep reinforcement learning platform for autonomous driving

Language: Python - Size: 32.2 KB - Last synced at: 3 months ago - Pushed at: almost 5 years ago - Stars: 5 - Forks: 1

Nikunj-Gupta/HAMMER

HAMMER: Multi-Level Coordination of Reinforcement Learning Agents via Learned Messaging (Paper: https://ala2021.vub.ac.be/papers/ALA2021_paper_35.pdf)

Language: Python - Size: 4.02 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 4 - Forks: 0

alirezakazemipour/Mario-PPO

Language: Python - Size: 696 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 0

c2d08y/LearningBot

A deep reinforcement learning Bot for https://kana.byha.top:444/

Language: Python - Size: 126 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 0

Rashadows/DRL-for-ORAN-Resource-Allocation

Performance evaluation of several DRL algorithms in a discrete action-space for resource allocation in Open RAN

Language: Python - Size: 69.6 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

saqib1707/RL-PPO-PyTorch

Simple and Modular implementation of Proximal Policy Optimization (PPO) in PyTorch

Language: Python - Size: 22.5 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

hhxc-0/RL_DaVinciCode

A reinforcement learning model for the Da Vinci code game

Language: Python - Size: 138 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 2 - Forks: 0

steph-koopmanschap/PyLife2

The Improved version of PyLife (now with AI)

Language: Python - Size: 115 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

alex-nooj/champion_league

Language: Python - Size: 406 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

tomasspangelo/proximal-policy-optimization

An implementation from the state-of-the-art family of reinforcement learning algorithms Proximal Policy Optimization using normalized Generalized Advantage Estimation and optional batch mode training. The loss function incorporates an entropy bonus.

Language: Python - Size: 23.4 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

leonjovanovic/drl-ppo-bipedal-walker

PyTorch application of reinforcement learning Advanced Policy Gradient algorithms in OpenAI BipedalWalker- PPO

Language: Python - Size: 347 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

faildeny/PPO_pytorch_implementation

Proximal Policy Optimization method in Pytorch

Language: Python - Size: 2.9 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 1

anshdavid/pytorch-driving-torcs

self driving car using Torcs-1.3.7 simulator with server-patch

Language: Python - Size: 70.3 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

rshnn/battleship

Agent trained to play battleship using reinforcement learning (PPO) and openAI gym

Language: Jupyter Notebook - Size: 29.9 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

akashe/DeepReinforcementLearning

Deep RL implementations. DQN, SAC, DDPG, TD3, PPO and VPG implemented in pytorch. Tested Env: LunarLander-v2 and Pendulum-v0.

Language: Python - Size: 650 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 1

roeey777/Splendor-AI

AI agents for the boardgame Splendor

Language: Python - Size: 103 MB - Last synced at: about 18 hours ago - Pushed at: about 19 hours ago - Stars: 1 - Forks: 0

JRaposo151/Evolutionary-Robotics-with-GE

Evolutionary Robotics with Gramatical Evolution for Dissertation

Language: Python - Size: 10.2 MB - Last synced at: 18 days ago - Pushed at: 19 days ago - Stars: 1 - Forks: 0

naidezhujimo/Proximal-Policy-Optimization-PPO-for-BipedalWalker-v3

his repository contains an implementation of the Proximal Policy Optimization (PPO) algorithm to solve the BipedalWalker-v3 environment from the Gymnasium library. This project uses a combination of policy and value networks to learn a policy for controlling a bipedal walker.

Language: Python - Size: 8.29 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

CAI23sbP/Hybrid-Action-PPO

Hybrid Action PPO in stable-baselines3

Language: Python - Size: 20.5 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

mmbajo/AgentGrad

Collection of Reinforcement Learning Algorithm implementations.

Language: Python - Size: 294 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

Achronus/rl_atari_games

An exploration of the effects of Intrinsic Motivation methods on RL algorithms using Atari games.

Language: Python - Size: 40.8 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

ialexmp/DRL-Generalization

Exploring Generalization in Reinforcement Learning algorithms for different tasks using PPO, Gymnasium-Robotics and MuJoCo

Language: Python - Size: 78.9 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

bantu-4879/Atari_Games-Deep_Reinforcement_Learning

This repository hosts Jupyter notebooks showcasing the training of Atari games using a variety of Deep Reinforcement Learning (RL) algorithms such as Proximal Policy Optimization (PPO), Deep Deterministic Policy Gradient (DDPG), Deep Q-Networks (DQN), Advantage Actor-Critic (A2C), and more.

Language: Jupyter Notebook - Size: 364 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Icyfiremario/PPO-Jumpstart

Basic PPO based AI template

Language: Python - Size: 16.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

DataRohit/AI-Mario-Game

This is a Deep-Q Learning [Stable Baseline] based AI Mario Game where the Model Incrementally Learns and Improves to Play the Game.

Language: Jupyter Notebook - Size: 261 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

nkoorty/rl_parking

Repository with all source files relating to the 6CCE3EEP Final Year Project titled "Self Parking with Reinforcement Learning." The project was implemented using Python, and used PyGame, OpenAI Gym, and the Stable Baselines-3 libraries in order to implement a Proximal Policy Optimisation (PPO) algorithm.

Language: Python - Size: 2.67 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

tedhuang96/ppo-pytorch

PPO in pytorch version.

Language: Python - Size: 9.77 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

dodoseung/ppo-proximal-policy-optimization-pytorch

The pytorch implementation of ppo

Language: Python - Size: 2.93 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

leonjovanovic/drl-ml-agents-3dball

PyTorch application of reinforcement learning DDPG and PPO algorithms in Unity 3D-Ball

Language: Python - Size: 836 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

Nikunj-Gupta/IMRL Fork of lcswillems/rl-starter-files

Informationally Mosaic Reinforcement Learning (Preprint: https://scholar.google.com/citations?view_op=view_citation&hl=en&user=nargncAAAAAJ&citation_for_view=nargncAAAAAJ:W7OEmFMy1HYC)

Language: Python - Size: 27.5 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

nkarasovd/HSE_Production_Stories

:ram: Materials and homework assignments for HSE production stories course

Language: Jupyter Notebook - Size: 47.7 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

jiseongHAN/reinforcement

My Little Reinforcement Learning

Language: Python - Size: 3.32 MB - Last synced at: 3 months ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

emanuelegreco29/Toy_Model_RL

Toy model implementing various architectures to teach a generic point of mass to reach a static and/or dynamic target in a 3D space.

Language: Python - Size: 438 KB - Last synced at: about 19 hours ago - Pushed at: about 20 hours ago - Stars: 0 - Forks: 0

nonkloq/mazeharvest

A Grid Based RL Environment & Implementaions of few Deep-RL Algorithms.

Language: Python - Size: 33.2 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

RuvenGuna94/Dialogue-Summary-remove-toxic-text-PPO

Fine-tuning FLAN-T5 with PPO and PEFT to generate less toxic text summaries. This notebook leverages Meta AI's hate speech reward model and utilizes RLHF techniques for improved safety.

Language: Jupyter Notebook - Size: 12.9 MB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

mrunalmania/Transformer-based-Stock-Prediction

In this project I am focuses on determining the direction of trend for the specific stocks (e.g. MAMAA META, APPL, MSFT, AMZN, ALPHABET)

Language: Jupyter Notebook - Size: 18.3 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

Ladun/PPO

Minimal implementation of Proximal Policy Optimization (PPO) in PyTorch

Language: Python - Size: 19.2 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

msmmb/noob_ppo

PPO for Bipedal Walker

Language: Python - Size: 1.2 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

mominalix/Humanoid-Robot-Reinforcement-Learning-PPO

This repository contains a project that leverages reinforcement learning to make a humanoid robot walk in a PyBullet simulation. It uses a custom Gym environment, a Proximal Policy Optimization (PPO) agent, and a provided URDF file for the robot model. The training process prints rewards per generation and visualizes the robot's behavior.

Language: Python - Size: 7.74 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

asdfGuest/Simple-PPO

Simple implementation of PPO algorithm

Language: Jupyter Notebook - Size: 29.5 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

jakemaz66/Reinforcement_Learning_Algorithms

Custom implementations of RL algorithms that can solve complex tasks like Atari games

Language: Python - Size: 215 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

dfoshidero/RLModels-Doom_CartPole

Collection of Reinforcement Learning models for training in the VizDoom and CartPole environments. Developed by Group 2 for the University of Bath's CM50270 module, exploring advanced strategies for models navigating complex situations.

Language: Jupyter Notebook - Size: 504 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

EnriqManComp/smart-disks-PPO

This project aims to find a possible solution to a search problem in a given environment with two players using Proximal Policy Optimization as AI algorithm.

Language: Python - Size: 5.55 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

arya-ebrahimi/rl-playground

tabular and deep rl algorithms

Language: Jupyter Notebook - Size: 64.9 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

GerTheMessiah/Snake-AI

Short own implementation of the game snake. In this project I'am using the ray library together with ray tune and a custom PPO model.

Language: Python - Size: 11.3 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

muno-video-conferencing/muno

Muno server for bandwidth estimation in video conferencing

Size: 1000 Bytes - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

bay3s/ppo-parallel

Parallelized implementation of Proximal Policy Optimization (PPO).

Language: Python - Size: 105 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

harikris001/Super-Mario-Reinforcement_Learning

Reinforcement Learning in Super Mario using Pytorch and PPO

Language: Jupyter Notebook - Size: 11.8 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

escribano89/unir_tfm_reinforcement_learning

Repositorio para el contenido relativo al trabajo de fin de máster desarrollado en el Máster de Inteligencia Artificial de la Universidad Internacional de La Rioja (UNIR).

Language: Jupyter Notebook - Size: 1.09 GB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

tohsin/Reinforcement_learning_projects

Here i write basic optimisation algorithm using rl algroithms on AI gyms

Language: Jupyter Notebook - Size: 1.57 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

iamvigneshwars/ai-walkers-ppo-pytorch

AI agent learns to walk, run, hop and crawl with out any given data using proximal policy optimisation.

Language: Python - Size: 152 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

chpi7/ppo

An easily understandable implementation of Proximal Policy Optimization with PyTorch

Language: Python - Size: 68.4 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

KiUngSong/RL

Repository of Various Test & Implementation of RL

Language: Jupyter Notebook - Size: 8.73 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 1

nkarasovd/HSE_Reinforcement_Learning

:robot: Materials and homework assignments for HSE reinforcement learning course

Language: Python - Size: 915 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

sprakashdash/RL.Fun.Do

A repository for easy understanding of codes in Deep Reinforcement Learning

Language: Python - Size: 217 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

Related Topics
reinforcement-learning 52 ppo 34 pytorch 33 deep-reinforcement-learning 24 deep-learning 14 dqn-pytorch 12 python 11 machine-learning 11 proximal-policy-optimization 11 ddpg-pytorch 10 ddpg 7 td3 7 reinforcement-learning-algorithms 7 gymnasium 7 dqn 6 policy-gradient 6 td3-pytorch 6 actor-critic 5 ppo-agent 5 sac 5 ai 5 sac-pytorch 5 stable-baselines3 4 rl 4 artificial-intelligence 4 gail 4 ppo2 4 dueling-dqn 3 reinforcement-learning-agent 3 gym-environment 3 openai-gym 3 a2c 3 ppo-algo 3 mujoco 3 soft-actor-critic 3 multi-agent-reinforcement-learning 3 ddqn 3 rainbow-dqn 3 drl-pytorch 2 gae 2 pytorch-implementation 2 stablebaselines3 2 neural-networks 2 reinforce 2 prioritized-experience-replay 2 reinforcement-learning-environments 2 recurrent-neural-networks 2 neural-network 2 deep-neural-networks 2 gym 2 bipedalwalker 2 cartpole-v0 2 imitation-learning 2 drl 2 gail-ppo 2 recurrent-ppo 2 ppo-lstm 2 torchrl 1 dueling-dqn-pytorch 1 dueling-network-architecture 1 twin-delayed-policy-gradient 1 rl-algorithms-pytorch 1 atari-games 1 dm-control 1 automated-machine-learning 1 autonomous-driving 1 autonomous-vehicles 1 board-game 1 gymnasium-environment 1 generative-adversarial-imitation-learning 1 energy-management-strategies 1 structured-dynamic-gramatical-evolution 1 hybrid-electrical-vehicle 1 detoxification 1 dialogue-summarization 1 ddpg-agent 1 gym-super-mario-bros 1 openai 1 python38 1 gru 1 memory-augmented-neural-networks 1 rl-environment 1 robotics 1 bipedalwalker-v3 1 mario-game 1 tensorboard 1 ecosystem-simulation 1 simulation 1 stock-price-prediction 1 transformer 1 machine-learning-algorithms 1 model 1 a2c-algorithm 1 humanoid-robot 1 pybullet 1 tensorflow 1 dueling-ddqn 1 tradingapi 1 trading-algorithms 1 trading 1