policy-optimization | Topic | Ecosyste.ms: Repos

Topic: "policy-optimization"

chauncygu/Multi-Agent-Constrained-Policy-Optimisation

Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).

Language: Python - Size: 8.48 MB - Last synced at: 19 days ago - Pushed at: about 1 year ago - Stars: 165 - Forks: 27

cxxgtxy/POP3D

Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization

Language: Python - Size: 2.36 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 42 - Forks: 2

elsheikh21/car-racing-ppo

Implementation of a Deep Reinforcement Learning algorithm, Proximal Policy Optimization (SOTA), on a continuous action space openai gym (Box2D/Car Racing v0)

Language: Python - Size: 21.4 MB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 35 - Forks: 5

MahanFathi/Model-Based-RL

Model-based Policy Gradients

Language: Python - Size: 1.89 MB - Last synced at: 9 months ago - Pushed at: about 5 years ago - Stars: 29 - Forks: 4

manantomar/Mirror-Descent-Policy-Optimization

Mirror Descent Policy Optimization

Language: Python - Size: 44.9 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 25 - Forks: 3

CLAIRE-Labo/no-representation-no-trust

Codebase to fully reproduce the results of "No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO" (Moalla et al. 2024). Uses TorchRL and provides extensive tools for studying representation dynamics in policy optimization.

Language: Jupyter Notebook - Size: 8.28 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 13 - Forks: 1

sarmueller/gibo

This repository contains the code for the paper "Local policy search with Bayesian optimization".

Language: Jupyter Notebook - Size: 87 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 6

shaheennabi/Reinforcement-or-Deep-Reinforcement-Learning-Practices-and-Mini-Projects

Reinforcement Learning (RL) 🤖! This repository is your hands-on guide to implementing RL algorithms, from Markov Decision Processes (MDPs) to advanced methods like PPO and DDPG. 🚀 Build smart agents, learn the math behind policies, and experiment with real-world applications! 🔥💡

Size: 24.4 KB - Last synced at: 29 days ago - Pushed at: 5 months ago - Stars: 3 - Forks: 0