Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: policy-optimization
CLAIRE-Labo/no-representation-no-trust
Codebase to fully reproduce the results of "No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO" (Moalla et al. 2024). Uses TorchRL and provides extensive tools for studying representation dynamics in policy optimization.
Language: Jupyter Notebook - Size: 4.66 MB - Last synced: 11 days ago - Pushed: about 1 month ago - Stars: 10 - Forks: 1
chauncygu/Multi-Agent-Constrained-Policy-Optimisation
Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).
Language: Python - Size: 8.48 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 121 - Forks: 20
MehdiShahbazi/REINFORCE-Cart-Pole-Gymnasium
This repo implements the REINFORCE algorithm for solving the Cart Pole V1 environment of the Gymnasium library using Python 3.8 and PyTorch 2.0.1.
Language: Python - Size: 636 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
ConnorWatts/kpo
Policy optimization algorithm with trust regions based on the Maximum Mean Discrepancy (MMD) metric. Investigates the efficiency and effectiveness of the approach as well as exploring the different techniques used to approximate the policy update.
Size: 1.95 KB - Last synced: 7 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0
cxxgtxy/POP3D
Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization
Language: Python - Size: 2.36 MB - Last synced: 8 months ago - Pushed: over 5 years ago - Stars: 42 - Forks: 2
sarmueller/gibo
This repository contains the code for the paper "Local policy search with Bayesian optimization".
Language: Jupyter Notebook - Size: 87 MB - Last synced: 10 months ago - Pushed: over 1 year ago - Stars: 6 - Forks: 6
MahanFathi/Model-Based-RL
Model-based Policy Gradients
Language: Python - Size: 1.89 MB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 27 - Forks: 4
bmaxdk/OpenAI-Gym-PongDeterministic-v4-PPO
Language: Jupyter Notebook - Size: 1.77 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 3 - Forks: 0
elsheikh21/car-racing-ppo
Implementation of a Deep Reinforcement Learning algorithm, Proximal Policy Optimization (SOTA), on a continuous action space openai gym (Box2D/Car Racing v0)
Language: Python - Size: 21.4 MB - Last synced: about 1 year ago - Pushed: about 5 years ago - Stars: 35 - Forks: 5
manantomar/Mirror-Descent-Policy-Optimization
Mirror Descent Policy Optimization
Language: Python - Size: 44.9 KB - Last synced: over 1 year ago - Pushed: over 3 years ago - Stars: 25 - Forks: 3
grassking100/reinforcement_learning
An implementation of the reinforcement learning for CartPole-v0 by policy optimization
Language: Python - Size: 6.84 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 2 - Forks: 0
proceduralia/randomist
Code for Policy Optimization as Online Learning with Mediator Feedback
Language: Python - Size: 31.3 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0