Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: policy-optimization

Repositories

CLAIRE-Labo/no-representation-no-trust

Codebase to fully reproduce the results of "No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO" (Moalla et al. 2024). Uses TorchRL and provides extensive tools for studying representation dynamics in policy optimization.

Language: Jupyter Notebook - Size: 4.66 MB - Last synced: 11 days ago - Pushed: about 1 month ago - Stars: 10 - Forks: 1

chauncygu/Multi-Agent-Constrained-Policy-Optimisation

Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).

Language: Python - Size: 8.48 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 121 - Forks: 20

MehdiShahbazi/REINFORCE-Cart-Pole-Gymnasium

This repo implements the REINFORCE algorithm for solving the Cart Pole V1 environment of the Gymnasium library using Python 3.8 and PyTorch 2.0.1.

Language: Python - Size: 636 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

ConnorWatts/kpo

Policy optimization algorithm with trust regions based on the Maximum Mean Discrepancy (MMD) metric. Investigates the efficiency and effectiveness of the approach as well as exploring the different techniques used to approximate the policy update.

Size: 1.95 KB - Last synced: 7 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

cxxgtxy/POP3D

Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization

Language: Python - Size: 2.36 MB - Last synced: 8 months ago - Pushed: over 5 years ago - Stars: 42 - Forks: 2

sarmueller/gibo

This repository contains the code for the paper "Local policy search with Bayesian optimization".

Language: Jupyter Notebook - Size: 87 MB - Last synced: 10 months ago - Pushed: over 1 year ago - Stars: 6 - Forks: 6

MahanFathi/Model-Based-RL

Model-based Policy Gradients

Language: Python - Size: 1.89 MB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 27 - Forks: 4

bmaxdk/OpenAI-Gym-PongDeterministic-v4-PPO

Language: Jupyter Notebook - Size: 1.77 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 3 - Forks: 0

elsheikh21/car-racing-ppo

Implementation of a Deep Reinforcement Learning algorithm, Proximal Policy Optimization (SOTA), on a continuous action space openai gym (Box2D/Car Racing v0)

Language: Python - Size: 21.4 MB - Last synced: about 1 year ago - Pushed: about 5 years ago - Stars: 35 - Forks: 5

manantomar/Mirror-Descent-Policy-Optimization

Mirror Descent Policy Optimization

Language: Python - Size: 44.9 KB - Last synced: over 1 year ago - Pushed: over 3 years ago - Stars: 25 - Forks: 3

grassking100/reinforcement_learning

An implementation of the reinforcement learning for CartPole-v0 by policy optimization

Language: Python - Size: 6.84 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 2 - Forks: 0

proceduralia/randomist

Code for Policy Optimization as Online Learning with Mediator Feedback

Language: Python - Size: 31.3 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0

Related Keywords

policy-optimization 12 reinforcement-learning 9 pytorch 6 deep-learning 5 policy-gradient 4 deep-reinforcement-learning 4 gym 3 openai-gym 3 proximal-policy-optimization 3 ppo 3 mujoco 2 multi-agent-reinforcement-learning 1 ilqg 1 ilqg-mujoco 1 ilqr 1 model-based 1 mujoco-dynamics 1 mujoco-py 1 policy-gradients 1 atari-pong 1 deep-learning-ai 1 thompson-sampling 1 deep-learning-algorithms 1 multi-armed-bandits 1 deep-rl 1 mdpo 1 mirror-descent 1 model-free-rl 1 sac 1 stable-baselines 1 mcmc 1 trpo 1 cartpole-v0 1 exploration 1 safe-reinforcement-learning 1 cart 1 cart-pole 1 cart-pole-balancing 1 cart-pole-v1 1 drl 1 drl-pytorch 1 gymnasium 1 pendulum 1 policy 1 policy-based 1 python 1 reinforce 1 reinforce-algorithm 1 gym-environment 1 kernel-methods 1 active-learning 1 bayesian-optimization 1 gradient-descent 1 backpropagation 1 computation-graph 1 computational-graphs 1 direct-policy-search 1 finite-difference 1