An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: trust-region-policy-optimization

niupuhua1234/GFN-PG

Code for the ICML 2024 paper 'GFlowNet Training by Policy Gradients'

Language: Python - Size: 5.35 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

ikostrikov/pytorch-trpo

PyTorch implementation of Trust Region Policy Optimization

Language: Python - Size: 9.77 KB - Last synced at: 16 days ago - Pushed at: over 6 years ago - Stars: 440 - Forks: 90

legalaspro/rl-odyssey

RL-Odyssey is a research framework for continuous control that implements state-of-the-art RL algorithms (SAC, TD3, PPO, etc.) with clean experiment scripts and interactive notebooks.

Language: Jupyter Notebook - Size: 66 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

hcnoh/rl-collection-pytorch

A collection of Reinforcement Learning implementations with PyTorch

Language: Python - Size: 5.84 MB - Last synced at: 22 days ago - Pushed at: about 3 years ago - Stars: 20 - Forks: 1

pompetzki/nes-npg

Benchmarking the Natural Gradient in Policy Gradient Methods and Evolution Strategies

Language: Python - Size: 12.9 MB - Last synced at: 2 months ago - Pushed at: about 4 years ago - Stars: 11 - Forks: 0

TianhongDai/reinforcement-learning-algorithms

This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)

Language: Python - Size: 3.94 MB - Last synced at: 5 months ago - Pushed at: over 4 years ago - Stars: 666 - Forks: 110

nslyubaykin/trpo_schedule_kl

Scheduling TRPO's KL Divergence Constraint

Language: Jupyter Notebook - Size: 212 KB - Last synced at: 9 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

funnydman/BFGS-NelderMead-TrustRegion

Python implementation of some numerical (optimization) methods

Language: Python - Size: 16.6 KB - Last synced at: 18 days ago - Pushed at: about 4 years ago - Stars: 30 - Forks: 3

waynemystir/deep-RL-bootcamp

My solutions to the labs from this bootcamp:

Language: Jupyter Notebook - Size: 110 MB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 0

GioStamoulos/BTC_RL_Trading_Bot

A trading bitcoin agent was created with deep reinforcement learning implementations.

Language: Jupyter Notebook - Size: 53.9 MB - Last synced at: 12 months ago - Pushed at: about 3 years ago - Stars: 27 - Forks: 6

LihangLiu/CS395T-Numerical-Optimization

Course projects of CS395T Numerical Optimization, UT Austin

Language: Python - Size: 21.8 MB - Last synced at: about 2 months ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 2

Akella17/Deep-Bayesian-Quadrature-Policy-Optimization

Official implementation of the AAAI 2021 paper Deep Bayesian Quadrature Policy Optimization.

Language: Python - Size: 3.44 MB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 13 - Forks: 7

kparnis3/Final-Year-Project

Undergraduate Dissertation (University of Malta) 2020-2023 - 'Autonomous Drone Control using Reinforcement Learning''

Language: Jupyter Notebook - Size: 10.7 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

YixiongRen/Dynamics

works about solving nonlinear dynamic systems

Language: Matlab - Size: 308 KB - Last synced at: almost 2 years ago - Pushed at: about 7 years ago - Stars: 4 - Forks: 2

MahanFathi/TRPO-TensorFlow

Trust Region Policy Optimization (TRPO) in pure TensorFlow

Language: Python - Size: 55.7 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 17 - Forks: 8

RLOpensource/spinning_up_kr

Language: Python - Size: 1.95 MB - Last synced at: 6 days ago - Pushed at: about 6 years ago - Stars: 6 - Forks: 3