An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: generalized-advantage-estimation

bentrevett/pytorch-rl 📦

Tutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]

Language: Jupyter Notebook - Size: 55.7 MB - Last synced at: 28 days ago - Pushed at: over 4 years ago - Stars: 277 - Forks: 78

hcnoh/rl-collection-pytorch

A collection of Reinforcement Learning implementations with PyTorch

Language: Python - Size: 5.84 MB - Last synced at: 24 days ago - Pushed at: about 3 years ago - Stars: 20 - Forks: 1

adik993/ppo-pytorch

Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)

Language: Python - Size: 1.4 MB - Last synced at: 5 months ago - Pushed at: over 6 years ago - Stars: 133 - Forks: 27

nslyubaykin/relax_trpo_example

Example TRPO implementation with ReLAx

Language: Jupyter Notebook - Size: 2.35 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/relax_ppo_example

Example PPO implementation with ReLAx

Language: Jupyter Notebook - Size: 3.7 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/rnns_for_pomdp

Recurrent Policies for Handling Partially Observable Environments

Language: Jupyter Notebook - Size: 3.46 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

tomasspangelo/proximal-policy-optimization

An implementation from the state-of-the-art family of reinforcement learning algorithms Proximal Policy Optimization using normalized Generalized Advantage Estimation and optional batch mode training. The loss function incorporates an entropy bonus.

Language: Python - Size: 23.4 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

leaderj1001/Phasic-Policy-Gradient

Phasic-Policy-Gradient

Language: Python - Size: 4.88 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 0