proximal-policy-optimization

An implementation from the state-of-the-art family of reinforcement learning algorithms Proximal Policy Optimization using normalized Generalized Advantage Estimation and optional batch mode training. The loss function incorporates an entropy bonus.

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/tomasspangelo%2Fproximal-policy-optimization

Stars: 2
Forks: 0
Open issues: 0

License: None
Language: Python
Size: 23.4 KB
Dependencies parsed at: Pending

Created at: over 2 years ago
Updated at: over 1 year ago
Pushed at: over 2 years ago
Last synced at: over 1 year ago

Topics: actor-critic, deep-learning, entropy, gae, generalized-advantage-estimation, machine-learning, neural-network, open-ai, open-ai-gym, optimization, ppo, ppo-pytorch, proximal-policy-optimization, python, pytorch, reinforcement-learning, rl

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos

GitHub / tomasspangelo / proximal-policy-optimization