An open API service providing repository metadata for many open source software ecosystems.

Topic: "on-policy"

MarcoMeter/episodic-transformer-memory-ppo

Clean baseline implementation of PPO using an episodic TransformerXL memory

Language: Python - Size: 23.9 MB - Last synced at: about 2 months ago - Pushed at: 12 months ago - Stars: 172 - Forks: 22

MarcoMeter/recurrent-ppo-truncated-bptt

Baseline implementation of recurrent PPO using truncated BPTT

Language: Jupyter Notebook - Size: 17.7 MB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 138 - Forks: 18

wisnunugroho21/reinforcement_learning_v_mpo

Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)

Language: Python - Size: 15.6 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 15 - Forks: 1

wisnunugroho21/reinforcement_learning_truly_ppo

Deep Reinforcement Learning by using Truly Proximal Policy Optimization in Tensorflow 2 and Pytorch

Language: Python - Size: 27.3 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 11 - Forks: 1

OpenRL-Lab/RL_Tutorial

Reinforcement Learning Tutorial (强化学习教程)

Size: 7.81 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

BY571/pytorch-vmpo

PyTorch implementation of V-MPO

Language: Python - Size: 35.2 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

nima-siboni/simplest-world-Actor-Critic

Reinforcement learning, Policy Gradient, Actor-Critic, AC, Agent-based Simulation, Simple-world

Language: Python - Size: 698 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 1

TheUnsolvedDev/ReinforcementLearning

Repository containing basic algorithm applied in python.

Language: Jupyter Notebook - Size: 121 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 1

amirhosein-mesbah/Reinforcement_learning

This repository contains the implementation of a wide variety of Reinforcement Learning Projects in different applications of Bandit Algorithms, MDPs, Distributed RL and Deep RL. These projects include university projects and projects implemented due to interest in Reinforcement Learning.

Language: Jupyter Notebook - Size: 10.4 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

fardinabbasi/Tabulated_RL

Interactive Learning [ECE 641] - Fall 2023 - University of Tehran - Prof. Nili

Language: Jupyter Notebook - Size: 4.96 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

mabirck/CS294-DeepRL

My content of CS294 Deep Reinforcement Learning course, conduced by Sergey Levine from UC Berkeley.

Language: Python - Size: 3.91 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

srefsland/deep-rl-mcts

On-policy MCTS combined with deep learning to train an actor-critic neural network that plays Hex (Con-tac-tix).

Language: Python - Size: 5.39 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

SPozder3/RLFinanceProject

Stock Portfolio Management using tabular and deep Q-learning methods - extension of FinRL repo

Language: Jupyter Notebook - Size: 6.91 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0