Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: actor-critic-algorithm

amirhosseinh77/NN-Control

Control Methods for Dynamic Systems based on Neural Networks

Language: Jupyter Notebook - Size: 945 KB - Last synced: about 15 hours ago - Pushed: about 17 hours ago - Stars: 2 - Forks: 0

sweetice/Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Language: Python - Size: 42.1 MB - Last synced: 7 days ago - Pushed: about 1 year ago - Stars: 3,672 - Forks: 829

ccnets-team/causal-rl

Causal RL: Reverse-Environment Network Integrated Actor-Critic Algorithm

Language: Python - Size: 9.38 MB - Last synced: 20 days ago - Pushed: 20 days ago - Stars: 26 - Forks: 2

jekyllstein/Reinforcement-Learning-Sutton-Barto-Exercise-Solutions

Chapter notes and exercise solutions for Reinforcement Learning: An Introduction by Sutton and Barto

Language: HTML - Size: 89.7 MB - Last synced: 28 days ago - Pushed: 28 days ago - Stars: 10 - Forks: 3

BY571/Soft-Actor-Critic-and-Extensions

PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL + D2RL and parallel Environments.

Language: Python - Size: 5.99 MB - Last synced: about 1 month ago - Pushed: over 3 years ago - Stars: 247 - Forks: 29

GioStamoulos/BTC_RL_Trading_Bot

A trading bitcoin agent was created with deep reinforcement learning implementations.

Language: Jupyter Notebook - Size: 53.9 MB - Last synced: about 1 month ago - Pushed: about 2 years ago - Stars: 27 - Forks: 6

kkothari93/ActorCritic-ECE586-Project

Implementation of the actor critic algorithm for MountainCarContinuous-v0 OpenAI gym environment.

Language: Python - Size: 889 KB - Last synced: 3 months ago - Pushed: over 6 years ago - Stars: 1 - Forks: 1

BY571/D4PG

PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2RL which can be added to D4PG to improve its performance.

Language: Python - Size: 2.17 MB - Last synced: about 2 months ago - Pushed: about 3 years ago - Stars: 13 - Forks: 4

XuehaiPan/Soft-Actor-Critic

PyTorch Implementation of Soft Actor-Critic Algorithm

Language: Python - Size: 518 KB - Last synced: about 2 months ago - Pushed: over 3 years ago - Stars: 10 - Forks: 5

HYDesmondLiu/RUBICON

A novel method to incorporate existing policy (Rule-based control) with Reinforcement Learning.

Language: Python - Size: 25.4 KB - Last synced: 4 months ago - Pushed: about 1 year ago - Stars: 2 - Forks: 0

fangvv/UAV-DDPG

Code for paper "Computation Offloading Optimization for UAV-assisted Mobile Edge Computing: A Deep Deterministic Policy Gradient Approach"

Language: Python - Size: 21.5 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 297 - Forks: 66

Phoenix-Shen/ReinforcementLearning

强化学习算法库,包含了目前主流的强化学习算法(Value based and Policy basd)的代码,代码都经过调试并可以运行

Language: Python - Size: 20.1 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 32 - Forks: 8

RsGoksel/Reinforcement-Learning-PongGame

Reinforcement Learning - PPO (Proximal Policy Optimization) Implementation to Pong Game

Language: Jupyter Notebook - Size: 842 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 1 - Forks: 0

erfanMhi/Deep-Reinforcement-Learning-CS285-Pytorch

Solutions of assignments of Deep Reinforcement Learning course presented by the University of California, Berkeley (CS285) in Pytorch framework

Language: Python - Size: 34.7 MB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 124 - Forks: 11

nitish-kalan/CartPole-v1-Actor-Critic-Keras

Solving CartPole-v1 environment in Keras with Actor Critic algorithm an Deep Reinforcement Learning algorithm

Language: Python - Size: 605 KB - Last synced: 8 months ago - Pushed: about 4 years ago - Stars: 10 - Forks: 5

philtabor/Multi-Agent-Deep-Deterministic-Policy-Gradients

A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm

Language: Python - Size: 7.81 KB - Last synced: 8 months ago - Pushed: about 3 years ago - Stars: 207 - Forks: 62

d-dawg78/MVA_RL

Master MVA - Reinforcement Learning Project

Language: Python - Size: 6 MB - Last synced: 11 months ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0

imraviagrawal/Reinforcement-Learning-Implementation

Implementation of Reinforcement Algorithms from scratch

Language: Python - Size: 23.1 MB - Last synced: 11 months ago - Pushed: over 5 years ago - Stars: 9 - Forks: 3

mkurovski/deep_rl_nanodegree

Project Solutions for my Deep Reinforcement Learning Nanodegree at Udacity

Language: Jupyter Notebook - Size: 3.21 MB - Last synced: about 1 year ago - Pushed: about 5 years ago - Stars: 4 - Forks: 1

garlicdevs/Fruit-API

A Universal Deep Reinforcement Learning Framework

Language: Python - Size: 7.45 MB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 62 - Forks: 22

kcg2015/DDPG_numpy_only

Implemenation of DDPG with numpy only (without Tensorflow)

Language: Python - Size: 64.5 KB - Last synced: over 1 year ago - Pushed: over 6 years ago - Stars: 11 - Forks: 5

SwamiKannan/Reinforcement-Learning-Specialization

Programming Assignments for Reinforcement Learning Specialization

Language: Jupyter Notebook - Size: 2 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 1

Sohaib1424/Reinforcement-Learning-projects

Language: Python - Size: 1.34 MB - Last synced: 6 months ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

SamYuen101234/chrome_dino_RL

A very detailed project of Chrome Dinosaur in Deep RL for beginners

Language: Python - Size: 129 MB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 1 - Forks: 0

prajwalthakur/Projects

This Repository contains my projects!

Language: Python - Size: 243 MB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 2 - Forks: 1

charlola/autonomous-systems

In this project, it is intended to learn and get familiar with the concepts of Reinforcement Learning (RL) by modeling agents with actor-critic and rainbow-dqn algorithms. Two continuous Unity Machine Learning Agents domains are chosen to realize this project.

Language: ASP.NET - Size: 39.4 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 1 - Forks: 1

nima-siboni/simplest-world-Actor-Critic

Reinforcement learning, Policy Gradient, Actor-Critic, AC, Agent-based Simulation, Simple-world

Language: Python - Size: 698 KB - Last synced: over 1 year ago - Pushed: almost 4 years ago - Stars: 3 - Forks: 1

rtharungowda/Soft-Actor-Critic-Pytorch

Implement soft actor critic in pytorch to play a game of balancing pendulum in openai gym.

Language: Python - Size: 629 KB - Last synced: over 1 year ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

vrona/BOT88

Self Driving Racing Car Agent (Deep Deterministic Policy Gradient algorithm)

Language: Jupyter Notebook - Size: 373 KB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0

meet-minimalist/Udacity-Deep-Learning-Project-5-Fly-Quadcopter

Fly Quadcopter using Deep Reinforcement Learning

Language: Jupyter Notebook - Size: 1.66 MB - Last synced: over 1 year ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0

Related Keywords
actor-critic-algorithm 30 reinforcement-learning 23 deep-reinforcement-learning 12 reinforcement-learning-algorithms 10 pytorch 8 policy-gradient 7 deep-learning 6 actor-critic 5 q-learning 4 prioritized-experience-replay 4 actor-critic-methods 3 proximal-policy-optimization 3 td-learning 3 dqn 3 python 3 deep-deterministic-policy-gradient 3 machine-learning 3 sarsa 3 ddpg 2 off-policy 2 notebook 2 tensorflow 2 deep-q-network 2 d2rl 2 munchausen-reinforcement-learning 2 openai-gym 2 pytorch-implementation 2 soft-actor-critic 2 neural-network 2 multi-agent-reinforcement-learning 2 sac 2 reinforce 2 ppo 2 mountain-car 1 cartpole-environment 1 multiplayer-game 1 cartpole 1 policy-gradients 1 adam-optimizer 1 replay-buffer 1 stochastic-gradient-descent 1 blackbox-optimization 1 echolocation 1 target-network 1 alberta-machine-learning-institute 1 amii 1 maddpg 1 q-learning-lambda 1 reinforcement-algorithms 1 sarsa-lambda 1 gym 1 unity 1 gridworld-environment 1 gridworld 1 arcade-learning-environment 1 atari 1 cross-entropy-policy-search 1 environment 1 games 1 cross-entropy 1 human 1 human-in-the-loop 1 multi-agent 1 multi-objective-optimization 1 dqn-variants 1 gpu 1 ros 1 rrt-plan 1 rainbow-dqn 1 unity-ml-agents 1 monte-carlo-simulation 1 on-policy 1 reinforcement-learning-environments 1 pytorch-tutorial 1 agent 1 asymmetric 1 autonomous-agents 1 autonomous-cars 1 autonomous-driving 1 autonomous-vehicles 1 visionprocessing 1 jupyter-notebooks 1 quadcopter 1 udacity-nanodegree 1 capstone-project 1 coursera 1 coursera-machine-learning 1 lunar-lander 1 monte-carlo-sampling 1 sarsa-learning 1 specialization 1 university-of-alberta 1 bellman-equation 1 gym-environments 1 chrome-dinosaur-game 1 convolutional-neural-networks 1 double-q-learning 1 experience-replay 1 policy-based-method 1 rainbow 1