Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: actor-critic-methods

Rmko4/RL-On-Policy-Actor-Critic

Deep Reinforcement Learning: On-Policy Actor Critic methods. An implementation of Advantage Actor-Critic (A2C) and Proximal Policy Optimization (PPO) on the PyTorch Lightning framework.

Language: Python - Size: 4.89 MB - Last synced: 19 days ago - Pushed: 12 months ago - Stars: 0 - Forks: 0

Subramanyam6/A-Comprehensive-Review-on-RL-DL-in-Multi-Agent-Systems

Size: 619 KB - Last synced: 29 days ago - Pushed: 30 days ago - Stars: 2 - Forks: 0

n1ghtf4l1/cautious-octo-disco

Implemented the Actor-Critic method using TensorFlow to train an agent on the Open AI Gym CartPole-V0 environment.

Language: Jupyter Notebook - Size: 22.5 KB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

IDSIA/rtrl-elstm

Official repository for the paper "Exploring the Promise and Limits of Real-Time Recurrent Learning" (ICLR 2024)

Language: Python - Size: 102 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 5 - Forks: 2

lorenzomancini1/DeepRL

Implementations of some of the most well known Deep Reinforcement Learning algorithms

Language: Python - Size: 17.6 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

rafonsor/unRL

unRL (AKA "unreal") is a set of libraries providing Reinforcement Learning algorithms implemented in PyTorch or Jax.

Language: Python - Size: 354 KB - Last synced: 4 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

nitish-kalan/CartPole-v1-Actor-Critic-Keras

Solving CartPole-v1 environment in Keras with Actor Critic algorithm an Deep Reinforcement Learning algorithm

Language: Python - Size: 605 KB - Last synced: 8 months ago - Pushed: about 4 years ago - Stars: 10 - Forks: 5

philtabor/Multi-Agent-Deep-Deterministic-Policy-Gradients

A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm

Language: Python - Size: 7.81 KB - Last synced: 8 months ago - Pushed: about 3 years ago - Stars: 207 - Forks: 62

ustyuzhaninky/OSAR-keras

Objective Stimuli Active Repeater

Language: Jupyter Notebook - Size: 7.88 MB - Last synced: 10 months ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

gd-zhang/ACKTR

Actor Critic using Kronecker-Factored Trust Region

Language: Python - Size: 77.1 KB - Last synced: 10 months ago - Pushed: almost 6 years ago - Stars: 18 - Forks: 3

TanushGoel/Atari-Games-RL

A collection of ipython notebooks in which agents learn to play Atari games in Open AI gym environments using different methods of reinforcement learning.

Language: Jupyter Notebook - Size: 2.38 MB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 1 - Forks: 1

abhilash1910/Deep_Reinforcement_Learning_Trading

Deep Reinforcement Learning for Trading

Language: Jupyter Notebook - Size: 2.92 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 15 - Forks: 3

SwamiKannan/Reinforcement-Learning-Specialization

Programming Assignments for Reinforcement Learning Specialization

Language: Jupyter Notebook - Size: 2 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 1

ken-power/DRLND_ContinuousControl

Deep Reinforcement Learning: Continuous Control. Solve the Unity ML-Agents Reacher Environment.

Language: Jupyter Notebook - Size: 4.82 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 1 - Forks: 0

kanji95/Topics-in-Machine-Learning-CS7.502

Topics in Machine Learning @ IIIT Hyderabad (Fall 2021)

Language: Jupyter Notebook - Size: 1.31 MB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 0 - Forks: 1

being-aerys/Reinforcement-Learning-Self-Projects

Fun with Reinforcement Learning in my spare time

Language: Jupyter Notebook - Size: 2.77 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0

mynkpl1998/A3C

This repository contains high quality and tested implementation of Asynchronous Actor Critic Algorithm

Language: Dockerfile - Size: 5.04 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0

hmisra/Reacher-DDPG

A model to control a double-jointed arm to reach target using Deep Deterministic Policy Gradients

Language: ASP - Size: 59.4 MB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0

Related Keywords
actor-critic-methods 18 reinforcement-learning 11 deep-reinforcement-learning 6 policy-gradient 5 pytorch 4 q-learning 3 actor-critic-algorithm 3 ddpg 3 actor-critic 3 deep-deterministic-policy-gradient 2 dqn 2 openai-gym 2 reinforcement-learning-algorithms 2 ppo 2 a2c 2 deep-learning 2 multi-agent-reinforcement-learning 2 td3 1 university-of-alberta 1 td-learning 1 specialization 1 sarsa-learning 1 monte-carlo-sampling 1 lunar-lander 1 coursera-machine-learning 1 coursera 1 capstone-project 1 amii 1 alberta-machine-learning-institute 1 trpo 1 trading-strategies 1 trading 1 time-series 1 tensorflow2 1 soft 1 sarimax 1 sac 1 prophet-model 1 model-based-reinforcement-learning 1 genetic-algorithm 1 cross-entropy-policy-search 1 continuous-control 1 wrapper-api 1 classic-controller 1 cartpole 1 vanilla-policy-gradient 1 value-iteration 1 a3c 1 td-methods 1 a3c-lstm 1 policy-iteration 1 rl 1 monte-carlo-methods 1 inverse-reinforcement-learning 1 unity 1 reacher-environment 1 deep-q-learning 1 ddpg-agent 1 continuous-action-space 1 actorcritic 1 actor-critic-with-experience-replay 1 multi-armed-bandits 1 kfac 1 jax 1 artifical-intelligence 1 dqn-algorithm 1 torchbeast 1 rtrl 1 recurrent-neural-networks 1 real-time-recurrent-learning 1 procgen 1 deepmind-lab 1 atari 1 tensorflow 1 cartpole-v0 1 neural-network 1 multi-agent-systems 1 evolution-strategies 1 pytorch-lightning 1 gymnasium 1 ppo-agent 1 dueling-dqn 1 dqn-agents 1 double-dqn 1 cnn-lstm-models 1 arima-model 1 a3c-agent 1 temporal-difference 1 state-value-function 1 policy-based-method 1 monte-carlo 1 atari-games 1 natural-gradients 1 recurrent-neural-network 1 keras-tensorflow 1 gym 1 maddpg 1 machine-learning 1 deepreinforcement 1 cartpole-v1 1