Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: actor-critic-methods
Rmko4/RL-On-Policy-Actor-Critic
Deep Reinforcement Learning: On-Policy Actor Critic methods. An implementation of Advantage Actor-Critic (A2C) and Proximal Policy Optimization (PPO) on the PyTorch Lightning framework.
Language: Python - Size: 4.89 MB - Last synced: 19 days ago - Pushed: 12 months ago - Stars: 0 - Forks: 0
Subramanyam6/A-Comprehensive-Review-on-RL-DL-in-Multi-Agent-Systems
Size: 619 KB - Last synced: 29 days ago - Pushed: 30 days ago - Stars: 2 - Forks: 0
n1ghtf4l1/cautious-octo-disco
Implemented the Actor-Critic method using TensorFlow to train an agent on the Open AI Gym CartPole-V0 environment.
Language: Jupyter Notebook - Size: 22.5 KB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0
IDSIA/rtrl-elstm
Official repository for the paper "Exploring the Promise and Limits of Real-Time Recurrent Learning" (ICLR 2024)
Language: Python - Size: 102 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 5 - Forks: 2
lorenzomancini1/DeepRL
Implementations of some of the most well known Deep Reinforcement Learning algorithms
Language: Python - Size: 17.6 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
rafonsor/unRL
unRL (AKA "unreal") is a set of libraries providing Reinforcement Learning algorithms implemented in PyTorch or Jax.
Language: Python - Size: 354 KB - Last synced: 4 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0
nitish-kalan/CartPole-v1-Actor-Critic-Keras
Solving CartPole-v1 environment in Keras with Actor Critic algorithm an Deep Reinforcement Learning algorithm
Language: Python - Size: 605 KB - Last synced: 8 months ago - Pushed: about 4 years ago - Stars: 10 - Forks: 5
philtabor/Multi-Agent-Deep-Deterministic-Policy-Gradients
A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm
Language: Python - Size: 7.81 KB - Last synced: 8 months ago - Pushed: about 3 years ago - Stars: 207 - Forks: 62
ustyuzhaninky/OSAR-keras
Objective Stimuli Active Repeater
Language: Jupyter Notebook - Size: 7.88 MB - Last synced: 10 months ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0
gd-zhang/ACKTR
Actor Critic using Kronecker-Factored Trust Region
Language: Python - Size: 77.1 KB - Last synced: 10 months ago - Pushed: almost 6 years ago - Stars: 18 - Forks: 3
TanushGoel/Atari-Games-RL
A collection of ipython notebooks in which agents learn to play Atari games in Open AI gym environments using different methods of reinforcement learning.
Language: Jupyter Notebook - Size: 2.38 MB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 1 - Forks: 1
abhilash1910/Deep_Reinforcement_Learning_Trading
Deep Reinforcement Learning for Trading
Language: Jupyter Notebook - Size: 2.92 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 15 - Forks: 3
SwamiKannan/Reinforcement-Learning-Specialization
Programming Assignments for Reinforcement Learning Specialization
Language: Jupyter Notebook - Size: 2 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 1
ken-power/DRLND_ContinuousControl
Deep Reinforcement Learning: Continuous Control. Solve the Unity ML-Agents Reacher Environment.
Language: Jupyter Notebook - Size: 4.82 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 1 - Forks: 0
kanji95/Topics-in-Machine-Learning-CS7.502
Topics in Machine Learning @ IIIT Hyderabad (Fall 2021)
Language: Jupyter Notebook - Size: 1.31 MB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 0 - Forks: 1
being-aerys/Reinforcement-Learning-Self-Projects
Fun with Reinforcement Learning in my spare time
Language: Jupyter Notebook - Size: 2.77 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0
mynkpl1998/A3C
This repository contains high quality and tested implementation of Asynchronous Actor Critic Algorithm
Language: Dockerfile - Size: 5.04 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0
hmisra/Reacher-DDPG
A model to control a double-jointed arm to reach target using Deep Deterministic Policy Gradients
Language: ASP - Size: 59.4 MB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0