GitHub topics: td-learning

Repositories

sshkhr/Practical_RL

My solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow

Language: Jupyter Notebook - Size: 9.91 MB - Last synced at: 28 days ago - Pushed at: over 3 years ago - Stars: 54 - Forks: 25

omerbsezer/Reinforcement_learning_tutorial_with_demo 📦

Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..

Language: Jupyter Notebook - Size: 151 KB - Last synced at: 3 months ago - Pushed at: over 6 years ago - Stars: 751 - Forks: 174

SwamiKannan/Reinforcement-Learning-Specialization

Programming Assignments for Reinforcement Learning Specialization

Language: Jupyter Notebook - Size: 2 MB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

moripiri/Reinforcement-Learning-on-FrozenLake

Reinforcement Learning Algorithms in FrozenLake-v1

Language: Jupyter Notebook - Size: 19.8 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 22 - Forks: 2

k-karna/reinforcement_learning

Reinforcement Learning Specialization | University of Alberta

Language: Jupyter Notebook - Size: 2 MB - Last synced at: 3 months ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

makaveli10/reinforcementLearning

Reinforcement Learning - Implementation of Exercises, algorithms from the book Sutton Barto and David silver's RL course in Python, OpenAI Gym.

Language: Jupyter Notebook - Size: 6.84 MB - Last synced at: 3 months ago - Pushed at: about 5 years ago - Stars: 24 - Forks: 4

tirthajyoti/RL_basics

Basic Reinforcement Learning algorithms

Language: Jupyter Notebook - Size: 2.29 MB - Last synced at: about 2 months ago - Pushed at: about 6 years ago - Stars: 18 - Forks: 13

Aneeshers/Rapid-learning

Rapid learning mechanisms in reinforcement learning, specifically comparing the effectiveness of different neural representations and model configurations in the learning of novel cues

Language: Jupyter Notebook - Size: 70.8 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

shiivashaakeri/Data-Driven-MPC-Linear-Systems-RL

Z. Sun, Q. Wang, J. Pan and Y. Xia, "Data-Driven MPC for Linear Systems using Reinforcement Learning," 2021 China Automation Congress (CAC), Beijing, China, 2021, pp. 394-399, doi: 10.1109/CAC53003.2021.9728233.

Language: Python - Size: 8.79 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Breakend/SarsaVsExpectedSarsa

An a bias-variance tradeoff of Sarsa vs. Expected Sarsa with experiments.

Language: Jupyter Notebook - Size: 1.14 MB - Last synced at: about 1 year ago - Pushed at: over 8 years ago - Stars: 8 - Forks: 4

OneRaynyDay/RLEngine

A simple reinforcement learning simulation engine for OpenAI's gym.

Language: Python - Size: 44.9 KB - Last synced at: 4 days ago - Pushed at: over 6 years ago - Stars: 38 - Forks: 13

mobeets/value-rnn-td

train an RNN to estimate value in a POMDP using TD learning

Language: Python - Size: 324 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 2

JVP15/td-gammon Fork of dellalibera/td-gammon

TD-Gammon implementation

Language: Python - Size: 1.27 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

sean85914/rl_pnp

Language: C++ - Size: 69 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 8 - Forks: 4

Sagarnandeshwar/On_Policy_And_Off_Policy_Reinforcement_Learning

Reinforcement Learning (COMP 579) Project

Language: Jupyter Notebook - Size: 3.2 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

noahzemlin/PyOnitama

Onitama Board Game Simulator with Reinforcement Learning opponents

Language: Python - Size: 22.1 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 0

imraviagrawal/Reinforcement-Learning-Implementation

Implementation of Reinforcement Algorithms from scratch

Language: Python - Size: 23.1 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 9 - Forks: 3

szhangml/Average-Reward-TD-Q-Learning

Code for the numerical experiments in Zhang, Sheng, Zhe Zhang, and Siva Theja Maguluri. "Finite Sample Analysis of Average-Reward TD Learning and Q-Learning."

Language: Python - Size: 5.86 KB - Last synced at: 10 months ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

dellalibera/gym-backgammon

Backgammon OpenAI Gym

Language: Python - Size: 5.67 MB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 31 - Forks: 11

plopd/on-policy-experiments-td-and-etd

An Empirical Comparison of Temporal-Differences Learning Methods with Emphatic Temporal-Differences Learning Methods in the On-Policy Case.

Language: Python - Size: 35.2 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

basselkassem/easy21

This project is an implementation of the game EASY21

Language: Jupyter Notebook - Size: 5.9 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

sizzle0121/2048-Game-and-AI

A 2048 game platform made with Python & the AI of the game trained by reinforcement learning

Language: Python - Size: 79.2 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

Anjali001/Reinforcement-Learning

Language: Jupyter Notebook - Size: 1.05 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

RFLeijenaar/RL-Tabular-Rubikscube

Reinforcement Learning with tabular methods: TD-learning (Q-learning and SARSA) and MENACE-like approach applied to a Rubik's cube with a move set restricted to 180-degree turns.

Language: C - Size: 4.04 MB - Last synced at: 5 months ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 1

OneUpWallStreet/Reinforcement-Learning

All of my reinforcement learning projects (Some of the projects may contain errors :D )

Language: Jupyter Notebook - Size: 1.53 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

wjaskowski/mastering-2048

An efficient reinforcement learning algorithm for learning a strategy for game 2048

Language: Java - Size: 48.6 MB - Last synced at: 3 months ago - Pushed at: over 8 years ago - Stars: 9 - Forks: 0

harmanpreet93/reinforcement-learning

Reinforcement Learning algorithms

Language: Jupyter Notebook - Size: 4.06 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 4 - Forks: 0

PierpaoloLucarelli/QLearningMaze

Implementation of Q-Learning using TD error to navigate a maze avoiding obstacles and a moving enemy

Language: Python - Size: 1.98 MB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 8 - Forks: 0

Silviatulli/RLhomework

multi-armed bandit, gambler problem, cliff problem and TD learning

Language: Python - Size: 17.6 KB - Last synced at: about 2 months ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

d-dawg78/MVA_RL

Master MVA - Reinforcement Learning Project

Language: Python - Size: 6 MB - Last synced at: 3 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

AshishSinha5/maze_runner

python implementation of SARSA learning algorithm to solve a maze

Language: Python - Size: 5.96 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

eugene87222/intro-to-AI-group-project

AI of modified version of Othello/Reversi

Language: Python - Size: 357 KB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

worldofnick/pacman-AI

Implementation of reinforcement learning algorithms to solve pacman game. Part of CS188 AI course from UC Berkeley.

Language: Python - Size: 1.59 MB - Last synced at: over 2 years ago - Pushed at: about 8 years ago - Stars: 2 - Forks: 11

MaraTomasek/ML-Notebooks

Some coding stuff from various machine learning books

Language: Jupyter Notebook - Size: 273 MB - Last synced at: 5 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

Jonathan-Pearce/brain-inspired-AI

Code and reports from two projects; Boltzmann machine trained on the MNIST data and temporal difference learning model for navigating Morris water-maze task

Language: Jupyter Notebook - Size: 1.47 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

Daru13/watermaze-learning-model

TD learning model of a rat who learns how to navigate in a watermaze from the activation of its place cells 🐀

Language: Python - Size: 55.7 KB - Last synced at: over 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 1

navreeetkaur/reinforcement-learning-algorithms

Policy Evaluation and Policy Control for modified Blackjack: Assignment 1, COL870 (Reinforcement Learning) @ IIT Delhi

Language: Jupyter Notebook - Size: 26 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

znreza/RL_Best_Presentation

This presentation contains very precise yet detailed explanation of concepts of a very interesting topic -- Reinforcement Learning.

Size: 4.82 MB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

dyth/Juno

Tic-Tac-Toe agent trained by Deep Reinforcement Learning

Language: Python - Size: 87.9 KB - Last synced at: 4 months ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 1

smauermann/py_bg

A neural network playing Backgammon.

Language: Python - Size: 572 KB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

Related Keywords

td-learning 40 reinforcement-learning 30 q-learning 14 sarsa 12 machine-learning 7 sarsa-learning 6 monte-carlo 5 policy-gradient 5 reinforcement-learning-algorithms 5 reinforce 4 artificial-intelligence 4 policy-iteration 3 actor-critic-algorithm 3 dynamic-programming 3 backgammon 3 td-lambda 3 value-iteration 3 temporal-differencing-learning 3 deep-reinforcement-learning 3 rl 2 function-approximation 2 model-free-rl 2 monte-carlo-methods 2 dqn 2 python 2 openai-gym 2 sarsa-lambda 2 sutton-gridworld 2 q-learning-vs-sarsa 2 monte-carlo-sampling 2 gym-backgammon 2 blackjack 2 game 2 pytorch 2 model 2 qlearning 2 actor-critic 2 expected-sarsa 2 nctu 1 minmax-algorithm 1 pathfinding 1 echolocation 1 sequential-decision-making-problems 1 td-error 1 multi-armed-bandit 1 gambler-problem 1 cliff-problem 1 2048-solver 1 n-tuple-networks 1 tkinter 1 epsilon-greedy-exploration 1 exploration-exploitation 1 greedy-algorithm 1 ucb-algorithm 1 epsilon-greedy 1 menace-matchboxes 1 simulated-annealing 1 softmax 1 greedy-algorithms 1 nstep-bootstrapping 1 nstep-sarsa 1 python3 1 reinforcement-learning-environments 1 2048 1 ntuples 1 bandit 1 maze 1 multiagent-reinforcement-learning 1 boltzmann-machines 1 neuroscience 1 unsupervised-learning 1 place-cells 1 model-free-control 1 policy-control 1 policy-evaluation 1 active-learning 1 alphago 1 bandit-algorithm 1 bandit-learning 1 exploitation 1 exploration 1 model-based-rl 1 model-free 1 passive-learning 1 rl-vs-supervised-learning 1 rl-vs-unsupervised-learning 1 value-network 1 numpy 1 undergraduate-course 1 a-star 1 alpha-beta-pruning 1 bayes-network 1 breadth-first-search 1 depth-first-search 1 expectimax 1 minimax-search 1 particle-filter 1 ucs 1 classification 1 cnn 1