GitHub topics: td-learning
sshkhr/Practical_RL
My solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow
Language: Jupyter Notebook - Size: 9.91 MB - Last synced at: 28 days ago - Pushed at: over 3 years ago - Stars: 54 - Forks: 25

omerbsezer/Reinforcement_learning_tutorial_with_demo 📦
Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Language: Jupyter Notebook - Size: 151 KB - Last synced at: 3 months ago - Pushed at: over 6 years ago - Stars: 751 - Forks: 174

SwamiKannan/Reinforcement-Learning-Specialization
Programming Assignments for Reinforcement Learning Specialization
Language: Jupyter Notebook - Size: 2 MB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

moripiri/Reinforcement-Learning-on-FrozenLake
Reinforcement Learning Algorithms in FrozenLake-v1
Language: Jupyter Notebook - Size: 19.8 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 22 - Forks: 2

k-karna/reinforcement_learning
Reinforcement Learning Specialization | University of Alberta
Language: Jupyter Notebook - Size: 2 MB - Last synced at: 3 months ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

makaveli10/reinforcementLearning
Reinforcement Learning - Implementation of Exercises, algorithms from the book Sutton Barto and David silver's RL course in Python, OpenAI Gym.
Language: Jupyter Notebook - Size: 6.84 MB - Last synced at: 3 months ago - Pushed at: about 5 years ago - Stars: 24 - Forks: 4

tirthajyoti/RL_basics
Basic Reinforcement Learning algorithms
Language: Jupyter Notebook - Size: 2.29 MB - Last synced at: about 2 months ago - Pushed at: about 6 years ago - Stars: 18 - Forks: 13

Aneeshers/Rapid-learning
Rapid learning mechanisms in reinforcement learning, specifically comparing the effectiveness of different neural representations and model configurations in the learning of novel cues
Language: Jupyter Notebook - Size: 70.8 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

shiivashaakeri/Data-Driven-MPC-Linear-Systems-RL
Z. Sun, Q. Wang, J. Pan and Y. Xia, "Data-Driven MPC for Linear Systems using Reinforcement Learning," 2021 China Automation Congress (CAC), Beijing, China, 2021, pp. 394-399, doi: 10.1109/CAC53003.2021.9728233.
Language: Python - Size: 8.79 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Breakend/SarsaVsExpectedSarsa
An a bias-variance tradeoff of Sarsa vs. Expected Sarsa with experiments.
Language: Jupyter Notebook - Size: 1.14 MB - Last synced at: about 1 year ago - Pushed at: over 8 years ago - Stars: 8 - Forks: 4

OneRaynyDay/RLEngine
A simple reinforcement learning simulation engine for OpenAI's gym.
Language: Python - Size: 44.9 KB - Last synced at: 4 days ago - Pushed at: over 6 years ago - Stars: 38 - Forks: 13

mobeets/value-rnn-td
train an RNN to estimate value in a POMDP using TD learning
Language: Python - Size: 324 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 2

JVP15/td-gammon Fork of dellalibera/td-gammon
TD-Gammon implementation
Language: Python - Size: 1.27 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

sean85914/rl_pnp
Language: C++ - Size: 69 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 8 - Forks: 4

Sagarnandeshwar/On_Policy_And_Off_Policy_Reinforcement_Learning
Reinforcement Learning (COMP 579) Project
Language: Jupyter Notebook - Size: 3.2 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

noahzemlin/PyOnitama
Onitama Board Game Simulator with Reinforcement Learning opponents
Language: Python - Size: 22.1 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 0

imraviagrawal/Reinforcement-Learning-Implementation
Implementation of Reinforcement Algorithms from scratch
Language: Python - Size: 23.1 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 9 - Forks: 3

szhangml/Average-Reward-TD-Q-Learning
Code for the numerical experiments in Zhang, Sheng, Zhe Zhang, and Siva Theja Maguluri. "Finite Sample Analysis of Average-Reward TD Learning and Q-Learning."
Language: Python - Size: 5.86 KB - Last synced at: 10 months ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

dellalibera/gym-backgammon
Backgammon OpenAI Gym
Language: Python - Size: 5.67 MB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 31 - Forks: 11

plopd/on-policy-experiments-td-and-etd
An Empirical Comparison of Temporal-Differences Learning Methods with Emphatic Temporal-Differences Learning Methods in the On-Policy Case.
Language: Python - Size: 35.2 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

basselkassem/easy21
This project is an implementation of the game EASY21
Language: Jupyter Notebook - Size: 5.9 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

sizzle0121/2048-Game-and-AI
A 2048 game platform made with Python & the AI of the game trained by reinforcement learning
Language: Python - Size: 79.2 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

Anjali001/Reinforcement-Learning
Language: Jupyter Notebook - Size: 1.05 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

RFLeijenaar/RL-Tabular-Rubikscube
Reinforcement Learning with tabular methods: TD-learning (Q-learning and SARSA) and MENACE-like approach applied to a Rubik's cube with a move set restricted to 180-degree turns.
Language: C - Size: 4.04 MB - Last synced at: 5 months ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 1

OneUpWallStreet/Reinforcement-Learning
All of my reinforcement learning projects (Some of the projects may contain errors :D )
Language: Jupyter Notebook - Size: 1.53 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

wjaskowski/mastering-2048
An efficient reinforcement learning algorithm for learning a strategy for game 2048
Language: Java - Size: 48.6 MB - Last synced at: 3 months ago - Pushed at: over 8 years ago - Stars: 9 - Forks: 0

harmanpreet93/reinforcement-learning
Reinforcement Learning algorithms
Language: Jupyter Notebook - Size: 4.06 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 4 - Forks: 0

PierpaoloLucarelli/QLearningMaze
Implementation of Q-Learning using TD error to navigate a maze avoiding obstacles and a moving enemy
Language: Python - Size: 1.98 MB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 8 - Forks: 0

Silviatulli/RLhomework
multi-armed bandit, gambler problem, cliff problem and TD learning
Language: Python - Size: 17.6 KB - Last synced at: about 2 months ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

d-dawg78/MVA_RL
Master MVA - Reinforcement Learning Project
Language: Python - Size: 6 MB - Last synced at: 3 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

AshishSinha5/maze_runner
python implementation of SARSA learning algorithm to solve a maze
Language: Python - Size: 5.96 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

eugene87222/intro-to-AI-group-project
AI of modified version of Othello/Reversi
Language: Python - Size: 357 KB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

worldofnick/pacman-AI
Implementation of reinforcement learning algorithms to solve pacman game. Part of CS188 AI course from UC Berkeley.
Language: Python - Size: 1.59 MB - Last synced at: over 2 years ago - Pushed at: about 8 years ago - Stars: 2 - Forks: 11

MaraTomasek/ML-Notebooks
Some coding stuff from various machine learning books
Language: Jupyter Notebook - Size: 273 MB - Last synced at: 5 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

Jonathan-Pearce/brain-inspired-AI
Code and reports from two projects; Boltzmann machine trained on the MNIST data and temporal difference learning model for navigating Morris water-maze task
Language: Jupyter Notebook - Size: 1.47 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

Daru13/watermaze-learning-model
TD learning model of a rat who learns how to navigate in a watermaze from the activation of its place cells 🐀
Language: Python - Size: 55.7 KB - Last synced at: over 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 1

navreeetkaur/reinforcement-learning-algorithms
Policy Evaluation and Policy Control for modified Blackjack: Assignment 1, COL870 (Reinforcement Learning) @ IIT Delhi
Language: Jupyter Notebook - Size: 26 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

znreza/RL_Best_Presentation
This presentation contains very precise yet detailed explanation of concepts of a very interesting topic -- Reinforcement Learning.
Size: 4.82 MB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

dyth/Juno
Tic-Tac-Toe agent trained by Deep Reinforcement Learning
Language: Python - Size: 87.9 KB - Last synced at: 4 months ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 1

smauermann/py_bg
A neural network playing Backgammon.
Language: Python - Size: 572 KB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0
