GitHub topics: temporal-difference
soheil-mp/Reinforcement-Learning-Algorithms
Step by Step Reinforcement Learning Tutorials.
Language: Jupyter Notebook - Size: 18.4 MB - Last synced at: 11 days ago - Pushed at: over 2 years ago - Stars: 12 - Forks: 6

deepbiolab/drl
Implementation of deep reinforcement learning
Language: Jupyter Notebook - Size: 30.7 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

devspaceship/madepro
A minimal Rust library for solving finite deterministic Markov decision processes
Language: Rust - Size: 64.5 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

chauvinSimon/Reinforcement-Learning-for-Decision-Making-in-self-driving-cars
Reinforcement-Learning-for-Decision-Making-in-self-driving-cars
Language: Python - Size: 25.8 MB - Last synced at: 9 months ago - Pushed at: over 6 years ago - Stars: 103 - Forks: 31

mweglowski/pathfinding_simulator
🧨 Interactive temporal difference algorithm simulator in which agent has to find the optimal path to reach certain destination.
Language: JavaScript - Size: 277 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

victor-iyi/deep-RL
Exploration of deep reinforcement learning and various state-of-the-art techniques to create a turely autonomous agent.
Language: Python - Size: 61.5 KB - Last synced at: 12 months ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 1

BY571/Medium_Code_Examples
Implementation of fundamental concepts and algorithms for reinforcement learning
Language: Jupyter Notebook - Size: 994 KB - Last synced at: about 1 year ago - Pushed at: almost 5 years ago - Stars: 12 - Forks: 2

qihongl/demo-td
TD, a model of second/higher order conditioning
Language: Jupyter Notebook - Size: 321 KB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

dksifoua/Reinforcement-Learning
Language: Jupyter Notebook - Size: 1.14 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 1

thoth-station/adviser
The recommendation engine for Python software stacks and Dependency Monkey in project Thoth.
Language: Python - Size: 13.1 MB - Last synced at: 10 days ago - Pushed at: over 1 year ago - Stars: 34 - Forks: 13

VEXLife/Accelerated-TD
My Implementation of the Accelerated Gradient Temporal Difference Learning algorithm in Python
Language: Python - Size: 1.66 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

alizindari/Reinforcement-Learning
Implementation of several algorithms in RL based on Prof. sutton's book
Language: Jupyter Notebook - Size: 510 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 2

dhikshitha29/Playing-the-game-of-twenty-one-and-pontoon
Optimising the blackjack game
Language: Jupyter Notebook - Size: 1.44 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

TanushGoel/Atari-Games-RL
A collection of ipython notebooks in which agents learn to play Atari games in Open AI gym environments using different methods of reinforcement learning.
Language: Jupyter Notebook - Size: 2.38 MB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 1

sourcecode369/deep-reinforcement-learning
Implementation notebooks and scripts of Deep Reinforcement learning Algorithms in PyTorch and TensorFlow.
Language: Jupyter Notebook - Size: 17.4 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 11 - Forks: 3

sparisi/td-reg
TD-Regularized Actor-Critic Methods
Language: MATLAB - Size: 101 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 30 - Forks: 5

steven112163/Deep-Learning-and-Practice
NCTU(NYCU) Deep Learning and Practice Spring 2021
Language: Python - Size: 36.7 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

anubhavshrimal/Reinforcement-Learning
Implementation and Notes of different Reinforcement Learning Algorithms
Language: Jupyter Notebook - Size: 34.3 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 8 - Forks: 3

keeeal/temporal-ut3
Temporal difference learning for ultimate tic-tac-toe.
Language: Python - Size: 21.5 KB - Last synced at: 4 months ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 2

WinDerek/reinforce-py
Reinforcement learning agents in Python (dynamic programming, temporal-difference, deep Q-learning, stochastic/deterministic policy gradients)
Language: Jupyter Notebook - Size: 7.09 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 1

ken-power/DRLND_DeepReinforcementLearning_Examples
Examples and tutorials that implement various algorithms in Deep Reinforcement Learning.
Language: Jupyter Notebook - Size: 19.1 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

i2a-k/Reinforcement-Learning
Multi-Armed Bandit Simulation, MDP GridWorld Example, Random Walk Problem by TD and MC
Language: Jupyter Notebook - Size: 186 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

KaleabTessera/Monte-Carlo-and-Temporal-Difference
Monte Carlo and Temporal Difference implementation from Chapter 5 and Chapter 6 of Reinforcement Learning: An Introduction Book by Andrew Barto and Richard S. Sutton.
Language: Python - Size: 8.13 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

rdadrl/DiceUp
DiceUp is a collection of backgammon playing AI's.
Language: Java - Size: 1.26 MB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 1

antonio-f/TD-methods-SARSA
Temporal Difference methods - A simple implementation of SARSA algorithm applied to OpenAI gym's "CliffWalking" environment.
Language: Jupyter Notebook - Size: 248 KB - Last synced at: 24 days ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

PieroMacaluso/reinforcement-learning-stuff
Just a bunch of exercises created during my thesis work working on Reinforcement Learning.
Language: Python - Size: 282 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

ahlusar1989/CS234ReinforcementLearning
CS234 Courswork
Language: Jupyter Notebook - Size: 10.6 MB - Last synced at: 7 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0
