An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: temporal-difference

soheil-mp/Reinforcement-Learning-Algorithms

Step by Step Reinforcement Learning Tutorials.

Language: Jupyter Notebook - Size: 18.4 MB - Last synced at: 11 days ago - Pushed at: over 2 years ago - Stars: 12 - Forks: 6

deepbiolab/drl

Implementation of deep reinforcement learning

Language: Jupyter Notebook - Size: 30.7 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

devspaceship/madepro

A minimal Rust library for solving finite deterministic Markov decision processes

Language: Rust - Size: 64.5 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

chauvinSimon/Reinforcement-Learning-for-Decision-Making-in-self-driving-cars

Reinforcement-Learning-for-Decision-Making-in-self-driving-cars

Language: Python - Size: 25.8 MB - Last synced at: 9 months ago - Pushed at: over 6 years ago - Stars: 103 - Forks: 31

mweglowski/pathfinding_simulator

🧨 Interactive temporal difference algorithm simulator in which agent has to find the optimal path to reach certain destination.

Language: JavaScript - Size: 277 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

victor-iyi/deep-RL

Exploration of deep reinforcement learning and various state-of-the-art techniques to create a turely autonomous agent.

Language: Python - Size: 61.5 KB - Last synced at: 12 months ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 1

BY571/Medium_Code_Examples

Implementation of fundamental concepts and algorithms for reinforcement learning

Language: Jupyter Notebook - Size: 994 KB - Last synced at: about 1 year ago - Pushed at: almost 5 years ago - Stars: 12 - Forks: 2

qihongl/demo-td

TD, a model of second/higher order conditioning

Language: Jupyter Notebook - Size: 321 KB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

dksifoua/Reinforcement-Learning

Language: Jupyter Notebook - Size: 1.14 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 1

thoth-station/adviser

The recommendation engine for Python software stacks and Dependency Monkey in project Thoth.

Language: Python - Size: 13.1 MB - Last synced at: 10 days ago - Pushed at: over 1 year ago - Stars: 34 - Forks: 13

VEXLife/Accelerated-TD

My Implementation of the Accelerated Gradient Temporal Difference Learning algorithm in Python

Language: Python - Size: 1.66 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

alizindari/Reinforcement-Learning

Implementation of several algorithms in RL based on Prof. sutton's book

Language: Jupyter Notebook - Size: 510 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 2

dhikshitha29/Playing-the-game-of-twenty-one-and-pontoon

Optimising the blackjack game

Language: Jupyter Notebook - Size: 1.44 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

TanushGoel/Atari-Games-RL

A collection of ipython notebooks in which agents learn to play Atari games in Open AI gym environments using different methods of reinforcement learning.

Language: Jupyter Notebook - Size: 2.38 MB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 1

sourcecode369/deep-reinforcement-learning

Implementation notebooks and scripts of Deep Reinforcement learning Algorithms in PyTorch and TensorFlow.

Language: Jupyter Notebook - Size: 17.4 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 11 - Forks: 3

sparisi/td-reg

TD-Regularized Actor-Critic Methods

Language: MATLAB - Size: 101 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 30 - Forks: 5

steven112163/Deep-Learning-and-Practice

NCTU(NYCU) Deep Learning and Practice Spring 2021

Language: Python - Size: 36.7 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

anubhavshrimal/Reinforcement-Learning

Implementation and Notes of different Reinforcement Learning Algorithms

Language: Jupyter Notebook - Size: 34.3 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 8 - Forks: 3

keeeal/temporal-ut3

Temporal difference learning for ultimate tic-tac-toe.

Language: Python - Size: 21.5 KB - Last synced at: 4 months ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 2

WinDerek/reinforce-py

Reinforcement learning agents in Python (dynamic programming, temporal-difference, deep Q-learning, stochastic/deterministic policy gradients)

Language: Jupyter Notebook - Size: 7.09 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 1

ken-power/DRLND_DeepReinforcementLearning_Examples

Examples and tutorials that implement various algorithms in Deep Reinforcement Learning.

Language: Jupyter Notebook - Size: 19.1 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

i2a-k/Reinforcement-Learning

Multi-Armed Bandit Simulation, MDP GridWorld Example, Random Walk Problem by TD and MC

Language: Jupyter Notebook - Size: 186 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

KaleabTessera/Monte-Carlo-and-Temporal-Difference

Monte Carlo and Temporal Difference implementation from Chapter 5 and Chapter 6 of Reinforcement Learning: An Introduction Book by Andrew Barto and Richard S. Sutton.

Language: Python - Size: 8.13 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

rdadrl/DiceUp

DiceUp is a collection of backgammon playing AI's.

Language: Java - Size: 1.26 MB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 1

antonio-f/TD-methods-SARSA

Temporal Difference methods - A simple implementation of SARSA algorithm applied to OpenAI gym's "CliffWalking" environment.

Language: Jupyter Notebook - Size: 248 KB - Last synced at: 24 days ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

PieroMacaluso/reinforcement-learning-stuff

Just a bunch of exercises created during my thesis work working on Reinforcement Learning.

Language: Python - Size: 282 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

ahlusar1989/CS234ReinforcementLearning

CS234 Courswork

Language: Jupyter Notebook - Size: 10.6 MB - Last synced at: 7 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Related Keywords
temporal-difference 27 reinforcement-learning 22 monte-carlo 11 deep-reinforcement-learning 6 dqn 6 dynamic-programming 6 q-learning 6 policy-gradient 5 sarsa 5 pytorch 5 artificial-intelligence 4 openai-gym 4 markov-decision-processes 4 reinforcement-learning-algorithms 4 ppo 3 monte-carlo-tree-search 3 deep-learning 3 reinforce 3 policy-based-method 3 machine-learning 3 python3 2 policy-iteration 2 value-iteration 2 temporal-difference-algorithms 2 td 2 random-walk 2 ai 2 actor-critic 2 proximal-policy-optimization 2 python 2 monte-carlo-methods 2 reinforcement-learning-environments 2 advantage-actor-critic 2 dueling-dqn 2 deep-q-network 2 ddpg 2 alphazero 2 eegnet 1 resnet 1 reinforcement-learning-exercises 1 deep-convolutional-networks 1 cvae 1 cpp 1 conditional-normalizing-flows 1 conditional-gan 1 trpo 1 spg 1 dpg 1 rl-algorithms 1 pytorch-rl 1 openai-gym-solutions 1 neural-networks 1 td-methods 1 simple 1 sarsa-algorithm 1 gym-environment 1 gym 1 cliffwalking 1 algorithm 1 101 1 minimax 1 java 1 backgammon 1 rl 1 n-armed-bandit-problem 1 multi-armed-bandit 1 incremental-monte-carlo 1 gridworld 1 cross-entropy 1 visualization 1 full-stack 1 ultimate-tic-tac-toe 1 temporal 1 self-play 1 neural-network 1 rl-notes 1 double-dqn 1 deep-rl 1 a3c 1 tailwindcss 1 reactjs 1 javascript 1 html 1 css 1 self-driving-car 1 rust 1 mdp 1 value-based-methods 1 tile-coding 1 prioritized-dqn 1 mc-control 1 hill-climbing 1 dueling-ddqn 1 deep-deterministic-policy-gradient 1 cross-entropy-method 1 finance 1 discretization 1 deep-q-learning 1 ml-agents 1 ddqn 1