GitHub topics: temporal-difference-learning

Repositories

UniBwTAS/CollisionPro

Towards explainable value functions in reinforcement learning. A framework for collision probability distribution estimation via deep temporal difference learning.

Language: Python - Size: 2.85 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 12 - Forks: 2

moporgic/TDL2048

The Most Efficient Temporal Difference Learning Framework for 2048

Language: C++ - Size: 1.91 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 11 - Forks: 1

reshalfahsi/swinging-up-acrobot

Swinging Up Acrobot with n-Step Q-Learning

Language: Jupyter Notebook - Size: 1.92 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

PsorTheDoctor/ludo-rl

Q-learning and SARSA playing ludo.

Language: Python - Size: 544 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Q-Learning Implementation for Process Optimization A reinforcement learning project that calculates the shortest route between locations using the Q-Learning algorithm. This code demonstrates how AI can optimize processes in a simulated environment with predefined states and rewards. 🚀

Language: Python - Size: 1.95 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

u84819482/Nano-RL

Tabular TD control in MAZE environment using Q-Learning, SARSA, and Expected SARSA

Language: Jupyter Notebook - Size: 667 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

HridayM25/ReinforcementLearning

Some algorithms of Reinforcement Learning implemented by me, in accordance to "Introduction to Reinforcement Learning" by Richard Sutton and Andrew Barto.

Language: Jupyter Notebook - Size: 538 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 2 - Forks: 0

NikolaZubic/AppliedGameTheoryHomeworkSolutions

Solutions for course: "Applied Game Theory" taken at University of Novi Sad - Faculty of Technical Sciences

Language: Jupyter Notebook - Size: 936 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 2

K-Winkles/Multi-stage-TDL-2048

This repository contains my undergraduate thesis source code for Multi-stage Temporal Difference Learning with 2048 as an AI testbed. I reimagined my original C++ implementation in Qt for visualisation purposes.

Language: C++ - Size: 5.86 KB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

enstit/Racetrack

Temporal Difference Learning for the Racetrack problem

Language: Jupyter Notebook - Size: 8.37 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

tensor-mutator/Symbiosis

A Reinforcement Learning library for solving custom environments

Language: Python - Size: 18 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

VEXLife/Accelerated-TD

My Implementation of the Accelerated Gradient Temporal Difference Learning algorithm in Python

Language: Python - Size: 1.66 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

ziap/2048-tdl

Temporal difference learning for 2048

Language: HTML - Size: 281 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

CrosleyZack/cse574

Course work for CSE 574 Planning and Learning Methods in AI

Language: Python - Size: 252 MB - Last synced at: 7 days ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

qpwoeirut/2048-solver

A set of AIs for the 2048 tile-merging game. Includes an expectimax strategy that reaches 16384 with 34.6% success and an ML model trained with temporal difference learning.

Language: C++ - Size: 1.35 GB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 11 - Forks: 0

Develop-Packt/Introduction-to-Temporal-Difference-Learning

This module introduces temporal-difference learning and focuses on how it develops over the ideas of both Monte Carlo methods, and dynamic programming.

Language: Jupyter Notebook - Size: 202 KB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 1

ZikangZhou/nim_rl

A reinforcement learning framework for the game of Nim.

Language: C++ - Size: 11.8 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 0

hadi16/ReAntics-Agents

AI agents for the game ReAntics.

Language: Python - Size: 452 KB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 1

Related Keywords

temporal-difference-learning 18 reinforcement-learning 11 q-learning 7 machine-learning 4 python 4 n-tuple-networks 3 artificial-intelligence 3 dynamic-programming 3 2048-ai 3 sarsa 3 temporal-difference-algorithms 2 2048-solver 2 2048 2 markov-decision-process 2 minimax-algorithm 2 monte-carlo-methods 2 alpha-beta-pruning 2 ai 2 markov-decision-processes 2 expectimax 2 deep-learning 2 expected-sarsa 2 reinforcement-learning-algorithms 2 temporal-difference 1 temporal-differencing-learning 1 agent 1 neural-networks 1 td 1 random-walk 1 atd 1 accelerated-td 1 prioritized-experience-replay 1 optical-flow 1 model-free-rl 1 model-based-rl 1 mcts 1 flappy-bird-agent 1 experience-replay 1 enduro-agent 1 deep-q-network 1 alphazero 1 minimax 1 genetic-algorithms 1 value-iteration 1 policy-iteration 1 off-policy-n-step-sarsa 1 off-policy-n-step-expected-sarsa 1 n-step-tree-backup 1 n-step-sarsa 1 n-step-expected-sarsa 1 n-step-bootstrapping 1 dqn 1 double-sarsa 1 double-q-learning 1 double-expected-sarsa 1 tensorflow-2 1 monte-carlo-tree-search 1 emscripten 1 embind 1 partially-observable-markov-model 1 markov-chain 1 hidden-markov-model 1 capture-the-flag 1 reward-systems 1 process-optimization 1 pathfinding-algorithms 1 ai-in-operations 1 pytorch-lightning 1 gymnasium 1 acrobot 1 machine-learning-algorithms 1 framework 1 2048-game 1 xai 1 tensorflow 1 safety-critical 1 safety 1 robotics 1 research 1 explainable-ai 1 end-to-end 1 deep-reinforcement-learning 1 collision-detection 1 autonomous-driving 1 qt 1 tic-tac-toe 1 softmax-policy 1 softmax 1 sarsa-learning 1 multi-armed-bandit 1 instigation-game 1 game-theory 1 evolutionary-game-theory 1 cournot-competition 1 blackjack 1 bellman-ford-algorithm 1 applied-game-theory 1 policy-gradient 1 policy-control 1 monte-carlo 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos