An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: temporal-difference-learning

moporgic/TDL2048

The Most Efficient Temporal Difference Learning Framework for 2048

Language: C++ - Size: 1.91 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 11 - Forks: 1

reshalfahsi/swinging-up-acrobot

Swinging Up Acrobot with n-Step Q-Learning

Language: Jupyter Notebook - Size: 1.84 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

PsorTheDoctor/ludo-rl

Q-learning and SARSA playing ludo.

Language: Python - Size: 544 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

ceodaniyal/q_learning

Q-Learning Implementation for Process Optimization A reinforcement learning project that calculates the shortest route between locations using the Q-Learning algorithm. This code demonstrates how AI can optimize processes in a simulated environment with predefined states and rewards. 🚀

Language: Python - Size: 1.95 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

u84819482/Nano-RL

Tabular TD control in MAZE environment using Q-Learning, SARSA, and Expected SARSA

Language: Jupyter Notebook - Size: 667 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

UniBwTAS/CollisionPro

A framework for collision probability distribution estimation via deep temporal difference learning

Language: Python - Size: 2.85 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 5 - Forks: 1

HridayM25/ReinforcementLearning

Some algorithms of Reinforcement Learning implemented by me, in accordance to "Introduction to Reinforcement Learning" by Richard Sutton and Andrew Barto.

Language: Jupyter Notebook - Size: 538 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

NikolaZubic/AppliedGameTheoryHomeworkSolutions

Solutions for course: "Applied Game Theory" taken at University of Novi Sad - Faculty of Technical Sciences

Language: Jupyter Notebook - Size: 936 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 2

K-Winkles/Multi-stage-TDL-2048

This repository contains my undergraduate thesis source code for Multi-stage Temporal Difference Learning with 2048 as an AI testbed. I reimagined my original C++ implementation in Qt for visualisation purposes.

Language: C++ - Size: 5.86 KB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

enstit/Racetrack

Temporal Difference Learning for the Racetrack problem

Language: Jupyter Notebook - Size: 8.37 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

tensor-mutator/Symbiosis

A Reinforcement Learning library for solving custom environments

Language: Python - Size: 18 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

VEXLife/Accelerated-TD

My Implementation of the Accelerated Gradient Temporal Difference Learning algorithm in Python

Language: Python - Size: 1.66 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

ziap/2048-tdl

Temporal difference learning for 2048

Language: HTML - Size: 281 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

CrosleyZack/cse574

Course work for CSE 574 Planning and Learning Methods in AI

Language: Python - Size: 252 MB - Last synced at: 3 days ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

qpwoeirut/2048-solver

A set of AIs for the 2048 tile-merging game. Includes an expectimax strategy that reaches 16384 with 34.6% success and an ML model trained with temporal difference learning.

Language: C++ - Size: 1.35 GB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 11 - Forks: 0

Develop-Packt/Introduction-to-Temporal-Difference-Learning

This module introduces temporal-difference learning and focuses on how it develops over the ideas of both Monte Carlo methods, and dynamic programming.

Language: Jupyter Notebook - Size: 202 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 1

ZikangZhou/nim_rl

A reinforcement learning framework for the game of Nim.

Language: C++ - Size: 11.8 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 0

hadi16/ReAntics-Agents

AI agents for the game ReAntics.

Language: Python - Size: 452 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 1

Related Keywords
temporal-difference-learning 18 reinforcement-learning 11 q-learning 7 machine-learning 4 python 4 dynamic-programming 3 artificial-intelligence 3 sarsa 3 n-tuple-networks 3 2048-ai 3 reinforcement-learning-algorithms 2 markov-decision-processes 2 expected-sarsa 2 minimax-algorithm 2 monte-carlo-methods 2 expectimax 2 ai 2 markov-decision-process 2 alpha-beta-pruning 2 temporal-difference-algorithms 2 2048 2 2048-solver 2 neural-networks 1 markov-chain 1 hidden-markov-model 1 capture-the-flag 1 agent 1 temporal-differencing-learning 1 temporal-difference 1 td 1 random-walk 1 atd 1 accelerated-td 1 prioritized-experience-replay 1 optical-flow 1 model-free-rl 1 model-based-rl 1 mcts 1 flappy-bird-agent 1 experience-replay 1 enduro-agent 1 minimax 1 genetic-algorithms 1 value-iteration 1 policy-iteration 1 off-policy-n-step-sarsa 1 off-policy-n-step-expected-sarsa 1 n-step-tree-backup 1 n-step-sarsa 1 n-step-expected-sarsa 1 n-step-bootstrapping 1 dqn 1 double-sarsa 1 double-q-learning 1 double-expected-sarsa 1 tensorflow-2 1 monte-carlo-tree-search 1 emscripten 1 embind 1 2048-game 1 partially-observable-markov-model 1 monte-carlo 1 acrobot 1 bandit-algorithms 1 xai 1 tensorflow 1 robotics 1 end-to-end 1 deep-reinforcement-learning 1 collision-detection 1 autonomous-driving 1 maze 1 gymnasium 1 state-transition-models 1 shortest-path-algorithm 1 route-optimization 1 reward-systems 1 pytorch-lightning 1 process-optimization 1 pathfinding-algorithms 1 ai-in-operations 1 deep-q-network 1 deep-learning 1 alphazero 1 qt 1 tic-tac-toe 1 softmax-policy 1 softmax 1 sarsa-learning 1 multi-armed-bandit 1 framework 1 machine-learning-algorithms 1 instigation-game 1 game-theory 1 evolutionary-game-theory 1 cournot-competition 1 blackjack 1 bellman-ford-algorithm 1 applied-game-theory 1 policy-gradient 1