Topic: "value-iteration"
kentsommer/pytorch-value-iteration-networks
Pytorch implementation of Value Iteration Networks (NIPS 2016 best paper)
Language: Python - Size: 128 KB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 314 - Forks: 61

pemami4911/POMDPy
POMDPs in Python.
Language: Python - Size: 10 MB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 238 - Forks: 62

Madhu009/Deep-math-machine-learning.ai
A blog which talks about machine learning, deep learning algorithms and the Math. and Machine learning algorithms written from scratch.
Language: Jupyter Notebook - Size: 44.5 MB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 195 - Forks: 174

AgentMaker/Paddle-RLBooks
Paddle-RLBooks is a reinforcement learning code study guide based on pure PaddlePaddle.
Language: Python - Size: 14.1 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 110 - Forks: 13

chauvinSimon/Reinforcement-Learning-for-Decision-Making-in-self-driving-cars
Reinforcement-Learning-for-Decision-Making-in-self-driving-cars
Language: Python - Size: 25.8 MB - Last synced at: 10 months ago - Pushed at: over 6 years ago - Stars: 103 - Forks: 31

sachinbiradar9/Markov-Decision-Processes
Implementation of value iteration algorithm for calculating an optimal MDP policy
Language: Python - Size: 21.5 KB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 85 - Forks: 43

tanmayshankar/RCNN_MDP
Code base for solving Markov Decision Processes and Reinforcement Learning problems using Recurrent Convolutional Neural Networks.
Language: Python - Size: 33.6 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 68 - Forks: 28

iisys-hof/map-matching-2
High Performance Map Matching with Markov Decision Processes (MDPs) and Hidden Markov Models (HMMs).
Language: C++ - Size: 20.1 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 58 - Forks: 9

callmespring/RL-short-course
Reinforcement Learning Short Course
Language: Jupyter Notebook - Size: 95.6 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 53 - Forks: 18

iamjagdeesh/Artificial-Intelligence-Pac-Man
CSE 571 Artificial Intelligence
Language: Python - Size: 2.29 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 45 - Forks: 54

PhadonP/Rubiks-Cube-Reinforcement-Learning
Solving a Rubik's Cube and 15 Puzzle using the Deep Reinforcement Learning and Search
Language: Jupyter Notebook - Size: 102 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 33 - Forks: 4

YyzHarry/SV-RL
[ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning
Language: Python - Size: 1.47 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 31 - Forks: 6

moripiri/Reinforcement-Learning-on-FrozenLake
Reinforcement Learning Algorithms in FrozenLake-v1
Language: Jupyter Notebook - Size: 19.8 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 22 - Forks: 2

BertrandBev/controls-js
⚙️ Controls.js is a sandbox showcasing a few modern controls techiques directly in the browser
Language: JavaScript - Size: 4.24 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 20 - Forks: 0

tirthajyoti/RL_basics
Basic Reinforcement Learning algorithms
Language: Jupyter Notebook - Size: 2.29 MB - Last synced at: 5 days ago - Pushed at: almost 6 years ago - Stars: 18 - Forks: 13

neka-nat/vin-keras
This is an implimentation of Value Iteration Networks (NIPS2016 best paper) in keras
Language: Python - Size: 4.66 MB - Last synced at: 29 days ago - Pushed at: over 7 years ago - Stars: 17 - Forks: 7

alwaysbyx/Optimization-and-Search
Implementation and visualization (some demos) of search and optimization algorithms.
Language: Python - Size: 79.1 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 15 - Forks: 2

aaksham/frozenlake
Value & Policy Iteration for the frozenlake environment of OpenAI
Language: Python - Size: 167 KB - Last synced at: about 1 year ago - Pushed at: almost 6 years ago - Stars: 15 - Forks: 11

svpino/cs7641-assignment4
CS7641 - Machine Learning - Assignment 4 - Markov Decision Processes
Language: Java - Size: 70.3 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 14 - Forks: 14

bfaure/AI_Project_4
Language: Python - Size: 442 KB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 14 - Forks: 33

joefarrington/mdpax
GPU-accelerated MDP solvers in Python with JAX
Language: Python - Size: 5.07 MB - Last synced at: 4 days ago - Pushed at: about 1 month ago - Stars: 13 - Forks: 0

mgiannopoulos24/Artificial-Intelligence
Solutions for the Projects of the Artificial Intelligence (CS 188) course of UC Berkeley
Language: Python - Size: 22.6 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 12 - Forks: 9

MahanFathi/HJxB
Continuous-Time/State/Action Fitted Value Iteration via Hamilton-Jacobi-Bellman (HJB)
Language: Python - Size: 142 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 12 - Forks: 0

rmoehn/piglet_pbvi
Implementation of point-based value iteration (for POMDPs)
Language: Python - Size: 33.2 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 12 - Forks: 2

antonio-f/Dynamic-Programming
Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncated Policy Iteration, Value Iteration . From Udacity's Deep Reinforcement Learning Nanodegree program.
Language: Jupyter Notebook - Size: 179 KB - Last synced at: 28 days ago - Pushed at: about 6 years ago - Stars: 12 - Forks: 4

mbodenham/gridworld-value-iteration
Using value iteration to find the optimum policy in a grid world environment.
Language: Python - Size: 3.91 KB - Last synced at: 8 months ago - Pushed at: about 5 years ago - Stars: 11 - Forks: 4

open-spaced-repetition/SSP-MMC-FSRS
Stochastic-Shortest-Path-Minimize-Memorization-Cost for FSRS
Language: Jupyter Notebook - Size: 53.7 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 10 - Forks: 3

waqasqammar/MDP-with-Value-Iteration-and-Policy-Iteration
Value Iteration and Policy Iteration to solve MDPs
Language: Jupyter Notebook - Size: 188 KB - Last synced at: 27 days ago - Pushed at: over 6 years ago - Stars: 9 - Forks: 7

shehio/ReinforcementLearning
Reinforcement Learning algorithms with nothing abstracted away
Language: Python - Size: 788 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 1

auralius/yadpf
A generic implementation of dynamic programming algorithm and value iteration algorithm.
Language: MATLAB - Size: 20.6 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 7 - Forks: 4

sachag678/Reinforcement_learning
Contains baseline implementations of all RL algorithms using tabular and function approximations. Algorithms such as TD(0), MC, SARSA, Q-Learning and Policy Gradient methods.
Language: Jupyter Notebook - Size: 227 KB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 7 - Forks: 2

nicoRomeroCuruchet/DynamicProgramming
Policy Iteration for Continuous Dynamics
Language: Python - Size: 58.1 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 6 - Forks: 0

shehio/Stochastic-Programming
Devising an optimal portfolio choosing strategy based on stochastic programming
Language: Python - Size: 40 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 6 - Forks: 2

saminheydarian/Interactive_Learning_Course_2021
Interactive Learning Course | Home Works & Quiz | Fall 2021 | Prof. Majid Nili
Language: Jupyter Notebook - Size: 5.5 MB - Last synced at: 11 months ago - Pushed at: about 3 years ago - Stars: 6 - Forks: 1

KHvic/Markov-Decision-Process-Value-Iteration-Policy-Iteration-Visualization
Computing an optimal Markov Decision Process (MDP) policy with Value Iteration and Policy Iteration
Language: Java - Size: 3.59 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 3

ColasGael/RL-flappy-bird
Reinforcement Learning on playable version of Flappy Bird
Language: Python - Size: 168 KB - Last synced at: 28 days ago - Pushed at: almost 6 years ago - Stars: 6 - Forks: 1

CEDL2017/homework2-MDPs
The homework for Cutting-Edge of Deep Learning, aka CEDL, from NTHU
Language: Jupyter Notebook - Size: 331 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 6 - Forks: 44

caelan/planning-algorithms
MIT Planning Algorithms Class Implementations
Language: Python - Size: 10.5 MB - Last synced at: about 2 years ago - Pushed at: over 8 years ago - Stars: 6 - Forks: 3

ariankhanjani/Frozen-Lake-Openai-Gym
Implementation of RL Algorithms in Openai Gym Frozen-Lake Environment
Language: Jupyter Notebook - Size: 2.71 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

idigitopia/Distributed-VI
Distributed Implementation of Value Iteration, built on top of Ray Framework.
Language: HTML - Size: 41.7 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 0

qdevpsi3/randomized-value-iteration
JAX implementation of Generalization and Exploration via Randomized Value Functions (Osband et al., 2016)
Language: Python - Size: 242 KB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 5 - Forks: 0

ZikangZhou/nim_rl
A reinforcement learning framework for the game of Nim.
Language: C++ - Size: 11.8 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 0

prakHr/Reinforcement-Learning-Book
[Book] :- Andrea Lonza - Reinforcement Learning Algorithms with Python_ Learn, understand, and develop smart algorithms for addressing AI challenges-Packt Publishing (2019)
Language: Python - Size: 20.9 MB - Last synced at: about 1 month ago - Pushed at: almost 5 years ago - Stars: 5 - Forks: 4

masoudslipknot/Reinforcment_Learning_ValueIteration
Reinforcement- Learning project: Value Iteration Implementation.
Language: Java - Size: 35.2 KB - Last synced at: 8 months ago - Pushed at: over 6 years ago - Stars: 5 - Forks: 0

Breakend/ValuePolicyIterationVariations
Experiments testing variants of Value and Policy iterations.
Language: Jupyter Notebook - Size: 1.05 MB - Last synced at: about 1 year ago - Pushed at: over 8 years ago - Stars: 5 - Forks: 3

Pegah-Ardehkhani/Reinforcement-Learning-Algorithms-from-Scratch
Explore key RL algorithms with detailed explanations and fully commented Python code implementations
Language: Jupyter Notebook - Size: 2.36 MB - Last synced at: 17 days ago - Pushed at: 5 months ago - Stars: 4 - Forks: 0

parissashahabi/Game-Playing-Intelligent-Agent
Implemented reinforcement learning algorithms, including Value-Iteration and Q-Learning, for a 2D grid world Markov Decision Process resembling a Pac-man game. Also applied the Mini-Max algorithm and common path-planning techniques such as A*, Dijkstra, and bidirectional search.
Language: Python - Size: 331 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 1

PeeteKeesel/Basic-RL-Algorithms
:robot: Implementation and short explanation of basic RL algorithms, reproducing the simulations from Andrej Kaparthy's REINFORCEjs library.
Language: Python - Size: 18.8 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 0

Atul-Acharya-17/Markov-Decision-Process
Solving Markov Decision Process using Value Iteration and Policy Iteration, SARSA, Expected SARSA and Q-Learning
Language: Jupyter Notebook - Size: 9.34 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 0

OleguerCanal/RL-algorithms
Numpy & Keras based re-implementation of basic RL-algorithms: DP, VI, PI, SARSA, Q-Learning, DQN
Language: Python - Size: 8.17 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 4 - Forks: 1

ShreeshaN/ReinforcementLearningTutorials
This repo contains implementations of algorithms such a Q-learning, SARSA, TD, Policy gradient
Language: Python - Size: 4.32 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 6

being-aerys/Distributed_Reinforcement_Learning_with_Intel-DevCloud
Language: Jupyter Notebook - Size: 10.3 MB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 4 - Forks: 0

Chaoukia/Reinforcement-Learning-course
A Reinforcement Learning course with classic examples of agents trained on gym environments.
Language: Python - Size: 1.69 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 1

auralius/parallel-value-iteration
Finding a shortest path on a binary occupancy map
Language: Python - Size: 634 KB - Last synced at: 28 days ago - Pushed at: about 2 months ago - Stars: 3 - Forks: 0

lukasmyth96/Piggy
Using Value Iteration and Policy Iteration to discover the optimal solution for the strategic dice game PIG. Ultimately interested in whether the optimal solution can be reached through self-play alone.
Language: Python - Size: 23.1 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 1

ljharb/iterate-value
Iterate any iterable JS value. Works robustly in all environments, all versions.
Language: JavaScript - Size: 54.7 KB - Last synced at: 4 days ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

nicolaloi/Dynamic-Programming-and-Optimal-Control
Infinite horizon policy optimization for drone navigation. Graded project for the ETH course "Dynamic Programming and Optimal Control".
Language: MATLAB - Size: 758 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 2

Kytabyte/rl-playground
Implementation and experiments of reinforcement learning algorithms in CS885 @ UW
Language: Python - Size: 93.8 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 1

Prakhar-FF13/Reinforcement-Learning-With-Python
Reinforcement Learning Notebooks
Language: Python - Size: 115 KB - Last synced at: about 2 months ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 1

zyxue/rljs
RLjs currently serves as an interactive playground for learning reinforcement learning.
Language: JavaScript - Size: 1.41 MB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 0

akaAlbo/deeprlbootcamp
Solution to the Deep RL Bootcamp labs from UC Berkeley
Language: Jupyter Notebook - Size: 5.86 MB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 1

Stephen-X/Machine-Learning-Algorithms
A repository for hosting some of the popular machine learning algorithm implementations.
Language: Python - Size: 819 KB - Last synced at: 7 days ago - Pushed at: about 7 years ago - Stars: 3 - Forks: 3

victor-iyi/navigating-a-virtual-world-using-dynamic-programming
A reinforcement learning agent navigating the OpenAI's FrozenLake environment
Language: Jupyter Notebook - Size: 10.7 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 2

cnheider/gym_solutions
Language: Python - Size: 2.04 MB - Last synced at: about 1 month ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 0

PranavNyati/Reinforcement_Learning_Term_Project
This repository contains the codes for Term Projects as part of the Reinforcement Learning course (CS600077) that I am taking in the Autumn 2023 semester at IIT Kharagpur
Language: Python - Size: 55.8 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

SiavashShams/Intelligent-Systems-Projects
Projects for the Intelligent Systems course
Language: Jupyter Notebook - Size: 3.65 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

MohammadAsadolahi/Reinforcement-Learning-solving-a-simple-4by4-Gridworld-using-policy-iteration-in-python
solving a simple 4*4 Gridworld almost similar to openAI gym frozenlake using value iteration method Reinforcement Learning
Language: Jupyter Notebook - Size: 47.9 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

sharma-n/Distributed_MDP_GPU
Distributed MPI based Heterogenous GPU Solver for Markov Decision Processes (MDP)
Language: Cuda - Size: 899 KB - Last synced at: 10 months ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 0

bhparijat/Parallel-Reinforcement-Learning
Parallel Implementation of RL Algorithms
Language: Jupyter Notebook - Size: 362 KB - Last synced at: 3 months ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 0

victor-iyi/simple-Q-network
A Q Learning Reinforcement agent using a simple feed forward neural net.
Language: Python - Size: 50.8 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 1

JasonSloan/RL-Algrithoms-Reimplementation
Reinforcement Learning Implementation Inspired by Bilibili Professor Zhao Shiyu's Lecture at Westlake University
Language: Jupyter Notebook - Size: 22 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

fardinabbasi/Tabulated_RL
Interactive Learning [ECE 641] - Fall 2023 - University of Tehran - Prof. Nili
Language: Jupyter Notebook - Size: 4.96 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

IsmaelMousa/mdp-value-iteration
Implementation of the MDP algorithm for optimal decision-making, focusing on value iteration and policy determination.
Language: Python - Size: 114 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

kuku929/math_puzzle_rl
A 15-puzzle solver using Reinforcement Learning in C++
Language: C++ - Size: 13.9 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

PeeteKeesel/reinforce-py
🐍 Implementation of the REINFORCEjs library from Kaparthy in Python
Language: Jupyter Notebook - Size: 692 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

kyomangold/ETH-DynamicProgrammingOptimalControl
Repository for the code of the "Dynamic Programming and Optimal Control" (DPOC) lecture at the "Institute for Dynamic Systems and Control" at ETH Zurich.
Language: MATLAB - Size: 1.77 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

ALotov2000/gym-frozen-lake-and-taxi-solved-by-reinforcement-learning
This repository belongs to one of my computer assignments for an AI course I attended at the University of Tehran.
Language: HTML - Size: 865 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

nowke/rlviz
GridWorld Reinforcement Learning - Policy Iteration, Value Iteration.
Language: Vue - Size: 1.97 MB - Last synced at: 2 months ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

brozjak2/HSVIforOSPOSGs.jl
Heuristic Search Value Iteration for One-Sided Partially Observable Stochastic Games
Language: Julia - Size: 2.86 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

meraccos/tictactoe-reinforcement-learning
Using MDP and Value Iteration to train a Tic Tac Toe agent
Language: Python - Size: 34.7 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

lkmuk/seq_decision_under_uncertainty
jupyter notebooks & notes on sequential decision under uncertainty
Language: Jupyter Notebook - Size: 147 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

peleiden/rl-rubiks
Using Deep Reinforcement Learning and Search for the Rubik's cube
Language: Python - Size: 6.25 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

paramrathour/Intelligent-and-Learning-Agents
My programs during CS747 (Foundations of Intelligent and Learning Agents) Autumn 2021-22
Language: Python - Size: 19.2 MB - Last synced at: about 2 months ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

ca-scribner/lrl
lrl: Learn Reinforcement Learning - A package to help people learn basic planning and Reinforcement Learning
Language: Python - Size: 925 KB - Last synced at: 5 days ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

sahandkhoshdel99/Reinforcement-Learning-
Language: Jupyter Notebook - Size: 209 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

alizindari/Reinforcement-Learning
Implementation of several algorithms in RL based on Prof. sutton's book
Language: Jupyter Notebook - Size: 510 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 2

prtkmishra/pacman
This repository has the code I wrote for Markovian Pacman
Language: Python - Size: 167 KB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

Megha-Bose/Markov-Decision-Process
Computing optimal MDP policy using Value Iteration Algorithm and Linear Programming
Language: Python - Size: 2.04 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

Architjain128/Value-Iteration
This assignment is based on the concept of the Bellman equation on the basis of the value iteration algorithm for solving MDPs.
Language: Python - Size: 1.45 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

MohandHAMADOUCHE/Comparison_of_V-Iter_Vs_P-Iter_Vs_Q-learn
Comparison of Value Iteration, Policy Iteration and Q-Learning for solving Decision-Making problems
Language: MATLAB - Size: 1.18 MB - Last synced at: 12 months ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

danielakuinchtner/cp-mdp
A CANDECOMP-PARAFAC tensor decomposition method to solve a Markov Decision Process (MDP) gridworld problem.
Language: Python - Size: 463 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

yahsiuhsieh/frozen-lake
Value Iteration, Policy Iteration, and Q-Learning in Frozen lake gym env
Language: Python - Size: 170 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 1

mett29/Reinforcement-Learning
This repository is dedicated to the reinforcement learning examples. I will also upload some algorithms which are somehow correlated with RL.
Language: Python - Size: 264 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

einstein07/RL-Value-Iteration
Program to find the optimal value (V ∗ ) for each state in a small grid-world, implemented (in C++) with the Value Iteration algorithm.
Language: C++ - Size: 32.2 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

sayarghoshroy/Markov-Decision-Process
Value Iteration Algorithm for calculating utilities
Language: Python - Size: 146 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

khush3/rl_algorithms
reinforcement learning algorithms implementation. custom opencv based environment to test codes.
Language: Jupyter Notebook - Size: 1.2 MB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 1

instance01/osmnx-mdp
Solving routing with uncertainty using Markov decision processes
Language: Python - Size: 47.2 MB - Last synced at: about 2 months ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 1

joneug/dvr-value-iteration 📦
Simulation of Value Iteration for a Dynamic Vehicle Routing Problem
Language: Julia - Size: 7.81 KB - Last synced at: about 2 months ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 1

andrecianflone/policy_value_iteration
Policy and Value Iteration with a GridWorld!
Language: Jupyter Notebook - Size: 34.2 KB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

sunayana-vempati/Machine-Learning
Developed projects as a part of INF-552 Course at University of Southern California - Fall 2019
Language: Python - Size: 5.15 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0
