An open API service providing repository metadata for many open source software ecosystems.

Topic: "value-iteration"

kentsommer/pytorch-value-iteration-networks

Pytorch implementation of Value Iteration Networks (NIPS 2016 best paper)

Language: Python - Size: 128 KB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 314 - Forks: 61

pemami4911/POMDPy

POMDPs in Python.

Language: Python - Size: 10 MB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 238 - Forks: 62

Madhu009/Deep-math-machine-learning.ai

A blog which talks about machine learning, deep learning algorithms and the Math. and Machine learning algorithms written from scratch.

Language: Jupyter Notebook - Size: 44.5 MB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 195 - Forks: 174

AgentMaker/Paddle-RLBooks

Paddle-RLBooks is a reinforcement learning code study guide based on pure PaddlePaddle.

Language: Python - Size: 14.1 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 110 - Forks: 13

chauvinSimon/Reinforcement-Learning-for-Decision-Making-in-self-driving-cars

Reinforcement-Learning-for-Decision-Making-in-self-driving-cars

Language: Python - Size: 25.8 MB - Last synced at: 10 months ago - Pushed at: over 6 years ago - Stars: 103 - Forks: 31

sachinbiradar9/Markov-Decision-Processes

Implementation of value iteration algorithm for calculating an optimal MDP policy

Language: Python - Size: 21.5 KB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 85 - Forks: 43

tanmayshankar/RCNN_MDP

Code base for solving Markov Decision Processes and Reinforcement Learning problems using Recurrent Convolutional Neural Networks.

Language: Python - Size: 33.6 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 68 - Forks: 28

iisys-hof/map-matching-2

High Performance Map Matching with Markov Decision Processes (MDPs) and Hidden Markov Models (HMMs).

Language: C++ - Size: 20.1 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 58 - Forks: 9

callmespring/RL-short-course

Reinforcement Learning Short Course

Language: Jupyter Notebook - Size: 95.6 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 53 - Forks: 18

iamjagdeesh/Artificial-Intelligence-Pac-Man

CSE 571 Artificial Intelligence

Language: Python - Size: 2.29 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 45 - Forks: 54

PhadonP/Rubiks-Cube-Reinforcement-Learning

Solving a Rubik's Cube and 15 Puzzle using the Deep Reinforcement Learning and Search

Language: Jupyter Notebook - Size: 102 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 33 - Forks: 4

YyzHarry/SV-RL

[ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning

Language: Python - Size: 1.47 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 31 - Forks: 6

moripiri/Reinforcement-Learning-on-FrozenLake

Reinforcement Learning Algorithms in FrozenLake-v1

Language: Jupyter Notebook - Size: 19.8 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 22 - Forks: 2

BertrandBev/controls-js

⚙️ Controls.js is a sandbox showcasing a few modern controls techiques directly in the browser

Language: JavaScript - Size: 4.24 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 20 - Forks: 0

tirthajyoti/RL_basics

Basic Reinforcement Learning algorithms

Language: Jupyter Notebook - Size: 2.29 MB - Last synced at: 5 days ago - Pushed at: almost 6 years ago - Stars: 18 - Forks: 13

neka-nat/vin-keras

This is an implimentation of Value Iteration Networks (NIPS2016 best paper) in keras

Language: Python - Size: 4.66 MB - Last synced at: 29 days ago - Pushed at: over 7 years ago - Stars: 17 - Forks: 7

alwaysbyx/Optimization-and-Search

Implementation and visualization (some demos) of search and optimization algorithms.

Language: Python - Size: 79.1 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 15 - Forks: 2

aaksham/frozenlake

Value & Policy Iteration for the frozenlake environment of OpenAI

Language: Python - Size: 167 KB - Last synced at: about 1 year ago - Pushed at: almost 6 years ago - Stars: 15 - Forks: 11

svpino/cs7641-assignment4

CS7641 - Machine Learning - Assignment 4 - Markov Decision Processes

Language: Java - Size: 70.3 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 14 - Forks: 14

bfaure/AI_Project_4

Language: Python - Size: 442 KB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 14 - Forks: 33

joefarrington/mdpax

GPU-accelerated MDP solvers in Python with JAX

Language: Python - Size: 5.07 MB - Last synced at: 4 days ago - Pushed at: about 1 month ago - Stars: 13 - Forks: 0

mgiannopoulos24/Artificial-Intelligence

Solutions for the Projects of the Artificial Intelligence (CS 188) course of UC Berkeley

Language: Python - Size: 22.6 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 12 - Forks: 9

MahanFathi/HJxB

Continuous-Time/State/Action Fitted Value Iteration via Hamilton-Jacobi-Bellman (HJB)

Language: Python - Size: 142 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 12 - Forks: 0

rmoehn/piglet_pbvi

Implementation of point-based value iteration (for POMDPs)

Language: Python - Size: 33.2 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 12 - Forks: 2

antonio-f/Dynamic-Programming

Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncated Policy Iteration, Value Iteration . From Udacity's Deep Reinforcement Learning Nanodegree program.

Language: Jupyter Notebook - Size: 179 KB - Last synced at: 28 days ago - Pushed at: about 6 years ago - Stars: 12 - Forks: 4

mbodenham/gridworld-value-iteration

Using value iteration to find the optimum policy in a grid world environment.

Language: Python - Size: 3.91 KB - Last synced at: 8 months ago - Pushed at: about 5 years ago - Stars: 11 - Forks: 4

open-spaced-repetition/SSP-MMC-FSRS

Stochastic-Shortest-Path-Minimize-Memorization-Cost for FSRS

Language: Jupyter Notebook - Size: 53.7 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 10 - Forks: 3

waqasqammar/MDP-with-Value-Iteration-and-Policy-Iteration

Value Iteration and Policy Iteration to solve MDPs

Language: Jupyter Notebook - Size: 188 KB - Last synced at: 27 days ago - Pushed at: over 6 years ago - Stars: 9 - Forks: 7

shehio/ReinforcementLearning

Reinforcement Learning algorithms with nothing abstracted away

Language: Python - Size: 788 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 1

auralius/yadpf

A generic implementation of dynamic programming algorithm and value iteration algorithm.

Language: MATLAB - Size: 20.6 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 7 - Forks: 4

sachag678/Reinforcement_learning

Contains baseline implementations of all RL algorithms using tabular and function approximations. Algorithms such as TD(0), MC, SARSA, Q-Learning and Policy Gradient methods.

Language: Jupyter Notebook - Size: 227 KB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 7 - Forks: 2

nicoRomeroCuruchet/DynamicProgramming

Policy Iteration for Continuous Dynamics

Language: Python - Size: 58.1 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 6 - Forks: 0

shehio/Stochastic-Programming

Devising an optimal portfolio choosing strategy based on stochastic programming

Language: Python - Size: 40 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 6 - Forks: 2

saminheydarian/Interactive_Learning_Course_2021

Interactive Learning Course | Home Works & Quiz | Fall 2021 | Prof. Majid Nili

Language: Jupyter Notebook - Size: 5.5 MB - Last synced at: 11 months ago - Pushed at: about 3 years ago - Stars: 6 - Forks: 1

KHvic/Markov-Decision-Process-Value-Iteration-Policy-Iteration-Visualization

Computing an optimal Markov Decision Process (MDP) policy with Value Iteration and Policy Iteration

Language: Java - Size: 3.59 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 3

ColasGael/RL-flappy-bird

Reinforcement Learning on playable version of Flappy Bird

Language: Python - Size: 168 KB - Last synced at: 28 days ago - Pushed at: almost 6 years ago - Stars: 6 - Forks: 1

CEDL2017/homework2-MDPs

The homework for Cutting-Edge of Deep Learning, aka CEDL, from NTHU

Language: Jupyter Notebook - Size: 331 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 6 - Forks: 44

caelan/planning-algorithms

MIT Planning Algorithms Class Implementations

Language: Python - Size: 10.5 MB - Last synced at: about 2 years ago - Pushed at: over 8 years ago - Stars: 6 - Forks: 3

ariankhanjani/Frozen-Lake-Openai-Gym

Implementation of RL Algorithms in Openai Gym Frozen-Lake Environment

Language: Jupyter Notebook - Size: 2.71 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

idigitopia/Distributed-VI

Distributed Implementation of Value Iteration, built on top of Ray Framework.

Language: HTML - Size: 41.7 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 0

qdevpsi3/randomized-value-iteration

JAX implementation of Generalization and Exploration via Randomized Value Functions (Osband et al., 2016)

Language: Python - Size: 242 KB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 5 - Forks: 0

ZikangZhou/nim_rl

A reinforcement learning framework for the game of Nim.

Language: C++ - Size: 11.8 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 0

prakHr/Reinforcement-Learning-Book

[Book] :- Andrea Lonza - Reinforcement Learning Algorithms with Python_ Learn, understand, and develop smart algorithms for addressing AI challenges-Packt Publishing (2019)

Language: Python - Size: 20.9 MB - Last synced at: about 1 month ago - Pushed at: almost 5 years ago - Stars: 5 - Forks: 4

masoudslipknot/Reinforcment_Learning_ValueIteration

Reinforcement- Learning project: Value Iteration Implementation.

Language: Java - Size: 35.2 KB - Last synced at: 8 months ago - Pushed at: over 6 years ago - Stars: 5 - Forks: 0

Breakend/ValuePolicyIterationVariations

Experiments testing variants of Value and Policy iterations.

Language: Jupyter Notebook - Size: 1.05 MB - Last synced at: about 1 year ago - Pushed at: over 8 years ago - Stars: 5 - Forks: 3

Pegah-Ardehkhani/Reinforcement-Learning-Algorithms-from-Scratch

Explore key RL algorithms with detailed explanations and fully commented Python code implementations

Language: Jupyter Notebook - Size: 2.36 MB - Last synced at: 17 days ago - Pushed at: 5 months ago - Stars: 4 - Forks: 0

parissashahabi/Game-Playing-Intelligent-Agent

Implemented reinforcement learning algorithms, including Value-Iteration and Q-Learning, for a 2D grid world Markov Decision Process resembling a Pac-man game. Also applied the Mini-Max algorithm and common path-planning techniques such as A*, Dijkstra, and bidirectional search.

Language: Python - Size: 331 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 1

PeeteKeesel/Basic-RL-Algorithms

:robot: Implementation and short explanation of basic RL algorithms, reproducing the simulations from Andrej Kaparthy's REINFORCEjs library.

Language: Python - Size: 18.8 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 0

Atul-Acharya-17/Markov-Decision-Process

Solving Markov Decision Process using Value Iteration and Policy Iteration, SARSA, Expected SARSA and Q-Learning

Language: Jupyter Notebook - Size: 9.34 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 0

OleguerCanal/RL-algorithms

Numpy & Keras based re-implementation of basic RL-algorithms: DP, VI, PI, SARSA, Q-Learning, DQN

Language: Python - Size: 8.17 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 4 - Forks: 1

ShreeshaN/ReinforcementLearningTutorials

This repo contains implementations of algorithms such a Q-learning, SARSA, TD, Policy gradient

Language: Python - Size: 4.32 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 6

being-aerys/Distributed_Reinforcement_Learning_with_Intel-DevCloud

Language: Jupyter Notebook - Size: 10.3 MB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 4 - Forks: 0

Chaoukia/Reinforcement-Learning-course

A Reinforcement Learning course with classic examples of agents trained on gym environments.

Language: Python - Size: 1.69 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 1

auralius/parallel-value-iteration

Finding a shortest path on a binary occupancy map

Language: Python - Size: 634 KB - Last synced at: 28 days ago - Pushed at: about 2 months ago - Stars: 3 - Forks: 0

lukasmyth96/Piggy

Using Value Iteration and Policy Iteration to discover the optimal solution for the strategic dice game PIG. Ultimately interested in whether the optimal solution can be reached through self-play alone.

Language: Python - Size: 23.1 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 1

ljharb/iterate-value

Iterate any iterable JS value. Works robustly in all environments, all versions.

Language: JavaScript - Size: 54.7 KB - Last synced at: 4 days ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

nicolaloi/Dynamic-Programming-and-Optimal-Control

Infinite horizon policy optimization for drone navigation. Graded project for the ETH course "Dynamic Programming and Optimal Control".

Language: MATLAB - Size: 758 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 2

Kytabyte/rl-playground

Implementation and experiments of reinforcement learning algorithms in CS885 @ UW

Language: Python - Size: 93.8 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 1

Prakhar-FF13/Reinforcement-Learning-With-Python

Reinforcement Learning Notebooks

Language: Python - Size: 115 KB - Last synced at: about 2 months ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 1

zyxue/rljs

RLjs currently serves as an interactive playground for learning reinforcement learning.

Language: JavaScript - Size: 1.41 MB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 0

akaAlbo/deeprlbootcamp

Solution to the Deep RL Bootcamp labs from UC Berkeley

Language: Jupyter Notebook - Size: 5.86 MB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 1

Stephen-X/Machine-Learning-Algorithms

A repository for hosting some of the popular machine learning algorithm implementations.

Language: Python - Size: 819 KB - Last synced at: 7 days ago - Pushed at: about 7 years ago - Stars: 3 - Forks: 3

victor-iyi/navigating-a-virtual-world-using-dynamic-programming

A reinforcement learning agent navigating the OpenAI's FrozenLake environment

Language: Jupyter Notebook - Size: 10.7 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 2

cnheider/gym_solutions

Language: Python - Size: 2.04 MB - Last synced at: about 1 month ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 0

PranavNyati/Reinforcement_Learning_Term_Project

This repository contains the codes for Term Projects as part of the Reinforcement Learning course (CS600077) that I am taking in the Autumn 2023 semester at IIT Kharagpur

Language: Python - Size: 55.8 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

SiavashShams/Intelligent-Systems-Projects

Projects for the Intelligent Systems course

Language: Jupyter Notebook - Size: 3.65 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

MohammadAsadolahi/Reinforcement-Learning-solving-a-simple-4by4-Gridworld-using-policy-iteration-in-python

solving a simple 4*4 Gridworld almost similar to openAI gym frozenlake using value iteration method Reinforcement Learning

Language: Jupyter Notebook - Size: 47.9 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

sharma-n/Distributed_MDP_GPU

Distributed MPI based Heterogenous GPU Solver for Markov Decision Processes (MDP)

Language: Cuda - Size: 899 KB - Last synced at: 10 months ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 0

bhparijat/Parallel-Reinforcement-Learning

Parallel Implementation of RL Algorithms

Language: Jupyter Notebook - Size: 362 KB - Last synced at: 3 months ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 0

victor-iyi/simple-Q-network

A Q Learning Reinforcement agent using a simple feed forward neural net.

Language: Python - Size: 50.8 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 1

JasonSloan/RL-Algrithoms-Reimplementation

Reinforcement Learning Implementation Inspired by Bilibili Professor Zhao Shiyu's Lecture at Westlake University

Language: Jupyter Notebook - Size: 22 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

fardinabbasi/Tabulated_RL

Interactive Learning [ECE 641] - Fall 2023 - University of Tehran - Prof. Nili

Language: Jupyter Notebook - Size: 4.96 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

IsmaelMousa/mdp-value-iteration

Implementation of the MDP algorithm for optimal decision-making, focusing on value iteration and policy determination.

Language: Python - Size: 114 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

kuku929/math_puzzle_rl

A 15-puzzle solver using Reinforcement Learning in C++

Language: C++ - Size: 13.9 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

PeeteKeesel/reinforce-py

🐍 Implementation of the REINFORCEjs library from Kaparthy in Python

Language: Jupyter Notebook - Size: 692 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

kyomangold/ETH-DynamicProgrammingOptimalControl

Repository for the code of the "Dynamic Programming and Optimal Control" (DPOC) lecture at the "Institute for Dynamic Systems and Control" at ETH Zurich.

Language: MATLAB - Size: 1.77 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

ALotov2000/gym-frozen-lake-and-taxi-solved-by-reinforcement-learning

This repository belongs to one of my computer assignments for an AI course I attended at the University of Tehran.

Language: HTML - Size: 865 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

nowke/rlviz

GridWorld Reinforcement Learning - Policy Iteration, Value Iteration.

Language: Vue - Size: 1.97 MB - Last synced at: 2 months ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

brozjak2/HSVIforOSPOSGs.jl

Heuristic Search Value Iteration for One-Sided Partially Observable Stochastic Games

Language: Julia - Size: 2.86 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

meraccos/tictactoe-reinforcement-learning

Using MDP and Value Iteration to train a Tic Tac Toe agent

Language: Python - Size: 34.7 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

lkmuk/seq_decision_under_uncertainty

jupyter notebooks & notes on sequential decision under uncertainty

Language: Jupyter Notebook - Size: 147 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

peleiden/rl-rubiks

Using Deep Reinforcement Learning and Search for the Rubik's cube

Language: Python - Size: 6.25 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

paramrathour/Intelligent-and-Learning-Agents

My programs during CS747 (Foundations of Intelligent and Learning Agents) Autumn 2021-22

Language: Python - Size: 19.2 MB - Last synced at: about 2 months ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

ca-scribner/lrl

lrl: Learn Reinforcement Learning - A package to help people learn basic planning and Reinforcement Learning

Language: Python - Size: 925 KB - Last synced at: 5 days ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

sahandkhoshdel99/Reinforcement-Learning-

Language: Jupyter Notebook - Size: 209 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

alizindari/Reinforcement-Learning

Implementation of several algorithms in RL based on Prof. sutton's book

Language: Jupyter Notebook - Size: 510 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 2

prtkmishra/pacman

This repository has the code I wrote for Markovian Pacman

Language: Python - Size: 167 KB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

Megha-Bose/Markov-Decision-Process

Computing optimal MDP policy using Value Iteration Algorithm and Linear Programming

Language: Python - Size: 2.04 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

Architjain128/Value-Iteration

This assignment is based on the concept of the Bellman equation on the basis of the value iteration algorithm for solving MDPs.

Language: Python - Size: 1.45 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

MohandHAMADOUCHE/Comparison_of_V-Iter_Vs_P-Iter_Vs_Q-learn

Comparison of Value Iteration, Policy Iteration and Q-Learning for solving Decision-Making problems

Language: MATLAB - Size: 1.18 MB - Last synced at: 12 months ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

danielakuinchtner/cp-mdp

A CANDECOMP-PARAFAC tensor decomposition method to solve a Markov Decision Process (MDP) gridworld problem.

Language: Python - Size: 463 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

yahsiuhsieh/frozen-lake

Value Iteration, Policy Iteration, and Q-Learning in Frozen lake gym env

Language: Python - Size: 170 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 1

mett29/Reinforcement-Learning

This repository is dedicated to the reinforcement learning examples. I will also upload some algorithms which are somehow correlated with RL.

Language: Python - Size: 264 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

einstein07/RL-Value-Iteration

Program to find the optimal value (V ∗ ) for each state in a small grid-world, implemented (in C++) with the Value Iteration algorithm.

Language: C++ - Size: 32.2 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

sayarghoshroy/Markov-Decision-Process

Value Iteration Algorithm for calculating utilities

Language: Python - Size: 146 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

khush3/rl_algorithms

reinforcement learning algorithms implementation. custom opencv based environment to test codes.

Language: Jupyter Notebook - Size: 1.2 MB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 1

instance01/osmnx-mdp

Solving routing with uncertainty using Markov decision processes

Language: Python - Size: 47.2 MB - Last synced at: about 2 months ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 1

joneug/dvr-value-iteration 📦

Simulation of Value Iteration for a Dynamic Vehicle Routing Problem

Language: Julia - Size: 7.81 KB - Last synced at: about 2 months ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 1

andrecianflone/policy_value_iteration

Policy and Value Iteration with a GridWorld!

Language: Jupyter Notebook - Size: 34.2 KB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

sunayana-vempati/Machine-Learning

Developed projects as a part of INF-552 Course at University of Southern California - Fall 2019

Language: Python - Size: 5.15 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

Related Topics
reinforcement-learning 109 policy-iteration 104 q-learning 61 markov-decision-processes 52 dynamic-programming 32 mdp 23 artificial-intelligence 22 reinforcement-learning-algorithms 22 sarsa 21 python 18 machine-learning 17 deep-reinforcement-learning 15 policy-evaluation 13 dqn 12 epsilon-greedy 12 bellman-equation 11 deep-q-learning 10 monte-carlo 10 policy-gradient 10 openai-gym 9 temporal-differencing-learning 8 ai 8 qlearning 8 gridworld 7 deep-learning 7 linear-programming 7 reinforce 7 astar-algorithm 7 rl 6 actor-critic 6 pytorch 6 policy-improvement 6 gym 6 monte-carlo-methods 6 sarsa-learning 6 grid-world 6 pacman 6 neural-networks 5 frozenlake 5 alpha-beta-pruning 5 ddpg 5 markov-decision-process 5 decision-trees 5 machine-learning-algorithms 5 algorithms 5 qlearning-algorithm 5 gridworld-environment 5 optimal-control 5 backpropagation 4 python3 4 java 4 value-iteration-algorithm 4 expectimax 4 neural-network 4 minimax 4 julia 3 gymnasium 3 td3 3 q-learning-vs-sarsa 3 tensorflow 3 perceptron 3 jax 3 frozen-lake 3 td-learning 3 algorithm 3 intelligent-agent 3 optimal-policy 3 sarsa-lambda 3 agent 3 a-star 3 sac 3 optimization 3 dfs 3 bfs 3 model-based-rl 3 frozenlake-v0 3 heuristics 3 reinforcement-learning-agent 2 optimistic-inital-values 2 dijkstra 2 graph 2 iterative-policy-evaluation 2 keras 2 linear-regression 2 pomdp 2 word2vec 2 multiagent-systems 2 reinforcement-learning-environments 2 pacman-agent 2 hidden-markov-models 2 dyna-q 2 td-lambda 2 thompson-sampling 2 ucb1 2 cartpole-v1 2 hidden-markov-model 2 shortest-path-algorithm 2 cross-entropy-method 2 simulation 2 game-theory 2