An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: markov-decision-processes

paramrathour/Intelligent-and-Learning-Agents

My programs during CS747 (Foundations of Intelligent and Learning Agents) Autumn 2021-22

Language: Python - Size: 19.2 MB - Last synced at: about 2 months ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

sankalprane/Artificial_Intelligence

Implemented Search Algorithms

Language: C++ - Size: 253 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 19 - Forks: 0

814CK5N0W/zuverlaessigkeitsmodelle

⬜️ Публичный репозиторий работ по курсу

Size: 1000 Bytes - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

bhaveshachhada/mdp-path-finder

Repository consists a project for path finding using Markov Decision Process (MDP). MDP is a Reinforcement Learning Algorithm.

Language: Python - Size: 37.1 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

wlxiong/PyABM

Markov decision process simulation model for household activity-travel behavior

Language: Python - Size: 172 KB - Last synced at: about 2 years ago - Pushed at: almost 10 years ago - Stars: 6 - Forks: 2

wlxiong/PyMarkovActv

A Markov Decision Process (MDP) model for activity-based travel demand model

Language: Python - Size: 1.16 MB - Last synced at: about 2 years ago - Pushed at: over 12 years ago - Stars: 6 - Forks: 1

amflorio/dvrp-stochastic-requests

Online algorithms for solving large-scale dynamic vehicle routing problems with stochastic requests

Language: Makefile - Size: 14.1 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 42 - Forks: 10

dannbuckley/rust-gridworld

Gridworld MDP Example implemented in Rust

Language: Rust - Size: 9.77 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

meraccos/tictactoe-reinforcement-learning

Using MDP and Value Iteration to train a Tic Tac Toe agent

Language: Python - Size: 34.7 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

florist-notes/CS228_PGM

🌀 Stanford CS 228 - Probabilistic Graphical Models

Language: Python - Size: 45.9 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 63 - Forks: 22

parissashahabi/reinforcement_learning

Markov decision process and q-learning implementation on a nondeterministic grid environment with doors and keys, diagonal moves, and other aggravating circumstances.

Language: Python - Size: 186 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

miguelTavora/Artifical-Intelligence

Project with reactive architecture, state space search, markov decision process and reinforcement learning

Language: Java - Size: 3.13 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

mabirck/Deep_RL_Bootcamp

Solutions for the labs in Deep RL Bootcamp.

Language: Jupyter Notebook - Size: 5.73 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 0

Nandarhline/FAD_pomdp_integration

Modelling of complex deterioration models and failure criteria in DBNs and POMDPs

Language: MATLAB - Size: 29.3 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

Shr2020/Artificial-Intelligence-Algorithms

Implementing various AI algorithms

Language: Python - Size: 22.5 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

shehio/Everything-Financial-Engineering

Links for the most relevant topics

Size: 29.3 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 17 - Forks: 2

SELab-unimi/mdp-generator

MDP Domain specific language and code generation

Language: Xtend - Size: 1.44 MB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

justachetan/spa

Code written as a part of MTH371 Stochastic Processes and its Applications taught my Dr. Monika Arora at IIIT Delhi in Monsoon 2018

Language: R - Size: 786 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 2 - Forks: 0

vamsi-bulusu/AI-Projects

CS520 AI Projects

Language: Jupyter Notebook - Size: 15.2 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

svpino/cs7641-assignment4

CS7641 - Machine Learning - Assignment 4 - Markov Decision Processes

Language: Java - Size: 70.3 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 14 - Forks: 14

jerry-shijieli/Feature_Selection_For_Reinforcement_Learning

Feature selection for reinforcement learning in educational policy development

Language: Jupyter Notebook - Size: 7.63 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 1

danielakuinchtner/cp-mdp

A CANDECOMP-PARAFAC tensor decomposition method to solve a Markov Decision Process (MDP) gridworld problem.

Language: Python - Size: 463 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

MichaelFish199/GameFrozenLake-in-CSharp-with-QLearningAgent

This project is a C# implementation of the popular game "Frozen Lake" and an AI agent that can play the game using the Q-learning algorithm. The game consists of a grid of tiles, some of which are safe to walk on, while others will cause the player to receive damage.

Language: C# - Size: 26.4 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

quawood/RLearning

Playing around with reinforcement learning

Language: Python - Size: 218 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

iRaneem/AI-fundmental-CCAI221

This is my work at 2021/2022 include : lab , assignment and project solutions

Language: Prolog - Size: 8.53 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

rssalessio/BestPolicyIdentificationMDP

Python implementation of algorithms for Best Policy Identification in Markov Decision Processes

Language: Python - Size: 55.7 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

aalexren/iu-rl

[Innopolis University] Reinforcement Learning Course 2022.

Language: Jupyter Notebook - Size: 100 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

alisatodorova/Briscola-Project

A semester-long project about AI vs. Human version of the Italian card game Briscola using Machine Learning, Reinforcement Learning, Adversarial Search, Markov Decision Process, Minimax algorithm and Monte Carlo Tree Search.

Language: Java - Size: 153 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

ShivamChourey/MDP_Path_Planning

This repository contains the MATLAB code to devise an optimal policy for the motion of the robot given the obstacles and world boundaries. This file contains implementation to a specific environment wiht known parameters and obstacles, but can easily be modified or generalized for any environment. The code was linked to the V-Rep simulation environment and tested.

Language: MATLAB - Size: 13.7 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 12 - Forks: 8

KryeKuzhinieri/Solving-Markov-Chains-in-Python

Mean Passage Times, Steady State Probabilities, N-step probabilities, Markov Chains

Language: Python - Size: 22.5 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

KryeKuzhinieri/Solving-Markov-Chains-in-R

Markov Chains are solved using R programming

Language: R - Size: 252 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Awni00/decentralized-MARL-general-cts-spaces

This repository studies and implements multi-agent reinforcement learning algorithms.

Language: Jupyter Notebook - Size: 1.56 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Akshit-Panapuzha/Self-Driving-Car-Simulation

A simulation that allows user to create a path using their mouse, in which a car with 3 sensors will path its way through, learning through a Markov Decision Process and keeps adjusting weights as it hits positive and negative rewards.

Language: Python - Size: 20.5 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

joulook/Performance-Evaluation-of-Computer-Systems-Fall-2021

In this repository you can find all of my projects for Performance Evaluation of Computer Systems Course when I was in 3rd semester of my master's at SUT.

Language: Java - Size: 3.23 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

simerplaha/reinforcement-learning

Reinforcement learning

Language: Scala - Size: 174 KB - Last synced at: about 1 month ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 2

cnheider/gym_solutions

Language: Python - Size: 2.04 MB - Last synced at: about 1 month ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 0

iamvigneshwars/rock-paper-scissors-ai

The AI based on markov chain model predicts your next move based on your previous move. The AI gets better as you play more rounds (it learns your patterns as you play more rounds).

Language: JavaScript - Size: 7.16 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 1

tomasort/MDP_Solver

Simple program to solve Markov Decision Processes using policy iteration and value iteration.

Language: Python - Size: 1.32 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

ezerilli/Machine_Learning

Georgia Tech - OMSCS - CS7641 - Machine Learning Repository

Language: Python - Size: 34.2 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 6 - Forks: 7

CyberLemonade/Markov-Decision-Process-WIP

Implementation of MDP in Java using Deep Q-Learning

Language: Java - Size: 5.86 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

mz-zarei/RL-project

Language: Jupyter Notebook - Size: 868 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

ar8372/Image-Feature-extraction-using-Reinforcement-Learning

In this project we use Reinforcement Learning to extract features from an image.

Language: Jupyter Notebook - Size: 3.43 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

StuartTruax/markov_decision_processes

Implementations of several solution methods for Markov decision processes (MDPs).

Language: Jupyter Notebook - Size: 75.2 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 1

salviosage/AI-playground

Playground for Artificial intelligence projects and exercises I come across.

Language: Python - Size: 43 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

ondrejbiza/mdp_abstraction

Algorithms for minimization of Markov Decision Processes.

Language: Python - Size: 10.8 MB - Last synced at: 3 months ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

somu15/Disf_Hazard

This repo consists of the codes used for a paper titled "DISFUNCTIONALITY HAZARD: A RISK-BASED TOOL TO SUPPORT THE RESILIENT DESIGN OF SYSTEMS SUBJECTED TO SINGLE HAZARDS AND MULTIHAZARDS."

Language: MATLAB - Size: 388 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

pablo-tech/ReinforcementLearning

Grids, mountains, and mysterious problems. Solved with Partially-Observable Markov Decision Procesees. Created at Stanford University, by Pablo Rodriguez Bertorello

Language: Julia - Size: 8.79 MB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 0

nicolasantero/Markov_Decision_Process-MDP--Reinforcement_Learning

Markov Decision Process to find optimal path.

Language: Jupyter Notebook - Size: 897 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

krishnaw14/CS747-assignments

Foundations of Intelligent and Learning Agenet

Language: Python - Size: 992 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

jackyan540/cs181-homework6

Spring 2021 Machine Learning (CS 181) Homework 6

Language: Python - Size: 1.16 MB - Last synced at: about 1 month ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

jajokine/Text-Based-Game-with-Markov-Decision-Process

MITx - MicroMasters Program on Statistics and Data Science - Machine Learning with Python - Fifth Project

Language: Python - Size: 103 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

JunhongXu/probablistic-robotics

Exercise solutions and algorithm implementations in Python and C++ for the book Probabilistic Robotics

Language: Python - Size: 1.95 MB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

pradyumnameena/COL333-Artificial-Intelligence

Collection of assignments given by Prof. Mausam in the COL333 course

Language: C++ - Size: 8.89 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 1

dizys/nyu-ai-lab-3

NYU Artificial Intelligence Course Lab 3: A generic Markov process solver.

Language: Python - Size: 11.7 KB - Last synced at: 3 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

kvignesh1420/AI

Artificial Intelligence

Language: Python - Size: 17.8 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

yxu1183/Machine-Learning

Machine Learning Algorithms - Naive Bayes Classifier, Bayesian Estimation, Linear Regression, Neural Network, Decision Trees, K-means Clustering, Markov Decision Process

Language: Python - Size: 21.2 MB - Last synced at: 8 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

imimali/reinforcement-learning-specialization

Reinforcement Learning Specialization courses solutions

Language: Jupyter Notebook - Size: 74.3 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

iamvigneshwars/ai-plays-frozen-lake

Simple AI agent built using MDP to cross a frozen lake without falling into the hole.

Language: Python - Size: 18.6 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

brendenehlers/FakeTweetGenerator

Language: Python - Size: 208 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

shehio/ReinforcementLearning

Reinforcement Learning algorithms with nothing abstracted away

Language: Python - Size: 788 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 1

Heewon-Hailey/AI-reinforcement-learning

implement reinforcement learning algorithms in Pacman

Language: Python - Size: 347 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

SELab-unimi/mbt-module

MBT module

Language: Java - Size: 63.5 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

steve303/AI_gym-MarkovDecisionProcess

Objective: Using the AI_gym environment design an algorithm which will instruct an agent to learn and succeed at different tasks

Language: Jupyter Notebook - Size: 26.9 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

prs98/Vehicle-Performance-Optimization

Designed a greedy algorithm based on Markov sequential decision-making process in MATLAB/Python to optimize using Gurobi solver, the wheel size, gear shifting sequence by modeling drivetrain constraints to achieve maximum laps in a race with a 2-hour time window.

Language: Jupyter Notebook - Size: 2.55 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

primaryobjects/qlearning

A game using Q-Learning artificial intelligence.

Language: JavaScript - Size: 39.1 KB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 4

allenwest24/MDPs-and-Q-learning-On-Ice

Using Markov Decision Processes and Q-Learning on a variation of the Wumpus World problem.

Language: Jupyter Notebook - Size: 42 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

roman1e2f5p8s/rlapseingym

Reinforcement Learning with Algorithms from Probabilistic Structure Estimation (RLAPSE)

Language: Jupyter Notebook - Size: 443 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

makarbaderko/grid_world_rl

MDP and Monte Carlo solution for maze solving

Language: Python - Size: 15.6 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

CEDL2017/homework2-MDPs

The homework for Cutting-Edge of Deep Learning, aka CEDL, from NTHU

Language: Jupyter Notebook - Size: 331 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 6 - Forks: 44

Tole-Git/MarkovStoryMaker

Artificial Intelligence story maker using methods such as the markov chain, bigram & trigram models.

Language: Java - Size: 215 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

shashankiit/FILA

Assignments for CS 747 course offered at IIT Bombay

Language: Python - Size: 12.8 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

prtkmishra/pacman

This repository has the code I wrote for Markovian Pacman

Language: Python - Size: 167 KB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

jonathanloganmoran/EDAN95-Applied-Machine-Learning

This is the repository for the EDAN95 - Tillämpad maskininlärning (Applied Machine Learning) course given at Lunds Tekniska Högskola (LTH) during the Fall 2019 term.

Language: Jupyter Notebook - Size: 3.97 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

milosgajdos/udacity-ai-nanodegree

Udacity AI Nanodegree projects

Language: Jupyter Notebook - Size: 148 MB - Last synced at: 20 days ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 2

deutranium/Markov-Decision-Processes

Implementing Markov Decision Process from scratch in Python

Language: Jupyter Notebook - Size: 3.41 MB - Last synced at: about 1 month ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 1

i2a-k/Reinforcement-Learning

Multi-Armed Bandit Simulation, MDP GridWorld Example, Random Walk Problem by TD and MC

Language: Jupyter Notebook - Size: 186 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

harmim/vut-sav-project

Static Analysis and Verification - Project - PRISM

Language: TeX - Size: 1.88 MB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

DevanshuSave/Pacman-and-Ghostbusters

Pacman and Ghost Agent | Python | Artificial Intelligence | Search-based Algorithms | Learning-based Algorithms

Language: Python - Size: 571 KB - Last synced at: 23 days ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 4

Naharul98/Pacman-AI-agent-for-stochastic-environment

A Markov Decision Process (MDP) based implementation of a Pacman agent, to survive and battle through a handicapped stochastic environment.

Language: Python - Size: 342 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

callmespring/TestMDP Fork of RunzheStat/TestMDP

Implementation of "Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision Making”(ICML 2020) in Python

Language: Python - Size: 983 KB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

AnsSUN/Reinf.-Learn.-gridworld-project

Creation of grid world environment through pygame package and optimizing the motion of agent through modified q-learning process. Video can be found here: https://www.youtube.com/watch?v=-nXH8k9gRLM

Language: Python - Size: 8.89 MB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 1

KaleabTessera/Gridworld-Markov-Decision-Process

Implementing a gridworld from scratch and configuring it as a Markov decision process.

Language: Jupyter Notebook - Size: 80.1 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

Mogady/AI-Algorithms-with-R

Implementation for some AI algorithms with R for my college assignments

Language: R - Size: 197 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

shamo0/ValueIteration

Program solves an MDP using value iteration

Language: Python - Size: 90.8 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

JoicePaz/computacional-simulation-studies

Repo to keep track of my learning process in this fantastic subject.

Language: Python - Size: 3.91 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

JoicePaz/tcc

Computational Simulation of water consumption using Markov chains.

Language: Python - Size: 2.86 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

rssalessio/PrivacyStochasticSystems

Code used in "Minimizing Information Leakage of Abrupt Changes in Stochastic Systems" , by Alessio Russo, [email protected] .

Language: Jupyter Notebook - Size: 1.03 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

berberto/GraphMDP

Kullback-Leibler regularized shortest path on a random graph from RNA velocity data

Language: Python - Size: 7.81 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

omriattal/Intro-to-AI

The programming assignments of the course Introduction to Artificial Intelligence in Ben Gurion University, Israel

Language: Python - Size: 108 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

wlxiong/MxMarkovActv

A simplified Markov decion process (MDP) model of activity-travel schdueling (using MATLAB)

Language: Matlab - Size: 160 KB - Last synced at: about 2 years ago - Pushed at: about 12 years ago - Stars: 2 - Forks: 1

yakuana/MDP

Markov Decision Process - Value Iteration Exploration

Language: Java - Size: 27.3 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

vlfom/StepLearn

Applying Markov Decision Processes and Q-Learning to a robot movement model

Language: Java - Size: 350 KB - Last synced at: about 2 years ago - Pushed at: over 8 years ago - Stars: 4 - Forks: 3

robotenique/AI-programming

:bulb: Classical AI algorithms, and a bit of Reinforcement Learning >:)

Language: Python - Size: 1.77 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 2 - Forks: 0

shivchander/reinforcement-learning-snakes-ladders

Language: Python - Size: 2.93 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

batra98/MDP-Basics

Using MDP based models (Value Iteration and Policy Iteration) on toy environments.

Language: Jupyter Notebook - Size: 3.23 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 1

michellesri/cs188

UC Berkeley CS188: Artificial Intelligence

Language: Python - Size: 1.42 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 2 - Forks: 5

K-Frash/ThePointRunner-MDP

An rougelike board game where you create the dungeon + environmental conditions and an AI agent will determine the optimal policy to traverse the dungeon to the treasure.

Language: C# - Size: 463 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

rohitdavas/Reinforcement-Learning

RL models from base.

Language: Jupyter Notebook - Size: 130 MB - Last synced at: 2 days ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

saikumarkaleru/Analyse-User-Behaviour-Optimise-the-User-Workflow-Using-a-Machine-Learning-Algorithm

Through this project, based on the characteristics, we will come up with another set of guided navigation, which can guide the user in case the user misses steps in a workflow on a consistent manner.

Language: Jupyter Notebook - Size: 688 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

SentientOrange/Rubiks-Cube

Reinforcement Learning program that looks to be able to quickly learn to solve a Rubik's Cube

Language: Python - Size: 88.9 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 14 - Forks: 1

Related Keywords
markov-decision-processes 336 reinforcement-learning 147 value-iteration 52 q-learning 50 artificial-intelligence 50 mdp 45 python 44 machine-learning 43 markov-chain 34 policy-iteration 31 dynamic-programming 21 reinforcement-learning-algorithms 19 ai 18 markov-model 15 sarsa 14 deep-reinforcement-learning 14 monte-carlo 13 markov 13 qlearning 12 policy-gradient 12 deep-learning 11 decision-making 10 gridworld 9 openai-gym 9 mdps 8 optimization 7 temporal-differencing-learning 7 bellman-equation 7 python3 7 pomdps 7 neural-network 7 deep-q-network 7 rl 6 planning 6 julia 6 stochastic-processes 6 astar-algorithm 6 constraint-satisfaction-problem 5 model-checking 5 jupyter-notebook 5 pytorch 5 tensorflow 5 hidden-markov-model 5 pomdp 5 neural-networks 5 random-walk 5 policy-evaluation 5 multi-armed-bandit 5 sarsa-lambda 5 pygame 5 value-iteration-algorithm 4 monte-carlo-tree-search 4 qlearning-algorithm 4 markov-decision-process 4 probabilistic-graphical-models 4 optimal-control 4 dqn 4 simulation 4 control-theory 4 numpy 4 probabilistic-models 4 adversarial-search 4 alpha-beta-pruning 4 linear-programming 4 temporal-difference 4 javascript 4 deep-q-learning 4 multi-agent-systems 4 grid-world 4 rust 4 reinforcement-learning-agent 4 algorithm 4 proximal-policy-optimization 4 bfs 4 gym 4 reinforce 3 multi-armed-bandits 3 solver 3 monte-carlo-methods 3 economics 3 operations-research 3 r 3 statistics 3 robotics 3 search 3 monte-carlo-simulation 3 probabilistic-programming 3 agent-based-modeling 3 expectimax 3 minimax-algorithm 3 heuristic-search-algorithms 3 planning-algorithms 3 game-theory 3 game-development 3 agent 3 minimax 3 travel-demand-modelling 3 csharp 3 dec-pomdp 3 nlp 3