An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: markov-decision-processes

robervz22/Optimal-Play-Pig

Replication and reproduction of the results in the article: Optimal Play of the Dice Game Pig by Neller and Presser 2004

Language: Python - Size: 71.3 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

VincentPinet/421-solver

Computing optimal strategy for the dice game 421

Language: C++ - Size: 26.4 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

JuliaPOMDP/CompressedBeliefMDPs.jl

Compressed belief-state MDPs in Julia for reinforcement learning and sequential decision making. Part of the POMDPs.jl community.

Language: Julia - Size: 643 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 5 - Forks: 0

JuliaPOMDP/POMDPs.jl

MDPs and POMDPs in Julia - An interface for defining, solving, and simulating fully and partially observable Markov decision processes on discrete and continuous spaces.

Language: Julia - Size: 10.2 MB - Last synced at: 6 days ago - Pushed at: 13 days ago - Stars: 710 - Forks: 104

ds4dm/ecole

Extensible Combinatorial Optimization Learning Environments

Language: C++ - Size: 2.29 MB - Last synced at: about 11 hours ago - Pushed at: 16 days ago - Stars: 339 - Forks: 72

bmarroc/reinforcement-learning

Jupyter notebooks implementing Reinforcement Learning algorithms in Numpy and Tensorflow

Language: Jupyter Notebook - Size: 2.84 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 1 - Forks: 1

sshkhr/Practical_RL

My solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow

Language: Jupyter Notebook - Size: 9.91 MB - Last synced at: 6 days ago - Pushed at: over 3 years ago - Stars: 53 - Forks: 25

Limmen/csle

A research platform to develop automated security policies using quantitative methods, e.g., optimal control, computational game theory, reinforcement learning, optimization, evolutionary methods, and causal inference.

Language: Python - Size: 140 MB - Last synced at: about 13 hours ago - Pushed at: about 2 months ago - Stars: 126 - Forks: 21

TolgaOk/jaxdp

A Dynamic Programming package for discrete MDPs implemented in JAX

Language: Python - Size: 549 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 5 - Forks: 1

h2r/pomdp-py

A framework to build and solve POMDP problems. Documentation: https://h2r.github.io/pomdp-py/

Language: Python - Size: 6.85 MB - Last synced at: 13 days ago - Pushed at: 20 days ago - Stars: 245 - Forks: 53

thiagopbueno/awesome-probabilistic-planning

A curated list of online resources for probabilistic planning: papers, software and research groups around the world!

Size: 18.6 KB - Last synced at: about 23 hours ago - Pushed at: about 7 years ago - Stars: 62 - Forks: 12

odow/SDDP.jl

A JuMP extension for Stochastic Dual Dynamic Programming

Language: Julia - Size: 24.9 MB - Last synced at: 6 days ago - Pushed at: 24 days ago - Stars: 326 - Forks: 66

Matheussoranco/Decision-Making-Via-Markov-chains

A decision making model that uses Markov chains to do it, opening way for a kind of reasoning

Language: Python - Size: 5.86 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

DES-Lab/AALpy

An Automata Learning Library Written in Python

Language: Python - Size: 25.6 MB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 179 - Forks: 28

madupite/madupite

a High-Performance Distributed Solver for Large-Scale Markov Decision Processes (MDP) relying on Inexact Policy Iteration; for Python and C++

Language: C++ - Size: 36.5 MB - Last synced at: 8 days ago - Pushed at: about 1 month ago - Stars: 25 - Forks: 1

mhahsler/pomdp

R package for Partially Observable Markov Decision Processes

Language: R - Size: 2.86 MB - Last synced at: 4 days ago - Pushed at: about 1 month ago - Stars: 18 - Forks: 6

ImanRHT/QECO

A QoE-Oriented Computation Offloading Algorithm based on Deep Reinforcement Learning (DRL) for Mobile Edge Computing (MEC) | This algorithm captures the dynamics of the MEC environment by integrating the Dueling Double Deep Q-Network (D3QN) model with Long Short-Term Memory (LSTM) networks.

Language: Python - Size: 17.2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 204 - Forks: 37

vladimirhristovski/Agent-basedSystems

Agent-based Systems Exercises

Language: Python - Size: 106 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

sudharsan13296/Hands-On-Reinforcement-Learning-With-Python

Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow

Language: Jupyter Notebook - Size: 41.9 MB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 847 - Forks: 325

who-else-but-arjun/Course_Project_DA221M

This project was made a part of the course project for DA221M - Artificial Intelligence course emphasizing on the limitations of AI in language understanding. .

Language: Python - Size: 43.9 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

soheil-mp/Reinforcement-Learning-Algorithms

Step by Step Reinforcement Learning Tutorials.

Language: Jupyter Notebook - Size: 18.4 MB - Last synced at: 4 days ago - Pushed at: over 2 years ago - Stars: 12 - Forks: 6

IBM/IBM-Extended-Markov-Ratio-Decision-Process

This repo includes code referenced in the paper A Rigorous Risk-aware Linear Approach to Extended Markov Ratio Decision Processes with Embedded Learning by Alexander Zadorojniy, Takayuki Osogami, and Orit Davidovich to appear in IJCAI 2023.

Language: Jupyter Notebook - Size: 905 KB - Last synced at: 4 days ago - Pushed at: about 2 months ago - Stars: 4 - Forks: 0

sameysimon/MoralPlanner

Probabilistic Moral Planner based on heuristic Dynamic Programming AO* and Machine Ethics Hypothetical Retrospection argumentation. Works with conflicting moral theories and non-moral costs/goals.

Language: C++ - Size: 10.2 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

afshinea/stanford-cs-221-artificial-intelligence

VIP cheatsheets for Stanford's CS 221 Artificial Intelligence

Size: 10.1 MB - Last synced at: about 2 months ago - Pushed at: over 5 years ago - Stars: 2,676 - Forks: 507

florianvazelle/unity-rl

Markov Decision Process and Temporal Difference algorithms

Language: C# - Size: 291 KB - Last synced at: 27 days ago - Pushed at: about 4 years ago - Stars: 6 - Forks: 0

iisys-hof/map-matching-2

High Performance Map Matching with Markov Decision Processes (MDPs) and Hidden Markov Models (HMMs).

Language: C++ - Size: 20.1 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 58 - Forks: 9

eleurent/finite-mdp

Gym environment for MDPs with finite state and action spaces

Language: Python - Size: 22.5 KB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 4

zafarali/emdp

Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations

Language: Python - Size: 82 KB - Last synced at: about 1 month ago - Pushed at: about 3 years ago - Stars: 49 - Forks: 14

upupming/Lab3-markov-decision-process

Language: HTML - Size: 1.2 MB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 3

cipryyyy/Markov

Text generator with a Markov chain

Language: Python - Size: 14.6 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

harmim/vut-mba-projects

Analýza systémů založená na modelech - Projekty

Language: TeX - Size: 2.61 MB - Last synced at: about 1 month ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 0

OpenSourceEconomics/respy

Framework for the simulation and estimation of some finite-horizon discrete choice dynamic programming models.

Language: Python - Size: 123 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 77 - Forks: 32

OMB227/RL_Collaborative-Practicals

This repo was dedicated for the RL_Collaborative Work

Language: Jupyter Notebook - Size: 8.1 MB - Last synced at: 12 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

LEAP-HI-ClimACT/Coastal-Infrastructure-Planning

Climate change-related risk mitigation for infrastructure systems often requires adaptation. A computational framework for optimal decision-making under uncertainty based on dynamically changing conditions observed in time is developed in response.

Language: MATLAB - Size: 4.56 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

rennaMAhcuS/Hands-on-RL

Hands-on-RL exploration and development.

Language: Python - Size: 29.4 MB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

shaheennabi/Reinforcement-or-Deep-Reinforcement-Learning-Practices-and-Mini-Projects

Reinforcement Learning (RL) 🤖! This repository is your hands-on guide to implementing RL algorithms, from Markov Decision Processes (MDPs) to advanced methods like PPO and DDPG. 🚀 Build smart agents, learn the math behind policies, and experiment with real-world applications! 🔥💡

Size: 24.4 KB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 3 - Forks: 0

colinskow/move37

Coding Demos from the School of AI's Move37 Course

Language: Python - Size: 59.6 KB - Last synced at: 17 days ago - Pushed at: over 6 years ago - Stars: 184 - Forks: 118

ComprisedAxis/Leveraging-Reinforcement-Learning-for-Cost-Effective-Medical-Diagnostics

The project focuses on dynamic diagnosis policies that can reduce costs while maintaining or improving diagnostic accuracy. Specifically, RL methods have been employed to balance the trade-off between medical testing budgets and prediction accuracy by identifying Pareto-optimal policies.

Language: Python - Size: 19.5 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

thiagodsd/echecs-par-renforcement

Studies on MDP and reinforcement learning in chess, focusing on position representation & encoding.

Language: Jupyter Notebook - Size: 16.7 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

JuliaPOMDP/quickpomdps

Interface for defining discrete and continuous-space MDPs and POMDPs in python. Compatible with the POMDPs.jl ecosystem.

Language: Python - Size: 33.2 KB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 24 - Forks: 4

liAmirali/UIAI-MDP Fork of InFluX-M/UIAI-MDP

Cliff Walking Project: An implementation of classic MDP algorithms (Policy Iteration, Value Iteration)

Language: Jupyter Notebook - Size: 25.6 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Chaoukia/branches

The Branches algorithm, fast Dynamic Programming and Branch and Bound search for seeking optimal Decision Trees

Language: Python - Size: 2.28 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 5 - Forks: 0

mhahsler/pomdpSolve

Provides Cassandra's pomdp-solve program.

Language: C - Size: 830 KB - Last synced at: 10 days ago - Pushed at: 9 months ago - Stars: 2 - Forks: 1

necrashter/PowerRAFT

PowerRAFT: Power Restoration Application with Field Teams. Implemented in Rust.

Language: Rust - Size: 4.08 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

bd2720/ML-CPP

A simple machine learning library for C++

Language: C++ - Size: 34.2 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

meetps/CS-747

Assignment codes for CS747 Intelligent and Learning Agents

Language: Python - Size: 34.4 MB - Last synced at: 3 days ago - Pushed at: over 8 years ago - Stars: 7 - Forks: 1

Rapfff/jajapy

Baum-Welch for all kind of Markov models

Language: Python - Size: 8.23 MB - Last synced at: 29 days ago - Pushed at: about 1 year ago - Stars: 21 - Forks: 2

ossef/MDP_Battery

MDP Battery decision-making framework, 2024-2025.

Language: C - Size: 17 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

thiagopbueno/mdp-problog

MDP-ProbLog is a framework to represent and solve (infinite-horizon) MDPs specified by probabilistic logic programming.

Language: Python - Size: 634 KB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 4

masouduut94/MCTS-agent-python

Monte Carlo Tree Search (MCTS) is a method for finding optimal decisions in a given domain by taking random samples in the decision space and building a search tree accordingly. It has already had a profound impact on Artificial Intelligence (AI) approaches for domains that can be represented as trees of sequential decisions, particularly games and planning problems. In this project I used a board game called "HEX" as a platform to test different simulation strategies in MCTS field.

Language: Python - Size: 695 KB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 65 - Forks: 9

JuliaPOMDP/QuickPOMDPs.jl

Concise and friendly interfaces for defining MDP and POMDP models for use with POMDPs.jl solvers

Language: Julia - Size: 435 KB - Last synced at: 5 days ago - Pushed at: 6 months ago - Stars: 28 - Forks: 7

kmock930/Mahjong-Strategy-Simulation

Simulating agents in a Cantonese-style Mahjong game as a Multi-agent system.

Language: Jupyter Notebook - Size: 9.07 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

eya-methnani/Assignment-1---Deep-Reinforcement-Learning-Course

This notebook is part of the first assignment for the Deep Reinforcement Learning (DRL) course. It implements a simplified grid-world environment modeled as a deterministic Markov Decision Process (MDP). The purpose of the notebook is to practice key reinforcement learning concepts, including state transitions, rewards, and termination conditions.

Language: Jupyter Notebook - Size: 9.77 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

MaxNaeg/ncmdp

Code for the paper "Tackling Decision Processes with Non-Cumulative Objectives using Reinforcement Learning".

Language: Jupyter Notebook - Size: 2.02 GB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

dsietz/test-data-generation

Test Data Generation

Language: Rust - Size: 2.83 MB - Last synced at: 1 day ago - Pushed at: over 3 years ago - Stars: 37 - Forks: 3

changkun/ws-18-19-deep-learning-tutorial

Deep Learning and Artificial Intelligence Tutorial @ LMU WS 2018/19

Language: Jupyter Notebook - Size: 24.3 MB - Last synced at: 20 days ago - Pushed at: over 6 years ago - Stars: 14 - Forks: 2

florentdelgrange/vae_mdp

Implementation of Variational Markov Decision Processes, a framework allowing to (i) distill policies learned through (deep) reinforcement learning and (ii) learn discrete abstractions of continuous environments, the two with bisimulation guarantees.

Language: Jupyter Notebook - Size: 236 MB - Last synced at: about 17 hours ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 2

lsunsi/markovjs

Reinforcement Learning in JavaScript

Language: JavaScript - Size: 47.9 KB - Last synced at: 11 days ago - Pushed at: over 8 years ago - Stars: 76 - Forks: 4

rllab-snu/tsallis_actor_critic_mujoco

Implementation of Tsallis Actor Critic method

Language: Jupyter Notebook - Size: 810 KB - Last synced at: 7 days ago - Pushed at: 6 months ago - Stars: 61 - Forks: 9

madhura711/LENOVO---Stochastic-Optimization-and-Predictive-Modeling

Language: R - Size: 6.09 MB - Last synced at: 5 months ago - Pushed at: over 6 years ago - Stars: 5 - Forks: 1

callmespring/RL-short-course

Reinforcement Learning Short Course

Language: Jupyter Notebook - Size: 95.6 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 53 - Forks: 18

Mahmood-Anaam/stochastic-dynamic-programming

This repository provides solutions and implementations for Stochastic Dynamic Programming (SDP) problems. It includes theoretical insights, practical coding examples, and detailed explanations for addressing various challenges in decision-making under uncertainty and stochastic processes.

Language: Jupyter Notebook - Size: 97.7 KB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 2 - Forks: 0

Networks-Learning/counterfactual-continuous-mdp

Code for "Finding Counterfactually Optimal Action Sequences in Continuous State Spaces", NeurIPS 2023.

Language: Python - Size: 85.9 KB - Last synced at: 17 days ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 1

raachelssss/pacman

Implemented a new variation of Pac-Man using AI

Language: Python - Size: 19.9 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

devspaceship/madepro

A minimal Rust library for solving finite deterministic Markov decision processes

Language: Rust - Size: 64.5 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

HridayM25/ReinforcementLearning

Some algorithms of Reinforcement Learning implemented by me, in accordance to "Introduction to Reinforcement Learning" by Richard Sutton and Andrew Barto.

Language: Jupyter Notebook - Size: 538 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

DariMe20/GoGameProject

Go AI Reinforcement Learning Project - This repository is dedicated to exploring and comparing two reinforcement learning methods—gradient descent and Q-value learning—in developing intelligent agents for the board game Go. The goal is to observe the model’s evolution after generating thousands of self-played games and compare agents’ results.

Language: Python - Size: 511 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

krichelj/AI_BGU_2021

Artificial Intelligence course, Computer Science M.Sc., Ben Gurion University of the Negev, 2021

Language: Python - Size: 463 KB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 1

fardinabbasi/Tabulated_RL

Interactive Learning [ECE 641] - Fall 2023 - University of Tehran - Prof. Nili

Language: Jupyter Notebook - Size: 4.96 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

franciscoengenheiro/ai-autonomous-agents

Project for developing autonomous agents with AI, using both reactive and deliberative architectures

Language: TeX - Size: 13.5 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

dibyendu/Reinforcement-Learning

A playground for reinforcement learning algorithms

Language: Jupyter Notebook - Size: 75.9 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

narenakash/Machine-Data-and-Learning

TLDR: Generic Algorithms, Decision Trees, Value Iteration, POMDPs, Bias-Variance. Data preprocessing using statistical techniques and visualization is crucial to understand and analyze the data before utilizing them to train a machine learning model. Several fundamental techniques for preprocessing are presented here.

Language: Python - Size: 13.6 MB - Last synced at: 10 months ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

masouduut94/MCTS-agent-cythonized

MONTE Carlo Tree Search (MCTS) is a method for finding optimal decisions in a given domain by taking random samples in the decision space and building a search tree accordingly. It has already had a profound impact on Artificial Intelligence (AI) approaches for domains that can be represented as trees of sequential decisions, particularly games and planning problems. In this project I used a board game called "HEX" as a platform to test different simulation strategies in MCTS field.

Language: Python - Size: 230 KB - Last synced at: 4 months ago - Pushed at: about 4 years ago - Stars: 13 - Forks: 3

MaxNaeg/ZXreinforce

Code for "Optimizing ZX-Diagrams with Deep Reinforcement Learning"

Language: Python - Size: 4.38 GB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 3 - Forks: 4

Svalorzen/AI-Toolbox

A C++ framework for MDPs and POMDPs with Python bindings

Language: C++ - Size: 20.2 MB - Last synced at: 9 months ago - Pushed at: over 1 year ago - Stars: 646 - Forks: 99

danieljsharpe/DISCOTRESS_tutorials

Learn to get started using DISCOTRESS with these tutorials! Then apply to your own Markov chains in ecology 🦜🌴 economics 💸📈 biophysics 🧬🦠 and more!

Language: Brainfuck - Size: 5.43 MB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 2

aai-institute/tfl-training-probabilistic-model-checking

TfL course on probabilistic model checking using storm

Language: Jupyter Notebook - Size: 59.4 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

Maleniski/SemiMarkov-MeanField

Dashboard documentation for the simulating the evolution of object proportions under a mean field approach.

Language: Python - Size: 641 KB - Last synced at: 5 days ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

lucaslopes/lp-learner

Dynamically adjusts load balancers coupled with auto scalers in response to workload changes using weakly coupled Markov Decision Processes (MDPs) and a two-timescale online learning approach.

Size: 33.2 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

LeoMartinezTAMUK/Markov_Decision_Process

This project implements a Markov Decision Process (MDP) using Reinforcement Learning in Python.

Language: Python - Size: 5.86 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

IsmaelMousa/mdp-value-iteration

Implementation of the MDP algorithm for optimal decision-making, focusing on value iteration and policy determination.

Language: Python - Size: 114 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

aws-samples/amazon-sagemaker-amazon-routing-challenge-sol

AWS Last Mile Route Sequence Optimization

Language: Python - Size: 2.02 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 51 - Forks: 12

RodneyShag/GridWorldMDP

Uses Markov decision processes (MDPs) and Temporal Difference (TD) Q-learning to maximize reward in a "grid world".

Language: Java - Size: 1.97 MB - Last synced at: 30 days ago - Pushed at: over 8 years ago - Stars: 3 - Forks: 3

juradohja/itesm-intsys-trafficlights

Intelligent Traffic Lights System built with C++ and OpenGL.

Language: C - Size: 818 KB - Last synced at: 12 months ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 1

camargomau/markovian-decisions

Repository for the final project for Procesos Estocásticos. S1.63.10

Language: Python - Size: 93.8 KB - Last synced at: 10 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

andre0xFF/ISEL-LEIM-IASA 📦

IASA (Artificial Intelligence of Autonomous Systems) class projects and resources of LEIM course at ISEL

Language: Java - Size: 95.6 MB - Last synced at: 12 months ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

FreakDev/DWNTNF

Don't Waste Neither Time Nor Food (meal planner) - Machine Learning experiment to reduce food waste

Language: TypeScript - Size: 10.7 KB - Last synced at: 12 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

MohandHAMADOUCHE/Comparison_of_V-Iter_Vs_P-Iter_Vs_Q-learn

Comparison of Value Iteration, Policy Iteration and Q-Learning for solving Decision-Making problems

Language: MATLAB - Size: 1.18 MB - Last synced at: 12 months ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

amajji/Markov-Chain

Markov Chain overview and their implementations in Finance

Language: Jupyter Notebook - Size: 1.29 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 2

gobind452/OptimalBlackJack

Solving BlackJack using Policy Iteration

Language: C++ - Size: 11.7 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

Gaby-253/Markov-Decision-Process

I had to choose the best policy for a certain agent in a certain world by using markov decision problem.

Language: MATLAB - Size: 625 KB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

Prakhar-FF13/Reinforcement-Learning-With-Python

Reinforcement Learning Notebooks

Language: Python - Size: 115 KB - Last synced at: about 2 months ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 1

yanshengjia/link

Undergraduate graduation project (Entity Linking System in Web Tables with Multiple Linked Knowledge Bases) at SEU.

Language: HTML - Size: 39.1 MB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 8 - Forks: 2

laurimi/pydpomdp

Python package for Dec-POMDP files in the .dpomdp format

Language: C++ - Size: 24.4 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 1

victor-iyi/simple-Q-network

A Q Learning Reinforcement agent using a simple feed forward neural net.

Language: Python - Size: 50.8 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 1

victor-iyi/contextual-bandit

A Reinforcement Learning approach to a contextual bandit problem.

Language: Jupyter Notebook - Size: 21.5 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

victor-iyi/basic-Q-learning-algorithm

Implementation of a basic Q Learning algorithm in the OpenAI's gym environment

Language: Jupyter Notebook - Size: 10.7 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

abhinand5/lunar-lander-deep-rl

Solving OpenAI Gym's Lunar Lander environment using Deep Reinforcement Learning

Language: Python - Size: 16.6 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 3

bermed28/cs7641-assignment4

Project that experiments with algorithms used to solve Markov Decision Processes

Language: Python - Size: 995 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

vipul2001/Component-Wise-Markov-Decision-Process

This repository provides code for the paper "Vipul Bansal, Yong Chen, Shiyu Zhou, Component-Wise Markov Decision Process for Solving Condition Based Maintenance of Large Multi-Component Systems with Economic Dependence"

Language: Jupyter Notebook - Size: 214 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Related Keywords
markov-decision-processes 336 reinforcement-learning 147 value-iteration 52 q-learning 50 artificial-intelligence 50 mdp 45 python 44 machine-learning 43 markov-chain 34 policy-iteration 31 dynamic-programming 21 reinforcement-learning-algorithms 19 ai 18 markov-model 15 sarsa 14 deep-reinforcement-learning 14 monte-carlo 13 markov 13 qlearning 12 policy-gradient 12 deep-learning 11 decision-making 10 openai-gym 9 gridworld 9 mdps 8 optimization 7 temporal-differencing-learning 7 python3 7 deep-q-network 7 neural-network 7 bellman-equation 7 pomdps 7 planning 6 stochastic-processes 6 astar-algorithm 6 julia 6 rl 6 constraint-satisfaction-problem 5 pomdp 5 sarsa-lambda 5 jupyter-notebook 5 hidden-markov-model 5 multi-armed-bandit 5 model-checking 5 random-walk 5 tensorflow 5 pytorch 5 pygame 5 neural-networks 5 policy-evaluation 5 proximal-policy-optimization 4 bfs 4 monte-carlo-tree-search 4 gym 4 alpha-beta-pruning 4 probabilistic-graphical-models 4 dqn 4 simulation 4 control-theory 4 qlearning-algorithm 4 numpy 4 temporal-difference 4 javascript 4 deep-q-learning 4 grid-world 4 reinforcement-learning-agent 4 algorithm 4 linear-programming 4 rust 4 multi-agent-systems 4 probabilistic-models 4 markov-decision-process 4 value-iteration-algorithm 4 optimal-control 4 adversarial-search 4 solver 3 economics 3 statistics 3 multi-armed-bandits 3 robotics 3 r 3 monte-carlo-simulation 3 probabilistic-programming 3 expectimax 3 minimax-algorithm 3 heuristic-search-algorithms 3 agent-based-modeling 3 search 3 operations-research 3 monte-carlo-methods 3 reinforce 3 dyna-q 3 bandit-algorithms 3 reinforcement-learning-environments 3 epsilon-greedy 3 game-theory 3 minimax 3 game-development 3 agent 3 csharp 3