GitHub topics: markov-decision-processes
robervz22/Optimal-Play-Pig
Replication and reproduction of the results in the article: Optimal Play of the Dice Game Pig by Neller and Presser 2004
Language: Python - Size: 71.3 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

VincentPinet/421-solver
Computing optimal strategy for the dice game 421
Language: C++ - Size: 26.4 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

JuliaPOMDP/CompressedBeliefMDPs.jl
Compressed belief-state MDPs in Julia for reinforcement learning and sequential decision making. Part of the POMDPs.jl community.
Language: Julia - Size: 643 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 5 - Forks: 0

JuliaPOMDP/POMDPs.jl
MDPs and POMDPs in Julia - An interface for defining, solving, and simulating fully and partially observable Markov decision processes on discrete and continuous spaces.
Language: Julia - Size: 10.2 MB - Last synced at: 6 days ago - Pushed at: 13 days ago - Stars: 710 - Forks: 104

ds4dm/ecole
Extensible Combinatorial Optimization Learning Environments
Language: C++ - Size: 2.29 MB - Last synced at: about 11 hours ago - Pushed at: 16 days ago - Stars: 339 - Forks: 72

bmarroc/reinforcement-learning
Jupyter notebooks implementing Reinforcement Learning algorithms in Numpy and Tensorflow
Language: Jupyter Notebook - Size: 2.84 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 1 - Forks: 1

sshkhr/Practical_RL
My solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow
Language: Jupyter Notebook - Size: 9.91 MB - Last synced at: 6 days ago - Pushed at: over 3 years ago - Stars: 53 - Forks: 25

Limmen/csle
A research platform to develop automated security policies using quantitative methods, e.g., optimal control, computational game theory, reinforcement learning, optimization, evolutionary methods, and causal inference.
Language: Python - Size: 140 MB - Last synced at: about 13 hours ago - Pushed at: about 2 months ago - Stars: 126 - Forks: 21

TolgaOk/jaxdp
A Dynamic Programming package for discrete MDPs implemented in JAX
Language: Python - Size: 549 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 5 - Forks: 1

h2r/pomdp-py
A framework to build and solve POMDP problems. Documentation: https://h2r.github.io/pomdp-py/
Language: Python - Size: 6.85 MB - Last synced at: 13 days ago - Pushed at: 20 days ago - Stars: 245 - Forks: 53

thiagopbueno/awesome-probabilistic-planning
A curated list of online resources for probabilistic planning: papers, software and research groups around the world!
Size: 18.6 KB - Last synced at: about 23 hours ago - Pushed at: about 7 years ago - Stars: 62 - Forks: 12

odow/SDDP.jl
A JuMP extension for Stochastic Dual Dynamic Programming
Language: Julia - Size: 24.9 MB - Last synced at: 6 days ago - Pushed at: 24 days ago - Stars: 326 - Forks: 66

Matheussoranco/Decision-Making-Via-Markov-chains
A decision making model that uses Markov chains to do it, opening way for a kind of reasoning
Language: Python - Size: 5.86 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

DES-Lab/AALpy
An Automata Learning Library Written in Python
Language: Python - Size: 25.6 MB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 179 - Forks: 28

madupite/madupite
a High-Performance Distributed Solver for Large-Scale Markov Decision Processes (MDP) relying on Inexact Policy Iteration; for Python and C++
Language: C++ - Size: 36.5 MB - Last synced at: 8 days ago - Pushed at: about 1 month ago - Stars: 25 - Forks: 1

mhahsler/pomdp
R package for Partially Observable Markov Decision Processes
Language: R - Size: 2.86 MB - Last synced at: 4 days ago - Pushed at: about 1 month ago - Stars: 18 - Forks: 6

ImanRHT/QECO
A QoE-Oriented Computation Offloading Algorithm based on Deep Reinforcement Learning (DRL) for Mobile Edge Computing (MEC) | This algorithm captures the dynamics of the MEC environment by integrating the Dueling Double Deep Q-Network (D3QN) model with Long Short-Term Memory (LSTM) networks.
Language: Python - Size: 17.2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 204 - Forks: 37

vladimirhristovski/Agent-basedSystems
Agent-based Systems Exercises
Language: Python - Size: 106 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

sudharsan13296/Hands-On-Reinforcement-Learning-With-Python
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Language: Jupyter Notebook - Size: 41.9 MB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 847 - Forks: 325

who-else-but-arjun/Course_Project_DA221M
This project was made a part of the course project for DA221M - Artificial Intelligence course emphasizing on the limitations of AI in language understanding. .
Language: Python - Size: 43.9 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

soheil-mp/Reinforcement-Learning-Algorithms
Step by Step Reinforcement Learning Tutorials.
Language: Jupyter Notebook - Size: 18.4 MB - Last synced at: 4 days ago - Pushed at: over 2 years ago - Stars: 12 - Forks: 6

IBM/IBM-Extended-Markov-Ratio-Decision-Process
This repo includes code referenced in the paper A Rigorous Risk-aware Linear Approach to Extended Markov Ratio Decision Processes with Embedded Learning by Alexander Zadorojniy, Takayuki Osogami, and Orit Davidovich to appear in IJCAI 2023.
Language: Jupyter Notebook - Size: 905 KB - Last synced at: 4 days ago - Pushed at: about 2 months ago - Stars: 4 - Forks: 0

sameysimon/MoralPlanner
Probabilistic Moral Planner based on heuristic Dynamic Programming AO* and Machine Ethics Hypothetical Retrospection argumentation. Works with conflicting moral theories and non-moral costs/goals.
Language: C++ - Size: 10.2 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

afshinea/stanford-cs-221-artificial-intelligence
VIP cheatsheets for Stanford's CS 221 Artificial Intelligence
Size: 10.1 MB - Last synced at: about 2 months ago - Pushed at: over 5 years ago - Stars: 2,676 - Forks: 507

florianvazelle/unity-rl
Markov Decision Process and Temporal Difference algorithms
Language: C# - Size: 291 KB - Last synced at: 27 days ago - Pushed at: about 4 years ago - Stars: 6 - Forks: 0

iisys-hof/map-matching-2
High Performance Map Matching with Markov Decision Processes (MDPs) and Hidden Markov Models (HMMs).
Language: C++ - Size: 20.1 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 58 - Forks: 9

eleurent/finite-mdp
Gym environment for MDPs with finite state and action spaces
Language: Python - Size: 22.5 KB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 4

zafarali/emdp
Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations
Language: Python - Size: 82 KB - Last synced at: about 1 month ago - Pushed at: about 3 years ago - Stars: 49 - Forks: 14

upupming/Lab3-markov-decision-process
Language: HTML - Size: 1.2 MB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 3

cipryyyy/Markov
Text generator with a Markov chain
Language: Python - Size: 14.6 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

harmim/vut-mba-projects
Analýza systémů založená na modelech - Projekty
Language: TeX - Size: 2.61 MB - Last synced at: about 1 month ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 0

OpenSourceEconomics/respy
Framework for the simulation and estimation of some finite-horizon discrete choice dynamic programming models.
Language: Python - Size: 123 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 77 - Forks: 32

OMB227/RL_Collaborative-Practicals
This repo was dedicated for the RL_Collaborative Work
Language: Jupyter Notebook - Size: 8.1 MB - Last synced at: 12 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

LEAP-HI-ClimACT/Coastal-Infrastructure-Planning
Climate change-related risk mitigation for infrastructure systems often requires adaptation. A computational framework for optimal decision-making under uncertainty based on dynamically changing conditions observed in time is developed in response.
Language: MATLAB - Size: 4.56 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

rennaMAhcuS/Hands-on-RL
Hands-on-RL exploration and development.
Language: Python - Size: 29.4 MB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

shaheennabi/Reinforcement-or-Deep-Reinforcement-Learning-Practices-and-Mini-Projects
Reinforcement Learning (RL) 🤖! This repository is your hands-on guide to implementing RL algorithms, from Markov Decision Processes (MDPs) to advanced methods like PPO and DDPG. 🚀 Build smart agents, learn the math behind policies, and experiment with real-world applications! 🔥💡
Size: 24.4 KB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 3 - Forks: 0

colinskow/move37
Coding Demos from the School of AI's Move37 Course
Language: Python - Size: 59.6 KB - Last synced at: 17 days ago - Pushed at: over 6 years ago - Stars: 184 - Forks: 118

ComprisedAxis/Leveraging-Reinforcement-Learning-for-Cost-Effective-Medical-Diagnostics
The project focuses on dynamic diagnosis policies that can reduce costs while maintaining or improving diagnostic accuracy. Specifically, RL methods have been employed to balance the trade-off between medical testing budgets and prediction accuracy by identifying Pareto-optimal policies.
Language: Python - Size: 19.5 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

thiagodsd/echecs-par-renforcement
Studies on MDP and reinforcement learning in chess, focusing on position representation & encoding.
Language: Jupyter Notebook - Size: 16.7 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

JuliaPOMDP/quickpomdps
Interface for defining discrete and continuous-space MDPs and POMDPs in python. Compatible with the POMDPs.jl ecosystem.
Language: Python - Size: 33.2 KB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 24 - Forks: 4

liAmirali/UIAI-MDP Fork of InFluX-M/UIAI-MDP
Cliff Walking Project: An implementation of classic MDP algorithms (Policy Iteration, Value Iteration)
Language: Jupyter Notebook - Size: 25.6 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Chaoukia/branches
The Branches algorithm, fast Dynamic Programming and Branch and Bound search for seeking optimal Decision Trees
Language: Python - Size: 2.28 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 5 - Forks: 0

mhahsler/pomdpSolve
Provides Cassandra's pomdp-solve program.
Language: C - Size: 830 KB - Last synced at: 10 days ago - Pushed at: 9 months ago - Stars: 2 - Forks: 1

necrashter/PowerRAFT
PowerRAFT: Power Restoration Application with Field Teams. Implemented in Rust.
Language: Rust - Size: 4.08 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

bd2720/ML-CPP
A simple machine learning library for C++
Language: C++ - Size: 34.2 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

meetps/CS-747
Assignment codes for CS747 Intelligent and Learning Agents
Language: Python - Size: 34.4 MB - Last synced at: 3 days ago - Pushed at: over 8 years ago - Stars: 7 - Forks: 1

Rapfff/jajapy
Baum-Welch for all kind of Markov models
Language: Python - Size: 8.23 MB - Last synced at: 29 days ago - Pushed at: about 1 year ago - Stars: 21 - Forks: 2

ossef/MDP_Battery
MDP Battery decision-making framework, 2024-2025.
Language: C - Size: 17 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

thiagopbueno/mdp-problog
MDP-ProbLog is a framework to represent and solve (infinite-horizon) MDPs specified by probabilistic logic programming.
Language: Python - Size: 634 KB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 4

masouduut94/MCTS-agent-python
Monte Carlo Tree Search (MCTS) is a method for finding optimal decisions in a given domain by taking random samples in the decision space and building a search tree accordingly. It has already had a profound impact on Artificial Intelligence (AI) approaches for domains that can be represented as trees of sequential decisions, particularly games and planning problems. In this project I used a board game called "HEX" as a platform to test different simulation strategies in MCTS field.
Language: Python - Size: 695 KB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 65 - Forks: 9

JuliaPOMDP/QuickPOMDPs.jl
Concise and friendly interfaces for defining MDP and POMDP models for use with POMDPs.jl solvers
Language: Julia - Size: 435 KB - Last synced at: 5 days ago - Pushed at: 6 months ago - Stars: 28 - Forks: 7

kmock930/Mahjong-Strategy-Simulation
Simulating agents in a Cantonese-style Mahjong game as a Multi-agent system.
Language: Jupyter Notebook - Size: 9.07 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

eya-methnani/Assignment-1---Deep-Reinforcement-Learning-Course
This notebook is part of the first assignment for the Deep Reinforcement Learning (DRL) course. It implements a simplified grid-world environment modeled as a deterministic Markov Decision Process (MDP). The purpose of the notebook is to practice key reinforcement learning concepts, including state transitions, rewards, and termination conditions.
Language: Jupyter Notebook - Size: 9.77 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

MaxNaeg/ncmdp
Code for the paper "Tackling Decision Processes with Non-Cumulative Objectives using Reinforcement Learning".
Language: Jupyter Notebook - Size: 2.02 GB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

dsietz/test-data-generation
Test Data Generation
Language: Rust - Size: 2.83 MB - Last synced at: 1 day ago - Pushed at: over 3 years ago - Stars: 37 - Forks: 3

changkun/ws-18-19-deep-learning-tutorial
Deep Learning and Artificial Intelligence Tutorial @ LMU WS 2018/19
Language: Jupyter Notebook - Size: 24.3 MB - Last synced at: 20 days ago - Pushed at: over 6 years ago - Stars: 14 - Forks: 2

florentdelgrange/vae_mdp
Implementation of Variational Markov Decision Processes, a framework allowing to (i) distill policies learned through (deep) reinforcement learning and (ii) learn discrete abstractions of continuous environments, the two with bisimulation guarantees.
Language: Jupyter Notebook - Size: 236 MB - Last synced at: about 17 hours ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 2

lsunsi/markovjs
Reinforcement Learning in JavaScript
Language: JavaScript - Size: 47.9 KB - Last synced at: 11 days ago - Pushed at: over 8 years ago - Stars: 76 - Forks: 4

rllab-snu/tsallis_actor_critic_mujoco
Implementation of Tsallis Actor Critic method
Language: Jupyter Notebook - Size: 810 KB - Last synced at: 7 days ago - Pushed at: 6 months ago - Stars: 61 - Forks: 9

madhura711/LENOVO---Stochastic-Optimization-and-Predictive-Modeling
Language: R - Size: 6.09 MB - Last synced at: 5 months ago - Pushed at: over 6 years ago - Stars: 5 - Forks: 1

callmespring/RL-short-course
Reinforcement Learning Short Course
Language: Jupyter Notebook - Size: 95.6 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 53 - Forks: 18

Mahmood-Anaam/stochastic-dynamic-programming
This repository provides solutions and implementations for Stochastic Dynamic Programming (SDP) problems. It includes theoretical insights, practical coding examples, and detailed explanations for addressing various challenges in decision-making under uncertainty and stochastic processes.
Language: Jupyter Notebook - Size: 97.7 KB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 2 - Forks: 0

Networks-Learning/counterfactual-continuous-mdp
Code for "Finding Counterfactually Optimal Action Sequences in Continuous State Spaces", NeurIPS 2023.
Language: Python - Size: 85.9 KB - Last synced at: 17 days ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 1

raachelssss/pacman
Implemented a new variation of Pac-Man using AI
Language: Python - Size: 19.9 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

devspaceship/madepro
A minimal Rust library for solving finite deterministic Markov decision processes
Language: Rust - Size: 64.5 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

HridayM25/ReinforcementLearning
Some algorithms of Reinforcement Learning implemented by me, in accordance to "Introduction to Reinforcement Learning" by Richard Sutton and Andrew Barto.
Language: Jupyter Notebook - Size: 538 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

DariMe20/GoGameProject
Go AI Reinforcement Learning Project - This repository is dedicated to exploring and comparing two reinforcement learning methods—gradient descent and Q-value learning—in developing intelligent agents for the board game Go. The goal is to observe the model’s evolution after generating thousands of self-played games and compare agents’ results.
Language: Python - Size: 511 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

krichelj/AI_BGU_2021
Artificial Intelligence course, Computer Science M.Sc., Ben Gurion University of the Negev, 2021
Language: Python - Size: 463 KB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 1

fardinabbasi/Tabulated_RL
Interactive Learning [ECE 641] - Fall 2023 - University of Tehran - Prof. Nili
Language: Jupyter Notebook - Size: 4.96 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

franciscoengenheiro/ai-autonomous-agents
Project for developing autonomous agents with AI, using both reactive and deliberative architectures
Language: TeX - Size: 13.5 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

dibyendu/Reinforcement-Learning
A playground for reinforcement learning algorithms
Language: Jupyter Notebook - Size: 75.9 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

narenakash/Machine-Data-and-Learning
TLDR: Generic Algorithms, Decision Trees, Value Iteration, POMDPs, Bias-Variance. Data preprocessing using statistical techniques and visualization is crucial to understand and analyze the data before utilizing them to train a machine learning model. Several fundamental techniques for preprocessing are presented here.
Language: Python - Size: 13.6 MB - Last synced at: 10 months ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

masouduut94/MCTS-agent-cythonized
MONTE Carlo Tree Search (MCTS) is a method for finding optimal decisions in a given domain by taking random samples in the decision space and building a search tree accordingly. It has already had a profound impact on Artificial Intelligence (AI) approaches for domains that can be represented as trees of sequential decisions, particularly games and planning problems. In this project I used a board game called "HEX" as a platform to test different simulation strategies in MCTS field.
Language: Python - Size: 230 KB - Last synced at: 4 months ago - Pushed at: about 4 years ago - Stars: 13 - Forks: 3

MaxNaeg/ZXreinforce
Code for "Optimizing ZX-Diagrams with Deep Reinforcement Learning"
Language: Python - Size: 4.38 GB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 3 - Forks: 4

Svalorzen/AI-Toolbox
A C++ framework for MDPs and POMDPs with Python bindings
Language: C++ - Size: 20.2 MB - Last synced at: 9 months ago - Pushed at: over 1 year ago - Stars: 646 - Forks: 99

danieljsharpe/DISCOTRESS_tutorials
Learn to get started using DISCOTRESS with these tutorials! Then apply to your own Markov chains in ecology 🦜🌴 economics 💸📈 biophysics 🧬🦠 and more!
Language: Brainfuck - Size: 5.43 MB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 2

aai-institute/tfl-training-probabilistic-model-checking
TfL course on probabilistic model checking using storm
Language: Jupyter Notebook - Size: 59.4 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

Maleniski/SemiMarkov-MeanField
Dashboard documentation for the simulating the evolution of object proportions under a mean field approach.
Language: Python - Size: 641 KB - Last synced at: 5 days ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

lucaslopes/lp-learner
Dynamically adjusts load balancers coupled with auto scalers in response to workload changes using weakly coupled Markov Decision Processes (MDPs) and a two-timescale online learning approach.
Size: 33.2 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

LeoMartinezTAMUK/Markov_Decision_Process
This project implements a Markov Decision Process (MDP) using Reinforcement Learning in Python.
Language: Python - Size: 5.86 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

IsmaelMousa/mdp-value-iteration
Implementation of the MDP algorithm for optimal decision-making, focusing on value iteration and policy determination.
Language: Python - Size: 114 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

aws-samples/amazon-sagemaker-amazon-routing-challenge-sol
AWS Last Mile Route Sequence Optimization
Language: Python - Size: 2.02 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 51 - Forks: 12

RodneyShag/GridWorldMDP
Uses Markov decision processes (MDPs) and Temporal Difference (TD) Q-learning to maximize reward in a "grid world".
Language: Java - Size: 1.97 MB - Last synced at: 30 days ago - Pushed at: over 8 years ago - Stars: 3 - Forks: 3

juradohja/itesm-intsys-trafficlights
Intelligent Traffic Lights System built with C++ and OpenGL.
Language: C - Size: 818 KB - Last synced at: 12 months ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 1

camargomau/markovian-decisions
Repository for the final project for Procesos Estocásticos. S1.63.10
Language: Python - Size: 93.8 KB - Last synced at: 10 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

andre0xFF/ISEL-LEIM-IASA 📦
IASA (Artificial Intelligence of Autonomous Systems) class projects and resources of LEIM course at ISEL
Language: Java - Size: 95.6 MB - Last synced at: 12 months ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

FreakDev/DWNTNF
Don't Waste Neither Time Nor Food (meal planner) - Machine Learning experiment to reduce food waste
Language: TypeScript - Size: 10.7 KB - Last synced at: 12 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

MohandHAMADOUCHE/Comparison_of_V-Iter_Vs_P-Iter_Vs_Q-learn
Comparison of Value Iteration, Policy Iteration and Q-Learning for solving Decision-Making problems
Language: MATLAB - Size: 1.18 MB - Last synced at: 12 months ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

amajji/Markov-Chain
Markov Chain overview and their implementations in Finance
Language: Jupyter Notebook - Size: 1.29 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 2

gobind452/OptimalBlackJack
Solving BlackJack using Policy Iteration
Language: C++ - Size: 11.7 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

Gaby-253/Markov-Decision-Process
I had to choose the best policy for a certain agent in a certain world by using markov decision problem.
Language: MATLAB - Size: 625 KB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

Prakhar-FF13/Reinforcement-Learning-With-Python
Reinforcement Learning Notebooks
Language: Python - Size: 115 KB - Last synced at: about 2 months ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 1

yanshengjia/link
Undergraduate graduation project (Entity Linking System in Web Tables with Multiple Linked Knowledge Bases) at SEU.
Language: HTML - Size: 39.1 MB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 8 - Forks: 2

laurimi/pydpomdp
Python package for Dec-POMDP files in the .dpomdp format
Language: C++ - Size: 24.4 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 1

victor-iyi/simple-Q-network
A Q Learning Reinforcement agent using a simple feed forward neural net.
Language: Python - Size: 50.8 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 1

victor-iyi/contextual-bandit
A Reinforcement Learning approach to a contextual bandit problem.
Language: Jupyter Notebook - Size: 21.5 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

victor-iyi/basic-Q-learning-algorithm
Implementation of a basic Q Learning algorithm in the OpenAI's gym environment
Language: Jupyter Notebook - Size: 10.7 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

abhinand5/lunar-lander-deep-rl
Solving OpenAI Gym's Lunar Lander environment using Deep Reinforcement Learning
Language: Python - Size: 16.6 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 3

bermed28/cs7641-assignment4
Project that experiments with algorithms used to solve Markov Decision Processes
Language: Python - Size: 995 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

vipul2001/Component-Wise-Markov-Decision-Process
This repository provides code for the paper "Vipul Bansal, Yong Chen, Shiyu Zhou, Component-Wise Markov Decision Process for Solving Condition Based Maintenance of Large Multi-Component Systems with Economic Dependence"
Language: Jupyter Notebook - Size: 214 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0
