Topic: "markov-decision-process"
odow/SDDP.jl
A JuMP extension for Stochastic Dual Dynamic Programming
Language: Julia - Size: 25.8 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 329 - Forks: 66

rares9301/anomaly-detection
simple but efficient kernel regression and anomaly detection algorithms
Language: MATLAB - Size: 3.3 MB - Last synced at: about 9 hours ago - Pushed at: 10 months ago - Stars: 202 - Forks: 403

iisys-hof/map-matching-2
High Performance Map Matching with Markov Decision Processes (MDPs) and Hidden Markov Models (HMMs).
Language: C++ - Size: 20.1 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 58 - Forks: 9

OpenSourceEconomics/ruspy
Python package for the simulation and estimation of a prototypical infinite-horizon dynamic discrete choice model based on Rust (1987)
Language: Python - Size: 38.7 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 20 - Forks: 6

ChaitanyaC22/Deep-RL-Project---Maximize-total-profits-earned-by-cab-driver
The goal of this project is to build an RL-based algorithm that can help cab drivers maximize their profits by improving their decision-making process on the field. Taking long-term profit as the goal, a method is proposed based on reinforcement learning to optimize taxi driving strategies for profit maximization. This optimization problem is formulated as a Markov Decision Process i.e. MDP.
Language: Jupyter Notebook - Size: 1.61 MB - Last synced at: 2 months ago - Pushed at: almost 4 years ago - Stars: 13 - Forks: 3

mhahsler/markovDP
R package for Discrete-Time Markov Decision Processes
Language: R - Size: 2.15 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 7 - Forks: 0

areenberg/MDPSolver
A fast solver for Markov Decision Processes
Language: Python - Size: 23 MB - Last synced at: 10 days ago - Pushed at: 11 days ago - Stars: 6 - Forks: 1

leonlan/dynamic-dispatch-waves π¦
Code for the paper "An iterative sample scenario approach for the dynamic dispatch waves problem."
Language: Jupyter Notebook - Size: 81.2 MB - Last synced at: 7 months ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 0

amajji/Markov-Chain
Markov Chain overview and their implementations in Finance
Language: Jupyter Notebook - Size: 1.29 MB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 2

jzsherlock4869/reinforcement-learning-sutton-code
Implementations of methods in book <Reinforcement Learning: an introduction> by Sutton Barto, using Python.
Language: Python - Size: 1.69 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 1

danieljsharpe/DISCOTRESS_tutorials
Learn to get started using DISCOTRESS with these tutorials! Then apply to your own Markov chains in ecology π¦π΄ economics πΈπ biophysics π§¬π¦ and more!
Language: Brainfuck - Size: 5.43 MB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 2

NikolaZubic/AppliedGameTheoryHomeworkSolutions
Solutions for course: "Applied Game Theory" taken at University of Novi Sad - Faculty of Technical Sciences
Language: Jupyter Notebook - Size: 936 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 2

dpmerrell/yahtzee
A Yahtzee-solving python package and command line tool
Language: Python - Size: 102 KB - Last synced at: 8 days ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 5

yrevar/navigation_vis
Python3 library for visualizing high dimensional data.
Language: Jupyter Notebook - Size: 23.2 MB - Last synced at: 24 days ago - Pushed at: about 3 years ago - Stars: 4 - Forks: 0

AiminLi-Hi/itw2024-aoi-goal-oriented-comm
π§ Code for our IEEE ITW 2024 paper: Learn to leverage Age of Information (AoI) for goal-oriented communication systems.
Language: MATLAB - Size: 49.8 KB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 3 - Forks: 1

mkhaled87/pFaces-AMYTISS
A tool for parallel automated controller synthesis for large-scale stochastic systems.
Language: C++ - Size: 26.4 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 3

ME-Msc/SwarmL
UAV swarm task description language with AI policies enhancement
Size: 257 KB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 3 - Forks: 0

ChaitanyaC22/Numerical_TicTacToe_Agent_using_Reinforcement_Learning
Build an RL (Reinforcement Learning) agent that learns to play Numerical Tic-Tac-Toe. The agent learns the game by Q-Learning.
Language: Jupyter Notebook - Size: 23.2 MB - Last synced at: 3 months ago - Pushed at: almost 4 years ago - Stars: 3 - Forks: 1

YasPHP/UofT-LearnAI
An Introductory ML Educational Program hosted by the UofT AI Society. Topics include, Data Manipulation, Classification & Regression, Neural Networks, Computer Vision (CNNs), Natural Language Processing (RNNs), Reinforcement Learning (RL), Markov Decision Process (MDP), Genetic Algorithms, Decision Trees, K-means Clustering, Minimax, Hidden Markov Model.
Language: Jupyter Notebook - Size: 225 KB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 3 - Forks: 0

rssalessio/optimal-attack-control-channel-mdp
Language: Jupyter Notebook - Size: 164 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

ostad-ai/Reinforcement-Learning
This repository is about Reinforcement Learning (RL) and related topics
Language: Jupyter Notebook - Size: 192 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 1 - Forks: 0

ME-Msc/SwarmL-Interpreter
Interpreter of SwarmL
Language: Python - Size: 9.06 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

rjs02/inexact-policy-iteration
Benchmarking Distributed Inexact Policy Iteration for Large-Scale Markov Decision Processes
Language: C++ - Size: 442 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

Develop-Packt/Markov-Decision-Processes-and-Bellman-Equations
The module covers the theory behind reinforcement learning and introduces Markov chains and Markov Decision Processes
Language: Jupyter Notebook - Size: 3.67 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 2

masouduut94/Monte-Carlo-Tree-Search-Hex-Cpp
I used this project to master c++. I just reimplemented monte carlo tree search agent on the game of hex with cpp.
Language: C++ - Size: 25.4 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

jamestiotio/ml
SUTD 2021 50.007 Machine Learning Code Dump
Language: Jupyter Notebook - Size: 4.84 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

demirayonur/CC-MMDP
Algorithms for Capacity Constrained Multi-model Markov Decision Processes
Language: Java - Size: 36.1 KB - Last synced at: 4 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 1

privateboss0/Artificial_Intelligence_Stanford
Stanford-CS221 class practical's, Assignments and projects
Language: Python - Size: 698 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

moonight3547/value_iteration
DDA4300 Course Project: MDP Agents (Value Iteration) on Zero-Sum Game (Tic-Tac-Toe)
Language: Jupyter Notebook - Size: 899 KB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Rui0828/Grid-World-DRL
A web-based interactive Grid World environment for learning and visualizing reinforcement learning algorithms including policy evaluation, policy improvement, and value iteration. Built with Flask backend implementing RL algorithms and JavaScript frontend for grid visualization.
Language: JavaScript - Size: 41 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

s1dewalker/Markov-Model-for-Stocks
Markov Model for Stocks in Python. Clustering in Time Series data | Model Development | Stochastic Models
Language: Jupyter Notebook - Size: 1.25 MB - Last synced at: 2 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

ceodaniyal/q_learning
Q-Learning Implementation for Process Optimization A reinforcement learning project that calculates the shortest route between locations using the Q-Learning algorithm. This code demonstrates how AI can optimize processes in a simulated environment with predefined states and rewards. π
Language: Python - Size: 1.95 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

matansuliman/Introduction-to-Artificial-Intelligence
Language: Python - Size: 1.19 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

exdsgift/MarkovChain_NLP
Build a rudimental NLP on pure statistic model
Language: Jupyter Notebook - Size: 20.8 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

yrevar/navigation_mdp
Python3 library for specifying MDP tailored for navigation applications.
Language: Jupyter Notebook - Size: 1.6 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

aygong/pcsma-asynchronous-mpr
Code for the paper "Generalized p-Persistent CSMA for Asynchronous Multiple-Packet Reception"
Language: MATLAB - Size: 230 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

dpmerrell/TrialMDP-analyses
Analyses, experiments, and evaluations for the TrialMDP method
Language: Python - Size: 105 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

dpmerrell/TrialMDP
An algorithm that designs blocked Response-Adaptive Randomized (RAR) clinical trials
Language: C++ - Size: 93.8 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

gigug/PML
Final Project from the course "Probabilistic Machine Learning" @ Data Science & Scientific Computing, University of Trieste, year 2020/2021, written in ipynb.
Language: Jupyter Notebook - Size: 566 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

SebastianLiando/CZ4046-1
CZ4046: Intelligent Agents - Assignment 1
Language: Kotlin - Size: 526 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

rabieifk/Prison_Break_Machine_Learning
Machine-learning application in path finding using the n-step TD(lambda) algorithm
Language: Python - Size: 165 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

cameronhh/MCTS-project
An assignment for COMP3702 at UQ. Original problem was a Markov Decision Process. Solved using an implementation of a Monte-Carlo-Tree-Search. Final grade 110/100.
Language: Java - Size: 13.7 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

instance01/BRTDP-DS-MPI
BRTDP implemented including DS-MPI for upper bound
Language: Python - Size: 4.1 MB - Last synced at: 3 months ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 2

eeshadutta/Markov-Decision-Process
MDPs solved using Value Iteration and Linear Programming
Language: Python - Size: 281 KB - Last synced at: over 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 1

neat-monte/markov-decision-process
The implementation of variable iteration and q-learning algorithms using Java programming language. The project was completed by @DavidLeeftink and @mantas-makelis for the Searching, Planning & Machine Learning (SPML) course @radbouduniversity. The framework of the grid world was created by Jered Vroon.
Language: Java - Size: 318 KB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0
