An open API service providing repository metadata for many open source software ecosystems.

Topic: "markov-decision-process"

odow/SDDP.jl

A JuMP extension for Stochastic Dual Dynamic Programming

Language: Julia - Size: 25.8 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 329 - Forks: 66

rares9301/anomaly-detection

simple but efficient kernel regression and anomaly detection algorithms

Language: MATLAB - Size: 3.3 MB - Last synced at: about 9 hours ago - Pushed at: 10 months ago - Stars: 202 - Forks: 403

iisys-hof/map-matching-2

High Performance Map Matching with Markov Decision Processes (MDPs) and Hidden Markov Models (HMMs).

Language: C++ - Size: 20.1 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 58 - Forks: 9

OpenSourceEconomics/ruspy

Python package for the simulation and estimation of a prototypical infinite-horizon dynamic discrete choice model based on Rust (1987)

Language: Python - Size: 38.7 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 20 - Forks: 6

ChaitanyaC22/Deep-RL-Project---Maximize-total-profits-earned-by-cab-driver

The goal of this project is to build an RL-based algorithm that can help cab drivers maximize their profits by improving their decision-making process on the field. Taking long-term profit as the goal, a method is proposed based on reinforcement learning to optimize taxi driving strategies for profit maximization. This optimization problem is formulated as a Markov Decision Process i.e. MDP.

Language: Jupyter Notebook - Size: 1.61 MB - Last synced at: 2 months ago - Pushed at: almost 4 years ago - Stars: 13 - Forks: 3

mhahsler/markovDP

R package for Discrete-Time Markov Decision Processes

Language: R - Size: 2.15 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 7 - Forks: 0

areenberg/MDPSolver

A fast solver for Markov Decision Processes

Language: Python - Size: 23 MB - Last synced at: 10 days ago - Pushed at: 11 days ago - Stars: 6 - Forks: 1

leonlan/dynamic-dispatch-waves πŸ“¦

Code for the paper "An iterative sample scenario approach for the dynamic dispatch waves problem."

Language: Jupyter Notebook - Size: 81.2 MB - Last synced at: 7 months ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 0

amajji/Markov-Chain

Markov Chain overview and their implementations in Finance

Language: Jupyter Notebook - Size: 1.29 MB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 2

jzsherlock4869/reinforcement-learning-sutton-code

Implementations of methods in book <Reinforcement Learning: an introduction> by Sutton Barto, using Python.

Language: Python - Size: 1.69 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 1

danieljsharpe/DISCOTRESS_tutorials

Learn to get started using DISCOTRESS with these tutorials! Then apply to your own Markov chains in ecology 🦜🌴 economics πŸ’ΈπŸ“ˆ biophysics 🧬🦠 and more!

Language: Brainfuck - Size: 5.43 MB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 2

NikolaZubic/AppliedGameTheoryHomeworkSolutions

Solutions for course: "Applied Game Theory" taken at University of Novi Sad - Faculty of Technical Sciences

Language: Jupyter Notebook - Size: 936 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 2

dpmerrell/yahtzee

A Yahtzee-solving python package and command line tool

Language: Python - Size: 102 KB - Last synced at: 8 days ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 5

yrevar/navigation_vis

Python3 library for visualizing high dimensional data.

Language: Jupyter Notebook - Size: 23.2 MB - Last synced at: 24 days ago - Pushed at: about 3 years ago - Stars: 4 - Forks: 0

AiminLi-Hi/itw2024-aoi-goal-oriented-comm

🧠 Code for our IEEE ITW 2024 paper: Learn to leverage Age of Information (AoI) for goal-oriented communication systems.

Language: MATLAB - Size: 49.8 KB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 3 - Forks: 1

mkhaled87/pFaces-AMYTISS

A tool for parallel automated controller synthesis for large-scale stochastic systems.

Language: C++ - Size: 26.4 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 3

ME-Msc/SwarmL

UAV swarm task description language with AI policies enhancement

Size: 257 KB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 3 - Forks: 0

ChaitanyaC22/Numerical_TicTacToe_Agent_using_Reinforcement_Learning

Build an RL (Reinforcement Learning) agent that learns to play Numerical Tic-Tac-Toe. The agent learns the game by Q-Learning.

Language: Jupyter Notebook - Size: 23.2 MB - Last synced at: 3 months ago - Pushed at: almost 4 years ago - Stars: 3 - Forks: 1

YasPHP/UofT-LearnAI

An Introductory ML Educational Program hosted by the UofT AI Society. Topics include, Data Manipulation, Classification & Regression, Neural Networks, Computer Vision (CNNs), Natural Language Processing (RNNs), Reinforcement Learning (RL), Markov Decision Process (MDP), Genetic Algorithms, Decision Trees, K-means Clustering, Minimax, Hidden Markov Model.

Language: Jupyter Notebook - Size: 225 KB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 3 - Forks: 0

rssalessio/optimal-attack-control-channel-mdp

Language: Jupyter Notebook - Size: 164 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

ostad-ai/Reinforcement-Learning

This repository is about Reinforcement Learning (RL) and related topics

Language: Jupyter Notebook - Size: 192 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 1 - Forks: 0

ME-Msc/SwarmL-Interpreter

Interpreter of SwarmL

Language: Python - Size: 9.06 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

rjs02/inexact-policy-iteration

Benchmarking Distributed Inexact Policy Iteration for Large-Scale Markov Decision Processes

Language: C++ - Size: 442 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

Develop-Packt/Markov-Decision-Processes-and-Bellman-Equations

The module covers the theory behind reinforcement learning and introduces Markov chains and Markov Decision Processes

Language: Jupyter Notebook - Size: 3.67 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 2

masouduut94/Monte-Carlo-Tree-Search-Hex-Cpp

I used this project to master c++. I just reimplemented monte carlo tree search agent on the game of hex with cpp.

Language: C++ - Size: 25.4 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

jamestiotio/ml

SUTD 2021 50.007 Machine Learning Code Dump

Language: Jupyter Notebook - Size: 4.84 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

demirayonur/CC-MMDP

Algorithms for Capacity Constrained Multi-model Markov Decision Processes

Language: Java - Size: 36.1 KB - Last synced at: 4 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 1

privateboss0/Artificial_Intelligence_Stanford

Stanford-CS221 class practical's, Assignments and projects

Language: Python - Size: 698 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

moonight3547/value_iteration

DDA4300 Course Project: MDP Agents (Value Iteration) on Zero-Sum Game (Tic-Tac-Toe)

Language: Jupyter Notebook - Size: 899 KB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Rui0828/Grid-World-DRL

A web-based interactive Grid World environment for learning and visualizing reinforcement learning algorithms including policy evaluation, policy improvement, and value iteration. Built with Flask backend implementing RL algorithms and JavaScript frontend for grid visualization.

Language: JavaScript - Size: 41 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

s1dewalker/Markov-Model-for-Stocks

Markov Model for Stocks in Python. Clustering in Time Series data | Model Development | Stochastic Models

Language: Jupyter Notebook - Size: 1.25 MB - Last synced at: 2 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

ceodaniyal/q_learning

Q-Learning Implementation for Process Optimization A reinforcement learning project that calculates the shortest route between locations using the Q-Learning algorithm. This code demonstrates how AI can optimize processes in a simulated environment with predefined states and rewards. πŸš€

Language: Python - Size: 1.95 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

matansuliman/Introduction-to-Artificial-Intelligence

Language: Python - Size: 1.19 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

exdsgift/MarkovChain_NLP

Build a rudimental NLP on pure statistic model

Language: Jupyter Notebook - Size: 20.8 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

yrevar/navigation_mdp

Python3 library for specifying MDP tailored for navigation applications.

Language: Jupyter Notebook - Size: 1.6 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

aygong/pcsma-asynchronous-mpr

Code for the paper "Generalized p-Persistent CSMA for Asynchronous Multiple-Packet Reception"

Language: MATLAB - Size: 230 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

dpmerrell/TrialMDP-analyses

Analyses, experiments, and evaluations for the TrialMDP method

Language: Python - Size: 105 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

dpmerrell/TrialMDP

An algorithm that designs blocked Response-Adaptive Randomized (RAR) clinical trials

Language: C++ - Size: 93.8 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

gigug/PML

Final Project from the course "Probabilistic Machine Learning" @ Data Science & Scientific Computing, University of Trieste, year 2020/2021, written in ipynb.

Language: Jupyter Notebook - Size: 566 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

SebastianLiando/CZ4046-1

CZ4046: Intelligent Agents - Assignment 1

Language: Kotlin - Size: 526 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

rabieifk/Prison_Break_Machine_Learning

Machine-learning application in path finding using the n-step TD(lambda) algorithm

Language: Python - Size: 165 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

cameronhh/MCTS-project

An assignment for COMP3702 at UQ. Original problem was a Markov Decision Process. Solved using an implementation of a Monte-Carlo-Tree-Search. Final grade 110/100.

Language: Java - Size: 13.7 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

instance01/BRTDP-DS-MPI

BRTDP implemented including DS-MPI for upper bound

Language: Python - Size: 4.1 MB - Last synced at: 3 months ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 2

eeshadutta/Markov-Decision-Process

MDPs solved using Value Iteration and Linear Programming

Language: Python - Size: 281 KB - Last synced at: over 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 1

neat-monte/markov-decision-process

The implementation of variable iteration and q-learning algorithms using Java programming language. The project was completed by @DavidLeeftink and @mantas-makelis for the Searching, Planning & Machine Learning (SPML) course @radbouduniversity. The framework of the grid world was created by Jered Vroon.

Language: Java - Size: 318 KB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

Related Topics
reinforcement-learning 12 dynamic-programming 7 q-learning 6 machine-learning 5 value-iteration 5 python 5 markov-decision-processes 4 markov-chain 4 policy-iteration 4 artificial-intelligence 4 epsilon-greedy 3 optimization 3 sarsa-learning 3 markov-model 3 navigation 2 gridworld-environment 2 clinical-trials 2 actions 2 convergence 2 stochastic-programming 2 epsilon-decay 2 hyperparameter-tuning 2 mdp-framework 2 model-building 2 domain-specific-language 2 multi-agent-system 2 unmanned-aerial-vehicle 2 rewards 2 rl 2 states 2 logic 2 nlp-machine-learning 2 discrete-event-simulation 2 deep-learning 2 minimax-algorithm 2 monte-carlo-methods 2 multi-armed-bandit 2 mdp 2 ai 2 markov-chains 2 temporal-difference-learning 2 optimal-control 2 multistage-stochastic-optimization 1 visualization 1 informed-search 1 uniformed-search 1 programming-language-interpreter 1 automated-synthesis 1 symbolic-models 1 disturbances 1 symbolic-controller 1 stochastisity 1 reactive-synthesis 1 parallel-algorithm 1 noise 1 hybrid-systems 1 bellman-equation 1 k-medoids 1 logistic-regression 1 logistic-regression-algorithm 1 mathematics 1 natural-language-processing 1 neural-network 1 perceptron 1 svm 1 economics 1 structural-microeconometrics 1 dynamic-dispatch-waves 1 dynamic-vehicle-routing 1 sample-scenario 1 vehicle-routing-problem 1 kernel-functions 1 linear-algebra 1 outliers-detection 1 mdps 1 game-solver 1 python-package 1 yahtzee 1 mixed-integer-programming 1 td-lambda 1 markov 1 adversial-search 1 sddip 1 sddp 1 stochastic-dual-dynamic-programming 1 stochastic-integer 1 stochastic-optimization 1 control-theory 1 bellman-optimality-equation 1 deepq-learning 1 sarsa 1 decision-process 1 predictive-maintenance 1 solver 1 age-of-information 1 information-freshness 1 relative-value-iteration 1 linear-programming 1 nash-equilibrium 1 zero-sum-game 1