Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: markov-decision-processes

Limmen/csle

A research platform to develop automated security policies using quantitative methods, e.g., optimal control, computational game theory, reinforcement learning, optimization, evolutionary methods, and causal inference.

Language: Python - Size: 125 MB - Last synced: about 15 hours ago - Pushed: about 18 hours ago - Stars: 102 - Forks: 19

JuliaPOMDP/CompressedBeliefMDPs.jl

Compressed belief-state MDPs in Julia compatible with POMDPs.jl

Language: Julia - Size: 373 KB - Last synced: about 22 hours ago - Pushed: 1 day ago - Stars: 5 - Forks: 0

camargomau/markovian-decisions

Repository for the final project for Procesos Estocásticos. S1.63.10

Language: Python - Size: 34.2 KB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 0 - Forks: 0

gobind452/OptimalBlackJack

Solving BlackJack using Policy Iteration

Language: C++ - Size: 11.7 KB - Last synced: 10 days ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0

mhahsler/pomdp

R package for Partially Observable Markov Decision Processes

Language: R - Size: 2.74 MB - Last synced: 9 days ago - Pushed: 11 days ago - Stars: 13 - Forks: 4

sudharsan13296/Hands-On-Reinforcement-Learning-With-Python

Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow

Language: Jupyter Notebook - Size: 41.9 MB - Last synced: 11 days ago - Pushed: over 3 years ago - Stars: 821 - Forks: 325

Gaby-253/Markov-Decision-Process

I had to choose the best policy for a certain agent in a certain world by using markov decision problem.

Language: MATLAB - Size: 625 KB - Last synced: 13 days ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

yanshengjia/link

Undergraduate graduation project (Entity Linking System in Web Tables with Multiple Linked Knowledge Bases) at SEU.

Language: HTML - Size: 39.1 MB - Last synced: 14 days ago - Pushed: about 5 years ago - Stars: 8 - Forks: 2

simerplaha/reinforcement-learning

Reinforcement learning

Language: Scala - Size: 174 KB - Last synced: 15 days ago - Pushed: almost 4 years ago - Stars: 4 - Forks: 2

rohitdavas/Reinforcement-Learning

RL models from base.

Language: Jupyter Notebook - Size: 130 MB - Last synced: 15 days ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

odow/SDDP.jl

Stochastic Dual Dynamic Programming in Julia

Language: Julia - Size: 21.4 MB - Last synced: 15 days ago - Pushed: 16 days ago - Stars: 277 - Forks: 56

laurimi/pydpomdp

Python package for Dec-POMDP files in the .dpomdp format

Language: C++ - Size: 24.4 KB - Last synced: 18 days ago - Pushed: over 1 year ago - Stars: 5 - Forks: 1

DES-Lab/AALpy

An Automata Learning Library Written in Python

Language: Python - Size: 22.2 MB - Last synced: 23 days ago - Pushed: 28 days ago - Stars: 150 - Forks: 19

ds4dm/ecole

Extensible Combinatorial Optimization Learning Environments

Language: C++ - Size: 2.32 MB - Last synced: 10 days ago - Pushed: about 1 year ago - Stars: 306 - Forks: 67

victor-iyi/simple-Q-network

A Q Learning Reinforcement agent using a simple feed forward neural net.

Language: Python - Size: 50.8 KB - Last synced: 22 days ago - Pushed: over 6 years ago - Stars: 2 - Forks: 1

victor-iyi/contextual-bandit

A Reinforcement Learning approach to a contextual bandit problem.

Language: Jupyter Notebook - Size: 21.5 KB - Last synced: 22 days ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0

victor-iyi/basic-Q-learning-algorithm

Implementation of a basic Q Learning algorithm in the OpenAI's gym environment

Language: Jupyter Notebook - Size: 10.7 KB - Last synced: 22 days ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0

abhinand5/lunar-lander-deep-rl

Solving OpenAI Gym's Lunar Lander environment using Deep Reinforcement Learning

Language: Python - Size: 16.6 MB - Last synced: 23 days ago - Pushed: over 1 year ago - Stars: 6 - Forks: 3

h2r/pomdp-py

A framework to build and solve POMDP problems. Documentation: https://h2r.github.io/pomdp-py/

Language: Python - Size: 6.85 MB - Last synced: 24 days ago - Pushed: 27 days ago - Stars: 193 - Forks: 45

iisys-hof/map-matching 📦

Map Matching with Markov Decision Processes [Proof-of-concept]

Language: Python - Size: 1.09 MB - Last synced: 30 days ago - Pushed: over 2 years ago - Stars: 1 - Forks: 0

bermed28/cs7641-assignment4

Project that experiments with algorithms used to solve Markov Decision Processes

Language: Python - Size: 995 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

OpenSourceEconomics/respy

Framework for the simulation and estimation of some finite-horizon discrete choice dynamic programming models.

Language: Python - Size: 123 MB - Last synced: 17 days ago - Pushed: 17 days ago - Stars: 74 - Forks: 31

sshkhr/Practical_RL

My solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow

Language: Jupyter Notebook - Size: 9.91 MB - Last synced: about 1 month ago - Pushed: over 2 years ago - Stars: 50 - Forks: 25

laurimi/multiagent-prediction-reward

Multi-agent active perception with prediction rewards

Language: C++ - Size: 312 KB - Last synced: 18 days ago - Pushed: over 3 years ago - Stars: 10 - Forks: 0

Rapfff/jajapy

Baum-Welch for all kind of Markov models

Language: Python - Size: 8.23 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 12 - Forks: 1

JuliaPOMDP/QuickPOMDPs.jl

Concise and friendly interfaces for defining MDP and POMDP models for use with POMDPs.jl solvers

Language: Julia - Size: 412 KB - Last synced: 7 days ago - Pushed: 5 months ago - Stars: 29 - Forks: 6

JuliaPOMDP/POMDPs.jl

MDPs and POMDPs in Julia - An interface for defining, solving, and simulating fully and partially observable Markov decision processes on discrete and continuous spaces.

Language: Julia - Size: 8.23 MB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 633 - Forks: 97

Svalorzen/AI-Toolbox

A C++ framework for MDPs and POMDPs with Python bindings

Language: C++ - Size: 20.2 MB - Last synced: about 1 month ago - Pushed: 4 months ago - Stars: 638 - Forks: 97

omniscientoctopus/decision_making_primer

Introduction to sequential decision making under uncertainty as part the course AR0202: Computational Intelligence @ the Faculty of Architecture, TU Delft

Language: Jupyter Notebook - Size: 24.1 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

vinaykakkad/robot-motion-planning

🤖✨ Robot Motion Planning using Markov Decison Processes and Quadtree Decomposition

Language: Python - Size: 2.55 MB - Last synced: about 1 month ago - Pushed: 8 months ago - Stars: 1 - Forks: 0

JuliaPOMDP/POMCP.jl 📦

Julia Implementation of the POMCP algorithm for solving POMDPs

Language: Jupyter Notebook - Size: 615 KB - Last synced: 7 days ago - Pushed: almost 3 years ago - Stars: 12 - Forks: 6

thiagopbueno/awesome-probabilistic-planning

A curated list of online resources for probabilistic planning: papers, software and research groups around the world!

Size: 18.6 KB - Last synced: 25 days ago - Pushed: about 6 years ago - Stars: 53 - Forks: 12

TolgaOk/jaxdp

A Dynamic Programming package for discrete MDPs implemented in JAX

Language: Python - Size: 87.9 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 1 - Forks: 0

ImanRHT/QECO

A QoE-Oriented Computation Offloading Algorithm based on Deep Reinforcement Learning for Mobile Edge Computing

Language: Python - Size: 1.4 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 50 - Forks: 9

Prakhar-FF13/Reinforcement-Learning-With-Python

Reinforcement Learning Notebooks

Language: Python - Size: 115 KB - Last synced: about 1 month ago - Pushed: about 5 years ago - Stars: 2 - Forks: 1

LaurentVeyssier/Optimizing-Warehouse-Flows-with-Q-Learning

calculate the optimum route in a warehouse using the Q-Learning algorithm (Bellman equation)

Language: Jupyter Notebook - Size: 35.2 KB - Last synced: about 2 months ago - Pushed: over 3 years ago - Stars: 3 - Forks: 3

aai-institute/tfl-training-probabilistic-model-checking

TfL course on probabilistic model checking using storm

Language: Jupyter Notebook - Size: 59.4 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 1 - Forks: 0

callmespring/RL-short-course

Reinforcement Learning Short Course

Language: Jupyter Notebook - Size: 83 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 35 - Forks: 12

fardinabbasi/MDP_RL

Interactive Learning [ECE 641] - Fall 2023 - University of Tehran - Prof. Nili

Size: 630 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0

JuliaPOMDP/quickpomdps

Interface for defining discrete and continuous-space MDPs and POMDPs in python. Compatible with the POMDPs.jl ecosystem.

Language: Python - Size: 33.2 KB - Last synced: 7 days ago - Pushed: 10 months ago - Stars: 18 - Forks: 4

colinskow/move37

Coding Demos from the School of AI's Move37 Course

Language: Python - Size: 59.6 KB - Last synced: about 1 month ago - Pushed: over 5 years ago - Stars: 178 - Forks: 114

JeffreyTsa1/rl_task

Trained an artificial intelligence agent using reinforcement learning to play a simple version of the game "Snake". Implemented a Temporal Difference version of the Q-learning Algorithm. Completed for school.

Language: Python - Size: 26.4 KB - Last synced: about 2 months ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

sakamoto00/AI

Implementation of some important ideas in the AI world.

Language: Python - Size: 572 KB - Last synced: 2 months ago - Pushed: about 6 years ago - Stars: 1 - Forks: 2

Surya059/Disease-Monitoring-System

We proposed and implemented a model of how an epidemic spreads based on the interactions recorded, among humans. The system was assumed as a Markov process where the hidden variable is the state of the person, transition between the states was done by the interactions. These interactions will be detected by using RFID technology in smart phones.

Language: Python - Size: 186 KB - Last synced: 2 months ago - Pushed: over 6 years ago - Stars: 0 - Forks: 1

Sahil3201/maze-solver

Repo for maze generation and pathfinding algorithms, including BFS, DFS, A*, MDP Value Iteration, and MDP Policy Iteration, implemented in Python for solving mazes.

Language: Python - Size: 970 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0

Neo-Zenith/sc3000-artificial-intelligence

Lab assignments for the course SC3000: Artificial Intelligence. The lab assignments consist of designing an RL agent for the balancing pole problem and programming a knowledge base using Prolog.

Language: Jupyter Notebook - Size: 2.35 MB - Last synced: 23 days ago - Pushed: about 1 year ago - Stars: 1 - Forks: 0

ivanbelenky/RL

R.L. methods and techniques.

Language: Python - Size: 1.3 MB - Last synced: 25 days ago - Pushed: 12 months ago - Stars: 2 - Forks: 1

lsunsi/markovjs

Reinforcement Learning in JavaScript

Language: JavaScript - Size: 47.9 KB - Last synced: 13 days ago - Pushed: over 7 years ago - Stars: 73 - Forks: 4

iisys-hof/map-matching-2

High Performance Map Matching with Markov Decision Processes (MDPs) and Hidden Markov Models (HMMs).

Language: C++ - Size: 16.5 MB - Last synced: 30 days ago - Pushed: 7 months ago - Stars: 31 - Forks: 7

afshinea/stanford-cs-221-artificial-intelligence

VIP cheatsheets for Stanford's CS 221 Artificial Intelligence

Size: 10.1 MB - Last synced: 3 months ago - Pushed: over 4 years ago - Stars: 2,394 - Forks: 468

OpenSourceEconomics/ekw-promotion 📦

promotional material for our work on Eckstein-Keane-Wolpin models

Language: TeX - Size: 28.6 MB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 1 - Forks: 5

masouduut94/MCTS-agent-python

Monte Carlo Tree Search (MCTS) is a method for finding optimal decisions in a given domain by taking random samples in the decision space and building a search tree accordingly. It has already had a profound impact on Artificial Intelligence (AI) approaches for domains that can be represented as trees of sequential decisions, particularly games and planning problems. In this project I used a board game called "HEX" as a platform to test different simulation strategies in MCTS field.

Language: Python - Size: 695 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 58 - Forks: 8

fxswiatowicz/cogs298-project

COGS-298 Project: Understanding Reinforcement Learning Through Markov Decision Processes and Pong

Language: Python - Size: 615 KB - Last synced: 3 months ago - Pushed: almost 6 years ago - Stars: 0 - Forks: 1

l-vicen/Marketing-Campaign-Planner

Marketing Campaign Planner based on a Markov Decision Process.

Language: Python - Size: 192 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 1

Networks-Learning/counterfactual-continuous-mdp

Code for "Finding Counterfactually Optimal Action Sequences in Continuous State Spaces", NeurIPS 2023.

Language: Python - Size: 85.9 KB - Last synced: about 1 month ago - Pushed: 7 months ago - Stars: 4 - Forks: 1

MatteoM95/Network-Dynamics-and-Learning

Homework and exercises from course Network Dynamics and Learning

Language: Jupyter Notebook - Size: 7.07 MB - Last synced: 18 days ago - Pushed: 7 months ago - Stars: 4 - Forks: 6

danieljsharpe/DISCOTRESS_tutorials

Learn to get started using DISCOTRESS with these tutorials! Then apply to your own Markov chains in ecology 🦜🌴 economics 💸📈 biophysics 🧬🦠 and more!

Language: Brainfuck - Size: 5.43 MB - Last synced: 3 months ago - Pushed: over 2 years ago - Stars: 4 - Forks: 1

ariankhanjani/Frozen-Lake-Openai-Gym

Implementation of RL Algorithms in Openai Gym Frozen-Lake Environment

Language: Jupyter Notebook - Size: 2.71 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 5 - Forks: 0

SheidaAbedpour/MDP-CliffWalking

This project utilizes Markov Decision Process (MDP) principles to implement a custom "CliffWalking" environment in Gym, employing policy iteration to find an optimal policy for agent navigation.

Language: Python - Size: 817 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

Hysperr/gridworld-v2

An agent learns the optimal path towards its goal from any starting point while avoiding obstacles.

Language: C# - Size: 21.5 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 1 - Forks: 0

divyajeettt/CSE564

A 5xx-level course, Reinforcement Learning, offered to undergrads at IIIT-Delhi.

Language: Jupyter Notebook - Size: 2.48 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

Kytabyte/rl-playground

Implementation and experiments of reinforcement learning algorithms in CS885 @ UW

Language: Python - Size: 93.8 KB - Last synced: 4 months ago - Pushed: over 4 years ago - Stars: 3 - Forks: 1

devspaceship/madepro

A minimal Rust library for solving finite deterministic Markov decision processes

Language: Rust - Size: 59.6 KB - Last synced: 13 days ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

sanjaycg486/masters-seminar-rl-ss2023

A repository contains resources related to a seminar on Reinforcement Learning, focusing on Markov Decision Processes (MDPs).

Size: 1.15 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

PranavNyati/Reinforcement_Learning_Term_Project

This repository contains the codes for Term Projects as part of the Reinforcement Learning course (CS600077) that I am taking in the Autumn 2023 semester at IIT Kharagpur

Language: Python - Size: 55.8 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 2 - Forks: 0

zafarali/emdp

Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations

Language: Python - Size: 82 KB - Last synced: about 1 month ago - Pushed: about 2 years ago - Stars: 47 - Forks: 14

parissashahabi/Game-Playing-Intelligent-Agent

Implemented reinforcement learning algorithms, including Value-Iteration and Q-Learning, for a 2D grid world Markov Decision Process resembling a Pac-man game. Also applied the Mini-Max algorithm and common path-planning techniques such as A*, Dijkstra, and bidirectional search.

Language: Python - Size: 331 KB - Last synced: 4 months ago - Pushed: 6 months ago - Stars: 4 - Forks: 1

mehdimo/reinforcement-learning-handson

Sample projects to learn reinforcement learning and deep reinforcement learning in practice.

Language: Python - Size: 213 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 5 - Forks: 7

kevin-hanselman/grid-world-rl

Value iteration, policy iteration, and Q-Learning in a grid-world MDP.

Language: Python - Size: 14.6 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 22 - Forks: 12

weirayao/causal-mdp

PGM final project

Language: Jupyter Notebook - Size: 3.73 MB - Last synced: about 2 months ago - Pushed: about 4 years ago - Stars: 3 - Forks: 0

dksifoua/Reinforcement-Learning

Language: Jupyter Notebook - Size: 1.14 MB - Last synced: 13 days ago - Pushed: 13 days ago - Stars: 1 - Forks: 1

nilskruse/mdp

Markov decision process master thesis

Language: Rust - Size: 180 KB - Last synced: 4 months ago - Pushed: 7 months ago - Stars: 3 - Forks: 0

IBM/IBM-Extended-Markov-Ratio-Decision-Process

This repo includes code referenced in the paper A Rigorous Risk-aware Linear Approach to Extended Markov Ratio Decision Processes with Embedded Learning by Alexander Zadorojniy, Takayuki Osogami, and Orit Davidovich to appear in IJCAI 2023.

Language: Jupyter Notebook - Size: 688 KB - Last synced: about 1 month ago - Pushed: 4 months ago - Stars: 4 - Forks: 0

ALotov2000/gym-frozen-lake-and-taxi-solved-by-reinforcement-learning

This repository belongs to one of my computer assignments for an AI course I attended at the University of Tehran.

Language: HTML - Size: 865 KB - Last synced: 4 months ago - Pushed: 6 months ago - Stars: 1 - Forks: 0

oguzmes/StochasticAntibiotic

A Stochastic Programming Approach to the Antibiotics Time Machine Problem

Language: Jupyter Notebook - Size: 5.64 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 5 - Forks: 0

bmarroc/reinforcement-learning

Jupyter notebooks implementing Reinforcement Learning algorithms in Numpy and Tensorflow

Language: Jupyter Notebook - Size: 2.84 MB - Last synced: 4 months ago - Pushed: 9 months ago - Stars: 1 - Forks: 0

lkmuk/seq_decision_under_uncertainty

jupyter notebooks & notes on sequential decision under uncertainty

Language: Jupyter Notebook - Size: 147 KB - Last synced: 25 days ago - Pushed: almost 2 years ago - Stars: 1 - Forks: 0

dsietz/test-data-generation

Test Data Generation

Language: Rust - Size: 2.83 MB - Last synced: 27 days ago - Pushed: over 2 years ago - Stars: 36 - Forks: 3

ameroyer/ReCA

(ReCA) Recommendation with Context Awareness using Multi-Environment Markov Decision Processes

Language: C++ - Size: 9.64 MB - Last synced: 6 months ago - Pushed: almost 6 years ago - Stars: 4 - Forks: 1

changkun/ws-18-19-deep-learning-tutorial

Deep Learning and Artificial Intelligence Tutorial @ LMU WS 2018/19

Language: Jupyter Notebook - Size: 24.3 MB - Last synced: 14 days ago - Pushed: over 5 years ago - Stars: 12 - Forks: 1

rllab-snu/tsallis_actor_critic_mujoco

Implementation of Tsallis Actor Critic method

Language: Jupyter Notebook - Size: 828 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 58 - Forks: 10

mircocarciani/MDP_dice_game

MDP Agent which dermines the best policy to obtain the best possible score in a Dice Game

Language: Python - Size: 8.79 KB - Last synced: 6 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

nasa/pymdptoolbox Fork of sawcordwell/pymdptoolbox

Markov Decision Process (MDP) Toolbox for Python

Language: Python - Size: 455 KB - Last synced: about 1 month ago - Pushed: almost 9 years ago - Stars: 31 - Forks: 31

abhineet123/deep_mdp

Deep MDP: A Modular Framework for Multi-Object Tracking

Language: Python - Size: 5.49 MB - Last synced: 29 days ago - Pushed: almost 2 years ago - Stars: 2 - Forks: 1

etienneandre/ImpRator

ImpRator (Inverse Method for Policy with Reward AbstracT behaviOR) is a prototype implementation to compute parameter valuations in parametric Markov decision processes such that optimal policies remain optimal.

Language: OCaml - Size: 55.7 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

mjmaher987/Artificial-Intelligence

Artificial Intelligence + Deep Learning

Language: Jupyter Notebook - Size: 7.59 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 8 - Forks: 1

sachinbiradar9/Markov-Decision-Processes

Implementation of value iteration algorithm for calculating an optimal MDP policy

Language: Python - Size: 21.5 KB - Last synced: 7 months ago - Pushed: over 6 years ago - Stars: 85 - Forks: 43

rldotai/mdpy

Markov Decision Processes in Python

Language: HTML - Size: 3.94 MB - Last synced: 7 months ago - Pushed: over 5 years ago - Stars: 14 - Forks: 3

joanby/curso-algebra-lineal

Curso de Álgebra Lineal

Language: HTML - Size: 149 MB - Last synced: 7 months ago - Pushed: 9 months ago - Stars: 409 - Forks: 968

KHvic/Markov-Decision-Process-Value-Iteration-Policy-Iteration-Visualization

Computing an optimal Markov Decision Process (MDP) policy with Value Iteration and Policy Iteration

Language: Java - Size: 3.59 MB - Last synced: 7 months ago - Pushed: over 3 years ago - Stars: 6 - Forks: 3

SiluPanda/maze-solver-reinforcement-learning

A smart agent which solves a escaping maze using MDP

Language: Python - Size: 63.5 KB - Last synced: 7 months ago - Pushed: about 5 years ago - Stars: 0 - Forks: 0

beatrizgj/MDPbiome

microbiome engineering through prescriptive perturbations

Language: R - Size: 845 KB - Last synced: 7 months ago - Pushed: almost 5 years ago - Stars: 8 - Forks: 2

nirjarigandhi/Sokoban-Probability-Agents

Language: Python - Size: 118 KB - Last synced: 7 months ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0

jia-yi-chen/Bandit-and-Reinforcement-Learning

Python implementation for Reinforcement Learning algorithms -- Bandit algorithms, MDP, Dynamic Programming (value/policy iteration), Model-free Control (off-policy Monte Carlo, Q-learning)

Language: Python - Size: 31.3 KB - Last synced: 7 months ago - Pushed: over 2 years ago - Stars: 3 - Forks: 1

madhura711/LENOVO---Stochastic-Optimization-and-Predictive-Modeling

Language: R - Size: 6.09 MB - Last synced: 3 months ago - Pushed: over 5 years ago - Stars: 4 - Forks: 1

michaelw123/Reinforcement-Learning

Reinforcement Learning in Scala

Language: Scala - Size: 1.13 MB - Last synced: 7 months ago - Pushed: almost 6 years ago - Stars: 0 - Forks: 0

Sudhansh6/Intelligent-Learning-Agents

A repository covering a range of topics from multi-arm bandits to reinforcement learning algorithms. Check out different applications of bandits, MDPs and RL algorithms along with theoretical aspects.

Language: Python - Size: 937 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

sourenaKhanzadeh/snakeAi

Reinforcement Learning with the classic snake game

Language: Python - Size: 3.98 MB - Last synced: about 1 month ago - Pushed: about 3 years ago - Stars: 13 - Forks: 4

karthikbhamidipati/reinforcement-learning

A repo for implementing reinforcement learning algorithms

Language: Python - Size: 90.8 KB - Last synced: 8 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

maximkha/HdCMM

novel high dimensional continuous Markov chain predictor

Language: Python - Size: 725 KB - Last synced: 8 months ago - Pushed: over 3 years ago - Stars: 2 - Forks: 0

Related Keywords
markov-decision-processes 299 reinforcement-learning 134 q-learning 48 value-iteration 46 artificial-intelligence 43 machine-learning 38 python 36 mdp 36 markov-chain 30 policy-iteration 26 dynamic-programming 18 reinforcement-learning-algorithms 16 ai 15 markov-model 15 sarsa 13 markov 13 qlearning 12 monte-carlo 11 deep-reinforcement-learning 11 policy-gradient 10 deep-learning 10 gridworld 9 openai-gym 9 mdps 8 decision-making 7 bellman-equation 7 pomdps 7 optimization 6 astar-algorithm 6 neural-network 6 julia 6 rl 6 model-checking 5 temporal-differencing-learning 5 random-walk 5 planning 5 stochastic-processes 5 python3 5 hidden-markov-model 5 constraint-satisfaction-problem 5 pytorch 5 sarsa-lambda 5 deep-q-network 5 neural-networks 5 multi-armed-bandit 5 tensorflow 5 pygame 4 algorithm 4 pomdp 4 monte-carlo-tree-search 4 gym 4 grid-world 4 rust 4 reinforcement-learning-agent 4 adversarial-search 4 qlearning-algorithm 4 temporal-difference 4 probabilistic-models 4 policy-evaluation 4 probabilistic-graphical-models 4 deep-q-learning 4 simulation 4 bfs 4 alpha-beta-pruning 4 javascript 4 markov-decision-process 4 value-iteration-algorithm 3 reinforcement-learning-environments 3 dfs 3 maze-solver 3 csharp 3 graph 3 reinforce 3 dyna-q 3 economics 3 monte-carlo-methods 3 planning-algorithms 3 gridworld-environment 3 search-algorithm 3 dqn 3 minimax 3 minimax-algorithm 3 heuristic-search-algorithms 3 r 3 nlp 3 pacman 3 travel-demand-modelling 3 expectimax 3 probabilistic-programming 3 markov-chains 3 dec-pomdp 3 multi-agent-systems 3 operations-research 3 statistics 3 monte-carlo-simulation 3 control-theory 3 linear-programming 3 robotics 3 jupyter-notebook 3 epsilon-greedy 3