Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: markov-decision-processes
Limmen/csle
A research platform to develop automated security policies using quantitative methods, e.g., optimal control, computational game theory, reinforcement learning, optimization, evolutionary methods, and causal inference.
Language: Python - Size: 125 MB - Last synced: about 15 hours ago - Pushed: about 18 hours ago - Stars: 102 - Forks: 19
JuliaPOMDP/CompressedBeliefMDPs.jl
Compressed belief-state MDPs in Julia compatible with POMDPs.jl
Language: Julia - Size: 373 KB - Last synced: about 22 hours ago - Pushed: 1 day ago - Stars: 5 - Forks: 0
camargomau/markovian-decisions
Repository for the final project for Procesos Estocásticos. S1.63.10
Language: Python - Size: 34.2 KB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 0 - Forks: 0
gobind452/OptimalBlackJack
Solving BlackJack using Policy Iteration
Language: C++ - Size: 11.7 KB - Last synced: 10 days ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0
mhahsler/pomdp
R package for Partially Observable Markov Decision Processes
Language: R - Size: 2.74 MB - Last synced: 9 days ago - Pushed: 11 days ago - Stars: 13 - Forks: 4
sudharsan13296/Hands-On-Reinforcement-Learning-With-Python
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Language: Jupyter Notebook - Size: 41.9 MB - Last synced: 11 days ago - Pushed: over 3 years ago - Stars: 821 - Forks: 325
Gaby-253/Markov-Decision-Process
I had to choose the best policy for a certain agent in a certain world by using markov decision problem.
Language: MATLAB - Size: 625 KB - Last synced: 13 days ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0
yanshengjia/link
Undergraduate graduation project (Entity Linking System in Web Tables with Multiple Linked Knowledge Bases) at SEU.
Language: HTML - Size: 39.1 MB - Last synced: 14 days ago - Pushed: about 5 years ago - Stars: 8 - Forks: 2
simerplaha/reinforcement-learning
Reinforcement learning
Language: Scala - Size: 174 KB - Last synced: 15 days ago - Pushed: almost 4 years ago - Stars: 4 - Forks: 2
rohitdavas/Reinforcement-Learning
RL models from base.
Language: Jupyter Notebook - Size: 130 MB - Last synced: 15 days ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0
odow/SDDP.jl
Stochastic Dual Dynamic Programming in Julia
Language: Julia - Size: 21.4 MB - Last synced: 15 days ago - Pushed: 16 days ago - Stars: 277 - Forks: 56
laurimi/pydpomdp
Python package for Dec-POMDP files in the .dpomdp format
Language: C++ - Size: 24.4 KB - Last synced: 18 days ago - Pushed: over 1 year ago - Stars: 5 - Forks: 1
DES-Lab/AALpy
An Automata Learning Library Written in Python
Language: Python - Size: 22.2 MB - Last synced: 23 days ago - Pushed: 28 days ago - Stars: 150 - Forks: 19
ds4dm/ecole
Extensible Combinatorial Optimization Learning Environments
Language: C++ - Size: 2.32 MB - Last synced: 10 days ago - Pushed: about 1 year ago - Stars: 306 - Forks: 67
victor-iyi/simple-Q-network
A Q Learning Reinforcement agent using a simple feed forward neural net.
Language: Python - Size: 50.8 KB - Last synced: 22 days ago - Pushed: over 6 years ago - Stars: 2 - Forks: 1
victor-iyi/contextual-bandit
A Reinforcement Learning approach to a contextual bandit problem.
Language: Jupyter Notebook - Size: 21.5 KB - Last synced: 22 days ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0
victor-iyi/basic-Q-learning-algorithm
Implementation of a basic Q Learning algorithm in the OpenAI's gym environment
Language: Jupyter Notebook - Size: 10.7 KB - Last synced: 22 days ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0
abhinand5/lunar-lander-deep-rl
Solving OpenAI Gym's Lunar Lander environment using Deep Reinforcement Learning
Language: Python - Size: 16.6 MB - Last synced: 23 days ago - Pushed: over 1 year ago - Stars: 6 - Forks: 3
h2r/pomdp-py
A framework to build and solve POMDP problems. Documentation: https://h2r.github.io/pomdp-py/
Language: Python - Size: 6.85 MB - Last synced: 24 days ago - Pushed: 27 days ago - Stars: 193 - Forks: 45
iisys-hof/map-matching 📦
Map Matching with Markov Decision Processes [Proof-of-concept]
Language: Python - Size: 1.09 MB - Last synced: 30 days ago - Pushed: over 2 years ago - Stars: 1 - Forks: 0
bermed28/cs7641-assignment4
Project that experiments with algorithms used to solve Markov Decision Processes
Language: Python - Size: 995 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0
OpenSourceEconomics/respy
Framework for the simulation and estimation of some finite-horizon discrete choice dynamic programming models.
Language: Python - Size: 123 MB - Last synced: 17 days ago - Pushed: 17 days ago - Stars: 74 - Forks: 31
sshkhr/Practical_RL
My solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow
Language: Jupyter Notebook - Size: 9.91 MB - Last synced: about 1 month ago - Pushed: over 2 years ago - Stars: 50 - Forks: 25
laurimi/multiagent-prediction-reward
Multi-agent active perception with prediction rewards
Language: C++ - Size: 312 KB - Last synced: 18 days ago - Pushed: over 3 years ago - Stars: 10 - Forks: 0
Rapfff/jajapy
Baum-Welch for all kind of Markov models
Language: Python - Size: 8.23 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 12 - Forks: 1
JuliaPOMDP/QuickPOMDPs.jl
Concise and friendly interfaces for defining MDP and POMDP models for use with POMDPs.jl solvers
Language: Julia - Size: 412 KB - Last synced: 7 days ago - Pushed: 5 months ago - Stars: 29 - Forks: 6
JuliaPOMDP/POMDPs.jl
MDPs and POMDPs in Julia - An interface for defining, solving, and simulating fully and partially observable Markov decision processes on discrete and continuous spaces.
Language: Julia - Size: 8.23 MB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 633 - Forks: 97
Svalorzen/AI-Toolbox
A C++ framework for MDPs and POMDPs with Python bindings
Language: C++ - Size: 20.2 MB - Last synced: about 1 month ago - Pushed: 4 months ago - Stars: 638 - Forks: 97
omniscientoctopus/decision_making_primer
Introduction to sequential decision making under uncertainty as part the course AR0202: Computational Intelligence @ the Faculty of Architecture, TU Delft
Language: Jupyter Notebook - Size: 24.1 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0
vinaykakkad/robot-motion-planning
🤖✨ Robot Motion Planning using Markov Decison Processes and Quadtree Decomposition
Language: Python - Size: 2.55 MB - Last synced: about 1 month ago - Pushed: 8 months ago - Stars: 1 - Forks: 0
JuliaPOMDP/POMCP.jl 📦
Julia Implementation of the POMCP algorithm for solving POMDPs
Language: Jupyter Notebook - Size: 615 KB - Last synced: 7 days ago - Pushed: almost 3 years ago - Stars: 12 - Forks: 6
thiagopbueno/awesome-probabilistic-planning
A curated list of online resources for probabilistic planning: papers, software and research groups around the world!
Size: 18.6 KB - Last synced: 25 days ago - Pushed: about 6 years ago - Stars: 53 - Forks: 12
TolgaOk/jaxdp
A Dynamic Programming package for discrete MDPs implemented in JAX
Language: Python - Size: 87.9 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 1 - Forks: 0
ImanRHT/QECO
A QoE-Oriented Computation Offloading Algorithm based on Deep Reinforcement Learning for Mobile Edge Computing
Language: Python - Size: 1.4 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 50 - Forks: 9
Prakhar-FF13/Reinforcement-Learning-With-Python
Reinforcement Learning Notebooks
Language: Python - Size: 115 KB - Last synced: about 1 month ago - Pushed: about 5 years ago - Stars: 2 - Forks: 1
LaurentVeyssier/Optimizing-Warehouse-Flows-with-Q-Learning
calculate the optimum route in a warehouse using the Q-Learning algorithm (Bellman equation)
Language: Jupyter Notebook - Size: 35.2 KB - Last synced: about 2 months ago - Pushed: over 3 years ago - Stars: 3 - Forks: 3
aai-institute/tfl-training-probabilistic-model-checking
TfL course on probabilistic model checking using storm
Language: Jupyter Notebook - Size: 59.4 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 1 - Forks: 0
callmespring/RL-short-course
Reinforcement Learning Short Course
Language: Jupyter Notebook - Size: 83 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 35 - Forks: 12
fardinabbasi/MDP_RL
Interactive Learning [ECE 641] - Fall 2023 - University of Tehran - Prof. Nili
Size: 630 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0
JuliaPOMDP/quickpomdps
Interface for defining discrete and continuous-space MDPs and POMDPs in python. Compatible with the POMDPs.jl ecosystem.
Language: Python - Size: 33.2 KB - Last synced: 7 days ago - Pushed: 10 months ago - Stars: 18 - Forks: 4
colinskow/move37
Coding Demos from the School of AI's Move37 Course
Language: Python - Size: 59.6 KB - Last synced: about 1 month ago - Pushed: over 5 years ago - Stars: 178 - Forks: 114
JeffreyTsa1/rl_task
Trained an artificial intelligence agent using reinforcement learning to play a simple version of the game "Snake". Implemented a Temporal Difference version of the Q-learning Algorithm. Completed for school.
Language: Python - Size: 26.4 KB - Last synced: about 2 months ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0
sakamoto00/AI
Implementation of some important ideas in the AI world.
Language: Python - Size: 572 KB - Last synced: 2 months ago - Pushed: about 6 years ago - Stars: 1 - Forks: 2
Surya059/Disease-Monitoring-System
We proposed and implemented a model of how an epidemic spreads based on the interactions recorded, among humans. The system was assumed as a Markov process where the hidden variable is the state of the person, transition between the states was done by the interactions. These interactions will be detected by using RFID technology in smart phones.
Language: Python - Size: 186 KB - Last synced: 2 months ago - Pushed: over 6 years ago - Stars: 0 - Forks: 1
Sahil3201/maze-solver
Repo for maze generation and pathfinding algorithms, including BFS, DFS, A*, MDP Value Iteration, and MDP Policy Iteration, implemented in Python for solving mazes.
Language: Python - Size: 970 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0
Neo-Zenith/sc3000-artificial-intelligence
Lab assignments for the course SC3000: Artificial Intelligence. The lab assignments consist of designing an RL agent for the balancing pole problem and programming a knowledge base using Prolog.
Language: Jupyter Notebook - Size: 2.35 MB - Last synced: 23 days ago - Pushed: about 1 year ago - Stars: 1 - Forks: 0
ivanbelenky/RL
R.L. methods and techniques.
Language: Python - Size: 1.3 MB - Last synced: 25 days ago - Pushed: 12 months ago - Stars: 2 - Forks: 1
lsunsi/markovjs
Reinforcement Learning in JavaScript
Language: JavaScript - Size: 47.9 KB - Last synced: 13 days ago - Pushed: over 7 years ago - Stars: 73 - Forks: 4
iisys-hof/map-matching-2
High Performance Map Matching with Markov Decision Processes (MDPs) and Hidden Markov Models (HMMs).
Language: C++ - Size: 16.5 MB - Last synced: 30 days ago - Pushed: 7 months ago - Stars: 31 - Forks: 7
afshinea/stanford-cs-221-artificial-intelligence
VIP cheatsheets for Stanford's CS 221 Artificial Intelligence
Size: 10.1 MB - Last synced: 3 months ago - Pushed: over 4 years ago - Stars: 2,394 - Forks: 468
OpenSourceEconomics/ekw-promotion 📦
promotional material for our work on Eckstein-Keane-Wolpin models
Language: TeX - Size: 28.6 MB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 1 - Forks: 5
masouduut94/MCTS-agent-python
Monte Carlo Tree Search (MCTS) is a method for finding optimal decisions in a given domain by taking random samples in the decision space and building a search tree accordingly. It has already had a profound impact on Artificial Intelligence (AI) approaches for domains that can be represented as trees of sequential decisions, particularly games and planning problems. In this project I used a board game called "HEX" as a platform to test different simulation strategies in MCTS field.
Language: Python - Size: 695 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 58 - Forks: 8
fxswiatowicz/cogs298-project
COGS-298 Project: Understanding Reinforcement Learning Through Markov Decision Processes and Pong
Language: Python - Size: 615 KB - Last synced: 3 months ago - Pushed: almost 6 years ago - Stars: 0 - Forks: 1
l-vicen/Marketing-Campaign-Planner
Marketing Campaign Planner based on a Markov Decision Process.
Language: Python - Size: 192 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 1
Networks-Learning/counterfactual-continuous-mdp
Code for "Finding Counterfactually Optimal Action Sequences in Continuous State Spaces", NeurIPS 2023.
Language: Python - Size: 85.9 KB - Last synced: about 1 month ago - Pushed: 7 months ago - Stars: 4 - Forks: 1
MatteoM95/Network-Dynamics-and-Learning
Homework and exercises from course Network Dynamics and Learning
Language: Jupyter Notebook - Size: 7.07 MB - Last synced: 18 days ago - Pushed: 7 months ago - Stars: 4 - Forks: 6
danieljsharpe/DISCOTRESS_tutorials
Learn to get started using DISCOTRESS with these tutorials! Then apply to your own Markov chains in ecology 🦜🌴 economics 💸📈 biophysics 🧬🦠 and more!
Language: Brainfuck - Size: 5.43 MB - Last synced: 3 months ago - Pushed: over 2 years ago - Stars: 4 - Forks: 1
ariankhanjani/Frozen-Lake-Openai-Gym
Implementation of RL Algorithms in Openai Gym Frozen-Lake Environment
Language: Jupyter Notebook - Size: 2.71 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 5 - Forks: 0
SheidaAbedpour/MDP-CliffWalking
This project utilizes Markov Decision Process (MDP) principles to implement a custom "CliffWalking" environment in Gym, employing policy iteration to find an optimal policy for agent navigation.
Language: Python - Size: 817 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
Hysperr/gridworld-v2
An agent learns the optimal path towards its goal from any starting point while avoiding obstacles.
Language: C# - Size: 21.5 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 1 - Forks: 0
divyajeettt/CSE564
A 5xx-level course, Reinforcement Learning, offered to undergrads at IIIT-Delhi.
Language: Jupyter Notebook - Size: 2.48 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
Kytabyte/rl-playground
Implementation and experiments of reinforcement learning algorithms in CS885 @ UW
Language: Python - Size: 93.8 KB - Last synced: 4 months ago - Pushed: over 4 years ago - Stars: 3 - Forks: 1
devspaceship/madepro
A minimal Rust library for solving finite deterministic Markov decision processes
Language: Rust - Size: 59.6 KB - Last synced: 13 days ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
sanjaycg486/masters-seminar-rl-ss2023
A repository contains resources related to a seminar on Reinforcement Learning, focusing on Markov Decision Processes (MDPs).
Size: 1.15 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
PranavNyati/Reinforcement_Learning_Term_Project
This repository contains the codes for Term Projects as part of the Reinforcement Learning course (CS600077) that I am taking in the Autumn 2023 semester at IIT Kharagpur
Language: Python - Size: 55.8 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 2 - Forks: 0
zafarali/emdp
Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations
Language: Python - Size: 82 KB - Last synced: about 1 month ago - Pushed: about 2 years ago - Stars: 47 - Forks: 14
parissashahabi/Game-Playing-Intelligent-Agent
Implemented reinforcement learning algorithms, including Value-Iteration and Q-Learning, for a 2D grid world Markov Decision Process resembling a Pac-man game. Also applied the Mini-Max algorithm and common path-planning techniques such as A*, Dijkstra, and bidirectional search.
Language: Python - Size: 331 KB - Last synced: 4 months ago - Pushed: 6 months ago - Stars: 4 - Forks: 1
mehdimo/reinforcement-learning-handson
Sample projects to learn reinforcement learning and deep reinforcement learning in practice.
Language: Python - Size: 213 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 5 - Forks: 7
kevin-hanselman/grid-world-rl
Value iteration, policy iteration, and Q-Learning in a grid-world MDP.
Language: Python - Size: 14.6 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 22 - Forks: 12
weirayao/causal-mdp
PGM final project
Language: Jupyter Notebook - Size: 3.73 MB - Last synced: about 2 months ago - Pushed: about 4 years ago - Stars: 3 - Forks: 0
dksifoua/Reinforcement-Learning
Language: Jupyter Notebook - Size: 1.14 MB - Last synced: 13 days ago - Pushed: 13 days ago - Stars: 1 - Forks: 1
nilskruse/mdp
Markov decision process master thesis
Language: Rust - Size: 180 KB - Last synced: 4 months ago - Pushed: 7 months ago - Stars: 3 - Forks: 0
IBM/IBM-Extended-Markov-Ratio-Decision-Process
This repo includes code referenced in the paper A Rigorous Risk-aware Linear Approach to Extended Markov Ratio Decision Processes with Embedded Learning by Alexander Zadorojniy, Takayuki Osogami, and Orit Davidovich to appear in IJCAI 2023.
Language: Jupyter Notebook - Size: 688 KB - Last synced: about 1 month ago - Pushed: 4 months ago - Stars: 4 - Forks: 0
ALotov2000/gym-frozen-lake-and-taxi-solved-by-reinforcement-learning
This repository belongs to one of my computer assignments for an AI course I attended at the University of Tehran.
Language: HTML - Size: 865 KB - Last synced: 4 months ago - Pushed: 6 months ago - Stars: 1 - Forks: 0
oguzmes/StochasticAntibiotic
A Stochastic Programming Approach to the Antibiotics Time Machine Problem
Language: Jupyter Notebook - Size: 5.64 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 5 - Forks: 0
bmarroc/reinforcement-learning
Jupyter notebooks implementing Reinforcement Learning algorithms in Numpy and Tensorflow
Language: Jupyter Notebook - Size: 2.84 MB - Last synced: 4 months ago - Pushed: 9 months ago - Stars: 1 - Forks: 0
lkmuk/seq_decision_under_uncertainty
jupyter notebooks & notes on sequential decision under uncertainty
Language: Jupyter Notebook - Size: 147 KB - Last synced: 25 days ago - Pushed: almost 2 years ago - Stars: 1 - Forks: 0
dsietz/test-data-generation
Test Data Generation
Language: Rust - Size: 2.83 MB - Last synced: 27 days ago - Pushed: over 2 years ago - Stars: 36 - Forks: 3
ameroyer/ReCA
(ReCA) Recommendation with Context Awareness using Multi-Environment Markov Decision Processes
Language: C++ - Size: 9.64 MB - Last synced: 6 months ago - Pushed: almost 6 years ago - Stars: 4 - Forks: 1
changkun/ws-18-19-deep-learning-tutorial
Deep Learning and Artificial Intelligence Tutorial @ LMU WS 2018/19
Language: Jupyter Notebook - Size: 24.3 MB - Last synced: 14 days ago - Pushed: over 5 years ago - Stars: 12 - Forks: 1
rllab-snu/tsallis_actor_critic_mujoco
Implementation of Tsallis Actor Critic method
Language: Jupyter Notebook - Size: 828 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 58 - Forks: 10
mircocarciani/MDP_dice_game
MDP Agent which dermines the best policy to obtain the best possible score in a Dice Game
Language: Python - Size: 8.79 KB - Last synced: 6 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
nasa/pymdptoolbox Fork of sawcordwell/pymdptoolbox
Markov Decision Process (MDP) Toolbox for Python
Language: Python - Size: 455 KB - Last synced: about 1 month ago - Pushed: almost 9 years ago - Stars: 31 - Forks: 31
abhineet123/deep_mdp
Deep MDP: A Modular Framework for Multi-Object Tracking
Language: Python - Size: 5.49 MB - Last synced: 29 days ago - Pushed: almost 2 years ago - Stars: 2 - Forks: 1
etienneandre/ImpRator
ImpRator (Inverse Method for Policy with Reward AbstracT behaviOR) is a prototype implementation to compute parameter valuations in parametric Markov decision processes such that optimal policies remain optimal.
Language: OCaml - Size: 55.7 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0
mjmaher987/Artificial-Intelligence
Artificial Intelligence + Deep Learning
Language: Jupyter Notebook - Size: 7.59 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 8 - Forks: 1
sachinbiradar9/Markov-Decision-Processes
Implementation of value iteration algorithm for calculating an optimal MDP policy
Language: Python - Size: 21.5 KB - Last synced: 7 months ago - Pushed: over 6 years ago - Stars: 85 - Forks: 43
rldotai/mdpy
Markov Decision Processes in Python
Language: HTML - Size: 3.94 MB - Last synced: 7 months ago - Pushed: over 5 years ago - Stars: 14 - Forks: 3
joanby/curso-algebra-lineal
Curso de Álgebra Lineal
Language: HTML - Size: 149 MB - Last synced: 7 months ago - Pushed: 9 months ago - Stars: 409 - Forks: 968
KHvic/Markov-Decision-Process-Value-Iteration-Policy-Iteration-Visualization
Computing an optimal Markov Decision Process (MDP) policy with Value Iteration and Policy Iteration
Language: Java - Size: 3.59 MB - Last synced: 7 months ago - Pushed: over 3 years ago - Stars: 6 - Forks: 3
SiluPanda/maze-solver-reinforcement-learning
A smart agent which solves a escaping maze using MDP
Language: Python - Size: 63.5 KB - Last synced: 7 months ago - Pushed: about 5 years ago - Stars: 0 - Forks: 0
beatrizgj/MDPbiome
microbiome engineering through prescriptive perturbations
Language: R - Size: 845 KB - Last synced: 7 months ago - Pushed: almost 5 years ago - Stars: 8 - Forks: 2
nirjarigandhi/Sokoban-Probability-Agents
Language: Python - Size: 118 KB - Last synced: 7 months ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0
jia-yi-chen/Bandit-and-Reinforcement-Learning
Python implementation for Reinforcement Learning algorithms -- Bandit algorithms, MDP, Dynamic Programming (value/policy iteration), Model-free Control (off-policy Monte Carlo, Q-learning)
Language: Python - Size: 31.3 KB - Last synced: 7 months ago - Pushed: over 2 years ago - Stars: 3 - Forks: 1
madhura711/LENOVO---Stochastic-Optimization-and-Predictive-Modeling
Language: R - Size: 6.09 MB - Last synced: 3 months ago - Pushed: over 5 years ago - Stars: 4 - Forks: 1
michaelw123/Reinforcement-Learning
Reinforcement Learning in Scala
Language: Scala - Size: 1.13 MB - Last synced: 7 months ago - Pushed: almost 6 years ago - Stars: 0 - Forks: 0
Sudhansh6/Intelligent-Learning-Agents
A repository covering a range of topics from multi-arm bandits to reinforcement learning algorithms. Check out different applications of bandits, MDPs and RL algorithms along with theoretical aspects.
Language: Python - Size: 937 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0
sourenaKhanzadeh/snakeAi
Reinforcement Learning with the classic snake game
Language: Python - Size: 3.98 MB - Last synced: about 1 month ago - Pushed: about 3 years ago - Stars: 13 - Forks: 4
karthikbhamidipati/reinforcement-learning
A repo for implementing reinforcement learning algorithms
Language: Python - Size: 90.8 KB - Last synced: 8 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0
maximkha/HdCMM
novel high dimensional continuous Markov chain predictor
Language: Python - Size: 725 KB - Last synced: 8 months ago - Pushed: over 3 years ago - Stars: 2 - Forks: 0