Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: bellman-equation

krichelj/PyDiffGame

PyDiffGame is a Python implementation of a Nash Equilibrium solution to Differential Games, based on a reduction of Game Hamilton-Bellman-Jacobi (GHJB) equations to Game Algebraic and Differential Riccati equations, associated with Multi-Objective Dynamical Control Systems

Language: Python - Size: 7.47 MB - Last synced: 1 day ago - Pushed: 5 months ago - Stars: 41 - Forks: 6

nicoRomeroCuruchet/DynamicProgramming

Policy Iteration for Continuous Dynamics

Language: Jupyter Notebook - Size: 42.3 MB - Last synced: 18 days ago - Pushed: 19 days ago - Stars: 3 - Forks: 0

akain0/Reinforcement-Learning-

Projects and Models built in Python leveraging PyTorch, implementing Reinforcement Learning algorithms for reward-based tasks.

Language: Jupyter Notebook - Size: 693 KB - Last synced: 29 days ago - Pushed: 29 days ago - Stars: 0 - Forks: 0

simerplaha/reinforcement-learning

Reinforcement learning

Language: Scala - Size: 174 KB - Last synced: about 1 month ago - Pushed: almost 4 years ago - Stars: 4 - Forks: 2

YuriAntonelli/Bellman-Equation-Economics

Dynamic Optimization project working on an economic model

Language: Python - Size: 422 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 2 - Forks: 1

robotsorcerer/levelsetpy

A GPU-accelerated toolbox for hyperbolic PDEs in a weaker (viscosity) sense. It leverages the integral to the solution of the conservation of momentum problem (being equivalent to the derivative of Hamilton-Jacobi equations) in one spatial dimension. We resolve such hyperbolic differential equations using wave-front propagating schemes on a spatial-by-spatial dimension in resolving the classical value in dynamic programming (respectively optimal control and differential games) problems.

Language: Jupyter Notebook - Size: 18.4 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 5 - Forks: 1

LaurentVeyssier/Route-planner-algorithm

Find the shortest route using A* algorithm and graphs (Route Planner application)

Language: Jupyter Notebook - Size: 288 KB - Last synced: 2 months ago - Pushed: over 3 years ago - Stars: 18 - Forks: 1

LaurentVeyssier/Optimizing-Warehouse-Flows-with-Q-Learning

calculate the optimum route in a warehouse using the Q-Learning algorithm (Bellman equation)

Language: Jupyter Notebook - Size: 35.2 KB - Last synced: 2 months ago - Pushed: over 3 years ago - Stars: 3 - Forks: 3

bwe587/GridWorld

My Grid World Application. Presented in class on April 7th, 2023.

Language: Python - Size: 7.94 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 1 - Forks: 0

kyomangold/ETH-DynamicProgrammingOptimalControl

Repository for the code of the "Dynamic Programming and Optimal Control" (DPOC) lecture at the "Institute for Dynamic Systems and Control" at ETH Zurich.

Language: MATLAB - Size: 1.77 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0

bmarroc/reinforcement-learning

Jupyter notebooks implementing Reinforcement Learning algorithms in Numpy and Tensorflow

Language: Jupyter Notebook - Size: 2.84 MB - Last synced: 4 months ago - Pushed: 9 months ago - Stars: 1 - Forks: 0

syed-azim-git/Routing_Algorithms

Simulation of Routing Algorithms used in communication networks in python

Language: Python - Size: 7.81 KB - Last synced: 4 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

sudharsan13296/Deep-Reinforcement-Learning-With-Python

Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math

Language: Jupyter Notebook - Size: 23.9 MB - Last synced: 7 months ago - Pushed: about 3 years ago - Stars: 272 - Forks: 113

renan-siqueira/reinforcement-learning-frozen-lake

This project aims to explore the basic concepts of Reinforcement Learning using the FrozenLake environment from the OpenAI Gym library.

Language: Python - Size: 89.8 KB - Last synced: 4 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 2

hanjongho/rl

Reinforcement Learning을 이용한 Pac-Man 최적 경로 구하기

Language: Python - Size: 7.81 KB - Last synced: 7 months ago - Pushed: almost 4 years ago - Stars: 0 - Forks: 0

uonliaquat/RL_Visualizer

A visualization tool for policy iteration and value iteration

Language: JavaScript - Size: 37.3 MB - Last synced: 8 months ago - Pushed: over 3 years ago - Stars: 4 - Forks: 1

nicolaloi/Dynamic-Programming-and-Optimal-Control

Infinite horizon policy optimization for drone navigation. Graded project for the ETH course "Dynamic Programming and Optimal Control".

Language: MATLAB - Size: 758 KB - Last synced: 8 months ago - Pushed: over 2 years ago - Stars: 3 - Forks: 2

rickyhan24/RL_Linear_Proofs_Policy_Evaluation

Iterative Policy Evaluation for the world of linear-equation-solving proofs. Given a policy for how to solve a linear equation, we find the corresponding value function--that is, the function that assigns values to each state.

Language: Jupyter Notebook - Size: 5.86 KB - Last synced: 9 months ago - Pushed: about 3 years ago - Stars: 0 - Forks: 1

alizindari/Reinforcement-Learning

Implementation of several algorithms in RL based on Prof. sutton's book

Language: Jupyter Notebook - Size: 510 KB - Last synced: 9 months ago - Pushed: almost 3 years ago - Stars: 1 - Forks: 2

GiacomoFerro/Q_Learning_Games_v3

Implementation of Policy Iteration and Value Iteration Agents for Taxi game of OpenAI gym

Language: Python - Size: 126 KB - Last synced: 10 months ago - Pushed: about 6 years ago - Stars: 0 - Forks: 1

xujiachang1024/MDP-Pac-Man

Design and Implementation of Pac-Man Strategies with Embedded Markov Decision Process in a Dynamic, Non-Deterministic, Fully Observable Environment

Language: Python - Size: 2.46 MB - Last synced: 10 months ago - Pushed: over 5 years ago - Stars: 1 - Forks: 1

MartinSeeler/rllr-bot

Reinforcement Learning Light Riders Bot

Language: Python - Size: 16.6 KB - Last synced: 10 months ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0

vishaal27/RL-M2019

Repository for the Reinforcement Learning (CSE564) Fall'19 course at IIIT Delhi

Language: Jupyter Notebook - Size: 9.48 MB - Last synced: 10 months ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0

Suchetaaa/CS747-Assignments

Foundations Of Intelligent Learning Agents (FILA) Assignments

Language: Python - Size: 3.04 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 5 - Forks: 0

cdeliens/dynamic_programming_algorithms

Bellman Equation, Needleman-Warsh, Smith-Waterman Algorithms test and written in Ruby

Language: Ruby - Size: 7.81 KB - Last synced: about 1 year ago - Pushed: over 8 years ago - Stars: 1 - Forks: 0

krichelj/gymAsteroids

Asteroids evasion using OpenAI's gym Reinforcement Learning (RL) package - M.Sc. Thesis in Computer Science, Ben Gurion University Ben Gurion University of the Negev, Israe

Language: Python - Size: 20.5 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

andrejlukic/dicegame-solver

Solving a game of dice using value iteration / Bellman optimality equations

Language: Python - Size: 333 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

krichelj/AI_BGU_2021

Artificial Intelligence course, Computer Science M.Sc., Ben Gurion University of the Negev, 2021

Language: Python - Size: 463 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

piyush2896/Q-Learning

Q-Learning from scratch in Python

Language: Python - Size: 164 KB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 4 - Forks: 3

antonio-f/Dynamic-Programming

Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncated Policy Iteration, Value Iteration . From Udacity's Deep Reinforcement Learning Nanodegree program.

Language: Jupyter Notebook - Size: 179 KB - Last synced: over 1 year ago - Pushed: about 5 years ago - Stars: 6 - Forks: 2

Sohaib1424/Reinforcement-Learning-projects

Language: Python - Size: 1.34 MB - Last synced: 6 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

ar8372/Image-Feature-extraction-using-Reinforcement-Learning

In this project we use Reinforcement Learning to extract features from an image.

Language: Jupyter Notebook - Size: 3.43 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 2 - Forks: 0

tissimich/Reinforcement-learning

Reinforcement learning notebooks

Language: Jupyter Notebook - Size: 6.84 KB - Last synced: over 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

rasimandiran/EvoTrader

Evolutionary algorithm to make better trade decisions based on Bellman equation. (Experimental)

Language: Python - Size: 42 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 1

MCCiupek/DQL-CartPole

Using Deep Neural Network to solve the CartPole environment

Language: Jupyter Notebook - Size: 128 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

andreabac3/RL-Autonomous-Networking

Reinforcement Learning applied to Autonomous Networking to issue scheduling and decision to drones.

Language: Python - Size: 2.51 MB - Last synced: over 1 year ago - Pushed: over 3 years ago - Stars: 6 - Forks: 0

Naharul98/Pacman-AI-agent-for-stochastic-environment

A Markov Decision Process (MDP) based implementation of a Pacman agent, to survive and battle through a handicapped stochastic environment.

Language: Python - Size: 342 KB - Last synced: over 1 year ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0

SebastianLiando/CZ4046-1

CZ4046: Intelligent Agents - Assignment 1

Language: Kotlin - Size: 526 KB - Last synced: over 1 year ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0

Jogima-cyber/2048-TDLearning

Personal implementation in C++ of http://www.cs.put.poznan.pl/mszubert/pub/szubert2014cig.pdf. Results could be reproduced. It's an algorithm that learns by itself to solve the 2048 game. It doesn't use deep learning (aka. neural networks). But it learns by itself using the Bellman equations.

Language: JavaScript - Size: 4.6 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

ypleong/SeALS

Solving high dimensional HJB equation using tensor decomposition

Language: Mathematica - Size: 12.3 MB - Last synced: about 1 year ago - Pushed: over 6 years ago - Stars: 6 - Forks: 5

FarzamTP/Q-Learning-Mountain-Car

Mountain Car is a Gym environment. I used this environment to train my model using Q-Learning which is a reinforcement learning technic.

Language: Python - Size: 3.55 MB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

Atin17/Reinforcement-Learning

Language: Jupyter Notebook - Size: 199 KB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0

piyush2896/Q-Value-RL

Q-Value (Reinforcement Learning) on Grid World

Language: Python - Size: 151 KB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 1 - Forks: 0

Related Keywords
bellman-equation 43 reinforcement-learning 23 q-learning 12 policy-iteration 10 value-iteration 10 dynamic-programming 8 markov-decision-processes 7 python 7 python3 5 machine-learning 5 sarsa 4 policy-evaluation 4 openai-gym 4 artificial-intelligence 4 deep-reinforcement-learning 4 ai 3 computer-science 3 reinforcement-learning-algorithms 3 linear-programming 3 monte-carlo 3 deep-learning 3 optimal-control 3 gym 3 policy-improvement 3 hamilton-jacobi-bellman 3 sarsa-learning 3 frozenlake 2 multi-armed-bandits 2 deep-q-learning 2 a-star-algorithm 2 gym-environment 2 policy-gradient 2 dynamical-systems 2 optimization-algorithms 2 pontryagin-maximum-principle 2 control-theory 2 a3c 2 numpy 2 optimization-methods 1 open-ai-gym 1 montecarlo-methods 1 artificial-intelligence-algorithms 1 temporal-difference 1 iteration-agent 1 open-ai-api 1 open-ai 1 taxi-game 1 evasion 1 asteroids-game 1 smith-waterman 1 intelligent-agent 1 ruby 1 needleman-warsh 1 windy-gridworld 1 ucb 1 maximum-expected-utility 1 thompson-sampling 1 temporal-differencing-learning 1 kl-ucb 1 intelligent-learning-agents 1 howards-pi 1 modular-programming 1 non-deterministic 1 parameter-tuning 1 bootstrapping 1 uml-diagrams 1 neural-network 1 pytorch 1 aloha 1 autonomous-vehicles 1 drone 1 drones 1 mac 1 reinforcement-learning-environments 1 simulator 1 mdp 1 pacman-agent 1 intelligent-agents 1 markov-decision-process 1 2048 1 2048-game 1 tensor-decomposition 1 epsilon-decay 1 mountaincar-v0 1 peak 1 tensorflow 1 predictiveprogrammer 1 q-value 1 bayesian-network 1 game-theory-algorithms 1 heuristic-search-algorithms 1 kruskal-algorithm 1 minimal-spanning-tree 1 minimax-algorithm 1 policy-iteration-algorithm 1 prim-algorithm 1 value-iteration-algorithm 1 action-value-function 1 state-value-function 1 actor-critic-algorithm 1