An open API service providing repository metadata for many open source software ecosystems.

Topic: "sarsa-algorithm"

zaid-24/The-RL-Agent

path planning using Q learning algorithm

Language: Python - Size: 34.2 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 1

RajK853/RL_OpenAI_Gym

Solutions for OpenAI Gym RL environments

Language: Jupyter Notebook - Size: 11.3 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

Eshaancoding/gridworld

Using the SARSA to beat the environment, Windy Gridworld. Implement in C++.

Language: C++ - Size: 49.8 KB - Last synced at: about 1 month ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

viniciusenari/Q-Learning-and-SARSA-Mountain-Car-v0

Demonstration of Q-Learning and SARSA algorithms utilizing Python and OpenAI GYM

Language: Python - Size: 428 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

lucadivit/Reinforcement_Learning_Maze_Solver

This github contains a simple OpenAi Gym Maze Enviroment and (at now) a RL Algorithm to solve it.

Language: Python - Size: 28.3 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 0

Ostyk/population-based-training-of-NNs

Applying PBT optimization technique to different domains

Language: Python - Size: 205 MB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 2 - Forks: 0

Alberto-00/Super-Mario-Bros-AI

The following project concerns the development of an intelligent agent for the famous game produced by Nintendo Super Mario Bros. More in detail: the goal of this project was to design, implement and train an agent with the Q-learning reinforcement learning algorithm.

Language: Python - Size: 28.3 MB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Eng-ZeyadTarek/reinforcement-learning-dojo

The implementation of some reinforcement learning techniques like (Q-learning, SARSA, DQN) in two assignments and one big project.

Language: Jupyter Notebook - Size: 2.92 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

ayush29/Game-of-Pac-Man-RL

Pac-Man RL Agent

Language: Python - Size: 177 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

liAmirali/UIAI-RL-project

An agent being trained to find its way in a stochastic and partially observable maze using SARSA and Q-Learning algorithms.

Language: Jupyter Notebook - Size: 21.4 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 2

Taabannn/intro-rl

Language: Jupyter Notebook - Size: 2.93 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

kaylaipp/RacetraceReinforcementLearning

Various Reinforcement Learning Algorithms on Racetrack Simulations

Language: Python - Size: 583 KB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

Sphincz/PacmanRL

PacmanRL - Reinforcement Learning for Pacman (Q-Learning / SARSA)

Language: Python - Size: 1.92 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

nikapotato/sarsa-agent

Implementation of an agent capable of playing a simplified version of the blackjack game using SARSA algorithm.

Language: Python - Size: 325 KB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Shrinidhi1/Optimal-Placement-of-VNFs-and-SFC-in-Edge-Computing-Environment

Optimal Placement of VNFs using Genetic & Tabu Search Algorithms and Service Function Chaining using Q-Learning & SARSA Algorithms in an Multi-Access Edge Computing Environment

Language: Jupyter Notebook - Size: 193 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

lusk4r/algoRL

set of Reinforcement Learning algorithms implemented from scratch

Language: Python - Size: 33.2 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Satoing/AI-experiment

人工智能课程的实验

Language: Python - Size: 595 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

johng034/RL-Maze

Two reinforcement learning algorithms (Standard SARSA Control and Tabular Dyna-Q) where an agent learns to traverse a randomly generated maze

Language: Jupyter Notebook - Size: 72.3 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

sinsinan/SARSA

Implementation of SARSA algorithm for path planning

Language: Python - Size: 1000 Bytes - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 1

rusty-sj/Reinforcement-Learning-Algorithms-with-Ray-Framework-and-Intel-DevCloud

Implementation of certain crucial algorithms in the field of reinforcement learning.

Language: Jupyter Notebook - Size: 4.41 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

tarunk04/OpenGym-Taxi-v3

Open Gym Taxi v3 environment solved using sarsamax algorithm(Q-Learning)

Language: Jupyter Notebook - Size: 19.5 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 1

antonio-f/TD-methods-SARSA

Temporal Difference methods - A simple implementation of SARSA algorithm applied to OpenAI gym's "CliffWalking" environment.

Language: Jupyter Notebook - Size: 248 KB - Last synced at: 28 days ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

Related Topics
reinforcement-learning 15 sarsa 8 q-learning 7 reinforcement-learning-algorithms 6 qlearning-algorithm 5 q-learning-algorithm 4 openai-gym 4 machine-learning 4 dqn 4 reinforcement-learning-environments 3 epsilon-greedy 3 python 3 policy-iteration 2 deep-reinforcement-learning 2 maze-solver 2 deep-neural-networks 2 tabular-q-learning 2 sarsa-learning 2 q-learning-vs-sarsa 2 reinforcement-learning-agent 2 ddpg 2 qlearning 2 rl-algorithm 1 dqn-algorithm 1 policy 1 openai-gym-environment 1 maze-generator 1 maze-enviroment 1 maze 1 epsilon-decay 1 boltzmann-exploration 1 blackjack 1 vnf-placement 1 tabu-search 1 sfc 1 service-function-chain 1 genetic-algorithm 1 stochastic-environment 1 reinforement-learning 1 partially-observable-environment 1 pacman 1 agent-based-simulation 1 windy-gridworld 1 policy-sarsa 1 egreedy-policies 1 double-deep-q-networks 1 double-deep-q-network 1 double-deep-q-learning 1 value-iteration-algorithm 1 ray-framework 1 frozenlake 1 distributed-reinforcement-learning 1 devcloud 1 cartpole-environment 1 atari 1 grid-world 1 flappy-bird-game 1 flappy-bird 1 id3-algorithm 1 decision-tree 1 astar-algorithm 1 tensorflow 1 rl-openai-gym 1 reinforce 1 dqn-tensorflow 1 deep-q-network 1 deep-learning 1 ddqn-lunar-lander 1 ddqn 1 ddpg-algorithm 1 bayesian-optimization 1 pbt 1 nmt 1 image-classification 1 hyperparameter-optimization 1 gan 1 deepmind 1 gridworld-environment 1 dyna-q 1 temporal-difference 1 td-methods 1 simple 1 gym-environment 1 gym 1 cliffwalking 1 algorithm 1 101 1 value-iteration 1 ucb-algorithm 1 statistical-inference 1 monte-carlo 1 gradient-bandit 1 rl-agents 1 rl 1 openai 1 deep-q-learning 1