An open API service providing repository metadata for many open source software ecosystems.

Topic: "sarsa-algorithm"

linesd/tabular-methods

Tabular methods for reinforcement learning

Language: Python - Size: 1.51 MB - Last synced at: 5 days ago - Pushed at: over 5 years ago - Stars: 38 - Forks: 8

zaid-24/The-RL-Agent

path planning using Q learning algorithm

Language: Python - Size: 34.2 KB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 7 - Forks: 1

RajK853/RL_OpenAI_Gym

Solutions for OpenAI Gym RL environments

Language: Jupyter Notebook - Size: 11.3 MB - Last synced at: 4 months ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

Eshaancoding/gridworld

Using the SARSA to beat the environment, Windy Gridworld. Implement in C++.

Language: C++ - Size: 49.8 KB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

viniciusenari/Q-Learning-and-SARSA-Mountain-Car-v0

Demonstration of Q-Learning and SARSA algorithms utilizing Python and OpenAI GYM

Language: Python - Size: 428 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

lucadivit/Reinforcement_Learning_Maze_Solver

This github contains a simple OpenAi Gym Maze Enviroment and (at now) a RL Algorithm to solve it.

Language: Python - Size: 28.3 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 0

Ostyk/population-based-training-of-NNs

Applying PBT optimization technique to different domains

Language: Python - Size: 205 MB - Last synced at: over 2 years ago - Pushed at: about 6 years ago - Stars: 2 - Forks: 0

Alberto-00/Super-Mario-Bros-AI

The following project concerns the development of an intelligent agent for the famous game produced by Nintendo Super Mario Bros. More in detail: the goal of this project was to design, implement and train an agent with the Q-learning reinforcement learning algorithm.

Language: Python - Size: 28.3 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

Eng-ZeyadTarek/reinforcement-learning-dojo

The implementation of some reinforcement learning techniques like (Q-learning, SARSA, DQN) in two assignments and one big project.

Language: Jupyter Notebook - Size: 2.92 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

ayush29/Game-of-Pac-Man-RL

Pac-Man RL Agent

Language: Python - Size: 177 KB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

jonifat/SARSA-FIS

Dissertation project, Faculty of Industrial Technology, Universitas Trisakti. Contains pseudocode, rule bases, and reproducibility results for the SARSA–FIS Hybrid Decision Framework for Sustainable Forex Trading. Source code is under patent review and will be released after approval.

Language: HTML - Size: 202 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

nogafriedman/reinforcement_learning_ai

This project is part of the Introduction to Artificial Intelligence course (Exercise 6). The focus of this assignment is on Reinforcement Learning, specifically implementing and experimenting with Q-learning and SARSA algorithms in a grid-world environment.

Language: Python - Size: 117 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Sphincz/PacmanRL

PacmanRL - Reinforcement Learning for Pacman (Q-Learning / SARSA)

Language: Python - Size: 1.92 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

liAmirali/UIAI-RL-project

An agent being trained to find its way in a stochastic and partially observable maze using SARSA and Q-Learning algorithms.

Language: Jupyter Notebook - Size: 21.4 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 2

Taabannn/intro-rl

Language: Jupyter Notebook - Size: 2.93 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

kaylaipp/RacetraceReinforcementLearning

Various Reinforcement Learning Algorithms on Racetrack Simulations

Language: Python - Size: 583 KB - Last synced at: 7 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

nikapotato/sarsa-agent

Implementation of an agent capable of playing a simplified version of the blackjack game using SARSA algorithm.

Language: Python - Size: 325 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Shrinidhi1/Optimal-Placement-of-VNFs-and-SFC-in-Edge-Computing-Environment

Optimal Placement of VNFs using Genetic & Tabu Search Algorithms and Service Function Chaining using Q-Learning & SARSA Algorithms in an Multi-Access Edge Computing Environment

Language: Jupyter Notebook - Size: 193 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

lusk4r/algoRL

set of Reinforcement Learning algorithms implemented from scratch

Language: Python - Size: 33.2 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

Satoing/AI-experiment

人工智能课程的实验

Language: Python - Size: 595 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

johng034/RL-Maze

Two reinforcement learning algorithms (Standard SARSA Control and Tabular Dyna-Q) where an agent learns to traverse a randomly generated maze

Language: Jupyter Notebook - Size: 72.3 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

sinsinan/SARSA

Implementation of SARSA algorithm for path planning

Language: Python - Size: 1000 Bytes - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 1

rusty-sj/Reinforcement-Learning-Algorithms-with-Ray-Framework-and-Intel-DevCloud

Implementation of certain crucial algorithms in the field of reinforcement learning.

Language: Jupyter Notebook - Size: 4.41 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

tarunk04/OpenGym-Taxi-v3

Open Gym Taxi v3 environment solved using sarsamax algorithm(Q-Learning)

Language: Jupyter Notebook - Size: 19.5 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 1

antonio-f/TD-methods-SARSA

Temporal Difference methods - A simple implementation of SARSA algorithm applied to OpenAI gym's "CliffWalking" environment.

Language: Jupyter Notebook - Size: 248 KB - Last synced at: 7 months ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

Related Topics
reinforcement-learning 17 q-learning 9 sarsa 9 reinforcement-learning-algorithms 7 q-learning-algorithm 5 qlearning-algorithm 5 openai-gym 4 machine-learning 4 dqn 4 sarsa-learning 3 q-learning-vs-sarsa 3 reinforcement-learning-agent 3 reinforcement-learning-environments 3 policy-iteration 3 tabular-q-learning 3 python 3 epsilon-greedy 3 qlearning 2 ddpg 2 value-iteration 2 gridworld-environment 2 maze-solver 2 cliffwalking 2 deep-neural-networks 2 deep-reinforcement-learning 2 algorithm 2 blackjack 1 vnf-placement 1 atari 1 boltzmann-exploration 1 epsilon-decay 1 tabu-search 1 maze 1 grid-world 1 flappy-bird-game 1 flappy-bird 1 maze-enviroment 1 maze-generator 1 dqn-algorithm 1 rl-algorithm 1 policy 1 openai-gym-environment 1 ai 1 tabular-methods 1 tabular-environments 1 policy-evaluation 1 gridworld-cliff 1 gridworld 1 pacman 1 agent-based-simulation 1 stochastic-environment 1 reinforement-learning 1 partially-observable-environment 1 windy-gridworld 1 policy-sarsa 1 egreedy-policies 1 double-deep-q-networks 1 double-deep-q-network 1 double-deep-q-learning 1 value-iteration-algorithm 1 ray-framework 1 frozenlake 1 distributed-reinforcement-learning 1 devcloud 1 cartpole-environment 1 sfc 1 deep-q-learning 1 id3-algorithm 1 decision-tree 1 astar-algorithm 1 tensorflow 1 rl-openai-gym 1 reinforce 1 dqn-tensorflow 1 deep-q-network 1 deep-learning 1 ddqn-lunar-lander 1 ddqn 1 ddpg-algorithm 1 bayesian-optimization 1 pbt 1 nmt 1 image-classification 1 hyperparameter-optimization 1 gan 1 deepmind 1 service-function-chain 1 genetic-algorithm 1 dyna-q 1 temporal-difference 1 td-methods 1 simple 1 gym-environment 1 gym 1 101 1 machine-learning-algorithms 1 game-theory 1 fuzzy-inference-system 1 artificial-intelligence 1 ucb-algorithm 1