Topic: "td-methods"
ShreeshaN/ReinforcementLearningTutorials
This repo contains implementations of algorithms such a Q-learning, SARSA, TD, Policy gradient
Language: Python - Size: 4.32 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 6

katnoria/td-methods
Notebooks covering temporal difference methods using OpenAI Gym
Language: Jupyter Notebook - Size: 50.8 KB - Last synced at: 5 days ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 1

kanji95/Topics-in-Machine-Learning-CS7.502
Topics in Machine Learning @ IIIT Hyderabad (Fall 2021)
Language: Jupyter Notebook - Size: 1.31 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 1

antonio-f/TD-methods-SARSA
Temporal Difference methods - A simple implementation of SARSA algorithm applied to OpenAI gym's "CliffWalking" environment.
Language: Jupyter Notebook - Size: 248 KB - Last synced at: about 1 month ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0
