td-methods | Topic | Ecosyste.ms: Repos

Topic: "td-methods"

This repo contains implementations of algorithms such a Q-learning, SARSA, TD, Policy gradient

Language: Python - Size: 4.32 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 6

Notebooks covering temporal difference methods using OpenAI Gym

Language: Jupyter Notebook - Size: 50.8 KB - Last synced at: 5 days ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 1

Topics in Machine Learning @ IIIT Hyderabad (Fall 2021)

Language: Jupyter Notebook - Size: 1.31 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 1

Temporal Difference methods - A simple implementation of SARSA algorithm applied to OpenAI gym's "CliffWalking" environment.

Language: Jupyter Notebook - Size: 248 KB - Last synced at: about 1 month ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0