GitHub topics: action-value-function
antonio-f/Dynamic-Programming
Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncated Policy Iteration, Value Iteration . From Udacity's Deep Reinforcement Learning Nanodegree program.
Language: Jupyter Notebook - Size: 179 KB - Last synced at: 7 months ago - Pushed at: over 6 years ago - Stars: 12 - Forks: 4
antonio-f/MonteCarlo-methods
Monte Carlo methods for Reinforcement Learning (from Udacity's "Deep Reinforcement Learning Nanodegree Program").
Language: Jupyter Notebook - Size: 650 KB - Last synced at: 24 days ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 1