An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: action-value-function

antonio-f/Dynamic-Programming

Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncated Policy Iteration, Value Iteration . From Udacity's Deep Reinforcement Learning Nanodegree program.

Language: Jupyter Notebook - Size: 179 KB - Last synced at: 7 months ago - Pushed at: over 6 years ago - Stars: 12 - Forks: 4

antonio-f/MonteCarlo-methods

Monte Carlo methods for Reinforcement Learning (from Udacity's "Deep Reinforcement Learning Nanodegree Program").

Language: Jupyter Notebook - Size: 650 KB - Last synced at: 24 days ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 1