An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: state-value-function

tashi-2004/Deep-Learning-Grid-World-Q-Learning

Deep Learning Grid World Q-Learning . Implement Q-learning in a 5x5 grid where an agent navigates obstacles and rewards. Train the agent with varying learning rates, visualize its progress, and see Q-values as heatmaps. Run the script to start training and view results. Contributions are welcome!

Language: Python - Size: 280 KB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

antonio-f/Dynamic-Programming

Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncated Policy Iteration, Value Iteration . From Udacity's Deep Reinforcement Learning Nanodegree program.

Language: Jupyter Notebook - Size: 179 KB - Last synced at: 3 months ago - Pushed at: about 6 years ago - Stars: 12 - Forks: 4

Amine-Zitoun/Music_Recommender_RL

Recommending music using reinforcement learning

Language: Python - Size: 620 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

TanushGoel/Atari-Games-RL

A collection of ipython notebooks in which agents learn to play Atari games in Open AI gym environments using different methods of reinforcement learning.

Language: Jupyter Notebook - Size: 2.38 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 1