ecosyste.ms

Repos

An open API service providing repository metadata for many open source software ecosystems.

Topic: "q-value-iteration"

ChaitanyaC22/Numerical_TicTacToe_Agent_using_Reinforcement_Learning

Build an RL (Reinforcement Learning) agent that learns to play Numerical Tic-Tac-Toe. The agent learns the game by Q-Learning.

Language: Jupyter Notebook - Size: 23.2 MB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 3 - Forks: 1

Related Topics

actions 1 convergence 1 episodes 1 epsilon-decay 1 epsilon-greedy 1 hyperparameter-tuning 1 learning-rate 1 markov-decision-process 1 mdp-framework 1 model-building 1 policy 1 q-learning 1 q-learning-algorithm 1 q-value 1 reinforcement-learning 1 rewards 1 rl 1 states 1