An open API service providing repository metadata for many open source software ecosystems.

Topic: "q-value-iteration"

ChaitanyaC22/Numerical_TicTacToe_Agent_using_Reinforcement_Learning

Build an RL (Reinforcement Learning) agent that learns to play Numerical Tic-Tac-Toe. The agent learns the game by Q-Learning.

Language: Jupyter Notebook - Size: 23.2 MB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 3 - Forks: 1