Topic: "q-value"
puolival/multipy
Multiple hypothesis testing in Python
Language: Python - Size: 1.1 MB - Last synced at: 23 days ago - Pushed at: 9 months ago - Stars: 105 - Forks: 24

ChaitanyaC22/Numerical_TicTacToe_Agent_using_Reinforcement_Learning
Build an RL (Reinforcement Learning) agent that learns to play Numerical Tic-Tac-Toe. The agent learns the game by Q-Learning.
Language: Jupyter Notebook - Size: 23.2 MB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 3 - Forks: 1

flowun/gardnerChessAi
Implementation of the Double Deep Q-Learning algorithm with a prioritized experience replay memory to train an agent to play the minichess variante Gardner Chess
Language: Python - Size: 3.56 MB - Last synced at: 20 days ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

IsmaelMousa/mdp-value-iteration
Implementation of the MDP algorithm for optimal decision-making, focusing on value iteration and policy determination.
Language: Python - Size: 114 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

piyush2896/Q-Value-RL
Q-Value (Reinforcement Learning) on Grid World
Language: Python - Size: 151 KB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

Pradnya1208/Snake-Game-using-Deep-Reinforcement-Learning
🐍 The Project is based on Reinforcement Learning which trains the snake to eat the food present in the environment.
Language: Jupyter Notebook - Size: 1.31 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

lingfeiwang/fdrtoolw
Estimate (Local) False Discovery Rates with Weights, adapted from fdrtool
Language: R - Size: 81.1 KB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0
