GitHub topics: ucb-algorithm
vismaychuriwala/Optimal-Strategies-in-Multi-Armed-Bandits
This repository contains several implementations of multi-armed bandit (MAB) agents applied to a simulated cricket match where an agent selects among different strategies with the goal of maximizing runs while minimizing the risk of getting out.
Language: Python - Size: 17.6 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Taabannn/intro-rl
Language: Jupyter Notebook - Size: 2.93 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

pacificrm/Simulating-the-Multi-Armed-Bandit
Simulating-the-Multi-Armed-Bandit with 10 arms using algorithms like Greedy, Epsilon-Greedy and UCB.
Language: Jupyter Notebook - Size: 712 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

Asterinos1/RL_n_Dynamic_Optimization
This rep contains the projects made for the course "Reinforcement Learning and Dynamic Optimization" at TUC (2024).
Language: Jupyter Notebook - Size: 1.46 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Shlok1810/Ad-Selection-Algorithm-using-Machine-learning
Which Advertisement is the best fit for our business we can directly get through UCB Algorithm
Language: Jupyter Notebook - Size: 18.6 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

sahandkhoshdel99/Reinforcement-Learning-
Language: Jupyter Notebook - Size: 209 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

rachelsng/Multiarmed-Bandits-Website-Tuning
[Python] 4 multi-armed bandit algorithms are implemented to determine which one can most effectively determine the best website configuration that maximise signups.
Language: Jupyter Notebook - Size: 1.07 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

alxndrTL/RL-essais-cliniques
Size: 4.07 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

amirbalef/PS_MOMAB
Multi-Objective Multi-Armed Bandit
Language: Python - Size: 608 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

rmitsuboshi/bandit
A small collection of Bandit algorithms, written in Rust 🦀.
Language: Rust - Size: 1.72 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

theheisenberg10/Marketing-Mix-for-Leading-Hospitality-Company
Sending personalized marketing offers (called free play in a casino setting) to players by observing data on their gaming behavior and demographic information
Size: 1.31 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

Anjali001/Reinforcement-Learning
Language: Jupyter Notebook - Size: 1.05 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0
