GitHub topics: ucb-algorithm

Repositories

amirbalef/PS_MOMAB

Multi-Objective Multi-Armed Bandit

Language: Python - Size: 608 KB - Last synced at: 2 days ago - Pushed at: about 2 years ago - Stars: 5 - Forks: 2

meezys/Bernoulli-Bandits

An implementation of Bandit Algorithms, focusing on the case of Bernoulli Rewards.

Language: Python - Size: 16.6 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

vismaychuriwala/Optimal-Strategies-in-Multi-Armed-Bandits

This repository contains several implementations of multi-armed bandit (MAB) agents applied to a simulated cricket match where an agent selects among different strategies with the goal of maximizing runs while minimizing the risk of getting out.

Language: Python - Size: 17.6 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

Taabannn/intro-rl

Language: Jupyter Notebook - Size: 2.93 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

pacificrm/Simulating-the-Multi-Armed-Bandit

Simulating-the-Multi-Armed-Bandit with 10 arms using algorithms like Greedy, Epsilon-Greedy and UCB.

Language: Jupyter Notebook - Size: 712 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

Asterinos1/RL_n_Dynamic_Optimization

This rep contains the projects made for the course "Reinforcement Learning and Dynamic Optimization" at TUC (2024).

Language: Jupyter Notebook - Size: 1.46 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

Shlok1810/Ad-Selection-Algorithm-using-Machine-learning

Which Advertisement is the best fit for our business we can directly get through UCB Algorithm

Language: Jupyter Notebook - Size: 18.6 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

sahandkhoshdel99/Reinforcement-Learning-

Language: Jupyter Notebook - Size: 209 KB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

rachelsng/Multiarmed-Bandits-Website-Tuning

[Python] 4 multi-armed bandit algorithms are implemented to determine which one can most effectively determine the best website configuration that maximise signups.

Language: Jupyter Notebook - Size: 1.07 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

alxndrTL/RL-essais-cliniques

Size: 4.07 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

rmitsuboshi/bandit

A small collection of Bandit algorithms, written in Rust 🦀.

Language: Rust - Size: 1.72 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

theheisenberg10/Marketing-Mix-for-Leading-Hospitality-Company

Sending personalized marketing offers (called free play in a casino setting) to players by observing data on their gaming behavior and demographic information

Size: 1.31 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

Anjali001/Reinforcement-Learning

Language: Jupyter Notebook - Size: 1.05 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

Related Keywords

ucb-algorithm 13 reinforcement-learning 8 multiarmed-bandits 4 epsilon-greedy-exploration 3 epsilon-greedy 3 exploration-exploitation 2 sarsa-learning 2 policy-gradient 2 machine-learning 2 reinforcement-learning-algorithms 2 value-iteration 2 q-learning 2 policy-iteration 2 monte-carlo 2 dqn 2 thompson-sampling 2 stochastic 2 multi-armed-bandit 2 bandit-algorithms 2 kl-divergence 1 dynamic-programming 1 model-based-rl 1 model-learning 1 n-armed-bandit-problem 1 n-step-expected-sarsa 1 n-step-tree-backup 1 multi-objective 1 transfer-learning 1 greedy-algorithms 1 python 1 clinical-trials 1 essais-cliniques 1 asymptotically-optimal-ucb-algorithm 1 bandit 1 etc-algorithm 1 abtesting 1 bayesian-neural-networks 1 greedy-algorithm 1 reinforce 1 td-lambda 1 td-learning 1 proababilistic 1 regret-minimization 1 risk-management 1 ddpg 1 moss 1 gradient-bandit 1 lattimore 1 kl-ucb 1 explore-then-commit 1 q-learning-vs-sarsa 1 sarsa-algorithm 1 statistical-inference 1 bernoulli 1 bandits 1 adaucb 1 multi-armed-bandits 1 multiplicative-weights 1 advertisement 1 non-stationary 1 selection-algorithm 1 behavioral-economics 1 cognitive-fallacies 1 double-q-learning 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos