GitHub topics: contextual-bandit
Digitalized-Energy-Systems/opfgym
A gymnasium-compatible framework to create reinforcement learning (RL) environment for solving the optimal power flow (OPF) problem. Contains five OPF benchmark environments for comparable research.
Language: Python - Size: 527 KB - Last synced at: 11 days ago - Pushed at: about 1 month ago - Stars: 8 - Forks: 0

victor-iyi/contextual-bandit
A Reinforcement Learning approach to a contextual bandit problem.
Language: Jupyter Notebook - Size: 21.5 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

niffler92/Bandit
Bandit algorithms
Language: Python - Size: 300 KB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 29 - Forks: 6

doerlbh/BerlinUCB
Code for our AJCAI 2020 paper: "Online Semi-Supervised Learning in Contextual Bandits with Episodic Reward".
Language: MATLAB - Size: 33.2 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 0

bsteenwi/ContextualBandit
Contextual bandit implementation using Keras
Language: Python - Size: 4.88 KB - Last synced at: 6 months ago - Pushed at: almost 7 years ago - Stars: 2 - Forks: 0

Bilkent-CYBORG/ACC-UCB
Implementation of the Adaptive Contextual Combinatorial Upper Confidence Bound (ACC-UCB) algorithm for the contextual combinatorial volatile multi-armed bandit setting.
Language: Python - Size: 55.8 MB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 6 - Forks: 1

Hins-Hu/Bandit-Algorithms
An illustrative project including some multi-armed bandit algorithms and contextual bandit algorithms
Language: Python - Size: 902 KB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

ej0cl6/cbpr
Contextual Bandit with Piled Rewards
Language: Python - Size: 57.6 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 0

SC5/bandits
Language: Python - Size: 95.7 KB - Last synced at: 4 days ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 5
