GitHub topics: ucb1
alextanhongpin/go-bandit
Multi-Armed Bandit (MAB) algorithm implementation in go
Language: Go - Size: 77.1 KB - Last synced at: 21 days ago - Pushed at: over 5 years ago - Stars: 30 - Forks: 7

gokhanmeteerturk/adaptive-shots
Few-shot prompting using Contextual Combinatorial Bandit optimizations
Language: Python - Size: 49.8 KB - Last synced at: 3 days ago - Pushed at: 4 months ago - Stars: 7 - Forks: 0

mykeels/multi-armed-bandit-problem
An implementation of solvers for the multi-armed-bandit-problem in JavaScript.
Language: JavaScript - Size: 5.86 KB - Last synced at: 21 days ago - Pushed at: almost 6 years ago - Stars: 3 - Forks: 0

sanxore/py-mcts
Python implementation of Monte Carlo Tree Search
Language: Python - Size: 4.88 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 1

HoangTran0410/Reversi-mcts
Reversi (Othello) AI game in C#. Using Monte Carlo Tree Search algorithm AND BTMM algorithm.
Language: C# - Size: 474 KB - Last synced at: 28 days ago - Pushed at: almost 4 years ago - Stars: 4 - Forks: 0

VladMarianCimpeanu/OLA_project
Reinforcement learning techniques applied to solve pricing problems in e-commerce applications. Final project for "Online learning applications" course (2021-2022)
Language: Jupyter Notebook - Size: 52.6 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 3

kochlisGit/Reinforcement-Learning-Algorithms
This project focuses on comparing different Reinforcement Learning Algorithms, including monte-carlo, q-learning, lambda q-learning epsilon-greedy variations, etc.
Language: Python - Size: 460 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0

zzmtsvv/ml_sandbox
Language: Jupyter Notebook - Size: 422 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

viswanath57/Bandit-Algorithms
Language: Jupyter Notebook - Size: 1.65 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 8 - Forks: 4

akshaykhadse/reinforcement-learning
Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay
Language: Python - Size: 20.4 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 15 - Forks: 6

Nikita-Kudrin/funcorp-bandit
REST service, that returns content sorted by UCB1 algorithm.(Multi-Armed Bandit algorithm). Spring Boot, Kotlin
Language: Kotlin - Size: 211 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

EmanuelAlogna/Data-Intelligence-Applications
Pricing and Social Influence Maximization using Reinforcement Learning algorithms in Data Intelligence Applications projects from Politechnic of Milan
Language: Python - Size: 9.52 MB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 2

Twice22/Reinforcement-Learning
My reports for the reinforcement learning class given at the ENS
Language: Jupyter Notebook - Size: 6.64 MB - Last synced at: 5 months ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0
