GitHub topics: ucb1

Repositories

alextanhongpin/go-bandit

Multi-Armed Bandit (MAB) algorithm implementation in go

Language: Go - Size: 77.1 KB - Last synced at: 21 days ago - Pushed at: over 5 years ago - Stars: 30 - Forks: 7

gokhanmeteerturk/adaptive-shots

Few-shot prompting using Contextual Combinatorial Bandit optimizations

Language: Python - Size: 49.8 KB - Last synced at: 3 days ago - Pushed at: 4 months ago - Stars: 7 - Forks: 0

mykeels/multi-armed-bandit-problem

An implementation of solvers for the multi-armed-bandit-problem in JavaScript.

Language: JavaScript - Size: 5.86 KB - Last synced at: 21 days ago - Pushed at: almost 6 years ago - Stars: 3 - Forks: 0

sanxore/py-mcts

Python implementation of Monte Carlo Tree Search

Language: Python - Size: 4.88 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 1

HoangTran0410/Reversi-mcts

Reversi (Othello) AI game in C#. Using Monte Carlo Tree Search algorithm AND BTMM algorithm.

Language: C# - Size: 474 KB - Last synced at: 28 days ago - Pushed at: almost 4 years ago - Stars: 4 - Forks: 0

VladMarianCimpeanu/OLA_project

Reinforcement learning techniques applied to solve pricing problems in e-commerce applications. Final project for "Online learning applications" course (2021-2022)

Language: Jupyter Notebook - Size: 52.6 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 3

kochlisGit/Reinforcement-Learning-Algorithms

This project focuses on comparing different Reinforcement Learning Algorithms, including monte-carlo, q-learning, lambda q-learning epsilon-greedy variations, etc.

Language: Python - Size: 460 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0

zzmtsvv/ml_sandbox

Language: Jupyter Notebook - Size: 422 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

viswanath57/Bandit-Algorithms

Language: Jupyter Notebook - Size: 1.65 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 8 - Forks: 4

akshaykhadse/reinforcement-learning

Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay

Language: Python - Size: 20.4 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 15 - Forks: 6

Nikita-Kudrin/funcorp-bandit

REST service, that returns content sorted by UCB1 algorithm.(Multi-Armed Bandit algorithm). Spring Boot, Kotlin

Language: Kotlin - Size: 211 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

EmanuelAlogna/Data-Intelligence-Applications

Pricing and Social Influence Maximization using Reinforcement Learning algorithms in Data Intelligence Applications projects from Politechnic of Milan

Language: Python - Size: 9.52 MB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 2

Twice22/Reinforcement-Learning

My reports for the reinforcement learning class given at the ENS

Language: Jupyter Notebook - Size: 6.64 MB - Last synced at: 5 months ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

Related Keywords

ucb1 13 reinforcement-learning 6 epsilon-greedy 4 thompson-sampling 4 multi-armed-bandit 3 pricing 2 multiarm-bandit 2 policy-iteration 2 monte-carlo-tree-search 2 python 2 mcts 2 vq-vae 1 algorithms 1 softmax-algorithm 1 batch-switching 1 howards-pi 1 kl-divergence 1 value-iteration 1 variational-autoencoder 1 style-transfer 1 spectral-normalization 1 self-organizing-map 1 self-normalizing-neural-networks 1 regression 1 mlp 1 knearest-neighbor-algorithm 1 gradient-boosting 1 gan 1 reinforce 1 policy-gradient 1 social-network 1 social-influence 1 reinforcement-learning-algorithms 1 spring-boot 1 kotlin 1 ucb 1 reinforcement-learning-excercises 1 reinforcement-learning-analysis 1 randomized-policy-iteration 1 randomised-algorithms 1 policy-evaluation 1 multi-armed-bandits 1 mdps 1 markovian-epidemic-processes 1 linear-programming 1 online-learning-applications 1 montecarlo-simulation 1 mab 1 reversi-game 1 othello-game 1 othello-ai 1 mcts-algorithm 1 machine-learning 1 csharp 1 board-game 1 bitboard 1 uct 1 few-shot 1 contextual-bandits 1 ai 1 mulit-arm-bandit 1 greedy-epsilon 1 go 1 diffusion-models 1 cyclegan 1 cnn-visualization 1 classification 1 calibration 1 thomson-sampling 1 rl-agents 1 q-learning 1 q-lambda 1 policy 1 openai-gym 1 multi-bandit-army 1 monte-carlo 1 markov-chains 1 frozen-lake 1 exploration-exploitation 1 dynamic-programming 1 approximation-algorithms 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos