An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: ucb1

alextanhongpin/go-bandit

Multi-Armed Bandit (MAB) algorithm implementation in go

Language: Go - Size: 77.1 KB - Last synced at: 21 days ago - Pushed at: over 5 years ago - Stars: 30 - Forks: 7

gokhanmeteerturk/adaptive-shots

Few-shot prompting using Contextual Combinatorial Bandit optimizations

Language: Python - Size: 49.8 KB - Last synced at: 3 days ago - Pushed at: 4 months ago - Stars: 7 - Forks: 0

mykeels/multi-armed-bandit-problem

An implementation of solvers for the multi-armed-bandit-problem in JavaScript.

Language: JavaScript - Size: 5.86 KB - Last synced at: 21 days ago - Pushed at: almost 6 years ago - Stars: 3 - Forks: 0

sanxore/py-mcts

Python implementation of Monte Carlo Tree Search

Language: Python - Size: 4.88 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 1

HoangTran0410/Reversi-mcts

Reversi (Othello) AI game in C#. Using Monte Carlo Tree Search algorithm AND BTMM algorithm.

Language: C# - Size: 474 KB - Last synced at: 28 days ago - Pushed at: almost 4 years ago - Stars: 4 - Forks: 0

VladMarianCimpeanu/OLA_project

Reinforcement learning techniques applied to solve pricing problems in e-commerce applications. Final project for "Online learning applications" course (2021-2022)

Language: Jupyter Notebook - Size: 52.6 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 3

kochlisGit/Reinforcement-Learning-Algorithms

This project focuses on comparing different Reinforcement Learning Algorithms, including monte-carlo, q-learning, lambda q-learning epsilon-greedy variations, etc.

Language: Python - Size: 460 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0

zzmtsvv/ml_sandbox

Language: Jupyter Notebook - Size: 422 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

viswanath57/Bandit-Algorithms

Language: Jupyter Notebook - Size: 1.65 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 8 - Forks: 4

akshaykhadse/reinforcement-learning

Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay

Language: Python - Size: 20.4 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 15 - Forks: 6

Nikita-Kudrin/funcorp-bandit

REST service, that returns content sorted by UCB1 algorithm.(Multi-Armed Bandit algorithm). Spring Boot, Kotlin

Language: Kotlin - Size: 211 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

EmanuelAlogna/Data-Intelligence-Applications

Pricing and Social Influence Maximization using Reinforcement Learning algorithms in Data Intelligence Applications projects from Politechnic of Milan

Language: Python - Size: 9.52 MB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 2

Twice22/Reinforcement-Learning

My reports for the reinforcement learning class given at the ENS

Language: Jupyter Notebook - Size: 6.64 MB - Last synced at: 5 months ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0