An open API service providing repository metadata for many open source software ecosystems.

Topic: "multi-armed-bandits"

tensorflow/agents

TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

Language: Python - Size: 12.9 MB - Last synced at: 1 day ago - Pushed at: about 1 month ago - Stars: 2,907 - Forks: 738

st-tech/zr-obp

Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation

Language: Python - Size: 28.6 MB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 666 - Forks: 91

fidelity/mabwiser

[IJAIT 2021] MABWiser: Contextual Multi-Armed Bandits Library

Language: Python - Size: 73.3 MB - Last synced at: 15 days ago - Pushed at: 9 months ago - Stars: 234 - Forks: 43

rlberry-py/rlberry

An easy-to-use reinforcement learning library for research and education.

Language: Python - Size: 16 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 167 - Forks: 30

fidelity/mab2rec

[AAAI 2024] Mab2Rec: Multi-Armed Bandits Recommender

Language: Jupyter Notebook - Size: 3.51 MB - Last synced at: 15 days ago - Pushed at: 8 months ago - Stars: 142 - Forks: 27

bayesianbandits/bayesianbandits

A Pythonic microframework for multi-armed bandit problems

Language: Python - Size: 5.3 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 109 - Forks: 3

Nth-iteration-labs/contextual

Contextual Bandits in R - simulation and evaluation of Multi-Armed Bandit Policies

Language: R - Size: 107 MB - Last synced at: 3 days ago - Pushed at: almost 5 years ago - Stars: 80 - Forks: 26

antonismand/Personalized-News-Recommendation

Multi Armed Bandits implementation using the Yahoo! Front Page Today Module User Click Log Dataset

Language: Jupyter Notebook - Size: 3.63 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 75 - Forks: 22

stitchfix/mab

Library for multi-armed bandit selection strategies, including efficient deterministic implementations of Thompson sampling and epsilon-greedy.

Language: Go - Size: 78.1 KB - Last synced at: 22 days ago - Pushed at: about 2 months ago - Stars: 54 - Forks: 7

cfoh/Multi-Armed-Bandit-Example

Learning Multi-Armed Bandits by Examples. Currently covering MAB, UCB, Boltzmann Exploration, Thompson Sampling, Contextual MAB, Deep MAB.

Language: Python - Size: 705 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 27 - Forks: 6

wbwang2020/MP-MAB

This project is created for the simulations of the paper: [Wang2021] Wenbo Wang, Amir Leshem, Dusit Niyato and Zhu Han, "Decentralized Learning for Channel Allocation inIoT Networks over Unlicensed Bandwidth as aContextual Multi-player Multi-armed Bandit Game", to appear in IEEE Transactions on Wireless Communications, 2021.

Language: Python - Size: 126 KB - Last synced at: 11 months ago - Pushed at: over 3 years ago - Stars: 24 - Forks: 10

kulinshah98/Multi-Armed-Bandit-Algorithms

Python implementation of UCB, EXP3 and Epsilon greedy algorithms

Language: Python - Size: 760 KB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 24 - Forks: 9

ir-uam/kNNBandit

Software for the experiments reported in the RecSys 2019 paper "A Simple Multi-Armed Nearest-Neighbor Bandit for Interactive Recommendation"

Language: Java - Size: 110 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 20 - Forks: 4

Kenza-AI/mab-ranking

Online Ranking with Multi-Armed-Bandits

Language: Python - Size: 705 KB - Last synced at: 19 days ago - Pushed at: over 3 years ago - Stars: 18 - Forks: 3

nphdang/Bandit-BO

Bayesian Optimization for Categorical and Continuous Inputs

Language: Python - Size: 188 KB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 16 - Forks: 2

xuedong/machine-learning-summer-schools

Curated materials for different machine learning related summer schools

Language: Jupyter Notebook - Size: 1.19 GB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 15 - Forks: 9

akshaykhadse/reinforcement-learning

Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay

Language: Python - Size: 20.4 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 15 - Forks: 6

jtcho/FairMachineLearning

Implementation of provably Rawlsian fair ML algorithms for contextual bandits.

Language: Jupyter Notebook - Size: 24.5 MB - Last synced at: about 2 years ago - Pushed at: about 8 years ago - Stars: 15 - Forks: 4

rotationalio/honu

Adaptive consistency replication with reinforcement learning for large scale globally distributed storage.

Language: Go - Size: 6.68 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 13 - Forks: 2

adik993/reinforcement-learning-sutton

Language: Python - Size: 75.2 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 13 - Forks: 3

machinelearningnuremberg/DyHPO

[NeurIPS 2022] Supervising the Multi-Fidelity Race of Hyperparameter Configurations

Language: Python - Size: 13.7 MB - Last synced at: 10 months ago - Pushed at: about 2 years ago - Stars: 10 - Forks: 2

RonyAbecidan/Neural-Thompson-Sampling

Study of the paper 'Neural Thompson Sampling' published in October 2020

Language: Jupyter Notebook - Size: 10.4 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 10 - Forks: 3

paulozip/beer-recommender-mab

A beer recommendation system using multi-armed bandit approach to solve cold start problems

Language: Python - Size: 4 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 10 - Forks: 1

thetawom/mabby

A multi-armed bandit (MAB) simulation library in Python

Language: Python - Size: 3.86 MB - Last synced at: about 24 hours ago - Pushed at: 11 months ago - Stars: 8 - Forks: 1

kkm24132/ReinforcementLearning

Focuses on Reinforcement Learning related concepts, use cases, and learning approaches

Language: Jupyter Notebook - Size: 7.56 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 7 - Forks: 3

Shahul-Rahman/SPGD-Search-Party-Gradient-Descent-algorithm

SPGD: Search Party Gradient Descent algorithm, a Simple Gradient-Based Parallel Algorithm for Bound-Constrained Optimization. Link: https://www.mdpi.com/2227-7390/10/5/800

Language: Jupyter Notebook - Size: 21.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 1

nphdang/turbo_bbo_neurips_2020

An improved version of Turbo algorithm for the Black-box optimization competition organized by NeurIPS 2020

Language: Python - Size: 110 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 7 - Forks: 0

FlynnOwen/multi-armed-bandits

Multi-Armed Bandit method of accurately estimating the largest parameter out of a set of candidates.

Language: Python - Size: 1.48 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 0

DURUII/Replica-EUWR

🐯REPLICA of "Combinatorial Multi-Armed Bandit Based Unknown Worker Recruitment in Heterogeneous Crowdsensing"

Language: Jupyter Notebook - Size: 18 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 0

neural-bandits/calvera

Python Library for Neural Multi Armed Bandits

Language: Python - Size: 51.6 MB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 5 - Forks: 0

songzhg/RLOP

RLOP: A Framework for Reinforcement Learning, Optimization and Planning Algorithms

Language: C++ - Size: 331 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 5 - Forks: 2

jzsherlock4869/reinforcement-learning-sutton-code

Implementations of methods in book <Reinforcement Learning: an introduction> by Sutton Barto, using Python.

Language: Python - Size: 1.69 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 1

g0ulash/awesome-bandits

Awesome list about anything bandit problems

Size: 1.95 KB - Last synced at: 3 days ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 0

Suchetaaa/CS747-Assignments

Foundations Of Intelligent Learning Agents (FILA) Assignments

Language: Python - Size: 3.04 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 0

manome/python-mab

This project provides a simulation of multi-armed bandit problems. This implementation is based on the below paper. https://arxiv.org/abs/2308.14350.

Language: Python - Size: 1.2 MB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 4 - Forks: 0

rctzeng/NeurIPS2021-FWS

This is the repo for Fast Pure Exploration via Frank-Wolfe (NeurIPS 2021).

Language: Julia - Size: 25.4 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 2

banyikun/LOCB

Code for "Local Clustering in Contextual Multi-Armed Bandits".

Language: Python - Size: 4.19 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 2

nicoleorzan/Multi-armed-bandit-RL

C++ implementation of Multi-Armed Bandits (Gaussian and Bernoulli)

Language: C++ - Size: 73.2 KB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 2

marlesson/MaB-Experiments

Multi-armed Bandits

Language: Jupyter Notebook - Size: 6.2 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 1

crenwick/Swiper

🦊 A series of bandit algorithms in Swift

Language: Swift - Size: 23.4 KB - Last synced at: about 1 year ago - Pushed at: about 9 years ago - Stars: 4 - Forks: 0

navarog/cross-validated-ope

The source code to Cross-Validated Off-Policy Evaluation

Language: Python - Size: 2.23 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 3 - Forks: 0

cezarbulancea/Multi-Armed-Bandits

Implementation of several multi-armed bandit problems.

Language: Python - Size: 371 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

hubbs5/django-bandits

A middleware package for site optimization and Django

Language: Python - Size: 208 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

Shahul-Rahman/NPROS

NPROS: A Not So Pure Random Orthogonal Search Algorithm –A Suite of Random Optimization Algorithms Driven by Reinforcement Learning

Language: Jupyter Notebook - Size: 2.44 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

Megha-Bose/WebAppMTD

Contains Code for the Papers "Moving Target Defense under Uncertainty for Web Applications" and "Learning Effective Strategies for Moving Target Defense with Switching Costs" written by Vignesh Viswanathan, Megha Bose and Praveen Paruchuri.

Language: Python - Size: 146 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

yubin1219/multi_armed_bandits_recommendation_system

Reinforcement learning project using multi-armed bandits for recommendation system

Language: Jupyter Notebook - Size: 229 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

MaxenceGiraud/ucb-nonstationary

On Upper-Confidence Bound Policies for Non-Stationary Bandit Problems

Language: Python - Size: 384 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

anishacharya/Bandits-Online-Learning

Simple Implementations of Bandit Algorithms in python

Language: Jupyter Notebook - Size: 120 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

TanguyUrvoy/pmlib

A python library for (finite) Partial Monitoring algorithms

Language: Jupyter Notebook - Size: 1.75 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 1

royhzq/bayesian-ab-django

An implementation of Bayesian AB testing framework in Django. Implements multi-armed bandit algorithms such as Thompson Sampling and UCB1. API for registering impressions/conversions implemented with django-rest framework

Language: Python - Size: 224 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 1

ygrayson/multi-armed-bandit

Belief Resilience Algorithm developed to solve the Multi-armed Bandits problem, implemented in NetLogo

Language: NetLogo - Size: 8.79 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 1

rssalessio/DPE

DPE code - Code used in "Optimal Algorithms for Multiplayer Multi-Armed Bandits" (AISTATS 2020)

Language: Python - Size: 95.7 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 2

pm3310/pulpo

WIP: A library and AWS sdk for non-contextual and contextual Multi-Armed-Bandit (MAB) algorithms for multiple use cases

Language: Python - Size: 26.4 KB - Last synced at: 11 days ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 2

pm3310/mab-covid19

Multi-Armed-Bandit solutions on AWS to deliver Covid-19 test kits efficiently and effectively

Language: Jupyter Notebook - Size: 12.7 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 1

nathanael-fijalkow/Multi-Armed-Bandits

Simple implementations of some algorithms for the multi-armed bandit problem and some plots to compare the algorithms

Language: Jupyter Notebook - Size: 421 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 2

Aditya-Ramesh-10/Basic_Bandit

Implementation of common bandit algorithms for the Bernoulli setting.

Language: Jupyter Notebook - Size: 165 KB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 2 - Forks: 0

Vinit-4689/Multi-Armed-Bandit

Efficient exploration and exploitation strategies using Epsilon-Greedy, UCB1, and Thompson Sampling — with code, math, and intuition.

Language: Python - Size: 14.6 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

tejassp2002/SoC-EconML

Language: Jupyter Notebook - Size: 7.31 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

JurajZelman/multi-armed-bandits

Several multi-armed bandit strategies with additional holding option for smoother exploration.

Language: Jupyter Notebook - Size: 297 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

mobarski/kraken

Contextual Bandit Engine

Language: Python - Size: 906 KB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

neildhir/non-stationary-causal-bandits

Non-stationary multi-armed bandit under a causal perspective

Language: Jupyter Notebook - Size: 1.71 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 1

sarthakmittal92/multi-armed-bandits

Repository for the course project done as part of CS-747 (Foundations of Intelligent & Learning Agents) course at IIT Bombay in Autumn 2022.

Language: Python - Size: 330 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

MehranTaghian/prophet-inequlity-implementation

Implementation of the prophet inequalities

Language: Python - Size: 1.47 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

JavierSanzCruza/IRBandits

Java library for interactive recommendation.

Language: Java - Size: 1020 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 1

proceduralia/randomist

Code for Policy Optimization as Online Learning with Mediator Feedback

Language: Python - Size: 31.3 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

hstojic/BGSE_ReinforcementLearning

Teaching materials of a reinforcement learning course at Barcelona Graduate shool of Economics, Barcelona

Size: 25.7 MB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 1

abhinavcreed13/Multi-armed-bandits-MAB

This project implements famous MAB algorithms and evaluates them on the basis of their performance - EpsilonGreedy, UCB, BetaThompson, LinUCB, LinThompson.

Language: Jupyter Notebook - Size: 1.03 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 1

swasun/BanditProblem 📦

A collection of implementations of the bandit problem.

Language: Jupyter Notebook - Size: 580 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

victor-iyi/policy-gradient

A policy gradient approach to a multi-armed bandit problem

Language: Jupyter Notebook - Size: 5.86 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

The-RL-Hub/The-RL-Hub.github.io

The RL Hub is a Persian-language website dedicated to providing educational content on reinforcement learning. It offers a structured curriculum that begins with fundamental concepts, such as the definition of RL, its primary components, and illustrative examples to enhance understanding.

Language: JavaScript - Size: 838 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Asterinos1/RL_n_Dynamic_Optimization

This rep contains the projects made for the course "Reinforcement Learning and Dynamic Optimization" at TUC (2024).

Language: Jupyter Notebook - Size: 1.46 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

riccardopoiani/adv-mf-bai

Code for the paper "Optimal Multi-Fidelity Best-Arm Identification" (NeurIPS 2024)

Language: Python - Size: 11.7 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

dmolitor/pyssed

The Mixture Adaptive Design (MAD): An experimental design for anytime-valid causal inference on Multi-Armed Bandits.

Language: Python - Size: 12.3 MB - Last synced at: 24 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

diyabodiwala/FlicksMAB

FlicksMAB is a movie recommendation system that leverages the power of multi-armed bandits (MAB) to personalize movie suggestions for users. Built using PyTorch, this system uses the MovieLens 100K dataset to learn user preferences and recommend movies that are likely to engage them.

Language: Python - Size: 55.7 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

Alanjamlu34/Multi-Armed-Bandit--Adaptive-epsilon-greedy-

Repository tugas akhir tentang Multi-Armed Bandit

Language: Jupyter Notebook - Size: 2.71 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

rafonsor/unRL

unRL (AKA "unreal") is a set of libraries providing Reinforcement Learning algorithms implemented in PyTorch or Jax.

Language: Python - Size: 354 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Sudhansh6/Intelligent-Learning-Agents

A repository covering a range of topics from multi-arm bandits to reinforcement learning algorithms. Check out different applications of bandits, MDPs and RL algorithms along with theoretical aspects.

Language: Python - Size: 937 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

DennisSoemers/MultiArmedBanditsTutorial

Multi-Armed Bandits Tutorial

Language: Jupyter Notebook - Size: 63.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

hritikb/Reinforcement-Learning-Algorithms

Language: Jupyter Notebook - Size: 1.02 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

yiming-fang/hierarchical_bandits

Hierarchical Bandits for Music Recommendation

Language: Jupyter Notebook - Size: 1.35 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

amirhosein-prdv/Reinforcement-learning-exercises

This Repository contain the Answers of "Coursera RL Specialization" Course exercises

Language: Jupyter Notebook - Size: 1.3 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

JS2498/CS420-Reinforcement-Learning

CS420: Reinforcement Learning

Language: Jupyter Notebook - Size: 116 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 1

Orange-OpenSource/MAB_IoT

a C framework including Multi-player Multi-armed bandit (MP-MAB) algorithms applied to IoT network and a LoRa network simulator

Language: C - Size: 3.66 MB - Last synced at: 24 days ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 1

elina-israyelyan/thompson-sampling

Package to implement the Thompson Sampling algorithm.

Language: Python - Size: 466 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

posgnu/bayesian-active-learning-on-multi-armed-bandit

Bayesian active learning algorithm with Thompson sampling on multi-armed bandit with Numpy

Language: Python - Size: 121 KB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

n4i9kita/ExploratoryProject

Analysis of various multi armed bandit algorithms over normal and heavy-tailed distributions.

Language: Jupyter Notebook - Size: 798 KB - Last synced at: 2 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

Murtazali05/Multi-armed-bandit

Multi Armed Bandits implementation using the Jester Dataset

Language: Python - Size: 4.14 MB - Last synced at: 3 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

kyminbb/multi-armed-bandits

Implementation of multi-armed bandits from scratch

Language: Python - Size: 15.6 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

Nikhilkohli1/Multi-Armed-bandit-Bayesian-A-B-Testing

Language: Jupyter Notebook - Size: 155 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

VigHub/Exam_IA

Development of algorithms for reinforcement learning. Specifically, software implementation of the algorithms and policies described in the paper Batched Multi-armed Bandits Problems, by Zijun Gao, Yanjun Han, Zhimei Ren, Zhengqing Zhou.

Language: Python - Size: 6.46 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

vishaal27/RL-M2019

Repository for the Reinforcement Learning (CSE564) Fall'19 course at IIIT Delhi

Language: Jupyter Notebook - Size: 9.48 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

harshsiloiya98/CS747-Assignments

Assignments for CS747 - Foundations of Intelligent and Learning Agents

Language: Python - Size: 692 KB - Last synced at: 2 months ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

NekuSakuraba/reinforcement-learning

A simple repository to highlight and explain Reinforcement Learning topics and concepts

Language: Jupyter Notebook - Size: 158 KB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

cotrane/Thompson-Sampler

Simple Thompson Sampler for a multi-armed bandit problem

Language: Jupyter Notebook - Size: 43.9 KB - Last synced at: almost 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

julbright/hastur

A multi-armed bandit implementation in python

Language: Python - Size: 92.8 KB - Last synced at: about 1 year ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 0

Related Topics
reinforcement-learning 44 thompson-sampling 15 python 15 machine-learning 12 contextual-bandits 9 bandit-algorithms 7 multi-armed-bandit 7 ucb 7 bandits 6 optimization 6 epsilon-greedy 6 recommendation-system 5 q-learning 4 sarsa 4 gaussian-processes 4 multiarmed-bandits 4 bandit-learning 4 bandit 4 deep-reinforcement-learning 3 statistics 3 deep-learning 3 causal-inference 3 mab 3 temporal-differencing-learning 3 policy-iteration 3 reinforcement-learning-algorithms 3 hyperparameter-optimization 3 bayesian-statistics 3 policy-gradient 3 online-learning 3 markov-decision-processes 3 ab-testing 3 dqn 3 bayesian-optimization 3 python3 3 pytorch 3 bellman-equation 2 recommender-systems 2 ppo 2 stochastic-bandit-algorithms 2 artificial-intelligence 2 simulation 2 recsys 2 e-greedy 2 linucb 2 evaluation 2 howards-pi 2 kl-ucb 2 linear-programming 2 sarsa-learning 2 sagemaker 2 multiarm-bandit 2 aws 2 hyperparameter-tuning 2 batch-bayesian-optimization 2 multi-fidelity 2 automated-machine-learning 2 value-iteration 2 acquisition-functions 2 tensorflow 2 metaheuristics 2 online-learning-algorithms 2 mcmc 2 exploration 2 dynamic-programming 2 django 2 softmax 2 alpha-beta-search 1 research 1 off-policy-evaluation 1 datasets 1 multi-agent-reinforcement-learning 1 machine-learning-algorithms 1 smac 1 hyperopt 1 gpyopt 1 continuous-variable 1 categorical-variables 1 cplusplus 1 cpp 1 gymnasium 1 libtorch 1 local-search 1 mcts 1 operation-research 1 pybind11 1 sac 1 search 1 simulated-annealing 1 tabu-search 1 vrp 1 ddpg 1 jax 1 kfac 1 global-optimization 1 upenn 1 jupyter-notebook 1 python-3 1 open-gym-ai 1 sarsa-lambda 1