Topic: "multi-armed-bandits"
tensorflow/agents
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
Language: Python - Size: 12.9 MB - Last synced at: 1 day ago - Pushed at: about 1 month ago - Stars: 2,907 - Forks: 738

st-tech/zr-obp
Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation
Language: Python - Size: 28.6 MB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 666 - Forks: 91

fidelity/mabwiser
[IJAIT 2021] MABWiser: Contextual Multi-Armed Bandits Library
Language: Python - Size: 73.3 MB - Last synced at: 15 days ago - Pushed at: 9 months ago - Stars: 234 - Forks: 43

rlberry-py/rlberry
An easy-to-use reinforcement learning library for research and education.
Language: Python - Size: 16 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 167 - Forks: 30

fidelity/mab2rec
[AAAI 2024] Mab2Rec: Multi-Armed Bandits Recommender
Language: Jupyter Notebook - Size: 3.51 MB - Last synced at: 15 days ago - Pushed at: 8 months ago - Stars: 142 - Forks: 27

bayesianbandits/bayesianbandits
A Pythonic microframework for multi-armed bandit problems
Language: Python - Size: 5.3 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 109 - Forks: 3

Nth-iteration-labs/contextual
Contextual Bandits in R - simulation and evaluation of Multi-Armed Bandit Policies
Language: R - Size: 107 MB - Last synced at: 3 days ago - Pushed at: almost 5 years ago - Stars: 80 - Forks: 26

antonismand/Personalized-News-Recommendation
Multi Armed Bandits implementation using the Yahoo! Front Page Today Module User Click Log Dataset
Language: Jupyter Notebook - Size: 3.63 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 75 - Forks: 22

stitchfix/mab
Library for multi-armed bandit selection strategies, including efficient deterministic implementations of Thompson sampling and epsilon-greedy.
Language: Go - Size: 78.1 KB - Last synced at: 22 days ago - Pushed at: about 2 months ago - Stars: 54 - Forks: 7

cfoh/Multi-Armed-Bandit-Example
Learning Multi-Armed Bandits by Examples. Currently covering MAB, UCB, Boltzmann Exploration, Thompson Sampling, Contextual MAB, Deep MAB.
Language: Python - Size: 705 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 27 - Forks: 6

wbwang2020/MP-MAB
This project is created for the simulations of the paper: [Wang2021] Wenbo Wang, Amir Leshem, Dusit Niyato and Zhu Han, "Decentralized Learning for Channel Allocation inIoT Networks over Unlicensed Bandwidth as aContextual Multi-player Multi-armed Bandit Game", to appear in IEEE Transactions on Wireless Communications, 2021.
Language: Python - Size: 126 KB - Last synced at: 11 months ago - Pushed at: over 3 years ago - Stars: 24 - Forks: 10

kulinshah98/Multi-Armed-Bandit-Algorithms
Python implementation of UCB, EXP3 and Epsilon greedy algorithms
Language: Python - Size: 760 KB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 24 - Forks: 9

ir-uam/kNNBandit
Software for the experiments reported in the RecSys 2019 paper "A Simple Multi-Armed Nearest-Neighbor Bandit for Interactive Recommendation"
Language: Java - Size: 110 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 20 - Forks: 4

Kenza-AI/mab-ranking
Online Ranking with Multi-Armed-Bandits
Language: Python - Size: 705 KB - Last synced at: 19 days ago - Pushed at: over 3 years ago - Stars: 18 - Forks: 3

nphdang/Bandit-BO
Bayesian Optimization for Categorical and Continuous Inputs
Language: Python - Size: 188 KB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 16 - Forks: 2

xuedong/machine-learning-summer-schools
Curated materials for different machine learning related summer schools
Language: Jupyter Notebook - Size: 1.19 GB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 15 - Forks: 9

akshaykhadse/reinforcement-learning
Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay
Language: Python - Size: 20.4 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 15 - Forks: 6

jtcho/FairMachineLearning
Implementation of provably Rawlsian fair ML algorithms for contextual bandits.
Language: Jupyter Notebook - Size: 24.5 MB - Last synced at: about 2 years ago - Pushed at: about 8 years ago - Stars: 15 - Forks: 4

rotationalio/honu
Adaptive consistency replication with reinforcement learning for large scale globally distributed storage.
Language: Go - Size: 6.68 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 13 - Forks: 2

adik993/reinforcement-learning-sutton
Language: Python - Size: 75.2 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 13 - Forks: 3

machinelearningnuremberg/DyHPO
[NeurIPS 2022] Supervising the Multi-Fidelity Race of Hyperparameter Configurations
Language: Python - Size: 13.7 MB - Last synced at: 10 months ago - Pushed at: about 2 years ago - Stars: 10 - Forks: 2

RonyAbecidan/Neural-Thompson-Sampling
Study of the paper 'Neural Thompson Sampling' published in October 2020
Language: Jupyter Notebook - Size: 10.4 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 10 - Forks: 3

paulozip/beer-recommender-mab
A beer recommendation system using multi-armed bandit approach to solve cold start problems
Language: Python - Size: 4 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 10 - Forks: 1

thetawom/mabby
A multi-armed bandit (MAB) simulation library in Python
Language: Python - Size: 3.86 MB - Last synced at: about 24 hours ago - Pushed at: 11 months ago - Stars: 8 - Forks: 1

kkm24132/ReinforcementLearning
Focuses on Reinforcement Learning related concepts, use cases, and learning approaches
Language: Jupyter Notebook - Size: 7.56 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 7 - Forks: 3

Shahul-Rahman/SPGD-Search-Party-Gradient-Descent-algorithm
SPGD: Search Party Gradient Descent algorithm, a Simple Gradient-Based Parallel Algorithm for Bound-Constrained Optimization. Link: https://www.mdpi.com/2227-7390/10/5/800
Language: Jupyter Notebook - Size: 21.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 1

nphdang/turbo_bbo_neurips_2020
An improved version of Turbo algorithm for the Black-box optimization competition organized by NeurIPS 2020
Language: Python - Size: 110 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 7 - Forks: 0

FlynnOwen/multi-armed-bandits
Multi-Armed Bandit method of accurately estimating the largest parameter out of a set of candidates.
Language: Python - Size: 1.48 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 0

DURUII/Replica-EUWR
🐯REPLICA of "Combinatorial Multi-Armed Bandit Based Unknown Worker Recruitment in Heterogeneous Crowdsensing"
Language: Jupyter Notebook - Size: 18 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 0

neural-bandits/calvera
Python Library for Neural Multi Armed Bandits
Language: Python - Size: 51.6 MB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 5 - Forks: 0

songzhg/RLOP
RLOP: A Framework for Reinforcement Learning, Optimization and Planning Algorithms
Language: C++ - Size: 331 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 5 - Forks: 2

jzsherlock4869/reinforcement-learning-sutton-code
Implementations of methods in book <Reinforcement Learning: an introduction> by Sutton Barto, using Python.
Language: Python - Size: 1.69 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 1

g0ulash/awesome-bandits
Awesome list about anything bandit problems
Size: 1.95 KB - Last synced at: 3 days ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 0

Suchetaaa/CS747-Assignments
Foundations Of Intelligent Learning Agents (FILA) Assignments
Language: Python - Size: 3.04 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 0

manome/python-mab
This project provides a simulation of multi-armed bandit problems. This implementation is based on the below paper. https://arxiv.org/abs/2308.14350.
Language: Python - Size: 1.2 MB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 4 - Forks: 0

rctzeng/NeurIPS2021-FWS
This is the repo for Fast Pure Exploration via Frank-Wolfe (NeurIPS 2021).
Language: Julia - Size: 25.4 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 2

banyikun/LOCB
Code for "Local Clustering in Contextual Multi-Armed Bandits".
Language: Python - Size: 4.19 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 2

nicoleorzan/Multi-armed-bandit-RL
C++ implementation of Multi-Armed Bandits (Gaussian and Bernoulli)
Language: C++ - Size: 73.2 KB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 2

marlesson/MaB-Experiments
Multi-armed Bandits
Language: Jupyter Notebook - Size: 6.2 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 1

crenwick/Swiper
🦊 A series of bandit algorithms in Swift
Language: Swift - Size: 23.4 KB - Last synced at: about 1 year ago - Pushed at: about 9 years ago - Stars: 4 - Forks: 0

navarog/cross-validated-ope
The source code to Cross-Validated Off-Policy Evaluation
Language: Python - Size: 2.23 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 3 - Forks: 0

cezarbulancea/Multi-Armed-Bandits
Implementation of several multi-armed bandit problems.
Language: Python - Size: 371 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

hubbs5/django-bandits
A middleware package for site optimization and Django
Language: Python - Size: 208 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

Shahul-Rahman/NPROS
NPROS: A Not So Pure Random Orthogonal Search Algorithm –A Suite of Random Optimization Algorithms Driven by Reinforcement Learning
Language: Jupyter Notebook - Size: 2.44 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

Megha-Bose/WebAppMTD
Contains Code for the Papers "Moving Target Defense under Uncertainty for Web Applications" and "Learning Effective Strategies for Moving Target Defense with Switching Costs" written by Vignesh Viswanathan, Megha Bose and Praveen Paruchuri.
Language: Python - Size: 146 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

yubin1219/multi_armed_bandits_recommendation_system
Reinforcement learning project using multi-armed bandits for recommendation system
Language: Jupyter Notebook - Size: 229 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

MaxenceGiraud/ucb-nonstationary
On Upper-Confidence Bound Policies for Non-Stationary Bandit Problems
Language: Python - Size: 384 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

anishacharya/Bandits-Online-Learning
Simple Implementations of Bandit Algorithms in python
Language: Jupyter Notebook - Size: 120 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

TanguyUrvoy/pmlib
A python library for (finite) Partial Monitoring algorithms
Language: Jupyter Notebook - Size: 1.75 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 1

royhzq/bayesian-ab-django
An implementation of Bayesian AB testing framework in Django. Implements multi-armed bandit algorithms such as Thompson Sampling and UCB1. API for registering impressions/conversions implemented with django-rest framework
Language: Python - Size: 224 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 1

ygrayson/multi-armed-bandit
Belief Resilience Algorithm developed to solve the Multi-armed Bandits problem, implemented in NetLogo
Language: NetLogo - Size: 8.79 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 1

rssalessio/DPE
DPE code - Code used in "Optimal Algorithms for Multiplayer Multi-Armed Bandits" (AISTATS 2020)
Language: Python - Size: 95.7 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 2

pm3310/pulpo
WIP: A library and AWS sdk for non-contextual and contextual Multi-Armed-Bandit (MAB) algorithms for multiple use cases
Language: Python - Size: 26.4 KB - Last synced at: 11 days ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 2

pm3310/mab-covid19
Multi-Armed-Bandit solutions on AWS to deliver Covid-19 test kits efficiently and effectively
Language: Jupyter Notebook - Size: 12.7 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 1

nathanael-fijalkow/Multi-Armed-Bandits
Simple implementations of some algorithms for the multi-armed bandit problem and some plots to compare the algorithms
Language: Jupyter Notebook - Size: 421 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 2

Aditya-Ramesh-10/Basic_Bandit
Implementation of common bandit algorithms for the Bernoulli setting.
Language: Jupyter Notebook - Size: 165 KB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 2 - Forks: 0

Vinit-4689/Multi-Armed-Bandit
Efficient exploration and exploitation strategies using Epsilon-Greedy, UCB1, and Thompson Sampling — with code, math, and intuition.
Language: Python - Size: 14.6 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

tejassp2002/SoC-EconML
Language: Jupyter Notebook - Size: 7.31 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

JurajZelman/multi-armed-bandits
Several multi-armed bandit strategies with additional holding option for smoother exploration.
Language: Jupyter Notebook - Size: 297 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

mobarski/kraken
Contextual Bandit Engine
Language: Python - Size: 906 KB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

neildhir/non-stationary-causal-bandits
Non-stationary multi-armed bandit under a causal perspective
Language: Jupyter Notebook - Size: 1.71 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 1

sarthakmittal92/multi-armed-bandits
Repository for the course project done as part of CS-747 (Foundations of Intelligent & Learning Agents) course at IIT Bombay in Autumn 2022.
Language: Python - Size: 330 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

MehranTaghian/prophet-inequlity-implementation
Implementation of the prophet inequalities
Language: Python - Size: 1.47 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

JavierSanzCruza/IRBandits
Java library for interactive recommendation.
Language: Java - Size: 1020 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 1

proceduralia/randomist
Code for Policy Optimization as Online Learning with Mediator Feedback
Language: Python - Size: 31.3 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

hstojic/BGSE_ReinforcementLearning
Teaching materials of a reinforcement learning course at Barcelona Graduate shool of Economics, Barcelona
Size: 25.7 MB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 1

abhinavcreed13/Multi-armed-bandits-MAB
This project implements famous MAB algorithms and evaluates them on the basis of their performance - EpsilonGreedy, UCB, BetaThompson, LinUCB, LinThompson.
Language: Jupyter Notebook - Size: 1.03 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 1

swasun/BanditProblem 📦
A collection of implementations of the bandit problem.
Language: Jupyter Notebook - Size: 580 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

victor-iyi/policy-gradient
A policy gradient approach to a multi-armed bandit problem
Language: Jupyter Notebook - Size: 5.86 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

The-RL-Hub/The-RL-Hub.github.io
The RL Hub is a Persian-language website dedicated to providing educational content on reinforcement learning. It offers a structured curriculum that begins with fundamental concepts, such as the definition of RL, its primary components, and illustrative examples to enhance understanding.
Language: JavaScript - Size: 838 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Asterinos1/RL_n_Dynamic_Optimization
This rep contains the projects made for the course "Reinforcement Learning and Dynamic Optimization" at TUC (2024).
Language: Jupyter Notebook - Size: 1.46 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

riccardopoiani/adv-mf-bai
Code for the paper "Optimal Multi-Fidelity Best-Arm Identification" (NeurIPS 2024)
Language: Python - Size: 11.7 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

dmolitor/pyssed
The Mixture Adaptive Design (MAD): An experimental design for anytime-valid causal inference on Multi-Armed Bandits.
Language: Python - Size: 12.3 MB - Last synced at: 24 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

diyabodiwala/FlicksMAB
FlicksMAB is a movie recommendation system that leverages the power of multi-armed bandits (MAB) to personalize movie suggestions for users. Built using PyTorch, this system uses the MovieLens 100K dataset to learn user preferences and recommend movies that are likely to engage them.
Language: Python - Size: 55.7 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

Alanjamlu34/Multi-Armed-Bandit--Adaptive-epsilon-greedy-
Repository tugas akhir tentang Multi-Armed Bandit
Language: Jupyter Notebook - Size: 2.71 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

rafonsor/unRL
unRL (AKA "unreal") is a set of libraries providing Reinforcement Learning algorithms implemented in PyTorch or Jax.
Language: Python - Size: 354 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Sudhansh6/Intelligent-Learning-Agents
A repository covering a range of topics from multi-arm bandits to reinforcement learning algorithms. Check out different applications of bandits, MDPs and RL algorithms along with theoretical aspects.
Language: Python - Size: 937 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

DennisSoemers/MultiArmedBanditsTutorial
Multi-Armed Bandits Tutorial
Language: Jupyter Notebook - Size: 63.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

hritikb/Reinforcement-Learning-Algorithms
Language: Jupyter Notebook - Size: 1.02 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

yiming-fang/hierarchical_bandits
Hierarchical Bandits for Music Recommendation
Language: Jupyter Notebook - Size: 1.35 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

amirhosein-prdv/Reinforcement-learning-exercises
This Repository contain the Answers of "Coursera RL Specialization" Course exercises
Language: Jupyter Notebook - Size: 1.3 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

JS2498/CS420-Reinforcement-Learning
CS420: Reinforcement Learning
Language: Jupyter Notebook - Size: 116 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 1

Orange-OpenSource/MAB_IoT
a C framework including Multi-player Multi-armed bandit (MP-MAB) algorithms applied to IoT network and a LoRa network simulator
Language: C - Size: 3.66 MB - Last synced at: 24 days ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 1

elina-israyelyan/thompson-sampling
Package to implement the Thompson Sampling algorithm.
Language: Python - Size: 466 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

posgnu/bayesian-active-learning-on-multi-armed-bandit
Bayesian active learning algorithm with Thompson sampling on multi-armed bandit with Numpy
Language: Python - Size: 121 KB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

n4i9kita/ExploratoryProject
Analysis of various multi armed bandit algorithms over normal and heavy-tailed distributions.
Language: Jupyter Notebook - Size: 798 KB - Last synced at: 2 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

Murtazali05/Multi-armed-bandit
Multi Armed Bandits implementation using the Jester Dataset
Language: Python - Size: 4.14 MB - Last synced at: 3 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

kyminbb/multi-armed-bandits
Implementation of multi-armed bandits from scratch
Language: Python - Size: 15.6 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

Nikhilkohli1/Multi-Armed-bandit-Bayesian-A-B-Testing
Language: Jupyter Notebook - Size: 155 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

VigHub/Exam_IA
Development of algorithms for reinforcement learning. Specifically, software implementation of the algorithms and policies described in the paper Batched Multi-armed Bandits Problems, by Zijun Gao, Yanjun Han, Zhimei Ren, Zhengqing Zhou.
Language: Python - Size: 6.46 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

vishaal27/RL-M2019
Repository for the Reinforcement Learning (CSE564) Fall'19 course at IIIT Delhi
Language: Jupyter Notebook - Size: 9.48 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

harshsiloiya98/CS747-Assignments
Assignments for CS747 - Foundations of Intelligent and Learning Agents
Language: Python - Size: 692 KB - Last synced at: 2 months ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

NekuSakuraba/reinforcement-learning
A simple repository to highlight and explain Reinforcement Learning topics and concepts
Language: Jupyter Notebook - Size: 158 KB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

cotrane/Thompson-Sampler
Simple Thompson Sampler for a multi-armed bandit problem
Language: Jupyter Notebook - Size: 43.9 KB - Last synced at: almost 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

julbright/hastur
A multi-armed bandit implementation in python
Language: Python - Size: 92.8 KB - Last synced at: about 1 year ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 0
