Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: thompson-sampling
thetawom/mabby
A multi-armed bandit (MAB) simulation library in Python
Language: Python - Size: 4.18 MB - Last synced: about 3 hours ago - Pushed: 1 day ago - Stars: 8 - Forks: 1
atse0612/Machine-Learning-A-Z
Language: Jupyter Notebook - Size: 706 KB - Last synced: 18 days ago - Pushed: about 6 years ago - Stars: 12 - Forks: 40
Shlok1810/Ad-Selection-Algorithm-using-Machine-learning
Which Advertisement is the best fit for our business we can directly get through UCB Algorithm
Language: Jupyter Notebook - Size: 18.6 KB - Last synced: about 1 month ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0
alison-carrera/onn
Online Deep Learning: Learning Deep Neural Networks on the Fly / Non-linear Contextual Bandit Algorithm (ONN_THS)
Language: Python - Size: 66.4 KB - Last synced: 24 days ago - Pushed: over 4 years ago - Stars: 171 - Forks: 43
krishnaaxo/Reinforcement-UCB-ThompsonSampling
Language: Jupyter Notebook - Size: 198 KB - Last synced: about 1 month ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0
mykeels/multi-armed-bandit-problem
An implementation of solvers for the multi-armed-bandit-problem in JavaScript.
Language: JavaScript - Size: 5.86 KB - Last synced: about 1 month ago - Pushed: about 5 years ago - Stars: 2 - Forks: 0
marmiskarian/AB-testing
An implementation of the Epsilon Greedy and Thompson Sampling algorithms using NumPy, pandas and Matplotlib.
Language: Jupyter Notebook - Size: 2.11 MB - Last synced: about 1 month ago - Pushed: 11 months ago - Stars: 0 - Forks: 0
LaurentVeyssier/Maximize-Revenues-with-Thompson-Sampling
Maximize revenues of Online Retail Business with Thompson Sampling algorithm
Language: Jupyter Notebook - Size: 659 KB - Last synced: about 2 months ago - Pushed: over 3 years ago - Stars: 3 - Forks: 0
alison-carrera/mabalgs
:bust_in_silhouette: Multi-Armed Bandit Algorithms Library (MAB) :cop:
Language: Python - Size: 2.1 MB - Last synced: 27 days ago - Pushed: over 1 year ago - Stars: 126 - Forks: 26
v-i-s-h/MAB.jl
A Julia Package for providing Multi Armed Bandit Experiments
Language: Julia - Size: 274 KB - Last synced: 2 months ago - Pushed: almost 6 years ago - Stars: 20 - Forks: 8
hmishfaq/LMC-LSVI
The official code release for Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo, ICLR 2024.
Language: Python - Size: 32.9 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 1 - Forks: 2
graciangelica/Ads_CTR_Optimisation
🖱 Figure out which ad has the highest click rate
Language: Jupyter Notebook - Size: 60.5 KB - Last synced: 3 months ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0
GuilongAaron/beta_distribution_adprediction
This program deploys Thompson Bandit algorithm to solve an ad prediction for highest probability of clicking.
Language: Python - Size: 68.4 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
cormac-rynne/bandits
Variety of Multi-Arm Bandit (MAB) algorithms using classic and advanced strategies, including tools for experiments and simulations in stationary and nonstationary environments
Language: Jupyter Notebook - Size: 4.07 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
stitchfix/mab
Library for multi-armed bandit selection strategies, including efficient deterministic implementations of Thompson sampling and epsilon-greedy.
Language: Go - Size: 82 KB - Last synced: 3 months ago - Pushed: 4 months ago - Stars: 44 - Forks: 6
Eric-Bradford/TS-EMO
This repository contains the source code for “Thompson sampling efficient multiobjective optimization” (TSEMO).
Language: MATLAB - Size: 1.83 MB - Last synced: 4 months ago - Pushed: almost 4 years ago - Stars: 81 - Forks: 14
StivenMetaj/Data_Intelligence_Applications_Exam_Project
Our project for the "Data Intelligence Applications" exam at Politecnico di Milano. The project was about Social Influence and Pricing techniques applied to networks.
Language: Python - Size: 1.52 MB - Last synced: 4 months ago - Pushed: over 3 years ago - Stars: 2 - Forks: 3
volvo-cars/eene-nav-bandit-sim
EENE Navigation Bandit Simulator
Language: Python - Size: 66.4 KB - Last synced: 4 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0
alexandrulita91/multi-armed-bandit
The Multi-armed bandit problem is one of the classical reinforcements learning problems that describe the friction between the agent's exploration and exploitation.
Language: Python - Size: 233 KB - Last synced: 7 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0
ZIYU-DEEP/Awesome-Papers-on-Combinatorial-Semi-Bandit-Problems
A curated list on papers about combinatorial multi-armed bandit problems.
Size: 40 KB - Last synced: 25 days ago - Pushed: about 3 years ago - Stars: 13 - Forks: 0
sharmaroshan/Ads-Optimization
Optimizing the best Ads using Reinforcement learning Algorithms such as Thompson Sampling and Upper Confidence Bound.
Language: Jupyter Notebook - Size: 84 KB - Last synced: 7 months ago - Pushed: almost 5 years ago - Stars: 13 - Forks: 8
andrecianflone/thompson
Thompson Sampling Tutorial
Language: Jupyter Notebook - Size: 305 KB - Last synced: 7 months ago - Pushed: over 5 years ago - Stars: 41 - Forks: 15
Jayavathsan/MachineLearning-SciKitLearn
Using SciKit Learn few Deep Learning Rules and Algorithms are implemented
Language: Jupyter Notebook - Size: 1.09 MB - Last synced: 4 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0
cotrane/Thompson-Sampler
Simple Thompson Sampler for a multi-armed bandit problem
Language: Jupyter Notebook - Size: 43.9 KB - Last synced: 9 months ago - Pushed: about 5 years ago - Stars: 0 - Forks: 0
Sagarnandeshwar/Bandit_Algorithms
Reinforcement Learning (COMP 579) Project
Language: Jupyter Notebook - Size: 3.03 MB - Last synced: 4 months ago - Pushed: 10 months ago - Stars: 1 - Forks: 0
posgnu/bayesian-active-learning-on-multi-armed-bandit
Bayesian active learning algorithm with Thompson sampling on multi-armed bandit with Numpy
Language: Python - Size: 121 KB - Last synced: 10 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
annieyan/Bandits-using-UCB-algorithm
Thompson Sampling for Bandits using UCB policy
Language: Python - Size: 3.91 KB - Last synced: 9 months ago - Pushed: almost 7 years ago - Stars: 10 - Forks: 3
twkillian/nonstationary_contextual_bandits
Repo for course CSC2558: "Intelligent Adaptive Interventions" project in nonstationary contextual bandits.
Language: Python - Size: 634 KB - Last synced: 10 months ago - Pushed: over 4 years ago - Stars: 1 - Forks: 2
niffler92/Bandit
Bandit algorithms
Language: Python - Size: 300 KB - Last synced: 10 months ago - Pushed: over 6 years ago - Stars: 29 - Forks: 6
michaelosthege/pyrff
pyrff: Python implementation of random fourier feature approximations for gaussian processes
Language: Jupyter Notebook - Size: 54.5 MB - Last synced: 11 days ago - Pushed: almost 2 years ago - Stars: 26 - Forks: 2
nphdang/Bandit-BO
Bayesian Optimization for Categorical and Continuous Inputs
Language: Python - Size: 188 KB - Last synced: 11 months ago - Pushed: almost 4 years ago - Stars: 16 - Forks: 2
GjjvdBurg/ThompsonSampling
Source code for blog post on Thompson Sampling
Language: JavaScript - Size: 18.6 KB - Last synced: 10 months ago - Pushed: over 3 years ago - Stars: 5 - Forks: 1
VladMarianCimpeanu/OLA_project
Reinforcement learning techniques applied to solve pricing problems in e-commerce applications. Final project for "Online learning applications" course (2021-2022)
Language: Jupyter Notebook - Size: 52.6 MB - Last synced: 12 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 3
Amshra267/Thompson-Greedy-Comparison-for-MultiArmed-Bandits
Repository Containing Comparison of two methods for dealing with Exploration-Exploitation dilemma for MultiArmed Bandits
Language: Python - Size: 12.9 MB - Last synced: 9 months ago - Pushed: almost 3 years ago - Stars: 7 - Forks: 0
Nikronic/Machine-Learning-Models 📦
In This repository I made some simple to complex methods in machine learning. Here I try to build template style code.
Language: Python - Size: 716 KB - Last synced: 25 days ago - Pushed: over 2 years ago - Stars: 31 - Forks: 12
NaynaJahan/Best-Ad-selection-to-optimize-CTR-using-Reinforcement-Learning-Thompson-Sampling-
Best Ad selection among many advertisements shown to different users/website viewer to optimize Click-through rate using Thompson Sampling - a reinforcement learning approach. As the customer navigates the website, they will suddenly get a pop-up ad, suggesting to them that they subscribe to the premium plan. For each customer browsing the website, only one of the nine strategies will be displayed. Then the user will choose, or not, to take action and subscribe to the premium plan. If the customer subscribes, the strategy is a success; otherwise, it is a failure. The more customers we do this with, the more feedback we collect, and the better idea we get of what the best strategy isThe data is taken through simulation and the best ad is shown using a histogram.
Language: Jupyter Notebook - Size: 43.9 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
Suchetaaa/CS747-Assignments
Foundations Of Intelligent Learning Agents (FILA) Assignments
Language: Python - Size: 3.04 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 5 - Forks: 0
sarthakmittal92/multi-armed-bandits
Repository for the course project done as part of CS-747 (Foundations of Intelligent & Learning Agents) course at IIT Bombay in Autumn 2022.
Language: Python - Size: 330 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0
farhanchoudhary/Machine_Learning_A-Z_All_Codes_and_Templates
All codes, both created and optimized for best results from the SuperDataScience Course
Language: Python - Size: 5.7 MB - Last synced: 11 months ago - Pushed: over 6 years ago - Stars: 31 - Forks: 32
ishank-juneja/Correlated-AoI-Bandits
Author's implementation of the paper Correlated Age-of-Information Bandits.
Language: Python - Size: 176 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 9 - Forks: 0
doguilmak/Rock-Paper-Scissors-with-Thompson-Sampling
The aim of this project is to achieve maximum success by the algorithm in the rock-paper-scissors game with the reinforcement learning.
Language: Python - Size: 168 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
nimily/linear-ts
Codes for simulations in the paper "On Worst-case Regret of Linear Thompson Sampling"
Language: Python - Size: 2.33 MB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 4 - Forks: 2
antoine-hochart/bandit_algo_evaluation
Offline evaluation of multi-armed bandit algorithms
Language: Python - Size: 340 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 13 - Forks: 3
LukasRinder/bayesian-neural-networks
Different implementations of Bayesian neural networks for uncertainty estimation. The uncertainty estimation is utilized for efficient exploration in reinforcement learning.
Language: Python - Size: 1.73 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 1 - Forks: 1
swasun/BanditProblem 📦
A collection of implementations of the bandit problem.
Language: Jupyter Notebook - Size: 580 KB - Last synced: about 1 year ago - Pushed: about 5 years ago - Stars: 1 - Forks: 0
rudrajit1729/Machine-Learning-Codes-And-Templates
Codes and templates for ML algorithms created, modified and optimized in Python and R.
Language: Python - Size: 21 MB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 4 - Forks: 3
Ralami1859/Stochastic-Multi-Armed-Bandit
Implementation of 9 multi-armed bandit algorithm for the stationary stochastic environment
Language: MATLAB - Size: 48.8 KB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 4 - Forks: 2
elina-israyelyan/thompson-sampling
Package to implement the Thompson Sampling algorithm.
Language: Python - Size: 466 KB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0
RonyAbecidan/Neural-Thompson-Sampling
Study of the paper 'Neural Thompson Sampling' published in October 2020
Language: Jupyter Notebook - Size: 10.4 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 10 - Forks: 3
lionelsamrat10/Machine-learning-a-to-z
This repo contains code templates of all the machine learning algorithms that are used, like Regression, Classification, Clustering, etc.
Language: Jupyter Notebook - Size: 13.4 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 1 - Forks: 0
featmate/thompsonsampling-orderrpc
汤普森采样的通用服务,用于从redis中获得目标物品的alpha,beta值,然后过beta分布随机出一个数值后做排序
Language: Go - Size: 117 KB - Last synced: 9 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
AlexandroLuis/Hyper-Heuristic
Thompson Sampling HH implementation to solve TSP in a genetic algorithm configuration.
Language: Jupyter Notebook - Size: 4.62 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
lucko515/ads-strategy-reinforcement-learning
The example of using reinforcement learning algorithms in the business, specifically finding what ads to use in our campaign.
Language: Jupyter Notebook - Size: 37.1 KB - Last synced: about 1 year ago - Pushed: almost 7 years ago - Stars: 7 - Forks: 3
nphdang/turbo_bbo_neurips_2020
An improved version of Turbo algorithm for the Black-box optimization competition organized by NeurIPS 2020
Language: Python - Size: 110 KB - Last synced: 11 months ago - Pushed: over 3 years ago - Stars: 7 - Forks: 0
naman2040/Ads_Optimisation
(REINFORCEMENT LEARNING) : We are given a dataset that contains information about the ads clicked by the visitors at each visit to a webpage (amongst 10 different ads). Our Task is to find the most viewed ad i.e ad having the highest distribution of the viewers in Minimum number of Rounds and Resources. Here I have used "Upper Confidence Bound" and "Thompson Sampling" models to get the insights.
Language: Jupyter Notebook - Size: 52.7 KB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 1 - Forks: 1
vidits-kth/bayesla-link-adaptation
Bayesian Link Adaptation under a BLER Target
Language: Jupyter Notebook - Size: 2.4 MB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 3 - Forks: 2
doguilmak/Random-Seleciton-Upper-Confidence-Bound-and-Thompson-Sampling-on-Advertising-Preference
The purpose of this study is to predict which ad will be the most preferred by the customers over the fictitious ads clicked by the users.
Language: Python - Size: 28.3 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
akshaykhadse/reinforcement-learning
Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay
Language: Python - Size: 20.4 MB - Last synced: about 1 year ago - Pushed: about 6 years ago - Stars: 15 - Forks: 6
CavenaghiEmanuele/Multi-armed-bandit
Library on Multi-armed bandit
Language: Python - Size: 29 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 2
paramrathour/Intelligent-and-Learning-Agents
My programs during CS747 (Foundations of Intelligent and Learning Agents) Autumn 2021-22
Language: Python - Size: 19.2 MB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0
proceduralia/randomist
Code for Policy Optimization as Online Learning with Mediator Feedback
Language: Python - Size: 31.3 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0
rssalessio/Parallel-Bayesian-Optimization-Thompson-Sampling
Language: Python - Size: 29.3 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 3 - Forks: 2
k9luo/Deep-Preference-Elicitation
A Comparative Evaluation of Active Learning Methods in Deep Recommendation
Language: Jupyter Notebook - Size: 14.6 MB - Last synced: about 1 year ago - Pushed: almost 5 years ago - Stars: 3 - Forks: 1
aijunbai/thompson-sampling
Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs
Language: C++ - Size: 948 KB - Last synced: about 1 year ago - Pushed: almost 8 years ago - Stars: 14 - Forks: 0
aashish22bansal/Best-Ads-Predictor
Predicting the best Ad from the given Ads.
Language: Jupyter Notebook - Size: 63.5 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0
hardhik-99/Thompsom_Sampling_GoF
Thompson Sampling equipped with Goodness of Fit test based active change-point detection in Non-Stationary Bandit environment
Language: Python - Size: 96.7 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0
SamueleMeta/data-intelligence-applications Fork of StivenMetaj/Data_Intelligence_Applications_Exam_Project
Our project for the "Data Intelligence Applications" exam at Politecnico di Milano. The project was about Social Influence and Pricing online learning techniques applied to networks.
Language: Python - Size: 1.52 MB - Last synced: 12 months ago - Pushed: about 3 years ago - Stars: 2 - Forks: 0
amirabbasii/A3C_Kung_Fu
Language: Jupyter Notebook - Size: 1.53 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0
kevinzakka/learn-mab
Learning about Stochastic Multi-Armed Bandit Algorithms
Language: Python - Size: 2.73 MB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 8 - Forks: 1
rueian/gobandit
A golang library for solving multi armed bandit problem which can optimize your business choice on the fly without A/B testing
Language: Go - Size: 6.84 KB - Last synced: 9 months ago - Pushed: about 5 years ago - Stars: 2 - Forks: 0
R4j4n/Maximizing-Revenue-of-an-Online-Retail-Business
Language: Python - Size: 32.2 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 7 - Forks: 0
Sanket758/Machine-Learning-A-Z
Language: Jupyter Notebook - Size: 2.94 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 0 - Forks: 1
idjoannachen/botorch Fork of pytorch/botorch
Bayesian optimization in PyTorch
Language: Python - Size: 15.4 MB - Last synced: 12 months ago - Pushed: almost 4 years ago - Stars: 0 - Forks: 0
preferred-pictures/python
A python client for the PreferedPictures API
Language: Python - Size: 183 KB - Last synced: 13 days ago - Pushed: almost 4 years ago - Stars: 0 - Forks: 0
preferred-pictures/ruby
A Ruby client for the PreferredPictures API.
Language: Ruby - Size: 9.77 KB - Last synced: 27 days ago - Pushed: almost 4 years ago - Stars: 0 - Forks: 0
preferred-pictures/php
A PHP client for the PreferredPictures API.
Language: PHP - Size: 597 KB - Last synced: about 1 month ago - Pushed: almost 4 years ago - Stars: 0 - Forks: 0
preferred-pictures/node
A Node.js client for PreferredPictures API.
Language: TypeScript - Size: 113 KB - Last synced: 3 days ago - Pushed: almost 4 years ago - Stars: 0 - Forks: 0
EmanuelAlogna/Data-Intelligence-Applications
Pricing and Social Influence Maximization using Reinforcement Learning algorithms in Data Intelligence Applications projects from Politechnic of Milan
Language: Python - Size: 9.52 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 0 - Forks: 2
vmarchaud/ts-mab
Typescript implementation of a multi-armed bandit
Language: TypeScript - Size: 107 KB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 0 - Forks: 0
Eric-Su-2718/Reinforcement-learning-methods-for-the-multi-armed-bandit-problem
Implementation of the Upper confidence bounds and Thompson sampling algorithms in R for the multi armed bandit problem
Size: 7.81 KB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 0 - Forks: 1
etscrivner/thompson_sampling
Prototype use of Thompson Sampling for interview candidate evaluation.
Language: D - Size: 173 KB - Last synced: about 1 year ago - Pushed: about 7 years ago - Stars: 0 - Forks: 0