An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: multiarm-bandit

mobarski/kraken

Contextual Bandit Engine

Language: Python - Size: 906 KB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

duoan/OpenMultiarmedBandits

A open source multi arm bandit framework for optimize your website quickly. You’ll quickly use the benefits of several simple algorithms—including the epsilon-Greedy, Softmax, and Upper Confidence Bound (UCB) algorithms—by working through this framework written in Java, which you can easily adapt for deployment on your own website.

Size: 5.86 KB - Last synced at: about 1 year ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 0

duoan/recsys

A end-to-end open source recommender platform, include data collection, feature engineering and ABTest, recommend algorithm.

Size: 5.86 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

cormac-rynne/bandits

Variety of Multi-Arm Bandit (MAB) algorithms using classic and advanced strategies, including tools for experiments and simulations in stationary and nonstationary environments

Language: Jupyter Notebook - Size: 4.07 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Xinjie-Lan/Multi-Armed_Bandit

python implementation of e-Greedy, UCB, LinUCB, LinThompson, and offline evaluator

Language: Jupyter Notebook - Size: 723 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 0

FanchenBao/reinforcement_learning

Code examples for simple reinforcement learning projects

Language: Jupyter Notebook - Size: 16.8 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 2

niazangels/bandits

An introduction to multi arm bandits

Language: Jupyter Notebook - Size: 2.46 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

Shahul-Rahman/MABSearch-Learning-the-learning-rate

MABSearch: The Bandit Way of Learning the Learning Rate - A Harmony Between Reinforcement Learning and Gradient Descent

Language: Jupyter Notebook - Size: 506 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 1

niffler92/Bandit

Bandit algorithms

Language: Python - Size: 300 KB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 29 - Forks: 6

sourcecode369/ml-algorithms-on-scikit-and-keras

Implementation scripts of Machine Learning algorithms on Scikit-learn and Keras for complete novice..

Language: Jupyter Notebook - Size: 26.6 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 17 - Forks: 12

viswanath57/Bandit-Algorithms

Language: Jupyter Notebook - Size: 1.65 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 8 - Forks: 4

akshaykhadse/reinforcement-learning

Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay

Language: Python - Size: 20.4 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 15 - Forks: 6

CavenaghiEmanuele/Multi-armed-bandit

Library on Multi-armed bandit

Language: Python - Size: 29 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 2

afiliot/Information-Directed-Sampling-For-Multi-Arm-Bandit-Problems

Review project on Information Directed Sampling - MVA MSc

Language: Python - Size: 1.58 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 2

MassimoGennaro/DIA_Project_PoliMi

Data Intelligence Application project

Language: Jupyter Notebook - Size: 7.03 MB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 2

Nth-iteration-labs/streamingbandit-ui

Client that handles the administration of StreamingBandit online, or straight from your desktop. Setup and run streaming (contextual) bandit experiments in your browser.

Language: JavaScript - Size: 13.5 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 3

himeag/Class--Machine-Learning

University of Utah—MKTG 66420 | Taken: Fall 2020

Language: HTML - Size: 1.12 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 1

Bilkent-CYBORG/FeedBAL

Implementation of the FeedBack Adaptive Learning (FeedBAL) algorithm for the episodic multi-armed bandit (eMAB) setting.

Language: Python - Size: 51.8 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 1

Related Keywords
multiarm-bandit 18 reinforcement-learning 8 machine-learning 5 multiarmed-bandits 4 thompson-sampling 4 bandit-algorithms 3 epsilon-greedy 2 ucb1 2 linucb 2 ucb 2 deep-learning 2 recommendation-system 2 recommendation-engine 2 contextual-bandits 2 multi-armed-bandit 2 multi-armed-bandits 2 abtest 2 outlier-detection 1 reinforcement-learning-analysis 1 randomized-policy-iteration 1 randomised-algorithms 1 policy-iteration 1 policy-evaluation 1 mdps 1 markovian-epidemic-processes 1 linear-programming 1 kl-divergence 1 howards-pi 1 batch-switching 1 softmax-algorithm 1 algorithms 1 xgboost 1 t-sne 1 scikit-learn 1 logistic-regression 1 cluster-analysis 1 webapp 1 pca-analysis 1 streamingbandit-client 1 react 1 javascript 1 client 1 bandit-learning 1 svm-classifier 1 time-series-analysis 1 bandit-algorithm 1 pricing 1 advertising 1 mva 1 information-directed-sampling 1 thompson-algorithm 1 reinforcement-learning-excercises 1 regression 1 gradient-descent 1 global-optimization-algorithms 1 global-optimization 1 global-minimum 1 tabular-methods 1 actor-critic 1 multi-arm 1 exp3-algorithm 1 recsys 1 recommender-system 1 neural-network 1 data-collection 1 data-cleaning 1 website-optimization 1 statistical-models 1 optimization-algorithms 1 openmultiarmedbandits 1 distribution 1 pandas 1 numpy 1 natural-language-processing 1 matplotlib 1 kfold-cross-validation 1 keras 1 grid-search 1 dimensionality-reduction 1 data-preprocessing 1 clustering 1 classification 1 association-rule-learning 1 simulation 1 contextual-bandit 1 python 1 optimization 1 metaheuristics 1 learning-rate 1