An open API service providing repository metadata for many open source software ecosystems.

Topic: "bandit-algorithm"

andrecianflone/thompson

Thompson Sampling Tutorial

Language: Jupyter Notebook - Size: 305 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 41 - Forks: 15

thunfischtoast/LinUCB

Contextual bandit algorithm called LinUCB / Linear Upper Confidence Bounds as proposed by Li, Langford and Schapire

Language: Java - Size: 27.3 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 25 - Forks: 11

iheartradio/thomas

Another A/B test library

Language: Scala - Size: 5.12 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 24 - Forks: 8

mmalekzadeh/privacy-preserving-bandits

Privacy-Preserving Bandits (MLSys'20)

Language: Jupyter Notebook - Size: 35.8 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 21 - Forks: 6

raklokesh/ReinforcementLearning_Sutton-Barto_Solutions

Solutions and figures for problems from Reinforcement Learning: An Introduction Sutton&Barto

Language: Python - Size: 4.47 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 20 - Forks: 4

adik993/reinforcement-learning-sutton

Language: Python - Size: 75.2 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 13 - Forks: 3

clreda/NORDic

Network-Oriented Repurposing of Drugs Python Package

Language: Jupyter Notebook - Size: 101 MB - Last synced at: 7 days ago - Pushed at: 17 days ago - Stars: 8 - Forks: 2

Nth-iteration-labs/streamingbandit-ui

Client that handles the administration of StreamingBandit online, or straight from your desktop. Setup and run streaming (contextual) bandit experiments in your browser.

Language: JavaScript - Size: 13.5 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 3

niravnb/Movie-Recommendation-using-Cascading-Bandits

Movie Recommendation using Cascading Bandits namely CascadeLinTS and CascadeLinUCB

Language: Matlab - Size: 9.2 MB - Last synced at: almost 2 years ago - Pushed at: almost 7 years ago - Stars: 6 - Forks: 2

simerplaha/reinforcement-learning

Reinforcement learning

Language: Scala - Size: 174 KB - Last synced at: about 1 month ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 2

Ralami1859/Adversarial-Multi-Armed-bandit

Adversarial multi-armed bandit algorithms

Language: MATLAB - Size: 16.6 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 4 - Forks: 0

alextanhongpin/go-a-b

A/B testing metrics collection with golang

Language: Go - Size: 45.9 KB - Last synced at: about 2 months ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 0

NickKaparinos/Stanford-CS-234-RL-2022

Solutions to the Stanford CS:234 Reinforcement Learning 2022 course assignments.

Language: Python - Size: 228 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

vwang0/causal_inference

Language: Jupyter Notebook - Size: 4.86 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

rasros/evolutionarybandit

Research project on automated A/B testing of software by evolutionary bandits.

Language: MATLAB - Size: 454 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 1

znreza/RL_Best_Presentation

This presentation contains very precise yet detailed explanation of concepts of a very interesting topic -- Reinforcement Learning.

Size: 4.82 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

kapshaul/Join-Game Fork of OSU-IDEA-Lab/Join-Game

Online learning approaches to optimize database join operations in PostgreSQL.

Language: C - Size: 65.1 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

mgpopinjay/bandit-algorithms

A small collection of Bandit Algorithms (ETC, E-Greedy, Elimination, UCB, Exp3, LinearUCB, and Thompson Sampling)

Language: Python - Size: 447 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 1

KavishBhatia/MachineLearning

Language: Jupyter Notebook - Size: 888 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

Related Topics
reinforcement-learning 7 bandit-learning 5 contextual-bandits 3 sarsa 3 online-learning 2 sutton-book 2 machine-learning 2 ab-testing 2 bandit 2 recommender-system 2 postgresql 1 deep-reinforcement-learning 1 dqn 1 policy-gradients 1 pytorch 1 stanford-university 1 active-learning 1 alphago 1 exploitation 1 gene-regulatory-network-inference 1 database-join 1 c 1 thompson-sampling 1 linucb 1 java 1 simulation 1 experiment 1 movie-recommendation 1 sutton-gridworld 1 sutton-gambler 1 short-corridor 1 semi-gradient-sarsa 1 gene-regulatory-network 1 drug-simulation 1 drug-repurposing 1 boolean-network 1 scala 1 public 1 functional-reactive-programming 1 functional-programming 1 bayesian-analysis 1 bayesian 1 bandits 1 nomad 1 golang 1 go 1 td-learning 1 sarsa-learning 1 rl-vs-unsupervised-learning 1 rl-vs-supervised-learning 1 reinforcement-learning-algorithms 1 passive-learning 1 model-free 1 model-based-rl 1 exploration 1 td-lambda 1 random-walk 1 racecar 1 q-learning 1 multi-armed-bandits 1 gridworld 1 dyna-q 1 cliffwalking 1 adversarial-machine-learning 1 support-vector-machines 1 stochastic-gradient-descent 1 gaussian-discriminant-analysis 1 binary-logistic-regression 1 adaboost 1 recommendation 1 privacy-preserving-machine-learning 1 privacy-preserving-bandits 1 online-machine-learning 1 federated-learning 1 differentially-private 1 differential-privacy 1 criteo-dataset 1 bandit-algorithms 1 rl-sutton 1 qlearning 1 optimal-policy 1 mountain-car 1 maximization-bias 1 infinite-variance 1 gradient-descent 1 feature-engineering 1 dynaq 1 blackjack-montecarlo 1 batch-update 1 webapp 1 streamingbandit-client 1 react 1 multiarm-bandit 1 javascript 1 client 1 monte-carlo 1 markov-decision-processes 1 bellman-equation 1 matlab 1 genetic-algorithm 1