GitHub topics: bandit-learning
bapungiri/BanditPy
Language: Python - Size: 126 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

Nth-iteration-labs/contextual
Contextual Bandits in R - simulation and evaluation of Multi-Armed Bandit Policies
Language: R - Size: 107 MB - Last synced at: 6 days ago - Pushed at: almost 5 years ago - Stars: 80 - Forks: 26

cair/TsetlinMachine
Code and datasets for the Tsetlin Machine
Language: Cython - Size: 491 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 475 - Forks: 51

cair/pyTsetlinMachine
Implements the Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, Weighted Tsetlin Machine, and Embedding Tsetlin Machine, with support for continuous features, multigranularity, clause indexing, and literal budget
Language: C - Size: 611 KB - Last synced at: 11 days ago - Pushed at: about 1 month ago - Stars: 135 - Forks: 28

cair/convolutional-tsetlin-machine-tutorial
Tutorial on the Convolutional Tsetlin Machine
Language: Python - Size: 316 KB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 53 - Forks: 13

cair/pyTsetlinMachineParallel
Multi-threaded implementation of the Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, and Weighted Tsetlin Machine, with support for continuous features and multigranularity.
Language: C - Size: 290 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 41 - Forks: 9

SamRagusa/Checkers-Reinforcement-Learning
A checkers reinforcement learning AI, and all the tools needed to train it.
Language: Python - Size: 49.8 KB - Last synced at: 4 months ago - Pushed at: almost 5 years ago - Stars: 55 - Forks: 14

hartikainen/information-theoretic-bandit 📦
Language: Python - Size: 10.7 KB - Last synced at: about 1 month ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

kapshaul/OnlineLearning
Repository of Online Learning algorithms, including Bandits, UCB, and more.
Language: Python - Size: 5.13 MB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

dscolby/Whiteboard
A virtual whiteboard so I don't forget the ideas that come to me
Size: 31.3 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

shashankp914/Over-the-wire-wargames-Solutions
Detailed solution of solving wargames of over the wire which includes bandit and in future many more.
Size: 39.1 KB - Last synced at: 9 months ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

AntoineG92/Online-Clustering-of-Bandits-ENSAE
Based on Gentile-Li-Zapella article "Online Clustering of Bandits"
Language: Jupyter Notebook - Size: 11.5 MB - Last synced at: 11 months ago - Pushed at: almost 7 years ago - Stars: 3 - Forks: 1

victor-iyi/policy-gradient
A policy gradient approach to a multi-armed bandit problem
Language: Jupyter Notebook - Size: 5.86 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

victor-iyi/contextual-bandit
A Reinforcement Learning approach to a contextual bandit problem.
Language: Jupyter Notebook - Size: 21.5 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

crenwick/Swiper
🦊 A series of bandit algorithms in Swift
Language: Swift - Size: 23.4 KB - Last synced at: about 1 year ago - Pushed at: almost 9 years ago - Stars: 4 - Forks: 0

zeroinfiniti/bandit-wargames
Leveling up on the Bandit Wargames
Size: 485 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

DenzilFrancisCrasta/bandit
Language: Python - Size: 721 KB - Last synced at: over 1 year ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 0

florian/reinforcement-learning
Implementing RL algorithms
Language: Jupyter Notebook - Size: 315 KB - Last synced at: about 1 year ago - Pushed at: about 8 years ago - Stars: 2 - Forks: 1

vwang0/causal_inference
Language: Jupyter Notebook - Size: 4.86 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

rasros/combo
Language: Kotlin - Size: 13.5 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 1

juliakreutzer/bandit-neuralmonkey Fork of ufal/neuralmonkey
Bandit learning on top of Neural Monkey, an open-source tool for sequence learning in NLP built on TensorFlow. Bandit online learning objectives in branch bandits-acl (ACL17) and counterfactual learning objectives in branch acl-2018 (ACL18).
Language: Python - Size: 6.97 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 4

juliakreutzer/bandit-cdec Fork of redpony/cdec
Decoder, aligner, and model optimizer for statistical machine translation and other structured prediction models based on (mostly) context-free formalisms
Language: C++ - Size: 232 MB - Last synced at: almost 2 years ago - Pushed at: about 8 years ago - Stars: 2 - Forks: 0

thunfischtoast/LinUCB
Contextual bandit algorithm called LinUCB / Linear Upper Confidence Bounds as proposed by Li, Langford and Schapire
Language: Java - Size: 27.3 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 25 - Forks: 11

fouratifares/RGL
Randomized Greedy Learning Under Full-bandit Feedback
Language: Python - Size: 150 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

mmalekzadeh/privacy-preserving-bandits
Privacy-Preserving Bandits (MLSys'20)
Language: Jupyter Notebook - Size: 35.8 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 21 - Forks: 6

anishacharya/Bandits-Online-Learning
Simple Implementations of Bandit Algorithms in python
Language: Jupyter Notebook - Size: 120 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

etiennekintzler/visualize_bandit_algorithms
Some visualizations of bandit algorithm outputs.
Language: Jupyter Notebook - Size: 2.23 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 4

ad0x99/linux-4-fun
My Linux Notes
Size: 4.88 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

vitorhugo13/feup-mssi
Repository of code developed for the course MSSI @FEUP.
Language: Python - Size: 22.9 MB - Last synced at: 2 months ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

Nth-iteration-labs/streamingbandit-ui
Client that handles the administration of StreamingBandit online, or straight from your desktop. Setup and run streaming (contextual) bandit experiments in your browser.
Language: JavaScript - Size: 13.5 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 3

SFV-CORE/Bandit_OverTheWire
Aqui irei explicar como passar de cada nÃvel do CTF Bandit fornecido pela Over The Wire
Size: 4.88 KB - Last synced at: 4 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

znreza/RL_Best_Presentation
This presentation contains very precise yet detailed explanation of concepts of a very interesting topic -- Reinforcement Learning.
Size: 4.82 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

thiagopbueno/pybayesbandit
Bayesian bandits in Python3.
Language: Python - Size: 60.5 KB - Last synced at: 16 days ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 1

jpthanga/10-Arm-Bandit
Implementation of 10 Arm Bandit using RLGlue
Language: C - Size: 24.4 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

jonad/smartcab
Train a SmartCab how to drive using reinforcement learning.
Language: Jupyter Notebook - Size: 632 KB - Last synced at: 3 months ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0
