An open API service providing repository metadata for many open source software ecosystems.

Topic: "bandit-learning"

cair/TsetlinMachine

Code and datasets for the Tsetlin Machine

Language: Cython - Size: 491 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 475 - Forks: 51

cair/pyTsetlinMachine

Implements the Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, Weighted Tsetlin Machine, and Embedding Tsetlin Machine, with support for continuous features, multigranularity, clause indexing, and literal budget

Language: C - Size: 611 KB - Last synced at: 14 days ago - Pushed at: 2 months ago - Stars: 138 - Forks: 29

Nth-iteration-labs/contextual

Contextual Bandits in R - simulation and evaluation of Multi-Armed Bandit Policies

Language: R - Size: 107 MB - Last synced at: 10 days ago - Pushed at: almost 5 years ago - Stars: 80 - Forks: 26

SamRagusa/Checkers-Reinforcement-Learning

A checkers reinforcement learning AI, and all the tools needed to train it.

Language: Python - Size: 49.8 KB - Last synced at: 5 months ago - Pushed at: about 5 years ago - Stars: 55 - Forks: 14

cair/convolutional-tsetlin-machine-tutorial

Tutorial on the Convolutional Tsetlin Machine

Language: Python - Size: 316 KB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 53 - Forks: 13

cair/pyTsetlinMachineParallel

Multi-threaded implementation of the Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, and Weighted Tsetlin Machine, with support for continuous features and multigranularity.

Language: C - Size: 290 KB - Last synced at: 20 days ago - Pushed at: over 2 years ago - Stars: 41 - Forks: 9

thunfischtoast/LinUCB

Contextual bandit algorithm called LinUCB / Linear Upper Confidence Bounds as proposed by Li, Langford and Schapire

Language: Java - Size: 27.3 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 25 - Forks: 11

mmalekzadeh/privacy-preserving-bandits

Privacy-Preserving Bandits (MLSys'20)

Language: Jupyter Notebook - Size: 35.8 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 21 - Forks: 6

Nth-iteration-labs/streamingbandit-ui

Client that handles the administration of StreamingBandit online, or straight from your desktop. Setup and run streaming (contextual) bandit experiments in your browser.

Language: JavaScript - Size: 13.5 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 3

etiennekintzler/visualize_bandit_algorithms

Some visualizations of bandit algorithm outputs.

Language: Jupyter Notebook - Size: 2.23 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 4

juliakreutzer/bandit-neuralmonkey Fork of ufal/neuralmonkey

Bandit learning on top of Neural Monkey, an open-source tool for sequence learning in NLP built on TensorFlow. Bandit online learning objectives in branch bandits-acl (ACL17) and counterfactual learning objectives in branch acl-2018 (ACL18).

Language: Python - Size: 6.97 MB - Last synced at: almost 2 years ago - Pushed at: almost 7 years ago - Stars: 4 - Forks: 4

crenwick/Swiper

🦊 A series of bandit algorithms in Swift

Language: Swift - Size: 23.4 KB - Last synced at: over 1 year ago - Pushed at: about 9 years ago - Stars: 4 - Forks: 0

anishacharya/Bandits-Online-Learning

Simple Implementations of Bandit Algorithms in python

Language: Jupyter Notebook - Size: 120 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

thiagopbueno/pybayesbandit

Bayesian bandits in Python3.

Language: Python - Size: 60.5 KB - Last synced at: 5 days ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 1

AntoineG92/Online-Clustering-of-Bandits-ENSAE

Based on Gentile-Li-Zapella article "Online Clustering of Bandits"

Language: Jupyter Notebook - Size: 11.5 MB - Last synced at: 12 months ago - Pushed at: almost 7 years ago - Stars: 3 - Forks: 1

shashankp914/Over-the-wire-wargames-Solutions

Detailed solution of solving wargames of over the wire which includes bandit and in future many more.

Size: 39.1 KB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

rasros/combo

Language: Kotlin - Size: 13.5 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 1

florian/reinforcement-learning

Implementing RL algorithms

Language: Jupyter Notebook - Size: 315 KB - Last synced at: about 1 year ago - Pushed at: about 8 years ago - Stars: 2 - Forks: 1

juliakreutzer/bandit-cdec Fork of redpony/cdec

Decoder, aligner, and model optimizer for statistical machine translation and other structured prediction models based on (mostly) context-free formalisms

Language: C++ - Size: 232 MB - Last synced at: almost 2 years ago - Pushed at: about 8 years ago - Stars: 2 - Forks: 0

dhawale-lab/BanditPy

Language: Python - Size: 169 KB - Last synced at: 4 days ago - Pushed at: 10 days ago - Stars: 1 - Forks: 1

kapshaul/OnlineLearning

Repository of Online Learning algorithms, including Bandits, UCB, and more.

Language: Python - Size: 5.13 MB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

vwang0/causal_inference

Language: Jupyter Notebook - Size: 4.86 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

znreza/RL_Best_Presentation

This presentation contains very precise yet detailed explanation of concepts of a very interesting topic -- Reinforcement Learning.

Size: 4.82 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

victor-iyi/policy-gradient

A policy gradient approach to a multi-armed bandit problem

Language: Jupyter Notebook - Size: 5.86 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

dscolby/Whiteboard

A virtual whiteboard so I don't forget the ideas that come to me

Size: 31.3 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

zeroinfiniti/bandit-wargames

Leveling up on the Bandit Wargames

Size: 485 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

fouratifares/RGL

Randomized Greedy Learning Under Full-bandit Feedback

Language: Python - Size: 150 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

vitorhugo13/feup-mssi

Repository of code developed for the course MSSI @FEUP.

Language: Python - Size: 22.9 MB - Last synced at: 3 months ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

ad0x99/linux-4-fun

My Linux Notes

Size: 4.88 KB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

SFV-CORE/Bandit_OverTheWire

Aqui irei explicar como passar de cada nível do CTF Bandit fornecido pela Over The Wire

Size: 4.88 KB - Last synced at: 4 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

jpthanga/10-Arm-Bandit

Implementation of 10 Arm Bandit using RLGlue

Language: C - Size: 24.4 KB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

victor-iyi/contextual-bandit

A Reinforcement Learning approach to a contextual bandit problem.

Language: Jupyter Notebook - Size: 21.5 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

jonad/smartcab

Train a SmartCab how to drive using reinforcement learning.

Language: Jupyter Notebook - Size: 632 KB - Last synced at: 20 days ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

hartikainen/information-theoretic-bandit 📦

Language: Python - Size: 10.7 KB - Last synced at: 2 months ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

DenzilFrancisCrasta/bandit

Language: Python - Size: 721 KB - Last synced at: over 1 year ago - Pushed at: over 8 years ago - Stars: 0 - Forks: 0

Related Topics
reinforcement-learning 13 machine-learning 10 bandit 6 bandit-algorithms 5 bandit-algorithm 5 tsetlin-machine 4 propositional-logic 4 frequent-pattern-mining 4 reinforcement-learning-algorithms 4 contextual-bandits 4 multi-armed-bandits 4 convolution 3 online-learning 3 rule-based 3 linux 3 markov-decision-processes 2 interpretable-machine-learning 2 pattern-recognition 2 adversarial-learning 2 ab-testing 2 machine-translation 2 overthewire 2 multi-arm-bandits 2 weak-feedback 2 active-learning 2 cybersecurity 2 simulation 2 game-theory 2 exploration 2 exploitation 2 classification 2 regression 2 linucb 2 bandit-experiments 1 value-to-go 1 perception-action-cycle 1 cmab 1 contextual 1 contextual-bandit-policies 1 cran 1 k-armed-bandit 1 information-to-go 1 information-theory 1 td-learning 1 sarsa-learning 1 sarsa 1 rl-vs-unsupervised-learning 1 rl-vs-supervised-learning 1 passive-learning 1 model-free 1 model-based-rl 1 webapp 1 board-game 1 checker 1 checkers 1 checkers-reinforcement-learning 1 draughts 1 dynamic-programming 1 game 1 game-board 1 q-learning 1 embedding 1 interpretable 1 pygame 1 python 1 bayesian 1 belief-planning 1 rl 1 evaluation 1 multi-armed 1 multi-armed-bandit 1 offline-bandit 1 reinforcement 1 statistics 1 sumo 1 traci 1 learning-automata 1 adaptive-ad 1 linear-regression 1 upper-confidence-bound 1 adversarial 1 ai 1 alpha-beta-pruning 1 artificial-intelligence 1 streamingbandit-client 1 react 1 multiarm-bandit 1 javascript 1 client 1 online-learning-algorithms 1 contextual-bandit 1 tensorflow 1 policy-gradient 1 optimization 1 kotlin-library 1 kotlin 1 genetic-algorithm 1 swift 1 softmax 1 epsilon 1